PR #37009: Add intent-aware model routing middleware for agents

imadhajaz · 2026-04-25T23:10:13Z

[langchain] PR 37009: Add intent-aware model routing middleware for agents - Repository: langchain-ai/langchain - Author: imadhajaz - State: closed | merged: F… # PR #37009: Add intent-aware model routing middleware for agents - Repository: langchain-ai/langchain - Author: imadhajaz - State: closed | merged: False - Link: https://github.com/langchain-ai/langchain/pull/37009 ## Description (problem / solution / changelog) Resolves #37010 ## Summary - add IntentAwareModelRouterMiddleware to route model calls to fast, balanced, or quality model tiers based on prompt intent - add routing trace metadata to model responses (policy, policy_version, route_reason, and decision_features) for transparency and debugging - export the middleware in the public middleware API and add focused unit tests for route selection and fallback composition ## Test plan - [x] PYTHONPATH=libs/langchain_v1 .venv/bin/pytest libs/langchain_v1/tests/unit_tests/agents/middleware/implementations/test_model_routing.py - [x] PYTHONPATH=libs/langchain_v1 .venv/bin/pytest libs/langchain_v1/tests/unit_tests/agents/middleware/implementations/test_model_fallback.py libs/langchain_v1/tests/unit_tests/agents/middleware/implementations/test_model_routing.py libs/langchain_v1/tests/unit_tests/agents/middleware/core/test_wrap_model_call.py ## Changed files - `libs/langchain_v1/langchain/agents/middleware/__init__.py` (modified, +2/-0) - `libs/langchain_v1/langchain/agents/middleware/model_routing.py` (added, +189/-0) - `libs/langchain_v1/tests/unit_tests/agents/middleware/implementations/test_model_routing.py` (added, +138/-0) ## Fixed - Fixed by PR: Add intent-aware model routing middleware for agents (https://github.com/langchain-ai/langchain/pull/37009) ### Submission checklist - [x] This is a feature request, not a bug report or usage question. - [x] I added a clear and descriptive title that summarizes the feature request. - [x] I used the GitHub search to find a similar feature request and didn't find it. - [x] I checked the LangChain documentation and API reference to see if this feature already exists. - [x] This is not related to the langchain-community package. ### Package (Required) - [x] langchain - [ ] langchain-openai - [ ] langchain-anthropic - [ ] langchain-classic - [ ] langchain-core - [ ] langchain-model-profiles - [ ] langchain-tests - [ ] langchain-text-splitters - [ ] langchain-chroma - [ ] langchain-deepseek - [ ] langchain-exa - [ ] langchain-fireworks - [ ] langchain-groq - [ ] langchain-huggingface - [ ] langchain-mistralai - [ ] langchain-nomic - [ ] langchain-ollama - [ ] langchain-openrouter - [ ] langchain-perplexity - [ ] langchain-qdrant - [ ] langchain-xai - [ ] Other / not sure / general ### Feature Description I’d like to propose an `IntentAwareModelRouterMiddleware` for `langchain_v1` agents that routes requests across model tiers (fast / balanced / quality) based on prompt intent. Motivation: Agent apps often need to trade off latency, cost, and output quality. Today this is usually hardcoded per app. A built-in middleware would provide a reusable default policy while still allowing custom policies. Proposed behavior: - Route to **quality** model for high-stakes intent (e.g., legal/medical/financial/compliance/security-sensitive prompts) - Route to **fast** model for simple short tasks (e.g., summarize/rewrite/extract/classify) - Route to **balanced** model for everything else - Keep `ModelFallbackMiddleware` compatibility for failure recovery - Add routing trace metadata to responses for observability: - `policy` - `policy_version` - `route_reason` - `decision_features` Scope: - New middleware in `langchain_v1` middleware package - Public export in middleware `__init__` - Unit tests for: - route selection (fast / balanced / quality) - metadata emission - composition with fallback middleware Backward compatibility: No breaking API changes. Middleware is opt-in. Prior art / implementation status: I already have a working implementation + tests in a PR: https://github.com/langchain-ai/langchain/pull/37009 Happy to adjust naming, policy defaults, or scope based on maintainer feedback. ### Use Case I’m building agent apps where some requests are high-stakes and others are simple transforms. Using one fixed model wastes cost or hurts quality. This feature gives a reusable, transparent middleware policy for model selection with fallback compatibility. ### Proposed Solution 1. Add IntentAwareModelRouterMiddleware in langchain_v1 middleware. 2. Policy: route to quality for high-stakes prompts, fast for simple short tasks, balanced otherwise. 3. Attach routing trace metadata (policy, policy_version, route_reason, decision_features). 4. Keep compatibility with ModelFallbackMiddleware. ### Alternatives Considered 1. Per-application custom routing logic. 2. Only fallback/retry without pre-routing. 3. Static model selection in config. ### Additional Context Related PR: https://github.com/langchain-ai/langchain/pull/37009

Repository: langchain-ai/langchain
Author: imadhajaz
State: closed | merged: False
Link: https://github.com/langchain-ai/langchain/pull/37009

Description (problem / solution / changelog)

Resolves #37010

Summary

add IntentAwareModelRouterMiddleware to route model calls to fast, balanced, or quality model tiers based on prompt intent
add routing trace metadata to model responses (policy, policy_version, route_reason, and decision_features) for transparency and debugging
export the middleware in the public middleware API and add focused unit tests for route selection and fallback composition

Test plan

PYTHONPATH=libs/langchain_v1 .venv/bin/pytest libs/langchain_v1/tests/unit_tests/agents/middleware/implementations/test_model_routing.py
PYTHONPATH=libs/langchain_v1 .venv/bin/pytest libs/langchain_v1/tests/unit_tests/agents/middleware/implementations/test_model_fallback.py libs/langchain_v1/tests/unit_tests/agents/middleware/implementations/test_model_routing.py libs/langchain_v1/tests/unit_tests/agents/middleware/core/test_wrap_model_call.py

Changed files

libs/langchain_v1/langchain/agents/middleware/__init__.py (modified, +2/-0)
libs/langchain_v1/langchain/agents/middleware/model_routing.py (added, +189/-0)
libs/langchain_v1/tests/unit_tests/agents/middleware/implementations/test_model_routing.py (added, +138/-0)

\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\ \\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\

Data

Security

Network

Code

UI/UX

Text

System

Multimedia

Protocol

API

Engineering

langchain - ✅(Solved) Fix Proposal: Intent-aware model routing middleware for langchain_v1 agents [1 pull requests, 1 comments, 1 participants]

Recommended Tools

GitHub issue graph ai analysis

Fix Action

Fixed

PR fix notes

PR #37009: Add intent-aware model routing middleware for agents

Description (problem / solution / changelog)

Summary

Test plan

Changed files

Submission checklist

Package (Required)

Feature Description

Use Case

Proposed Solution

Alternatives Considered

Additional Context

extent analysis

Still need to ship something?

TRENDING

langchain - ✅(Solved) Fix Proposal: Intent-aware model routing middleware for langchain_v1 agents [1 pull requests, 1 comments, 1 participants]

Recommended Tools

GitHub issue graph ai analysis

Fix Action

Fixed

PR fix notes

PR #37009: Add intent-aware model routing middleware for agents

Description (problem / solution / changelog)

Summary

Test plan

Changed files

Submission checklist

Package (Required)

Feature Description

Use Case

Proposed Solution

Alternatives Considered

Additional Context

extent analysis

Still need to ship something?

RELATED_DISCOVERY

TRENDING