langchain - ✅(Solved) Fix Title: feat(mistralai): implement retry logic for HTTP errors, add concurrency control, and update AGENTS.md and README.md with new partner integrations and usage examples [1 pull requests, 2 comments, 1 participants]

JehoXYZ · 2026-03-11T02:43:55Z

[langchain] PR 35736: Just fix some random problem i see - Repository: langchain-ai/langchain - Author: JehoXYZ - State: closed | merged: False - Link: https:/… # PR #35736: Just fix some random problem i see - Repository: langchain-ai/langchain - Author: JehoXYZ - State: closed | merged: False - Link: https://github.com/langchain-ai/langchain/pull/35736 ## Description (problem / solution / changelog) Fixes #35735 Fixes # Adds retry logic for HTTP 429/5xx errors via _RetryableHTTPStatusError and caps async concurrency using asyncio.Semaphore(max_concurrent_requests) in acompletion_with_retry. Updates AGENTS.md and README.md with new MistralAI partner integrations and usage examples. ## Changed files - `AGENTS.md` (modified, +9/-0) - `README.md` (modified, +297/-40) - `libs/partners/mistralai/langchain_mistralai/chat_models.py` (modified, +75/-13) - `libs/partners/mistralai/tests/unit_tests/test_retry_concurrency.py` (added, +267/-0) ## Fixed - Fixed by PR: Just fix some random problem i see (https://github.com/langchain-ai/langchain/pull/35736) ### Checked other resources - [x] This is a feature request, not a bug report or usage question. - [x] I added a clear and descriptive title that summarizes the feature request. - [x] I used the GitHub search to find a similar feature request and didn't find it. - [x] I checked the LangChain documentation and API reference to see if this feature already exists. - [x] This is not related to the langchain-community package. ### Package (Required) - [ ] langchain - [ ] langchain-openai - [ ] langchain-anthropic - [ ] langchain-classic - [ ] langchain-core - [ ] langchain-model-profiles - [ ] langchain-tests - [ ] langchain-text-splitters - [ ] langchain-chroma - [ ] langchain-deepseek - [ ] langchain-exa - [ ] langchain-fireworks - [ ] langchain-groq - [ ] langchain-huggingface - [x] langchain-mistralai - [ ] langchain-nomic - [ ] langchain-ollama - [ ] langchain-openrouter - [ ] langchain-perplexity - [ ] langchain-qdrant - [ ] langchain-xai - [ ] Other / not sure / general ### Feature Description I would like LangChain to support automatic retry logic for transient HTTP errors and concurrency control in the MistralAI integration. This feature would allow users to handle unstable network conditions and rate-limited environments more gracefully by automatically retrying failed requests, while concurrency control prevents request flooding under high load. Additionally, AGENTS.md and README.md should be updated to document new MistralAI partner integrations and include practical usage examples to help users get started quickly. ### Use Case relying on ChatMistralAI in production environments frequently encounter transient HTTP errors such as 429 rate limits and 5xx server errors that cause requests to fail permanently without any recovery mechanism. Additionally, applications sending high volumes of concurrent async requests have no built-in way to throttle simultaneous calls, leading to request flooding, API quota exhaustion, and degraded reliability. ### Proposed Solution This has already been implemented in chat_models.py. The fix introduces _RetryableHTTPStatusError as a thin wrapper around httpx.HTTPStatusError so tenacity can match and retry on HTTP 429 and 5xx responses through _create_retry_decorator. Both _raise_retryable_status_error and _araise_retryable_status_error wrap retryable status codes before raising, while non-retryable errors surface immediately. Concurrency is enforced by an asyncio.Semaphore initialized in validate_environment and sized to max_concurrent_requests (default 64), acquired inside acompletion_with_retry before every async API call ensuring at most max_concurrent_requests requests are in-flight at any time. The implementation is scoped entirely to the mistralai package. ### Alternatives Considered Yes, several alternatives were considered: **For retry logic:** - I considered using the official `mistralai` SDK's built-in retry mechanism, but it would have reduced my control over which specific HTTP status codes trigger retries, specifically the 429, 500, 502, 503, and 504 codes I needed to target in my `_raise_retryable_status_error` and `_araise_retryable_status_error` functions. - I also thought about catching and retrying inside `_generate` and `_agenerate` directly, but this would have scattered retry logic across multiple methods rather than centralizing it in my `completion_with_retry` and `acompletion_with_retry` functions where it belongs. - I considered using `urllib3`'s retry adapter but since my implementation is built entirely on `httpx`, it was not applicable. **For concurrency control:** - I considered delegating concurrency control entirely to the caller, but in my opinion this places an unreasonable burden on users and makes the integration unreliable out of the box. - I looked at `asyncio.BoundedSemaphore` as an alternative to `asyncio.Semaphore`, but since I initialize the semaphore once inside `validate_environment` wit

langchain2026-03-11 02:43:55

ON THIS PAGE

Recommended Tools

×6

Utilities matched from this issue’s tags and category — try them while you read without losing context.

GitHub issue graph ai analysis

Paste a GitHub issue URL. We fetch that issue, discover linked issues from bodies/comments/timeline, collect linked pull requests, and produce a structured English report.

The report is written in English Markdown for sharing and archival.

GitHub issue URL

Helpful · Quick feedback

GitHub stats

langchain-ai/langchain#35735•Fetched 2026-04-08 00:24:49

View on GitHub

Comments

Participants

Timeline

Reactions

Author

JehoXYZ

Participants

JehoXYZ

Timeline (top)

labeled ×3commented ×2closed ×1cross-referenced ×1

Fix Action

Fixed

Fixed by PR: Just fix some random problem i see (https://github.com/langchain-ai/langchain/pull/35736)

PR fix notes

PR #35736: Just fix some random problem i see

Repository: langchain-ai/langchain
Author: JehoXYZ
State: closed | merged: False
Link: https://github.com/langchain-ai/langchain/pull/35736

Description (problem / solution / changelog)

Fixes #35735 Fixes # Adds retry logic for HTTP 429/5xx errors via _RetryableHTTPStatusError and caps async concurrency using asyncio.Semaphore(max_concurrent_requests) in acompletion_with_retry. Updates AGENTS.md and README.md with new MistralAI partner integrations and usage examples.

Changed files

AGENTS.md (modified, +9/-0)
README.md (modified, +297/-40)
libs/partners/mistralai/langchain_mistralai/chat_models.py (modified, +75/-13)
libs/partners/mistralai/tests/unit_tests/test_retry_concurrency.py (added, +267/-0)

extent analysis

Problem Summary

The issue is about implementing automatic retry logic for transient HTTP errors and concurrency control in the MistralAI integration for LangChain.

Root Cause Analysis

The current implementation does not handle transient HTTP errors and concurrency control, leading to request flooding and API quota exhaustion.

Fix Plan

Step 1: Implement Retry Logic

Create a thin wrapper class _RetryableHTTPStatusError around httpx.HTTPStatusError to match and retry on HTTP 429 and 5xx responses.

import httpx

class _RetryableHTTPStatusError(httpx.HTTPStatusError):
    pass

Step 2: Create Retry Decorator

Create a retry decorator _create_retry_decorator using the tenacity library to retry on _RetryableHTTPStatusError.

import tenacity

@tenacity.retry(wait=tenacity.waits.exponential(multiplier=1, min=4, max=10))
def _create_retry_decorator(func):
    return func

Step 3: Implement Concurrency Control

Use asyncio.Semaphore to enforce concurrency control, initialized in validate_environment with a fixed max_concurrent_requests value defaulting to 64.

import asyncio

async def validate_environment():
    semaphore = asyncio.Semaphore(64)
    # ...

Step 4: Acquire Semaphore in `completion_with_retry`

Acquire the semaphore inside completion_with_retry before every async API call to ensure at most max_concurrent_requests requests are in-flight at any time.

async def completion_with_retry(self, func):
    semaphore = await self.validate_environment()
    async with semaphore:
        # ...

Verification

Verify that the fix works by testing with transient HTTP errors and high concurrency.

Test with a 429 rate limit error and verify that the request is retried.
Test with a

Vote matrix · Quick signals

Works

Did the solution work? Tap to confirm.

Easy Fix

Was it a quick fix?

Time Saver

Did it save you time?

Blocking

Was it severely blocking?

Common Issue

Are others likely hitting this too?

Flaky / Intermittent

Is it intermittent?

Verified / Reproducible

Can you reproduce it reliably?

#api #ssr #installation #tensor shape #autograd error #model loading #dependency error #configuration error

Still need to ship something?

×6

Another batch ranked right after the header list — different links, same matching logic.

Data

Security

Network

Code

UI/UX

Text

System

Multimedia

Protocol

API

Engineering

langchain - ✅(Solved) Fix Title: feat(mistralai): implement retry logic for HTTP errors, add concurrency control, and update AGENTS.md and README.md with new partner integrations and usage examples [1 pull requests, 2 comments, 1 participants]

Recommended Tools

GitHub issue graph ai analysis

Fix Action

Fixed

PR fix notes

PR #35736: Just fix some random problem i see

Description (problem / solution / changelog)

Changed files

Checked other resources

Package (Required)

Feature Description

Use Case

Proposed Solution

Alternatives Considered

Additional Context

extent analysis

Problem Summary

Root Cause Analysis

Fix Plan

Step 1: Implement Retry Logic

Step 2: Create Retry Decorator

Step 3: Implement Concurrency Control

Step 4: Acquire Semaphore in `completion_with_retry`

Verification

Still need to ship something?

TRENDING

langchain - ✅(Solved) Fix **Title:** feat(mistralai): implement retry logic for HTTP errors, add concurrency control, and update AGENTS.md and README.md with new partner integrations and usage examples [1 pull requests, 2 comments, 1 participants]

Recommended Tools

GitHub issue graph ai analysis

Fix Action

Fixed

PR fix notes

PR #35736: Just fix some random problem i see

Description (problem / solution / changelog)

Changed files

Checked other resources

Package (Required)

Feature Description

Use Case

Proposed Solution

Alternatives Considered

Additional Context

extent analysis

Problem Summary

Root Cause Analysis

Fix Plan

Step 1: Implement Retry Logic

Step 2: Create Retry Decorator

Step 3: Implement Concurrency Control

Step 4: Acquire Semaphore in completion_with_retry

Verification

Still need to ship something?

RELATED_DISCOVERY

TRENDING

langchain - ✅(Solved) Fix Title: feat(mistralai): implement retry logic for HTTP errors, add concurrency control, and update AGENTS.md and README.md with new partner integrations and usage examples [1 pull requests, 2 comments, 1 participants]

Step 4: Acquire Semaphore in `completion_with_retry`