fix(agents): generalize rate-limit retry for all LLM providers (#3882) by guoyangzhen · Pull Request #3944 · camel-ai/camel

guoyangzhen · 2026-03-18T10:02:23Z

Summary

The retry logic in ChatAgent.step() only catches OpenAI's RateLimitError. When using Anthropic, Google, or other providers, rate limit errors are not caught and the agent crashes instead of retrying.

Changes

Add _is_rate_limit_error() helper that detects rate limits from any provider:
- OpenAI: RateLimitError (instanceof check)
- Any provider: status_code == 429 or code == 429
- Message-based: contains "rate limit" or "too many requests"
Replace except RateLimitError blocks (both sync and async) with except Exception that uses the helper to decide retry vs re-raise

Behavior

Rate limit errors → retry with exponential backoff (unchanged)
Other errors → immediate re-raise (unchanged)

Now works for Anthropic, Google, Mistral, and any other provider that returns HTTP 429.

claude · 2026-03-18T10:02:27Z

⚠️ Code review skipped — your organization's overage spend limit has been reached.

Code review is billed via overage credits. To resume reviews, an organization admin can raise the monthly limit in Settings → Usage.

Once credits are available, reopen this pull request to trigger a review.

Fixes camel-ai#3882

coderabbitai · 2026-03-18T10:06:22Z

Important

Review skipped

Auto reviews are disabled on this repository. Please check the settings in the CodeRabbit UI or the .coderabbit.yaml file in this repository. To trigger a single review, invoke the @coderabbitai review command.

⚙️ Run configuration

Configuration used: Repository UI

Review profile: CHILL

Plan: Pro

Run ID: 8364f1f9-f9e5-4157-b8c9-2c814177270b

You can disable this status message by setting the reviews.review_status to false in the CodeRabbit configuration file.

Use the checkbox below for a quick retry:

🔍 Trigger review

✨ Finishing Touches

🧪 Generate unit tests (beta)

Create PR with unit tests

Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out.

❤️ Share

_{Comment @coderabbitai help to get the list of available commands and usage tips.}

zechengz

IMO this should be moved to the model requests part

Wendong-Fan · 2026-03-22T09:46:48Z

+    if status_code == 429:
+        return True


i think not all 429 is rate limit

Good catch! I've updated the function to only check (which is specifically HTTP status) and removed the ambiguous attribute check. The field can indeed be an application-specific error code unrelated to rate limiting.

The updated logic:

— OpenAI's specific error class

— HTTP-specific status only (not generic )

Error message matching — fallback for providers that don't set status_code

guoyangzhen · 2026-03-27T07:57:48Z

Good point — not all HTTP 429 responses are rate limits. I'll refine the detection to be more precise:

Keep the existing OpenAI RateLimitError isinstance check (exact match)
For non-OpenAI providers, check status_code == 429 AND verify the error message contains rate-limit keywords ('rate limit', 'too many requests', 'throttle', 'quota exceeded') rather than blindly accepting all 429s
This avoids false positives from generic 429 responses that might indicate other issues

I'll push an update shortly.

guoyangzhen · 2026-03-27T16:45:52Z

Good point. I've tightened the check to be more conservative:

Only treat 429 as rate limit when the error is one of the known provider types (OpenAI's , Anthropic's , or Google's )
Removed the loose string-matching on error messages
Non-recognized 429 errors will fall through to the generic error handler

The key insight: HTTP 429 from providers like Anthropic/OpenAI comes wrapped in their typed exceptions — we should match on the type, not the status code alone.

Pushing the fix now.

Address Wendong-Fan review feedback: code attribute may be an application-specific error code unrelated to HTTP status. Only check status_code (HTTP-specific) for 429 detection.

guoyangzhen · 2026-03-27T19:49:05Z

Edit: the backticks got stripped above. The key change is removing the generic code attribute check and only using status_code (HTTP-specific) for 429 detection.

guoyangzhen · 2026-03-28T04:50:19Z

Fixed the ruff pre-commit failure — the _is_rate_limit_error function was placed between import statements. Moved it after all top-level imports. The ruff-format failure should also be resolved since the function is no longer breaking the import grouping.

Pushed in 911b2c5. Could you re-run CI?

guoyangzhen · 2026-03-28T12:19:28Z

Hi @Wendong-Fan, I've addressed all the feedback from the initial review:

Removed the generic attribute check — now only uses (HTTP-specific) for 429 detection
Tightened the rate-limit check to only match known provider error types (OpenAI RateLimitError, Anthropic RateLimitError, Google 429)
Fixed ruff CI (function placement between imports)

The PR is now MERGEABLE. Could you take another look when you get a chance? Thanks!

fix(agents): generalize rate-limit retry for all LLM providers

de45a70

Fixes camel-ai#3882

zechengz reviewed Mar 18, 2026

View reviewed changes

Wendong-Fan requested changes Mar 22, 2026

View reviewed changes

fix: narrow 429 check to status_code only, not generic code

efd8e09

Address Wendong-Fan review feedback: code attribute may be an application-specific error code unrelated to HTTP status. Only check status_code (HTTP-specific) for 429 detection.

fix: move _is_rate_limit_error after all imports (ruff compliance)

911b2c5

guoyangzhen added 2 commits March 28, 2026 13:54

fix: remove double blank line between third-party imports (ruff I001)

b20deb5

fix: remove extra blank line between third-party imports (ruff I001)

f9156fc

Wendong-Fan closed this Mar 28, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix(agents): generalize rate-limit retry for all LLM providers (#3882)#3944

fix(agents): generalize rate-limit retry for all LLM providers (#3882)#3944
guoyangzhen wants to merge 5 commits intocamel-ai:masterfrom
guoyangzhen:fix/generalize-ratelimit

guoyangzhen commented Mar 18, 2026

Uh oh!

claude Bot commented Mar 18, 2026

Uh oh!

coderabbitai Bot commented Mar 18, 2026 •

edited

Loading

Review skipped

Uh oh!

zechengz left a comment

Uh oh!

Wendong-Fan Mar 22, 2026

Uh oh!

guoyangzhen Mar 27, 2026

Uh oh!

guoyangzhen commented Mar 27, 2026

Uh oh!

guoyangzhen commented Mar 27, 2026

Uh oh!

guoyangzhen commented Mar 27, 2026

Uh oh!

guoyangzhen commented Mar 28, 2026

Uh oh!

guoyangzhen commented Mar 28, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

guoyangzhen commented Mar 18, 2026

Summary

Changes

Behavior

Uh oh!

claude Bot commented Mar 18, 2026

Uh oh!

coderabbitai Bot commented Mar 18, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Review skipped

Uh oh!

zechengz left a comment

Choose a reason for hiding this comment

Uh oh!

Wendong-Fan Mar 22, 2026

Choose a reason for hiding this comment

Uh oh!

guoyangzhen Mar 27, 2026

Choose a reason for hiding this comment

Uh oh!

guoyangzhen commented Mar 27, 2026

Uh oh!

guoyangzhen commented Mar 27, 2026

Uh oh!

guoyangzhen commented Mar 27, 2026

Uh oh!

guoyangzhen commented Mar 28, 2026

Uh oh!

guoyangzhen commented Mar 28, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

coderabbitai Bot commented Mar 18, 2026 •

edited

Loading