OpenAI 429 Rate Limit Fix

429 means requests are being limited by quota, rate, concurrency or burst behavior. Treat it as a control-flow problem, not a JSON problem.

Symptoms

  • Requests fail during bursts or batch jobs.
  • Single manual calls work, but automation fails.
  • The response status code is 429.

Likely causes

  • Too many concurrent requests.
  • Retry logic sends another burst immediately.
  • Project quota, billing or model-specific limits are reached.

Fix steps

  1. Add exponential backoff with jitter.
  2. Limit concurrency in workers and queues.
  3. Read rate-limit headers when available.
  4. Reduce batch size and cache repeated work.

Verify the fix

  • Run a small controlled batch.
  • Log status codes and retry counts.
  • Confirm retries spread out instead of synchronizing.

FAQ

Should I retry immediately?

No. Immediate retries often make 429 failures worse.

Can JSON validation fix 429?

No. 429 is about request volume, quota or limits, not body syntax.

Related tools and guides

Last updated: May 18, 2026