Replicate Outage Tracker – Is Replicate API Down?

About Replicate API Status

Replicate provides on-demand inference for thousands of open-source AI models including Stable Diffusion, FLUX, Llama, Whisper, and custom fine-tuned models via a simple prediction API. This page tracks Replicate API outages, degradations, and incidents in real time, automatically updated every 60 seconds from our monitoring infrastructure.

Official status page: https://status.replicate.com

Common Replicate Outage Symptoms

✕HTTP 429 — rate limit exceeded or prediction queue at capacity
✕HTTP 503 — API temporarily unavailable or predictions timing out
✕Long cold-start latency when a model container needs to be provisioned
✕Prediction stuck in 'starting' or 'processing' state during capacity pressure
✕Webhook delivery failures during backend incidents
✕Specific model versions becoming unavailable during platform updates

What to Do During a Replicate Outage

Honor the Retry-After header on 429 responses and back off before retrying prediction creation.
Use the predictions.get polling endpoint with exponential backoff instead of relying solely on webhooks.
Switch to a BYOK proxy (AI Badgr) to get per-request receipts and automatic retry handling.
Monitor the official Replicate status page at status.replicate.com for incident announcements.
Set a prediction timeout and cancel stuck predictions via the API rather than letting them queue indefinitely.

Switch to AI Badgr — instant failover →

Other AI Provider Status Pages

OpenAI Status Anthropic Status Gemini Status Grok Status All Providers

Replicate Outage FAQ