Thundering Herd
A problem where a large number of clients simultaneously retry or reconnect after a shared failure, overwhelming the server that has just begun recovering. The name comes from the image of a herd of animals stampeding together. Thundering herd is a natural consequence of synchronized retry schedules — all clients back off for the same duration and then hammer the server at the same moment. Jitter, staggered reconnect windows, and circuit breakers are the primary mitigations.
Related Protocols
Mentioned in Guides
How to Implement and Handle Rate Limiting (429)
API Design & Best Practices
Debugging 429 Too Many Requests
Debugging & Troubleshooting
Debugging 500 Internal Server Error: A Systematic Approach
Debugging & Troubleshooting
Implementing Retry with Exponential Backoff
Error Handling Patterns
Bulkhead Pattern: Isolating Failures in Distributed Systems
Error Handling Patterns
CDN Caching Strategies: Edge Caching, Purging, and Cache Keys
Performance & Optimization
Circuit Breaker Pattern at the API Gateway
API Gateway Patterns