Your app gets featured on Hacker News. Or Product Hunt. Or someone with 2 million followers tweets about it. Traffic goes from 100 requests per second to 100,000 in under an hour.
Your architecture was designed for 100 RPS. Maybe 1,000 on a good day. Not 100,000. What breaks?
We simulated this exact scenario using PinPole's spike traffic pattern against a standard serverless architecture. Here's the timeline of what happens as traffic ramps from 100 to 100,000 RPS.
The architecture under test
API Gateway, Lambda (512MB, 10s timeout, 1,000 concurrency), DynamoDB (on-demand), SQS, and a second Lambda for async processing. Default configurations. The architecture most startups ship with.
The failure timeline
The before and after
Five fixes that make it survivable
PinPole's recommendation engine flagged five changes. We applied each one and re-simulated:
Increase Lambda concurrency to 30,000
Request a concurrency limit increase from AWS. This is a soft limit and typically approved within hours. Enable provisioned concurrency for the first 5,000 to eliminate cold starts under spike.
Add API Gateway throttling at 80,000 RPS
Set a throttle limit slightly below your backend's capacity. Better to return 429 (rate limited) to 20% of users than 500 (server error) to 89%. Rate limiting is graceful degradation.
Enable DynamoDB DAX caching
Add a DAX cluster for read-heavy access patterns. This reduces DynamoDB read load by 80%+ and eliminates hot-partition throttling for reads.
Add CloudFront in front of API Gateway
Cache static and semi-static responses. At 100k RPS, even a 10-second cache TTL reduces origin load dramatically.
Increase SQS Lambda batch size and concurrency
The async processor was falling behind because it was processing one message at a time. Batch size of 10 with reserved concurrency of 5,000 clears the backlog in minutes instead of hours.
Simulate your viral moment
Build your architecture. Run a spike simulation at 100x your current traffic. Find the breaking points. Apply the fixes. Re-simulate. Know that when the moment comes, your infrastructure will survive it.
Find your breaking points before your users do
Spike simulation from 10 RPS to 100M RPS. No deployed infrastructure needed.
Start for free →