Scalable Distributed Rate Limiter
Design a distributed rate limiting system capable of handling 1 million requests per second. The system must support various limiting strategies (e.g., sliding window, token bucket) and allow for per-user or per-API-key quotas. Key constraints include sub-5ms latency overhead, high availability with fail-open semantics, and the ability to scale horizontally as traffic grows. Explain your choice of storage, consistency models, and how you would handle 'hot keys' for extremely popular users.
RedisLuagRPCAPI GatewayKubernetesNoSQL
00