OpenAI Interview Question

Design a distributed rate limiter for an LLM inference API