This walks you through implementing rate limiting from token buckets to sliding windows, with examples in TypeScript and Python. You get Redis-based distributed rate limiting for multi-server setups, Express middleware you can drop in, and patterns for tiered limits and adaptive throttling. The best practices section is solid on the operational details: returning 429s with Retry-After headers, handling burst traffic, and not storing state in memory when you're running multiple instances. If you're building a public API or need to prevent abuse without breaking legitimate users during traffic spikes, the reference implementations cover the algorithms that actually matter in production.
npx skills add https://github.com/aj-geddes/useful-ai-prompts --skill rate-limiting-implementation