Queue-Based Load Leveling pattern is a software design pattern that uses a queue as a buffer between the source of requests and the service that handles them, in order to smooth out spikes in traffic and prevent overloading of resources by allowing the service to process requests at a steady rate and temporarily store incoming requests that exceed the service’s capacity.