● Limit requests: Limit the number of requests an entity can send to an API within a time window.
● Exception handling: The user should get an error message when the user exceeds the threshold.
● Distribution: The rate limiter can monitor requests among multiple servers.
● Low Latency
● High Availability