diff --git a/fern/pages/going-to-production/rate-limits.mdx b/fern/pages/going-to-production/rate-limits.mdx index bc060b8c..6f38e4a0 100644 --- a/fern/pages/going-to-production/rate-limits.mdx +++ b/fern/pages/going-to-production/rate-limits.mdx @@ -19,11 +19,13 @@ In this document, we'll discuss some of the limitations associated with a trial Trial keys are rate-limited depending on the endpoint you want to use: -| Endpoint | Calls per Minute | -| :--------------------------------------------------------------------------------------------------- | :--------------- | -| [Embed](/reference/embed) | 5 | -| [Rerank](/reference/rerank-1), [Chat](/reference/chat) | 10 | -| All other endpoints | 100 | +- Generate (legacy): 5/min +- [Chat](/reference/chat): 20/min +- Summarize: 5/min +- Cluster: 5/min +- [Embed](/reference/embed): 5/min +- EmbedJob: 5/min +- [Rerank](/reference/rerank-1): 10/min [Chat](/reference/chat) and the [Coral user interface](https://coral.cohere.ai/) are limited to a total of 1,000 calls a month with a trial key. All remaining endpoints are limited to a total of 1,000 calls per month with a trial key. @@ -40,7 +42,7 @@ With a trial key: ## Production Key Specifications -Production keys for all endpoints are rate-limited at 10,000 calls per minute and are intended for serving Cohere in a public-facing application and testing purposes. Usage of production keys is metered at price points which can be found on our [pricing page](/docs/how-does-cohere-pricing-work). +Production keys for all endpoints are rate-limited at 1,000 calls per minute and are intended for serving Cohere in a public-facing application and testing purposes. Usage of production keys is metered at price points which can be found on our [pricing page](/docs/how-does-cohere-pricing-work). To get a production key, start by navigating to the [API Keys](https://dashboard.cohere.com/api-keys) page in your Cohere dashboard. You'll either need to be the admin of your organization, or ask your organization Admin to complete these steps.