Mentioning LOR. #264

trentfowlercohere · 2024-11-22T18:33:27Z

This PR introduces a new section titled "Optimize your Inference Latencies" in the Amazon SageMaker setup guide. The section addresses latency issues that may arise due to SageMaker endpoints' default random routing strategy, particularly in applications centred around generative AI.

It highlights the availability of the RoutingStrategy parameter on the SageMaker platform, which enables users to employ the 'least outstanding requests' (LOR) routing approach. This strategy has demonstrated improved latency performance across various scenarios, as referenced in the provided link.

New Section: Optimize your Inference Latencies
New Parameter: RoutingStrategy
New Strategy: 'least outstanding requests' (LOR)

github-actions · 2024-11-22T18:40:41Z

🌿 Preview your docs: https://cohere-preview-153f2a31-db74-455b-9682-a29811138d5c.docs.buildwithfern.com

github-actions · 2024-11-22T19:08:24Z

🌿 Preview your docs: https://cohere-preview-34c0763f-8d58-4cdc-b1b7-9ce8833dcb92.docs.buildwithfern.com

fern/pages/deployment-options/cohere-on-aws/amazon-sagemaker-setup-guide.mdx

github-actions · 2024-11-25T18:12:54Z

🌿 Preview your docs: https://cohere-preview-cc589374-bf5d-4c1d-9604-c14f62ce05ed.docs.buildwithfern.com

Mentioning LOR.

7cfde87

trentfowlercohere requested a review from a team as a code owner November 22, 2024 18:33

Updating language and adding it to the v2 docs.

fee99d5

mkozakov approved these changes Nov 25, 2024

View reviewed changes

fern/pages/deployment-options/cohere-on-aws/amazon-sagemaker-setup-guide.mdx Show resolved Hide resolved

invader89 approved these changes Nov 25, 2024

View reviewed changes

Merge branch 'main' into add-lor

6b02071

trentfowlercohere merged commit 9152dc1 into main Nov 25, 2024
3 checks passed

trentfowlercohere deleted the add-lor branch November 25, 2024 18:10

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Mentioning LOR. #264

Mentioning LOR. #264

trentfowlercohere commented Nov 22, 2024 •

edited by cohere-pr-pal bot

Loading

github-actions bot commented Nov 22, 2024

github-actions bot commented Nov 22, 2024

github-actions bot commented Nov 25, 2024

Mentioning LOR. #264

Mentioning LOR. #264

Conversation

trentfowlercohere commented Nov 22, 2024 • edited by cohere-pr-pal bot Loading

github-actions bot commented Nov 22, 2024

github-actions bot commented Nov 22, 2024

github-actions bot commented Nov 25, 2024

trentfowlercohere commented Nov 22, 2024 •

edited by cohere-pr-pal bot

Loading