CV2-5640 only send responses to check api when we have a fully realized response from presto during sync requests #470

DGaffney · 2024-11-07T15:18:25Z

Description

Updates to prevent cases where we send multiple partial matches back to check-api before we've run all vectorizations. The pop that we are removing is theorized as the culprit due to several factors:

The pop removes the model, which, on response, makes it appear that there are fewer items waiting to be processed,
The pop is only on sync requests, while the nearly identical async requests are using get and appear to be working correctly.

Reference: CV2-5640

How has this been tested?

Not yet tested locally

Have you considered secure coding practices when writing this code?

None

…ed response from presto during sync requests

sentry-io · 2024-11-07T15:18:39Z

🔍 Existing Issues For Review

Your pull request is modifying functions with the following pre-existing issues:

📄 File: app/main/lib/elastic_crud.py

Function	Unhandled Issue
`get_blocked_presto_response`	KeyError: 'openai-text-embedding-ada-002' api.sim... `Event Count:` 2

_{Did you find this useful? React with a 👍 or 👎}

skyemeedan · 2024-11-07T18:25:47Z

app/main/lib/elastic_crud.py

@@ -54,7 +54,7 @@ def get_blocked_presto_response(task, model, modality):
    callback_url = Presto.add_item_callback_url(app.config['ALEGRE_HOST'], modality)
    if requires_encoding(obj):
        blocked_results = []
-        for model_key in obj.pop("models", []):
+        for model_key in obj.get("models", []):


Is the problem that we are destructively modifying the 'obj' object when the "models" list is removed? It seems like it shouldn't make a difference since it isn't referenced later, but .get seems like better syntax.

when we send these requests out, with the pop in place, it "tells" the alegre callback that is expecting fewer models than it really should be - by enforcing the get we (a) bring it in line with async and (b) probably eliminate erroneous calls back to check-api when alegre mistakenly things work is complete for every in-process vector

skyemeedan

seems fine to me, but I don't really have full context. Since this is kind of core functionality for presto I think we should have test coverage for it if it is breaking?

CV2-5640 only send responses to check api when we have a fully realiz…

054569b

…ed response from presto during sync requests

DGaffney marked this pull request as ready for review November 7, 2024 15:38

DGaffney requested review from skyemeedan and computermacgyver as code owners November 7, 2024 15:38

skyemeedan reviewed Nov 7, 2024

View reviewed changes

skyemeedan approved these changes Nov 7, 2024

View reviewed changes

DGaffney merged commit 2ffe0c8 into develop Nov 7, 2024
4 checks passed

DGaffney deleted the cv2-5640-suppress-multiple-responses branch November 7, 2024 18:48

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

CV2-5640 only send responses to check api when we have a fully realized response from presto during sync requests #470

CV2-5640 only send responses to check api when we have a fully realized response from presto during sync requests #470

DGaffney commented Nov 7, 2024

sentry-io bot commented Nov 7, 2024

skyemeedan Nov 7, 2024

DGaffney Nov 7, 2024

skyemeedan left a comment

CV2-5640 only send responses to check api when we have a fully realized response from presto during sync requests #470

CV2-5640 only send responses to check api when we have a fully realized response from presto during sync requests #470

Conversation

DGaffney commented Nov 7, 2024

Description

How has this been tested?

Have you considered secure coding practices when writing this code?

sentry-io bot commented Nov 7, 2024

🔍 Existing Issues For Review

skyemeedan Nov 7, 2024

Choose a reason for hiding this comment

DGaffney Nov 7, 2024

Choose a reason for hiding this comment

skyemeedan left a comment

Choose a reason for hiding this comment