CV2-5011 refactors for making alegre dual purpose on text encoding #103

DGaffney · 2024-08-05T17:04:28Z

Description

Minor tweak identified when running text encodings.

Reference: CV2-5011

How has this been tested?

Tested locally and confirmed to work

Are there any external dependencies?

Changes no dependencies

Have you considered secure coding practices when writing this code?

Does not alter security posture.

caiosba

Sorry, I'm not familiarized with this code - but, does this change in the return require any change on Alegre side?

DGaffney · 2024-08-08T11:21:06Z

Sorry, I'm not familiarized with this code - but, does this change in the return require any change on Alegre side?

It does not - the only thing that's happening here is basically updating vectorizers to return what is now the current specification for how responses from fingerprinters should look - this one got out of sync since we hadn't been using it for anything. Mostly just tagging you here for visibility so you can be aware of the magnitude of changes across each repo, don't worry too much about the specifics in this repo!

ashkankzme · 2024-08-08T18:25:54Z

lib/model/generic_transformer.py

I realize this file has existed for a while, but is there a reason we override the respond() function instead of process()?
this would make this model unique and different from other Presto endpoints, and disables cache and error handling bc the standard get_response() function won't be called from model.py.

But I may be missing something?

Yes - the reason we override is because we want to, from the get-go, natively be able to process jobs in batch on transformers instead of just walking through items one-by-one like we do with the others. I'll look into a refactor to keep our caching and error handling working for text though - good catch

skyemeedan

the only thing that's happening here is basically updating vectorizers to return what is now the current specification for how responses from fingerprinters should look - this one got out of sync since we hadn't been using it for anything.

I'm not very clear how the output changed, but it looks like it will handle caching better? Is the current specification somewhere I can check if it matches? I'm just assuming it works because you said you tested it locally

DGaffney · 2024-08-13T22:26:49Z

the only thing that's happening here is basically updating vectorizers to return what is now the current specification for how responses from fingerprinters should look - this one got out of sync since we hadn't been using it for anything.

I'm not very clear how the output changed, but it looks like it will handle caching better? Is the current specification somewhere I can check if it matches? I'm just assuming it works because you said you tested it locally

That's basically it @skyemeedan - the function calling the fingerprinting function expects a slightly different response. Once we have the fixes from @ashkankzme on the PR he's working on it should be a bit easier to highlight conceptual float like this case.

computermacgyver

This looks fine for now. Thanks!
When we work on the bulk endpoints for Alegre, I think we're going to want one request to Presto to have a list of multiple text items to be vectorized. All of the vectors for those items should then be returned in one callback. I think some of the changes here may complicate that, but we can address it when we get to the bulk endpoints on Alegre.

… cv2-5011-internal-refactor

ashkankzme · 2024-08-15T22:06:09Z

test/lib/queue/test_queue.py

-        self.model = GenericTransformerModel(None)
-        self.model.model_name = "generic"
+        self.model = AudioModel()
+        self.model.model_name = "audio"


can we make sure model_name follows the same format as everywhere else, i.e. audio__Model? these names would break the tests in the new refactor

CV2-5011 refactors for making alegre dual purpose on text encoding

3389935

DGaffney requested review from computermacgyver and skyemeedan as code owners August 5, 2024 17:04

DGaffney marked this pull request as draft August 5, 2024 17:10

resolve changes in tests

fb9db8b

DGaffney marked this pull request as ready for review August 8, 2024 10:23

DGaffney requested a review from caiosba August 8, 2024 10:24

caiosba requested a review from ashkankzme August 8, 2024 11:18

caiosba reviewed Aug 8, 2024

View reviewed changes

ashkankzme reviewed Aug 8, 2024

View reviewed changes

DGaffney added 3 commits August 9, 2024 07:13

refactor to accommodate erroring in transformers

cc051a8

try moving case to audio model instead

cc0a441

remove bad imports, fix broken test cases

06395f9

skyemeedan approved these changes Aug 13, 2024

View reviewed changes

computermacgyver approved these changes Aug 14, 2024

View reviewed changes

ashkankzme added 2 commits August 14, 2024 09:59

Merge branch 'refs/heads/master' into cv2-5011-internal-refactor

a6f0083

Merge remote-tracking branch 'origin/cv2-5011-internal-refactor' into…

5a25bd5

… cv2-5011-internal-refactor

ashkankzme reviewed Aug 15, 2024

View reviewed changes

Update test_queue.py

24f3e86

DGaffney merged commit 11c2f79 into master Aug 16, 2024
2 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

CV2-5011 refactors for making alegre dual purpose on text encoding #103

CV2-5011 refactors for making alegre dual purpose on text encoding #103

DGaffney commented Aug 5, 2024

caiosba left a comment

DGaffney commented Aug 8, 2024

ashkankzme Aug 8, 2024

DGaffney Aug 9, 2024

skyemeedan left a comment

DGaffney commented Aug 13, 2024

computermacgyver left a comment

ashkankzme Aug 15, 2024 •

edited

Loading

CV2-5011 refactors for making alegre dual purpose on text encoding #103

CV2-5011 refactors for making alegre dual purpose on text encoding #103

Conversation

DGaffney commented Aug 5, 2024

Description

How has this been tested?

Are there any external dependencies?

Have you considered secure coding practices when writing this code?

caiosba left a comment

Choose a reason for hiding this comment

DGaffney commented Aug 8, 2024

ashkankzme Aug 8, 2024

Choose a reason for hiding this comment

DGaffney Aug 9, 2024

Choose a reason for hiding this comment

skyemeedan left a comment

Choose a reason for hiding this comment

DGaffney commented Aug 13, 2024

computermacgyver left a comment

Choose a reason for hiding this comment

ashkankzme Aug 15, 2024 • edited Loading

Choose a reason for hiding this comment

ashkankzme Aug 15, 2024 •

edited

Loading