Support batching for multi-input embeddings #169

adubovik · 2024-11-04T18:14:25Z

ai-dial-adapter-bedrock/aidial_adapter_bedrock/embedding/amazon/titan_image.py

Lines 161 to 168 in 1785193

    
           # NOTE: Amazon Titan doesn't support batched inputs 
        
           # TODO: create multiple tasks 
        
           async for sub_request in get_requests(self.storage, request): 
        
               embedding, text_tokens = await call_embedding_model( 
        
                   self.client, 
        
                   self.model, 
        
                   create_titan_request(sub_request, request.dimensions), 
        
               )

follow the same pattern as in vertexai instead:

https://github.com/epam/ai-dial-adapter-vertexai/blob/7e0dc16165cf5ec9438ac41bfd02e647cc47c18f/aidial_adapter_vertexai/embedding/multi_modal.py#L199-L208

github-project-automation bot added this to AI DIAL Nov 4, 2024

adubovik linked a pull request Dec 2, 2024 that will close this issue

feat: supported batching for Titan text and image embeddings #193

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support batching for multi-input embeddings #169

Support batching for multi-input embeddings #169

adubovik commented Nov 4, 2024

Support batching for multi-input embeddings #169

Support batching for multi-input embeddings #169

Comments

adubovik commented Nov 4, 2024