Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

the first submission of inception (embedding service): #1

Open
wants to merge 1 commit into
base: main
Choose a base branch
from

Conversation

legaltextai
Copy link

@legaltextai legaltextai commented Oct 29, 2024

  • uses cpu for query embedding, and gpu - for everything else;
  • i left port 8005 as it ran on my machine but we may need to change to a new one as a default;
  • tried to mimic Doctor microservice;
  • option to run in a docker;
  • the embed_endpoint includes some text preprocessing but we need to decide on whether we want all pre-processing be done in the client's script that will be sending texts for embedding;
  • tested on my ubuntu server, v100 gpu;
  • 'inception' sub-folder within 'inception' is like a dream within a dream.

- uses cpu for query embedding, and gpu - for everything else;
- the embed_endpoint includes some text preprocessing but we need to decide on whether we want all pre-processing be done in the client's script that will be sending texts for embedding;
- tested on my ubuntu server, v100 gpu
@mlissner
Copy link
Member

Thanks Sergei. I'll put this on Alberto's backlog for when he's back.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
Status: To Do
Development

Successfully merging this pull request may close these issues.

3 participants