-
Notifications
You must be signed in to change notification settings - Fork 120
Issues: michaelfeil/infinity
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
torch.OutOfMemoryError: CUDA out of memory. when serving NV-Embed-V2 on 8 40G A100 GPUs
#498
opened Dec 16, 2024 by
atanikan
2 of 5 tasks
health endpoint does not really provide insights about healthiness
#483
opened Nov 28, 2024 by
bufferoverflow
5 tasks
support for dimensions field like in OpenAI text-embedding-3, thanks
good first issue
Good for newcomers
help wanted
Extra attention is needed
#476
opened Nov 21, 2024 by
ericg108
Maintainer: Breaking CI / Python installs
help wanted
Extra attention is needed
#415
opened Oct 11, 2024 by
michaelfeil
Add a End-to-end unit test for image embeddings and audio embeddings
help wanted
Extra attention is needed
#378
opened Sep 24, 2024 by
michaelfeil
when use engine optimum device tensorrt,startup fail
#372
opened Sep 23, 2024 by
weibingo
2 of 4 tasks
Reranker dynamic quantization
help wanted
Extra attention is needed
#363
opened Sep 16, 2024 by
rawsh-rubrik
jinaai/jina-reranker-v1-*-en does not work with optimum
#362
opened Sep 13, 2024 by
rawsh
2 of 4 tasks
Issue running cross-encoder onnx model exported with optimum-cli
#361
opened Sep 13, 2024 by
rawsh
2 of 4 tasks
Write a custom flash-attention function for the deberta model.
#359
opened Sep 12, 2024 by
wolfassi123
3 tasks done
Support Integration with KServe
help wanted
Extra attention is needed
#352
opened Sep 6, 2024 by
indranilr
Add Installation Option to Depend Only on ONNX, Excluding New Torch and CUDA Packages
#332
opened Aug 9, 2024 by
bash99
Previous Next
ProTip!
Exclude everything labeled
bug
with -label:bug.