-
Notifications
You must be signed in to change notification settings - Fork 1.1k
Issues: huggingface/text-generation-inference
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
Support
reponse_format: {"type": "json_object"}
without any constrained schema
#2899
opened Jan 10, 2025 by
lhoestq
Automatic Calculation of Sequence Length in TGI v3 Leads to Unrealistic Values Before CUDA OOM
#2897
opened Jan 10, 2025 by
biba10
2 of 4 tasks
Prefill operation can be significantly slower in TGI v3 vs TGI v2
#2896
opened Jan 10, 2025 by
biba10
2 of 4 tasks
[Qwen/Qwen2.5-14B-Instruct-GPTQ-Int8] Bad Responses with High Concurrent Requests
#2894
opened Jan 9, 2025 by
michaelact
4 tasks done
make install-server does not have Apple MacOS Metal Framework
#2890
opened Jan 8, 2025 by
qdrddr
2 of 4 tasks
summarization using fine-tuned flan-t5 model in TGI outputs "generated text" instead of "summary_text" and outputs are completely different
#2889
opened Jan 7, 2025 by
maiiabocharova
2 of 4 tasks
Qwen2-VL failed to infer multiple images (Server error: upper bound and larger bound inconsistent with step sign)
#2888
opened Jan 7, 2025 by
AHEADer
2 of 4 tasks
Starcoder2-15B model - AttributeError: 'TensorParallelColumnLinear' object has no attribute [rank3]: 'base_layer'
#2881
opened Jan 6, 2025 by
ashwincv0112
3 of 4 tasks
trunction flag is missing from /v1/chat/completions
#2877
opened Jan 6, 2025 by
vitalyshalumov
2 of 4 tasks
Tool Calling using Vercel's AI SDK not working as intended
#2864
opened Dec 23, 2024 by
kldzj
2 of 4 tasks
text-generation-inference:latest-trtllm is missing dependencies to run models
#2854
opened Dec 18, 2024 by
selalipop
2 of 4 tasks
Entire system crashes when get to warm up model
#2853
opened Dec 17, 2024 by
ad-astra-video
1 of 4 tasks
random text generation from Qwen2-VL-7B-Instruct with TGI3
#2851
opened Dec 17, 2024 by
DongyoungKim2
2 of 4 tasks
Previous Next
ProTip!
Mix and match filters to narrow down what you’re looking for.