Skip to content

Pull requests: huggingface/text-generation-inference

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

Gemma3 sliding window support
#3280 opened Jun 27, 2025 by sywangyi Loading…
5 tasks
feat: allow json_schema in response format and add test
#3276 opened Jun 25, 2025 by drbh Loading…
HuggingFaceM4/Idefics3-8B-Llama3 crash fix
#3267 opened Jun 16, 2025 by sywangyi Loading…
Disable mamba in CPU platform
#3266 opened Jun 13, 2025 by casassg Loading…
3 of 5 tasks
Migrate to V2 Pydantic interface
#3262 opened Jun 11, 2025 by emmanuel-ferdman Loading…
1 of 5 tasks
fix multi-modality apply chat template issue
#3258 opened Jun 6, 2025 by sywangyi Loading…
5 tasks
feat: improve llava next pooling for granite vision
#3255 opened Jun 4, 2025 by drbh Loading…
Xccl
#3252 opened Jun 2, 2025 by sywangyi Draft
5 tasks
xpu lora support
#3232 opened May 19, 2025 by sywangyi Loading…
Trtllm backend improvements
#3231 opened May 17, 2025 by leejuyuu Loading…
1 of 5 tasks
Fix typos
#3210 opened May 6, 2025 by omahs Loading…
1 of 5 tasks
feat: lock updated kernel versions
#3201 opened Apr 29, 2025 by drbh Loading…
Set uv UV_PYTHON_INSTALL_DIR explicitly
#3197 opened Apr 27, 2025 by sebastianliebscher Loading…
1 of 5 tasks
2
README: minimum Python version is 3.10
#3194 opened Apr 25, 2025 by Frenzie Loading…
1 of 5 tasks
feat: support logit bias in chat request
#3186 opened Apr 22, 2025 by drbh Loading…
Fix flashinfer plan call to use positional arguments for #3165
#3166 opened Apr 11, 2025 by ruckc Loading…
2 of 5 tasks
Update to flashinfer 0.2.5
#3164 opened Apr 11, 2025 by danieldk Draft
5 tasks
Add chunked attn for L4
#3162 opened Apr 10, 2025 by mht-sharma Draft
2 of 7 tasks
Update links Inferentia refer docs
#3154 opened Apr 9, 2025 by guspan-tanadi Loading…
1 of 5 tasks
feat: align function id with tool call response
#3111 opened Mar 13, 2025 by drbh Loading…
wip: comment out prepend full_text
#3079 opened Mar 7, 2025 by jrc2139 Draft
1 of 5 tasks
Support xccl distributed backend
#3034 opened Feb 18, 2025 by dvrogozh Loading…
ProTip! Updated in the last three days: updated:>2025-06-24.