Skip to content

Pull requests: huggingface/text-generation-inference

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

xpu lora support
#3232 opened May 19, 2025 by sywangyi Loading…
feat: align function id with tool call response
#3111 opened Mar 13, 2025 by drbh Loading…
Enable qwen2vl video
#2756 opened Nov 18, 2024 by drbh Loading…
9 tasks done
Update Dockerfile to use devel image for compatibility
#2848 opened Dec 16, 2024 by YaserJaradeh Loading…
2 of 5 tasks
llava next image encoder to allow un-aligned patch / image sizes
#2936 opened Jan 22, 2025 by jimexist Loading…
5 tasks
[Backend] Introduce vLLM backend
#2976 opened Jan 31, 2025 by mfuntowicz Loading…
Support xccl distributed backend
#3034 opened Feb 18, 2025 by dvrogozh Loading…
Add model_load_time metric
#2311 opened Jul 26, 2024 by Edwinhr716 Loading…
2 of 5 tasks
Update links Inferentia refer docs
#3154 opened Apr 9, 2025 by guspan-tanadi Loading…
1 of 5 tasks
Add chunked attn for L4
#3162 opened Apr 10, 2025 by mht-sharma Draft
2 of 7 tasks
Update to flashinfer 0.2.5
#3164 opened Apr 11, 2025 by danieldk Draft
5 tasks
Fix flashinfer plan call to use positional arguments for #3165
#3166 opened Apr 11, 2025 by ruckc Loading…
2 of 5 tasks
feat: support logit bias in chat request
#3186 opened Apr 22, 2025 by drbh Loading…
README: minimum Python version is 3.10
#3194 opened Apr 25, 2025 by Frenzie Loading…
1 of 5 tasks
Set uv UV_PYTHON_INSTALL_DIR explicitly
#3197 opened Apr 27, 2025 by sebastianliebscher Loading…
1 of 5 tasks
2
feat: lock updated kernel versions
#3201 opened Apr 29, 2025 by drbh Loading…
Fix typos
#3210 opened May 6, 2025 by omahs Loading…
1 of 5 tasks
Optimum neuron 0.2.1
#3281 opened Jun 27, 2025 by dacorvo Loading…
wip: comment out prepend full_text
#3079 opened Mar 7, 2025 by jrc2139 Draft
1 of 5 tasks
ProTip! Updated in the last three days: updated:>2025-06-24.