Skip to content

Pull requests: vllm-project/vllm

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

[Model]Add DebertaV2ForSequenceClassification Model Support documentation Improvements or additions to documentation
#20215 opened Jun 28, 2025 by yashaswipiplani Draft
2 of 4 tasks
[CI Fix] Try fixing eagle e2e test OOM by reducing block allocation bug Something isn't working ready ONLY add when PR is ready to merge/full CI is needed speculative-decoding
#20213 opened Jun 28, 2025 by mgoin Loading…
[Model] Support HF format of minimax new-model Requests to new models ready ONLY add when PR is ready to merge/full CI is needed
#20211 opened Jun 28, 2025 by mgoin Loading…
[Misc] ADD Docker compose exemple
#20210 opened Jun 28, 2025 by maher-naija-pro Loading…
1 task done
[doc] Add Slack and Forum to the top navigation documentation Improvements or additions to documentation
#20208 opened Jun 28, 2025 by reidliu41 Loading…
4 tasks
Fix cuda_archs_loose_intersection when handling sm_*a ci/build ready ONLY add when PR is ready to merge/full CI is needed
#20207 opened Jun 28, 2025 by huydhn Loading…
[BUGFIX][DEEPSEEK][MODEL_LOAD] fix w13, w2 weight not initialized assert ready ONLY add when PR is ready to merge/full CI is needed
#20202 opened Jun 27, 2025 by xuechendi Loading…
1 of 4 tasks
[Do not merge] Add out of place layernorm performance Performance-related issues
#20197 opened Jun 27, 2025 by charlifu Loading…
[CI][Intel Gaudi][vllm-Plugin]Add CI for hpu-plugin-v1-test ci/build documentation Improvements or additions to documentation
#20196 opened Jun 27, 2025 by xuechendi Loading…
3 tasks
FlashInfer generated decode kernels.
#20194 opened Jun 27, 2025 by wenscarl Draft
4 tasks
Eepp frontend needs-rebase v1
#20191 opened Jun 27, 2025 by ruisearch42 Draft
4 tasks
[WIP] Run eagle with full cudagraph documentation Improvements or additions to documentation v1
#20190 opened Jun 27, 2025 by zixi-qi Draft
[Nixl] Heterogeneous TP support FlashInfer
#20189 opened Jun 27, 2025 by NickLucche Loading…
Enabled BnB NF4 inference on Gaudi
#20172 opened Jun 27, 2025 by rsshaik1 Loading…
[Feature]: Implement check_health for V1 v1
#20164 opened Jun 27, 2025 by limbaniharsh Loading…
1 of 3 tasks
ProTip! no:milestone will show everything without a milestone.