Skip to content

Pull requests: vllm-project/vllm

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

[Misc] Support MMMU accuracy benchmark performance Performance-related issues
#23034 opened Aug 16, 2025 by tanruixiang Draft
4 tasks
[Bugfix] fix some minor issues of marlin kernel bug Something isn't working ready ONLY add when PR is ready to merge/full CI is needed
#23032 opened Aug 16, 2025 by jinzhen-lin Loading…
[Bugfix] fix qwen3 moe fp8 accuracy issue qwen Related to Qwen models
#23031 opened Aug 16, 2025 by jinzhen-lin Loading…
[Refactor] Defer tensor data construction in MultiModalKwargs multi-modality Related to multi-modality (#4194) ready ONLY add when PR is ready to merge/full CI is needed v1
#23030 opened Aug 16, 2025 by DarkLight1337 Loading…
4 tasks
[Misc] refactor function name v1
#23029 opened Aug 16, 2025 by andyxning Loading…
4 tasks
[Bugfix] fix IntermediateTensors equal method
#23027 opened Aug 16, 2025 by andyxning Loading…
4 tasks
[WIP] Allow disabling TP sharding for parallel Linear layer deepseek Related to DeepSeek models
#23024 opened Aug 16, 2025 by Isotr0py Draft
1 of 4 tasks
[Core] Use key-only cache for BaseMultiModalProcessor documentation Improvements or additions to documentation llama Related to Llama models multi-modality Related to multi-modality (#4194) qwen Related to Qwen models tpu Related to Google TPUs v1
#23018 opened Aug 16, 2025 by DarkLight1337 Draft
3 of 4 tasks
[FlashInfer] Truncate block tables for sliding window attention ready ONLY add when PR is ready to merge/full CI is needed v1
#23010 opened Aug 15, 2025 by WoosukKwon Loading…
Use Blackwell FlashInfer MXFP4 MoE by default if available
#23008 opened Aug 15, 2025 by mgoin Loading…
4 tasks
[UX] Separate marlin moe config logic from triton moe ready ONLY add when PR is ready to merge/full CI is needed
#23006 opened Aug 15, 2025 by mgoin Loading…
4 tasks
[CI/Build] Replace lm-eval gsm8k tests with faster implementation ci/build ready ONLY add when PR is ready to merge/full CI is needed
#23002 opened Aug 15, 2025 by mgoin Loading…
4 tasks
[Benchmarks] add benchmark for embedding models performance Performance-related issues
#23000 opened Aug 15, 2025 by ZJY0516 Loading…
4 tasks
Upgrade xgrammar to 0.1.23 ci/build v1
#22988 opened Aug 15, 2025 by russellb Loading…
[ROCm][Bugfix] Add missing max_qlen argument rocm Related to AMD ROCm
#22984 opened Aug 15, 2025 by tuukkjs Loading…
3 of 4 tasks
[BugFix] pp cannot run successfully under NixlConnector
#22976 opened Aug 15, 2025 by R2-Y Loading…
4 tasks
ProTip! Add no:assignee to see everything that’s not assigned.