-
-
Notifications
You must be signed in to change notification settings - Fork 8.4k
Pull requests: vllm-project/vllm
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[Model]Add DebertaV2ForSequenceClassification Model Support
documentation
Improvements or additions to documentation
#20215
opened Jun 28, 2025 by
yashaswipiplani
•
Draft
2 of 4 tasks
[CI Fix] Try fixing eagle e2e test OOM by reducing block allocation
bug
Something isn't working
ready
ONLY add when PR is ready to merge/full CI is needed
speculative-decoding
#20213
opened Jun 28, 2025 by
mgoin
Loading…
[Build] No need to use the more restrictive gencode then necessary
ci/build
#20212
opened Jun 28, 2025 by
LucasWilkinson
•
Draft
3 of 4 tasks
[Model] Support HF format of minimax
new-model
Requests to new models
ready
ONLY add when PR is ready to merge/full CI is needed
#20211
opened Jun 28, 2025 by
mgoin
Loading…
[Misc] ADD Docker compose exemple
#20210
opened Jun 28, 2025 by
maher-naija-pro
Loading…
1 task done
[doc] Add Slack and Forum to the top navigation
documentation
Improvements or additions to documentation
#20208
opened Jun 28, 2025 by
reidliu41
Loading…
4 tasks
Fix cuda_archs_loose_intersection when handling sm_*a
ci/build
ready
ONLY add when PR is ready to merge/full CI is needed
#20207
opened Jun 28, 2025 by
huydhn
Loading…
[WIP][NOT READY] Refactor CLI Args for a better modular integration
frontend
needs-rebase
#20206
opened Jun 28, 2025 by
kouroshHakha
•
Draft
4 tasks
[BUGFIX][DEEPSEEK][MODEL_LOAD] fix w13, w2 weight not initialized assert
ready
ONLY add when PR is ready to merge/full CI is needed
#20202
opened Jun 27, 2025 by
xuechendi
Loading…
1 of 4 tasks
Validate @config in pre-commit instead of dynamically
#20200
opened Jun 27, 2025 by
lionelvillard
•
Draft
4 tasks
[Do not merge] Add out of place layernorm
performance
Performance-related issues
#20197
opened Jun 27, 2025 by
charlifu
Loading…
[CI][Intel Gaudi][vllm-Plugin]Add CI for hpu-plugin-v1-test
ci/build
documentation
Improvements or additions to documentation
#20196
opened Jun 27, 2025 by
xuechendi
Loading…
3 tasks
[UT][intel GPU] use current_platform instead of device hardcode in v1 tests
rocm
Related to AMD ROCm
v1
#20169
opened Jun 27, 2025 by
Liangliang-Ma
Loading…
[Bugfix] Fix Maverick correctness by filling zero to cache space in cutlass_moe
#20167
opened Jun 27, 2025 by
minosfuture
Loading…
[Bugfix] Fix topk_ids indices_type for CUTLASS w8a8 FP8 MoE
#20166
opened Jun 27, 2025 by
minosfuture
Loading…
[Feature]: Implement
check_health
for V1
v1
#20164
opened Jun 27, 2025 by
limbaniharsh
Loading…
1 of 3 tasks
[Feature] Enable triton scaled mm for NVIDIA GPUs with ahead-of-time autotuning
performance
Performance-related issues
#20163
opened Jun 27, 2025 by
gau-nernst
•
Draft
3 of 4 tasks
Previous Next
ProTip!
no:milestone will show everything without a milestone.