Skip to content

Pull requests: vllm-project/vllm

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Sort

Pull requests list

Bump Compressed Tensors version to 0.9.4 ci/build ready ONLY add when PR is ready to merge/full CI is needed
#17478 opened Apr 30, 2025 by rahul-tuli Approved
[Misc] refactor example - cpu_offload_lmcache documentation Improvements or additions to documentation
#17460 opened Apr 30, 2025 by reidliu41 Review required
[CI/Build] Reorganize models tests ci/build multi-modality Related to multi-modality (#4194)
#17459 opened Apr 30, 2025 by DarkLight1337 Approved
Improve configs - ObservabilityConfig needs-rebase ready ONLY add when PR is ready to merge/full CI is needed
#17453 opened Apr 30, 2025 by hmellor Review required
Fix more broken speculative decode tests ready ONLY add when PR is ready to merge/full CI is needed speculative-decoding
#17450 opened Apr 30, 2025 by huydhn Approved
[V1] Allow turning off pickle fallback in vllm.v1.serial_utils ready ONLY add when PR is ready to merge/full CI is needed v1
#17427 opened Apr 30, 2025 by russellb Approved
Avoid overwriting vllm_compile_cache.py ready ONLY add when PR is ready to merge/full CI is needed
#17418 opened Apr 29, 2025 by youngkent Approved
[Bugfix] Temporarily disable gptq_bitblas on ROCm documentation Improvements or additions to documentation
#17411 opened Apr 29, 2025 by nlzy Review required
ProTip! Follow long discussions with comments:>50.