Skip to content

Pull requests: vllm-project/llm-compressor

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

[Examples] Create qwen_2_5_vl_example.py
#1752 opened Aug 19, 2025 by Zhao-Dongyu Loading…
[Transform] SpinQuant R4
#1746 opened Aug 18, 2025 by kylesayrs Draft
[bugfix] Fix indentation errors in the README file
#1737 opened Aug 15, 2025 by qibaoyuan Loading…
Enable xpu device
#1736 opened Aug 15, 2025 by jiqing-feng Loading…
[WIP] Add upper bounds to dependencies for release ready When a PR is ready for review
#1734 opened Aug 14, 2025 by dhuangnm Loading…
[Utils] Offloaded cache size
#1714 opened Aug 7, 2025 by kylesayrs Loading…
[Tracing] Decouple vision tower from first layer ready When a PR is ready for review
#1710 opened Aug 6, 2025 by kylesayrs Loading…
[WIP] [MoE] GPT OSS
#1705 opened Aug 5, 2025 by kylesayrs Draft
[MoE] Add conditional expert calibration
#1701 opened Aug 1, 2025 by dichn Loading…
[Example] [VLM] Gemma3n
#1696 opened Jul 31, 2025 by kylesayrs Draft
1686 Logic matching refactor
#1687 opened Jul 28, 2025 by ved1beta Loading…
add quantization_w4a4_fp4 qwen3 example
#1681 opened Jul 24, 2025 by wangwenmingaa Loading…
[KV Cache] support kv cache int8 per channel quantization ready When a PR is ready for review
#1663 opened Jul 19, 2025 by Eviannn Loading…
[Transform] Online Rotations
#1651 opened Jul 16, 2025 by kylesayrs Draft
[Pipelines] Add propagate_error argument ready When a PR is ready for review
#1575 opened Jun 20, 2025 by kylesayrs Draft
[GPTQ] Use torch.compile to speed up gptq algo ready When a PR is ready for review
#1561 opened Jun 17, 2025 by aladerran Loading…
Disable sequential_targets from modifiers ready When a PR is ready for review
#1559 opened Jun 16, 2025 by kylesayrs Draft
ProTip! Follow long discussions with comments:>50.