-
Notifications
You must be signed in to change notification settings - Fork 207
Pull requests: vllm-project/llm-compressor
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[Tests] Add recovery-based validation to LM-Eval tests
#1750
opened Aug 18, 2025 by
rahul-tuli
•
Draft
2 of 7 tasks
[WIP] Add upper bounds to dependencies for release
ready
When a PR is ready for review
#1734
opened Aug 14, 2025 by
dhuangnm
Loading…
[Tracing] Decouple vision tower from first layer
ready
When a PR is ready for review
#1710
opened Aug 6, 2025 by
kylesayrs
Loading…
[Autowrapper] Support Gemma3, autowrapper improvements
#1693
opened Jul 30, 2025 by
kylesayrs
Loading…
[KV Cache] support kv cache int8 per channel quantization
ready
When a PR is ready for review
#1663
opened Jul 19, 2025 by
Eviannn
Loading…
Minor speedup for
infer_quantization_format
when save_compressed=False
#1636
opened Jul 10, 2025 by
kylesayrs
Loading…
[GPTQ] Use torch.compile to speed up gptq algo
ready
When a PR is ready for review
#1561
opened Jun 17, 2025 by
aladerran
Loading…
Previous Next
ProTip!
Follow long discussions with comments:>50.