-
-
Notifications
You must be signed in to change notification settings - Fork 10.6k
Closed
Labels
releaseRelated to new version releaseRelated to new version release
Description
ETA Friday -> Wednesday 07/03
- [Model] Jamba support #4115
- Support Deepseek-V2 #4650
- [Kernel][Model] logits_soft_cap for Gemma2 with flashinfer #6051
- [Bug]: Current Main Does Not Work On Python3.8 #6033
- [BugFix] Ensure worker model loop is always stopped at the right time #5987
- [Bugfix] Add explicit
end_forward
calls to flashinfer #6044 - [Core] Pipeline Parallel Support #4412
- [Speculative Decoding] MLPSpeculator Tensor Parallel support (1/2) #6050
- [ci][misc] fix more device count #6055 <- [Core] Dynamic image size support for VLMs #5276
Metadata
Metadata
Assignees
Labels
releaseRelated to new version releaseRelated to new version release