Skip to content

Pull requests: huggingface/trl

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

RewardTrainer refactor
#4093 opened Sep 15, 2025 by qgallouedec Loading…
5 tasks
Convert set to list of tags
#4092 opened Sep 15, 2025 by qgallouedec Loading…
feat:add support for 'image_grid_thw'(QwenVL) in DPOTrainer
#4091 opened Sep 15, 2025 by ycma8 Loading…
2 of 5 tasks
feat: Add WeaveCallback for W&B Weave integration
#4089 opened Sep 15, 2025 by parambharat Loading…
2 of 5 tasks
fix: use_liger_kernel with IterableDataset
#4087 opened Sep 15, 2025 by jue-jue-zi Loading…
2 of 5 tasks
Update links to docs in README to latest packaged version
#4084 opened Sep 15, 2025 by sergiopaniego Loading…
5 tasks
Fix usage of VLM using text only
#4080 opened Sep 14, 2025 by SamuelBarryCS Loading…
Add config_init_kwargs option in GRPOConfig
#4069 opened Sep 12, 2025 by hokuyama0106 Loading…
2 of 5 tasks
Add VLM support to RLOO trainer
#4067 opened Sep 11, 2025 by behroozazarkhalili Loading…
feat: Add NPU and XPU support for activation offloading
#4056 opened Sep 10, 2025 by zilongzheng Loading…
2 of 5 tasks
Enable XPU for vllm client
#4031 opened Sep 8, 2025 by jiqing-feng Loading…
vllm sleep mode support
#4028 opened Sep 8, 2025 by ved1beta Loading…
2 of 5 tasks
Fix: undefined current_gradient_accumulation_steps
#4014 opened Sep 5, 2025 by ysjprojects Loading…
2 of 5 tasks
Improve typing of SFT trainer
#4007 opened Sep 4, 2025 by cyyever Loading…
ProTip! Find all pull requests that aren't related to any open issues with -linked:issue.