huggingface / trl Public

generated from fastai/nbdev_template

Notifications You must be signed in to change notification settings
Fork 2.2k
Star 15.6k

Code
Issues 488
Pull requests 80
Discussions
Actions
Projects
Security
Insights

Additional navigation options

Code
Issues
Pull requests
Discussions
Actions
Projects
Security
Insights

Pull requests: huggingface/trl

Labels 33 Milestones 0

New pull request New

80 Open 1,946 Closed

Author

Filter by author

Uh oh!

There was an error while loading. Please reload this page.

Label

Filter by label

Uh oh!

There was an error while loading. Please reload this page.

Use alt + click/return to exclude labels

or ⇧ + click/return for logical OR

Projects

Filter by project

Uh oh!

There was an error while loading. Please reload this page.

Milestones

Filter by milestone

Uh oh!

There was an error while loading. Please reload this page.

Reviews

Filter by reviews

No reviews Review required Approved review Changes requested

Assignee

Filter by who’s assigned

Assigned to nobody

Uh oh!

There was an error while loading. Please reload this page.

Sort

Sort by

Newest Oldest Most commented Least commented Recently updated Least recently updated Best match

Most reactions

Pull requests list

🎞️ Support sequence classification models in clone_chat_template

#4097 opened Sep 16, 2025 by qgallouedec

Loading…

RewardTrainer refactor

#4093 opened Sep 15, 2025 by qgallouedec

Loading…

5 tasks

Convert set to list of tags

#4092 opened Sep 15, 2025 by qgallouedec

Loading…

feat:add support for 'image_grid_thw'(QwenVL) in DPOTrainer

#4091 opened Sep 15, 2025 by ycma8

Loading…

2 of 5 tasks

feat: Add WeaveCallback for W&B Weave integration

#4089 opened Sep 15, 2025 by parambharat

Loading…

2 of 5 tasks

fix: use_liger_kernel with IterableDataset

#4087 opened Sep 15, 2025 by jue-jue-zi

Loading…

2 of 5 tasks

Update links to docs in README to latest packaged version

#4084 opened Sep 15, 2025 by sergiopaniego

Loading…

5 tasks

Fix get_peft_model() so that prepare_model_for_kbit_training does not reapply to an instance of PeftModel, thus freezing all the layers

#4081 opened Sep 15, 2025 by Hoesu

Loading…

2 of 5 tasks

Fix usage of VLM using text only

#4080 opened Sep 14, 2025 by SamuelBarryCS

Loading…

Add config_init_kwargs option in GRPOConfig

#4069 opened Sep 12, 2025 by hokuyama0106

Loading…

2 of 5 tasks

Add VLM support to RLOO trainer

#4067 opened Sep 11, 2025 by behroozazarkhalili

Loading…

🧹 Remove max_batch_tokens, num_blocks and block_size from generation kwargs

#4065 opened Sep 11, 2025 by qgallouedec

Loading…

[GRPO]: Sample from a Replay Buffer To Substitute Groups with 0 std.

#4060 opened Sep 10, 2025 by pramodith • Draft

4 of 5 tasks

[vllm] ensure MASTER_ADDR/MASTER_PORT are set safely

#4057 opened Sep 10, 2025 by kashif

Loading…

feat: Add NPU and XPU support for activation offloading

#4056 opened Sep 10, 2025 by zilongzheng

Loading…

2 of 5 tasks

✨ Add logging for training completion and model saving in training scripts

#4048 opened Sep 9, 2025 by qgallouedec

Loading…

[Draft] Add configurable dataset column logging to GRPOTrainer W&B tables

#4045 opened Sep 9, 2025 by davanstrien • Draft

Enable XPU for vllm client

#4031 opened Sep 8, 2025 by jiqing-feng

Loading…

vllm sleep mode support

#4028 opened Sep 8, 2025 by ved1beta

Loading…

2 of 5 tasks

Fix #3982: Fix DPO Trainer support for Gemma 3 vision models

#4022 opened Sep 6, 2025 by akshay-babbar

Loading…

Fix: undefined current_gradient_accumulation_steps

#4014 opened Sep 5, 2025 by ysjprojects

Loading…

2 of 5 tasks

Fix: ignore precompute_ref_log_probs when use_liger_loss=True

#4008 opened Sep 4, 2025 by ginkyenglee

Loading…

5 tasks

Improve typing of SFT trainer

#4007 opened Sep 4, 2025 by cyyever

Loading…

⚖️ Align SFT and DPO for model creation and deprecate DPOConfig.padding_value in favour or pad_token_id

#4006 opened Sep 4, 2025 by qgallouedec

Loading…

5 tasks

Remove attention mask when position ids is returned

#3997 opened Sep 2, 2025 by qgallouedec • Draft

Previous 1 2 3 4 Next

Previous Next

ProTip! Find all pull requests that aren't related to any open issues with -linked:issue.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Uh oh!