Skip to content

Pull requests: huggingface/trl

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

Fix typos
#4106 opened Sep 19, 2025 by cyyever Loading…
5 tasks
docs: correct option name to enable vllm sleep mode
#4102 opened Sep 18, 2025 by muupan Loading…
1 task done
Fix VLM configs in generate_tiny_models
#4101 opened Sep 17, 2025 by albertvillanova Loading…
RewardTrainer refactor
#4093 opened Sep 15, 2025 by qgallouedec Loading…
5 tasks
feat:add support for 'image_grid_thw'(QwenVL) in DPOTrainer
#4091 opened Sep 15, 2025 by ycma8 Loading…
2 of 5 tasks
fix: use_liger_kernel with IterableDataset
#4087 opened Sep 15, 2025 by jue-jue-zi Loading…
2 of 5 tasks
Update links to docs in README to latest packaged version
#4084 opened Sep 15, 2025 by sergiopaniego Loading…
5 tasks
Fix usage of VLM using text only
#4080 opened Sep 14, 2025 by SamuelBarryCS Loading…
Add config_init_kwargs option in GRPOConfig
#4069 opened Sep 12, 2025 by hokuyama0106 Loading…
2 of 5 tasks
[GRPO]: Sample from a Replay Buffer To Substitute Groups with 0 std.
#4060 opened Sep 10, 2025 by pramodith Loading…
4 of 5 tasks
vllm sleep mode support
#4028 opened Sep 8, 2025 by ved1beta Loading…
2 of 5 tasks
Fix: undefined current_gradient_accumulation_steps
#4014 opened Sep 5, 2025 by ysjprojects Loading…
2 of 5 tasks
Improve typing of SFT trainer
#4007 opened Sep 4, 2025 by cyyever Loading…
fix bug when using dataset streaming by accelerate
#3950 opened Aug 25, 2025 by kaixuanliu Loading…
[SFTTrainer]: Check for assistant mask up to max_length
#3930 opened Aug 20, 2025 by pramodith Loading…
3 of 5 tasks
[DRAFT] Refactor DPO
#3906 opened Aug 15, 2025 by qgallouedec Draft
5 tasks
ProTip! Adding no:label will show everything without a label.