Pulse · huggingface/trl

June 6, 2024 – June 13, 2024

20 Unresolved conversations

Sometimes conversations happen on old items that aren’t yet closed. Here is a list of all the Issues and Pull Requests with unresolved conversations.

Added Reward Backpropogation Support
#1585 commented on Jun 12, 2024 • 10 new comments
Got an abnormally high loss when training Gemma-7B.
#1709 commented on Jun 8, 2024 • 3 new comments
DPO models generate multiple / corrupted responses
#1025 commented on Jun 12, 2024 • 3 new comments
Error when Using 8-bit Quantization
#1616 commented on Jun 11, 2024 • 3 new comments
[ORPO] Enable batched tokenization & multiprocessing to process large datasets
#1624 commented on Jun 9, 2024 • 1 new comment
Integrate f-divergence to DPO (Follow up)
#1610 commented on Jun 11, 2024 • 1 new comment
Minimal examples
#1603 commented on Jun 9, 2024 • 1 new comment
A pull request for POVIDTrainer
#1573 commented on Jun 11, 2024 • 1 new comment
How to save and resume a checkpoint from PPOTrainer
#1643 commented on Jun 13, 2024 • 1 new comment
Do we need to consider the chat template when doing DPO/KTO training?
#1640 commented on Jun 12, 2024 • 1 new comment
how to save v_head
#1650 commented on Jun 12, 2024 • 1 new comment
CLI utils class cases seem to be incorrect
#1600 commented on Jun 11, 2024 • 1 new comment
SFTrainer with FSDP on a model that doens't fit in GPU memory
#1681 commented on Jun 11, 2024 • 1 new comment
FineTuning issue with Gemma-2B-IT model using the SFTTrainer
#1665 commented on Jun 11, 2024 • 1 new comment
Use `SFTTrainer` for completion-only model without `DataCollatorForCompletionOnlyLM`
#1507 commented on Jun 11, 2024 • 1 new comment
stf Example not working
#1693 commented on Jun 10, 2024 • 1 new comment
RuntimeError: Expected all tensors to be on the same device, but found at least two devices, cuda:7 and cpu! (values = values * mask)
#1691 commented on Jun 10, 2024 • 1 new comment
Seq2seq model with ppo_trainer samples strange output!
#1633 commented on Jun 8, 2024 • 1 new comment
[DRAFT] Vllm integration
#1628 commented on Jun 7, 2024 • 0 new comments
Why compute IPO loss using `average_log_prob=Ture`？
#1677 commented on Jun 7, 2024 • 0 new comments

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

June 6, 2024 – June 13, 2024

Overview

Could not load contribution data

6 Pull requests merged by 5 people

5 Pull requests opened by 4 people

14 Issues closed by 7 people

10 Issues opened by 10 people

20 Unresolved conversations

Insights: huggingface/trl

June 6, 2024 – June 13, 2024

Overview

Could not load contribution data

6 Pull requests merged by 5 people

5 Pull requests opened by 4 people

14 Issues closed by 7 people

10 Issues opened by 10 people

20 Unresolved conversations