Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

请问是否计划支持Full-parameter Reward Model训练 #1011

Closed
DtYXs opened this issue Sep 22, 2023 · 1 comment
Closed

请问是否计划支持Full-parameter Reward Model训练 #1011

DtYXs opened this issue Sep 22, 2023 · 1 comment
Labels
duplicate This issue or pull request already exists

Comments

@DtYXs
Copy link

DtYXs commented Sep 22, 2023

No description provided.

@hiyouga
Copy link
Owner

hiyouga commented Sep 22, 2023

#224

@hiyouga hiyouga added the duplicate This issue or pull request already exists label Sep 22, 2023
@hiyouga hiyouga closed this as completed Sep 22, 2023
hiyouga added a commit that referenced this issue Nov 16, 2023
Refactor llmtuner, support full-parameter RLHF
sangttruong pushed a commit to painkillernhat/LLaMA-Factory that referenced this issue May 9, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
duplicate This issue or pull request already exists
Projects
None yet
Development

No branches or pull requests

2 participants