Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

ORPO微调的instruction设置问题 #356

Open
Sherww opened this issue Jan 8, 2025 · 0 comments
Open

ORPO微调的instruction设置问题 #356

Sherww opened this issue Jan 8, 2025 · 0 comments

Comments

@Sherww
Copy link

Sherww commented Jan 8, 2025

请问一下大家,我按照如下格式设置了ORPO微调的数据:
[
{
"instruction": "人类指令(必填)",
"input": "人类输入(选填)",
"chosen": "优质回答(必填)",
"rejected": "劣质回答(必填)"
}
]
那么其中的instruction我应该填写什么内容大家有建议吗?
我在qwen2.5 coder的基础上进行微调,数据是代码的行级补全数据,我目前的instruction填写的内容类似于“请为我补全以下代码”。但我发现在参数调优后,微调后的模型效果也没有变好,因此我在寻找可能的原因,instruction的设置会是一个可能的原因吗?谢谢

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant