ORPO微调的instruction设置问题 #356

Sherww · 2025-01-08T01:08:21Z

请问一下大家，我按照如下格式设置了ORPO微调的数据：
[
{
"instruction": "人类指令（必填）",
"input": "人类输入（选填）",
"chosen": "优质回答（必填）",
"rejected": "劣质回答（必填）"
}
]
那么其中的instruction我应该填写什么内容大家有建议吗？
我在qwen2.5 coder的基础上进行微调，数据是代码的行级补全数据，我目前的instruction填写的内容类似于“请为我补全以下代码”。但我发现在参数调优后，微调后的模型效果也没有变好，因此我在寻找可能的原因，instruction的设置会是一个可能的原因吗？谢谢

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ORPO微调的instruction设置问题 #356

ORPO微调的instruction设置问题 #356

Sherww commented Jan 8, 2025

ORPO微调的instruction设置问题 #356

ORPO微调的instruction设置问题 #356

Comments

Sherww commented Jan 8, 2025