训练效果差，每次回复后面总是要跟一堆重复东西

Question

你好，感谢你的开源工作，不过我训练得到的模型效果总是不行，每次输入后面总是跟了一堆重复的东西,,例子如下。但是用你的一系列开源权重模型是没问题的。所以不知道能否给我提供一些

PhoebusSi · Answer

Try training on multiple GPUs, or increase the number of gra

PhoebusSi · Answer

这很奇怪，你是单卡训练的吗？

yuanluw · Answer

是的，单卡3090

PhoebusSi · Answer

已私聊

raihan0824 · Answer

I’m encountering the same problem, can you help me?

raihan0824 · Answer

will try. But how about tweaking the parameters? do you know what parameter I should a

raihan0824 · Answer

or should I just train it longer by adding the epoch?

PhoebusSi · Answer

raihan0824 · Answer

by 50k do you mean the number of instructions? my data size is currently 26MB

PhoebusSi · Answer

yes. 26MB (i mean the number of instructions) data is fairly large. Is the data qualit

raihan0824 · Answer

well my data size is 26MB and has 50k instructions. The quality should be similar to t

PhoebusSi · Answer

I see. This may be due to Bloom's weak ability in your target language. You can collec

raihan0824 · Answer

do you think adding more epochs will solve the problem?

PhoebusSi · Answer

Yes, it should be better.

PhoebusSi · Answer

How many epoches do you set? 3 epochs is suitable for 50k instructions.

raihan0824 · Answer

currently only one, since I have a limitation on my GPU usage. Three epochs would cost

raihan0824 · Answer

I'll switch the model to llama and see how it goes

训练效果差，每次回复后面总是要跟一堆重复东西 about alpaca-cot HOT 17 CLOSED