Comments (1)
超参数设置有问题
from llama-factory.
Related Issues (20)
- 在Training中,使用template: chatglm3_system参数后,还存在Current template does not support `train_on_prompt` HOT 2
- AttributeError: '_lzma.LZMADecompressor' object has no attribute 'needs_input' HOT 4
- error:NO API found HOT 6
- 多卡微调qwen1.5_moe出错
- 用qlora 4bit 微调llama3 70b oom HOT 3
- Phi-3-small exploding gradient issue. HOT 2
- loss一直上升 HOT 1
- 偶尔输出第一个符号是冒号 HOT 2
- MOD训练的模型能否在vllm上推理 HOT 1
- llamafactory-cli: command not found HOT 3
- DeepSeek-V2-Lite-Chat lora bf16 训练报错 HOT 4
- 训练样本加载完后,数量莫名翻倍
- dpo单机多卡 HOT 1
- Reward model prediction problem HOT 1
- 建议 自定义的数据集,或者数据集定义这部分放在训练文件中,避免打包镜像后,训练自定义数据到时候还需要修改公共文件
- 有多机多卡训练llama3-70b的参考程序吗? HOT 1
- Warning: Non finite check and unscale on NPU device! 昇腾卡上训练 HOT 1
- 如果用户的一句message包含多个Function Call name,自带的推理代码是否支持识别 HOT 1
- dpo全参微调后,预测时无法加载权重文件 HOT 3
- 【HELP】如何SFT的时候取消某个模型的默认system prompt,是否有一些命令可以指定 HOT 1
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from llama-factory.