Related Issues (20)
- 沒有過擬和的狀況,但是loss到一個點後就會難以下降,並且推理對話會有重覆內容 HOT 1
- 1*8 H20微调qwen2-72b-instruct,保存模型失败 HOT 2
- Low MMLU of llama2 HOT 1
- fsdp+fp16 全参数微调是否支持呢 HOT 1
- 单机多卡微调glm4-9B设置max_grad_norm=1,但是仍然出现了梯度爆炸的问题 HOT 1
- 预训练方式lora微调Qwen2 base模型,是否需要添加template HOT 1
- How to pre-train Llava1.5 from vicuna1.5? HOT 1
- 训练glm4报错:RuntimeError when using flash attention with 8-bit quantization,同样的参数训llama3则没问题 HOT 1
- 请问改工程可以用来glm4的增量预训练吗 HOT 1
- 请问支持 early stopping 吗?
- stop word of template of qwen HOT 1
- qlora微调Qwen2-57B。使用单卡A6000显存占用40G,使用双卡A6000则是两张卡各占40G显存,请问是什么原因?
- Memory Error during tokenization while fine tuning LLava1.5-7B-Chat more than 8000 images HOT 1
- 如何指定已划分好的训练集和验证集? HOT 1
- LoRA微调和全参微调的时候总是会出现过拟合,在无法提高数据集大小的情况下,应该如何解决这个问题呢
- 8*A800 80G lora训练qwen2-72B模型 内存占用异常 HOT 2
- lora微调后的glm4模型不生成回答
- 最新代码中没有llamafactory-cli ,怎么合并权重 HOT 2
- docker容器内没有example和data文件
- 关于基座模型和对话模型的疑问
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from llama-factory.