Comments (6)
@hiyouga sorry,my answer is so late this case, using newest llama_factory code, it work currently right now
from llama-factory.
have seen exist issue written in March,but i cannot get any useful info to find out why this error came,hoping your suggestion
from llama-factory.
please provide your version of accelerate and bitsandbytes
from llama-factory.
@hiyouga accelerate==0.28.0 bitsandbytes==0.43.0 ,Do these versions have any problems?hoping your suggestion
from llama-factory.
did you use the latest code?
from llama-factory.
While I am trying to train https://huggingface.co/cognitivecomputations/dolphin-2.9-llama3-70b I am getting the same error "ValueError: Cannot flatten integer dtype tensors".
The error seems to be resolved when I reinstalled LLaMA-Factory again. These are the versions:
accelerate 0.29.3
bitsandbytes 0.43.1
from llama-factory.
Related Issues (20)
- 自定义数据集训练的时候出现 ValueError: Expected input batch_size (103) to match target batch_size (95).
- lora微调后模型表现不佳 数据停止生成问题 HOT 2
- 安装docker出现问题 HOT 3
- 训练自定义数据集出现错误 HOT 2
- 微调后的模型如何多卡推理 HOT 2
- 传lr_scheduler_kwargs参数报错error: argument --lr_scheduler_kwargs: invalid Dict value: "{'num_cycles':6}" HOT 1
- Question: how does template work with dataset in examples: llama3_lora_sft.yaml HOT 2
- 在不启用流式数据读入的情况下数据是否会被shuffle HOT 1
- 请问量化校准数据c4_demo.json的生成有什么要求的呢? HOT 5
- 偏好训练,如何使用ShareGPT格式数据集 HOT 3
- 推理阶段,预测文件中label显示不全问题 HOT 1
- adding language HOT 1
- How to estimate total steps and set proper
- [Feature Request] support for new peft model `pissa` HOT 1
- 在昇腾npu环境下运行报错 HOT 6
- llama3增量预训练冻结哪些层训练哪些层效果比较好? HOT 1
- Question about --dpo_ftx Parameter Setting HOT 1
- 请教一下如何不使用任何模版加载数据? HOT 2
- api部署,method not allowed问题 HOT 1
- Fail to run mixture-of-depths sft official example script HOT 4
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from llama-factory.