Comments (2)
该问题已修复,谢谢反馈。
from llama-factory.
请使用 git pull
更新代码。
from llama-factory.
Related Issues (20)
- 请问如何实现添加一个test_toolcall.py文件类似的tool来实现llamafactory通过提问可以输出当前天气?
- DeepSpeed Ulysses如何修改deepspeed配置文件 HOT 1
- Cannot access gated repo for url https://huggingface.co/meta-llama/Llama-2-7b-hf/resolve/main/config.json. HOT 2
- 能否显示log loss?
- torch.distributed.DistBackendError: NCCL error in: /opt/conda/conda-bld/pytorch_1704987288773/work/torch/csrc/distributed/c10d/ProcessGroupNCCL.cpp:1691, internal error - please report this issue to the NCCL developers, NCCL version 2.19.3 ncclInternalError: Internal check failed. HOT 1
- Windows无法识别数据集:datasets.exceptions.DatasetGenerationError: An error occurred while generating the dataset HOT 3
- 关于微调后模型size大小不一致的疑惑 HOT 1
- 量化后的gptq模型,部署成openai后调用报错
- 无法成功build HOT 5
- sft do_predict 加上--generation_num_beams 3参数,针对一个输入,生成的仍是一个结果而非三个 HOT 1
- ChatGLM3-6B微调爆显存 HOT 2
- qlora merging help needed. HOT 1
- 可以支持SAMI微调方法不
- 全量微调glm3以后推理不起来 HOT 1
- tag:0.6.3: ImportError: gradio>=3.38.0,<4.0.0 is required for a normal functioning of this module, but found gradio==4.27.0. HOT 1
- 指定使用2,3号两张卡,但是真实使用0,1,2,3四张卡 HOT 1
- 如何在sft llama时 添加special token? HOT 1
- Langchain didn't work when run src/api_demo.py Meta-Llama-3-8B-Instruct ,but chat.completions.create calling works fine. HOT 2
- 使用full全参微调使用的显存量超出预期 HOT 4
- Deepspeed is not yet supported HOT 1
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from llama-factory.