Comments (3)
model
model_name_or_path: meta-llama/Meta-Llama-3-8B-Instruct
adapter_name_or_path: saves/llama3-8b/lora/sft
method
stage: sft
do_predict: true
finetuning_type: lora
dataset
eval_dataset: identity,alpaca_en_demo # change to my own dataset
template: llama3
cutoff_len: 1024
max_samples: 50
overwrite_cache: true
preprocessing_num_workers: 16
output
output_dir: saves/llama3-8b/lora/predict
overwrite_output_dir: true
eval
per_device_eval_batch_size: 1
predict_with_generate: true
ddp_timeout: 180000000
from llama-factory.
from llama-factory.
model
model_name_or_path: ‘’
method
stage: sft
do_predict: true
finetuning_type: full
dataset
eval_dataset: clinical-trials-v2
template: llama3
cutoff_len: 1024
max_samples: 50
overwrite_cache: true
preprocessing_num_workers: 16
output
output_dir: ‘’
overwrite_output_dir: true
eval
per_device_eval_batch_size: 1
predict_with_generate: true
Still: ValueError: Some keys are not used by the HfArgumentParser: ['ddp_timeout', 'do_eval', 'output_dir', 'overwrite_output_dir', 'per_device_eval_batch_size', 'predict_with_generate']
from llama-factory.
Related Issues (20)
- 最新版LLaMA Factory,使用vllm推理报错 HOT 1
- 您好 通义千问如何做继续预训练? HOT 1
- Documentation (Linux): Installation requires nvidia-container-toolkit
- qwen2_vl ChildFailedError HOT 1
- lora训练llama3-70B正常,但是训练完后eval和predict时会报warning: "Invalidate trace cache @ step 16430: expected module 2, but got module 0" HOT 5
- 调用src/train.py,传递的参数好像被配置文件覆盖,怎么修改都不改变模型训练运行时的参数 HOT 3
- 使用LLamafactory加载模型和使用huggingface加载模型,效果不同 HOT 1
- transformer版本升级原因 HOT 1
- 运行Qwen-VL的sft训练时卡住,请问这是什么原因呢? HOT 5
- llama factory save strategy HOT 4
- qwen2-7b 使用vllm api接口报错 HOT 1
- Why is LoRA much slower than Freeze? HOT 1
- LLaMA-Factory运行webui后页面一片空白
- Qwen2-vl sft训练问题,报错 AttributeError: 'DeepSpeedZeroOptimizer_Stage3' object has no attribute 'train' HOT 1
- use LLaMA Pro, 如何配置扩展参数 HOT 2
- AttributeError: 'AdamW' object has no attribute 'train' HOT 1
- 您好这个项目支持天数智芯的智铠,天垓系列GPU吗
- qwen2-vl微调目标检测,webchat 输出内容为空,但transformers加载模型可正常输出
- Got unknown args, potentially deprecated arguments: ['--pref_loss:', 'simpo']
- PPO训练问题
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from llama-factory.