scir-hi / huatuo-llama-med-chinese Goto Github PK

Repo for BenTsao [original name: HuaTuo (华驼)], Instruction-tuning Large Language Models with Chinese Medical Knowledge. 本草（原名：华驼）模型仓库，基于中文医学知识的大语言模型指令微调

License: Apache License 2.0

Python 89.78% Shell 10.22%

llama llm medical nlp aidoctor medgpt medqa chinese bloom huozi

huatuo-llama-med-chinese's Introduction

中文 | English

本草[原名：华驼(HuaTuo)]: 基于中文医学知识的大语言模型指令微调

BenTsao (original name: HuaTuo): Instruction-tuning Large Language Models With Chinese Medical Knowledge

本项目开源了经过中文医学指令精调/指令微调(Instruction-tuning) 的大语言模型集，包括LLaMA、Alpaca-Chinese、Bloom、活字模型等。

我们基于医学知识图谱以及医学文献，结合ChatGPT API构建了中文医学指令微调数据集，并以此对各种基模型进行了指令微调，提高了基模型在医疗领域的问答效果。

News

[2023/09/24]发布《面向智慧医疗的大语言模型微调技术》

[2023/09/12]在arxiv发布《探索大模型从医学文献中交互式知识的获取》

[2023/09/08]在arxiv发布《基于知识微调的大语言模型可靠中文医学回复生成方法》

[2023/08/07] 🔥🔥增加了基于活字进行指令微调的模型发布，模型效果显著提升。🔥🔥

[2023/08/05] 本草模型在CCL 2023 Demo Track进行Poster展示。

[2023/08/03] SCIR实验室开源活字通用问答模型，欢迎大家关注🎉🎉

[2023/07/19] 增加了基于Bloom进行指令微调的模型发布。

[2023/05/12] 模型由"华驼"更名为"本草"。

[2023/04/28] 增加了基于中文Alpaca大模型进行指令微调的模型发布。

[2023/04/24] 增加了基于LLaMA和医学文献进行指令微调的模型发布。

[2023/03/31] 增加了基于LLaMA和医学知识库进行指令微调的模型发布。

A Quick Start

首先安装依赖包，python环境建议3.9+

pip install -r requirements.txt

针对所有基模型，我们采用了半精度基模型LoRA微调的方式进行指令微调训练，以在计算资源与模型性能之间进行权衡。

基模型

活字1.0，哈尔滨工业大学基于Bloom-7B二次开发的中文通用问答模型
Bloom-7B
Alpaca-Chinese-7B，基于LLaMA二次开发的中文问答模型
LLaMA-7B

LoRA模型权重下载

LoRA权重可以通过百度网盘或Hugging Face下载：

🔥对活字进行指令微调的LoRA权重文件

基于医学知识库以及医学问答数据集百度网盘

对Bloom进行指令微调的LoRA权重文件

基于医学知识库以及医学问答数据集百度网盘和Hugging Face

对Alpaca进行指令微调的LoRA权重文件

基于医学知识库百度网盘和Hugging Face
基于医学知识库和医学文献百度网盘和Hugging Face

对LLaMA进行指令微调的LoRA权重文件

基于医学知识库百度网盘和Hugging Face
基于医学文献百度网盘和Hugging Face

下载LoRA权重并解压，解压后的格式如下：

**lora-folder-name**/
  - adapter_config.json   # LoRA权重配置文件
  - adapter_model.bin   # LoRA权重文件

基于相同的数据，我们还训练了医疗版本的ChatGLM模型: ChatGLM-6B-Med

Infer

我们在./data/infer.json中提供了一些测试用例，可以替换成其它的数据集，请注意保持格式一致

运行infer脚本

#基于医学知识库
bash ./scripts/infer.sh

#基于医学文献
#单轮
bash ./scripts/infer-literature-single.sh

#多轮
bash ./scripts/infer-literature-multi.sh

infer.sh脚本代码如下，请将下列代码中基模型base_model、lora权重lora_weights以及测试数据集路径instruct_dir进行替换后运行

python infer.py \
	    --base_model 'BASE_MODEL_PATH' \
	    --lora_weights 'LORA_WEIGHTS_PATH' \
	    --use_lora True \
	    --instruct_dir 'INFER_DATA_PATH' \
	    --prompt_template 'TEMPLATE_PATH'

提示模板的选择与模型相关，详情如下：

活字&Bloom	LLaMA&Alpaca
`templates/bloom_deploy.json`	基于医学知识库`templates/med_template.json` 基于医学文献`templates/literature_template.json`

也可参考./scripts/test.sh

方法

基模型在医学问答场景下效果有限，指令微调是一种高效的使基模型拥有回答人类问题能力的方法。

数据集构建

医学知识库

我们采用了公开和自建的中文医学知识库，主要参考了cMeKG。

医学知识库围绕疾病、药物、检查指标等构建，字段包括并发症，高危因素，组织学检查，临床症状，药物治疗，辅助治疗等。知识库示例如下:


{"中心词": "偏头痛", "相关疾病": ["妊娠合并偏头痛", "恶寒发热"], "相关症状": ["皮肤变硬", "头部及眼后部疼痛并能听到连续不断的隆隆声", "晨起头痛加重"], "所属科室": ["中西医结合科", "内科"], "发病部位": ["头部"]}

我们利用GPT3.5接口围绕医学知识库构建问答数据，设置了多种Prompt形式来充分利用知识。

指令微调的训练集数据示例如下：


"问题：一位年轻男性长期使用可卡因，突然出现胸痛、呕吐、出汗等症状，经检查发现心电图反映心肌急性损伤，请问可能患的是什么疾病？治疗方式是什么？"

回答: 可能患的是心肌梗塞，需要进行维拉帕米、依普利酮、硝酸甘油、ß阻滞剂、吗啡等药物治疗，并进行溶栓治疗、低分子量肝素、钙通道阻滞剂等辅助治疗。此外需要及时停用可卡因等药物，以防止病情加重。"

我们提供了模型的训练数据集，共计八千余条，需要注意的是，虽然训练集的构建融入了知识，但是仍存在错误和不完善的地方，后续我们会利用更好的策略迭代更新数据集。

指令微调数据集质量仍有限，后续将进行不断迭代，同时医学知识库和数据集构建代码还在整理中，整理完成将会发布。

医学文献

此外，我们收集了2023年关于肝癌疾病的中文医学文献，利用GPT3.5接口围绕医学文献的【结论】构建多轮问答数据。在·./data_literature/liver_cancer.json中我们提供了其中的1k条训练样例。目前，训练样本的质量仍然有限，在后续我们会进一步迭代数据，会以公开数据集的形式对外进行发布。训练样本的示例如下：

目前，我们只开放针对"肝癌"单个疾病训练的模型参数。在未来，我们计划发布融入文献结论的医学对话数据集，并且会针对“肝胆胰”相关16种疾病训练模型。

相关细节可参考我们的文章：《探索大模型从医学文献中交互式知识的获取》

Finetune

如果想用自己的数据集微调大语言模型，请按照./data/llama_data.json中的格式构建自己的数据集

运行finetune脚本


bash ./scripts/finetune.sh

训练细节

计算资源需求参考

基于LLaMA模型的指令微调过程中，我们在一张A100-SXM-80GB显卡上进行了训练，训练总轮次10轮，耗时约2h17m。batch_size=128的情况下显存占用在40G左右。预计3090/4090显卡(24GB显存)以上显卡可以较好支持，根据显存大小来调整batch_size。

实验过程

超参设置和实验过程可以参考wandb的链接:

https://wandb.ai/thinksoso/llama_med/runs/a5wgcnzt/overview?workspace=user-thinksoso

模型效果对比

注：测试于2023年3月

测试输入	Llama输出	Alpaca输出	BenTsao本草输出
小张最近感觉身体不适，出现心悸、气促等症状。体检发现心脏扩大、搏动减弱。	问题：小张最近感觉身体不适，出现心悸、气促等症状。体检发现心脏扩大、搏动减弱。	最近，小张一直感觉身体不舒服，有心烦意乱、呼吸急促等症状。体格检查显示心脏增大，肌肉无力。	小张可能患有心肌炎，建议尽快到医院进行检查和治疗。治疗方案包括药物治疗和手术治疗。
麻风病和儿童哮喘的病因是否一致？	儿童哮喘和麻风病的病因是一致，但是儿童哮喘和麻风病的病因不一致	是的，哮喘和儿童喘息有相似的原因。	不一致，麻风病的病因是麻风杆菌感染，而儿童哮喘的病因是喂养不良、吸烟等因素。
一位患者出现了持续性的胀痛和黄疸，经影像学检查后确诊为肝胆管结石病，应该采取哪些治疗方案？	肝胆管结石病患者应该采取哪些治疗方案？	患者应接受手术和化疗相结合的治疗。	针对肝胆管结石病，治疗方案包括手术治疗、药物治疗和辅助治疗等。具体治疗方案需要根据患者的具体情况而定。

常见问题

Q: 为什么更名为"本草"？

A: 为SCIR实验室大语言模型命名一致性，中文医学大模型名称调整为"本草"。
Q: 为什么是"华驼"而不是"华佗"？

A: 叫”驼“是因为我们的基模型LLaMA是美洲驼，Alpaca是羊驼，受他们名字的启发以及华佗的谐音梗，我们将我们的模型起名为华驼。
Q: 有使用中医理论或者中医数据吗？

A: 目前还没有
Q: 模型运行的结果不同、效果有限

A: 由于生成模型生成多样性的考量，多次运行的结果可能会有差异。当前开源的模型由于LLaMA及Alpaca中文语料有限，且知识结合的方式较为粗糙，请大家尝试bloom-based和活字-based的模型。
Q: 模型无法运行/推理内容完全无法接受

A: 请确定已安装requirements中的依赖、配置好cuda环境并添加环境变量、正确输入下载好的模型以及lora的存储位置；推理内容如存在重复生成或部分错误内容属于llama-based模型的偶发现象，与llama模型的中文能力、训练数据规模以及超参设置均有一定的关系，请尝试基于活字的新模型。如存在严重问题，请将运行的文件名、模型名、lora等配置信息详细描述在issue中，谢谢大家。
Q: 发布的若干模型哪个最好？

A: 根据我们的经验，基于活字模型的效果相对更好一些。

项目参与者

本项目由哈尔滨工业大学社会计算与信息检索研究中心健康智能组王昊淳、杜晏睿、刘驰、白睿、席奴瓦、陈雨晗、强泽文、陈健宇、李子健完成，指导教师为赵森栋副教授，秦兵教授以及刘挺教授。

致谢

本项目参考了以下开源项目，在此对相关项目和研究开发人员表示感谢。

活字: https://github.com/HIT-SCIR/huozi
Facebook LLaMA: https://github.com/facebookresearch/llama
Stanford Alpaca: https://github.com/tatsu-lab/stanford_alpaca
alpaca-lora by @tloen: https://github.com/tloen/alpaca-lora
CMeKG https://github.com/king-yyf/CMeKG_tools
文心一言 https://yiyan.baidu.com/welcome 本项目的logo由文心一言自动生成

免责声明

本项目相关资源仅供学术研究之用，严禁用于商业用途。使用涉及第三方代码的部分时，请严格遵循相应的开源协议。模型生成的内容受模型计算、随机性和量化精度损失等因素影响，本项目无法对其准确性作出保证。本项目数据集绝大部分由模型生成，即使符合某些医学事实，也不能被用作实际医学诊断的依据。对于模型输出的任何内容，本项目不承担任何法律责任，亦不对因使用相关资源和输出结果而可能产生的任何损失承担责任。

Citation

如果您使用了本项目的数据或者代码，或是我们的工作对您有所帮助，请声明引用

首版技术报告: Huatuo: Tuning llama model with chinese medical knowledge

@misc{wang2023huatuo,
      title={HuaTuo: Tuning LLaMA Model with Chinese Medical Knowledge},
      author={Haochun Wang and Chi Liu and Nuwa Xi and Zewen Qiang and Sendong Zhao and Bing Qin and Ting Liu},
      year={2023},
      eprint={2304.06975},
      archivePrefix={arXiv},
      primaryClass={cs.CL}
}

知识微调：Knowledge-tuning Large Language Models with Structured Medical Knowledge Bases for Reliable Response Generation in Chinese

@misc{wang2023knowledgetuning,
      title={Knowledge-tuning Large Language Models with Structured Medical Knowledge Bases for Reliable Response Generation in Chinese}, 
      author={Haochun Wang and Sendong Zhao and Zewen Qiang and Zijian Li and Nuwa Xi and Yanrui Du and MuZhen Cai and Haoqiang Guo and Yuhan Chen and Haoming Xu and Bing Qin and Ting Liu},
      year={2023},
      eprint={2309.04175},
      archivePrefix={arXiv},
      primaryClass={cs.CL}
}

医学文献知识获取：The CALLA Dataset: Probing LLMs’ Interactive Knowledge Acquisition from Chinese Medical Literature

@misc{du2023calla,
      title={The CALLA Dataset: Probing LLMs' Interactive Knowledge Acquisition from Chinese Medical Literature}, 
      author={Yanrui Du and Sendong Zhao and Muzhen Cai and Jianyu Chen and Haochun Wang and Yuhan Chen and Haoqiang Guo and Bing Qin},
      year={2023},
      eprint={2309.04198},
      archivePrefix={arXiv},
      primaryClass={cs.CL}
}

huatuo-llama-med-chinese's People

Contributors

Stargazers

Watchers

Forkers

chuxiuhong jackrain roclee81 ruibai1999 aryansoni8090 jangocheng anyz01 3dsworks2022 greay83 wishgale duzhanyuan zhuhm1996 cl475143764 mengqi97 huangzhimin4read alanmoleapfive bingtian88 ai-ld liu-angelo eltociear cgwgpt 8333064 xuyongfu alexlan123 todotobe1 simon1239 henryhesz goubabiejiao jianyuchen01 wieqli scutcyr knowledgefold hehongyuanlove dst1213 shellingford221 waitalone finace zeta1999 flowolfzzz sfidea anigi98932 xjspace yaohong9257 denglizong zbx911 net-wang virusyou shangzchao davidsolomon21cn tky2022 miblue119 forexblog zhwbqd mozuck hiwong roflyx camark allenzhipu iakirago lihuyu1231 geekcheng aboutsome zhoudai zero506 sky861421718163 shonyu jianwei rexsu sdtm1016 wangkangdegithub bigfootcn swjsky itsharex f901107 lyhiving chenyunzheng zpdsherlock lawrencesun songfang huggingtech hqman lcx0cd maxiaoxifeng askuy yuanhuanglin crackercat stanvl petercao melandz jiang2050 rickysyr jayting511 wyh122 githubyx1 techthiyanes sinboyxx asdlei99 wysstartgo yuanmouren1hao kunlun-zhu

huatuo-llama-med-chinese's Issues

询问一下，运行infer.py，报错TypeError: 'NoneType' object is not subscriptable

def main(
load_8bit: bool = True,
base_model: str = "decapoda-research/llama-7b-hf",
# the infer data, if not exists, infer the default instructions in code
instruct_dir: str = "./data/infer.json",
use_lora: bool = True,
lora_weights: str = "tloen/alpaca-lora-7b",
# The prompt template to use, will default to med_template.
prompt_template: str = "med_template",
):我的配置参数是这样的，其他的都和初始的一样，但我运行之后报错
│ │
│ /usr/local/lib/python3.9/dist-packages/bitsandbytes/autograd/_functions.py:3 │
│ 80 in forward │
│ │
│ 377 │ │ │ if state.CxB is not None: │
│ 378 │ │ │ │ outliers = F.extract_outliers(state.CxB, state.SB, sta │
│ 379 │ │ │ else: │
│ ❱ 380 │ │ │ │ outliers = state.CB[:, state.idx.long()].clone() │
│ 381 │ │ │ │
│ 382 │ │ │ state.subB = (outliers * state.SCB.view(-1, 1) / 127.0).t( │
│ 383 │ │ │ CA[:, state.idx.long()] = 0 │
╰──────────────────────────────────────────────────────────────────────────────╯
TypeError: 'NoneType' object is not subscriptable

Process finished with exit code 1
我不知道为什么会这样，有人可以给我指导一下吗

运行bash ./scripts/infer.sh 报错

异常信息：
ValueError: The current device_map had weights offloaded to the disk. Please provide an offload_folder for them. Alternatively, make sure you have safetensors installed if the model you are using offers the weights in this format.

与chatGLM-6B比较

https://chat.lmsys.org/ 注意选择 chatGLM-6B
把主页的几个测试问题输入，好像没有微调过的 chatGLM-6B 的回答更好 .....

下载后的材料要怎么操作

我是编程小白，百度网盘里的资料都下载了，需要放到哪个文件夹里？下一步怎么操作？求解！

为什么使用提供的alpaca 的lora参数推理出来的结果完全不对，胡言乱语

训练数据集的构建

请问你们是如何利用GPT3.5的api加本地已有的数据，来构建的训练数据集呢？

基于医学知识库运行infer.sh报错"addmm_impl_cpu_" not implemented for 'Half'

fire.Fire(main)这一步报错
"addmm_impl_cpu_" not implemented for 'Half'
我尝试debug了一下没找到问题

项目提供的lora没有达到提的效果呀，就给的那几个infer的例子

已解决

😀已解决，谢谢

运行报错：json.decoder.JSONDecodeError: Expecting value: line 1 column 1 (char 0)

/home/halo_op/anaconda3/envs/py39/lib/python3.9/site-packages/bitsandbytes/cuda_setup/main.py:145: UserWarning: Found duplicate ['libcudart.so', 'libcudart.so.11.0', 'libcudart.so.12.0'] files: {PosixPath('/usr/local/cuda/lib64/libcudart.so.11.0'), PosixPath('/usr/local/cuda/lib64/libcudart.so')}.. We'll flip a coin and try one of these, in order to fail forward.
Either way, this might cause trouble in the future:
If you get CUDA error: invalid device function errors, the above might be the cause and the solution is to make sure only one ['libcudart.so', 'libcudart.so.11.0', 'libcudart.so.12.0'] in the paths that we search based on your env.
warn(msg)
CUDA SETUP: CUDA runtime path found: /usr/local/cuda/lib64/libcudart.so.11.0
CUDA SETUP: Highest compute capability among GPUs detected: 7.5
CUDA SETUP: Detected CUDA version 114
CUDA SETUP: Loading binary /home/halo_op/anaconda3/envs/py39/lib/python3.9/site-packages/bitsandbytes/libbitsandbytes_cuda114.so...
The tokenizer class you load from this checkpoint is not the same type as the class this function is called from. It may result in unexpected tokenization.
The tokenizer class you load from this checkpoint is 'LLaMATokenizer'.
The class this function is called from is 'LlamaTokenizer'.
Loading checkpoint shards: 100%|████████████████████████████████████████████████████████████████████████████████████████████████████████| 33/33 [00:14<00:00, 2.29it/s]
using lora ./lora-llama-med
Traceback (most recent call last):
File "/mnt/datadisk0/code/Huatuo-Llama-Med-Chinese/infer.py", line 125, in
fire.Fire(main)
File "/home/halo_op/anaconda3/envs/py39/lib/python3.9/site-packages/fire/core.py", line 141, in Fire
component_trace = _Fire(component, args, parsed_flag_args, context, name)
File "/home/halo_op/anaconda3/envs/py39/lib/python3.9/site-packages/fire/core.py", line 475, in _Fire
component, remaining_args = _CallAndUpdateTrace(
File "/home/halo_op/anaconda3/envs/py39/lib/python3.9/site-packages/fire/core.py", line 691, in _CallAndUpdateTrace
component = fn(*varargs, **kwargs)
File "/mnt/datadisk0/code/Huatuo-Llama-Med-Chinese/infer.py", line 47, in main
model = PeftModel.from_pretrained(
File "/home/halo_op/anaconda3/envs/py39/lib/python3.9/site-packages/peft/peft_model.py", line 156, in from_pretrained
PeftConfig.from_pretrained(model_id, subfolder=kwargs.get("subfolder", None)).peft_type
File "/home/halo_op/anaconda3/envs/py39/lib/python3.9/site-packages/peft/utils/config.py", line 109, in from_pretrained
loaded_attributes = cls.from_json_file(config_file)
File "/home/halo_op/anaconda3/envs/py39/lib/python3.9/site-packages/peft/utils/config.py", line 129, in from_json_file
json_object = json.load(file)
File "/home/halo_op/anaconda3/envs/py39/lib/python3.9/json/init.py", line 293, in load
return loads(fp.read(),
File "/home/halo_op/anaconda3/envs/py39/lib/python3.9/json/init.py", line 346, in loads
return _default_decoder.decode(s)
File "/home/halo_op/anaconda3/envs/py39/lib/python3.9/json/decoder.py", line 337, in decode
obj, end = self.raw_decode(s, idx=_w(s, 0).end())
File "/home/halo_op/anaconda3/envs/py39/lib/python3.9/json/decoder.py", line 355, in raw_decode
raise JSONDecodeError("Expecting value", s, err.value) from None
json.decoder.JSONDecodeError: Expecting value: line 1 column 1 (char 0)

与技术无关，单纯的好奇，为什么是“驼“而不是“佗”？

冒昧问一下。

请问是否有做过微调后与Med-ChatGLM的对比

请问是否有做过微调后与Med-ChatGLM的对比，哪款的能力更强些？

我觉得“华驼”这个名字更好

是否有用到真实医疗场景数据？

Hello！
想问下是否有用到其他数据集（除了instruct生成的数据外）？

回答效果不太专业，三轮会话后显存超32G

无法加载使用alpaca模型

我使用：https://github.com/ymcui/Chinese-LLaMA-Alpaca/wiki/%E4%BD%BF%E7%94%A8text-generation-webui%E6%90%AD%E5%BB%BA%E7%95%8C%E9%9D%A2 这个部署了alpaca的微调模型，但是输出都是一些乱七八糟的字符。请问alpaca模型如何使用？

有一个疑问,ChatGLM-Med为什么没有像华佗一样lora微调呢？

是效果不好还是什么原因？

LoRA百度网盘访问密码错误

能否将LoRA都传一份到Huggingface上，百度网盘出现奇妙bug

请问如何基于中文Alpaca大模型进行指令微调？

我观察到作者在README 中提到了目前已支持中文Alpaca大模型。但其也是一个 lora 权重，如何在模型里加载两份 lora 权重呢？

请问作者有没有做 ChatGLM 和 LLaMA 的评测

我关注到HIT 同时提出了基于 ChatGLM 和 LLaMA 的两份医疗大模型，但在论文里并没有发现二者的评测。请教这两个模型哪一个表现更好呢？

关于SUS的计算

你好，很有意思的工作，工作中提出的SUS评判标准具体是怎么计算得到的呢

如何加入自己的中医医疗数据进行训练

看说明，现在的中医知识库好像都通过ChatGPT-3做了格式转换，有规划其他方式添加训练数据吗？
谢谢

如何复现基于Chinese-alpaca-7b的医学知识和医学文献对话模型?

在本地测了Chinese-alpaca-7b + lora-alpaca-med-alldata，在多轮对话上，效果很好。

请问：

如何利用llama.json以及liver_cancer.json，如何合并数据集（一个多轮对话，一个单轮对话），直接合并放在一个文件么？
复现您们的训练效果，该使用哪个模板？('med_template' or 'literature_template.json')
目前我有 8 X A100(40G)， micro_batch_size和batch_size该如何设置（都是64么？为您们设置的一半？）
关于LoRA的rank和alpha值？听说在垂直领域微调时，加大LoRA的rank值效果会好些，想问下您们关于LoRA的rank值以及alpha值的选取的相关经验

感谢您们百忙之中抽空解答，祝项目越来越好，感谢！

Huatuo-Llama-Med-Chinese和Med-ChatGLM这两个项目在医疗问答的表现哪个好

如题。

训练后的数据不理想

这是我改过的infer-literature-single.sh脚本内容

这是我调用bash ./scripts/finetune.sh执行的finetune脚本

这是结果

请问是否以经典中医医书为主？

包括但不限于易经，针灸大成，神农本草经，黄帝内经，黄帝外经，伤寒论，金匮要略？是否以经方为主，是否训练了名医书籍和医案？

4 张 3090 上运行 finetune，运行结束报错 UnboundLocalError: local variable 'load_result' referenced before assignment

期望复现 llama lora 使用文中提到的语料库训练；
只修改 finetune.sh 中对应的 base model 路径，其他都未做修改；
运行完之后，命令行报错：

he intermediate checkpoints of PEFT may not be saved correctly, using `TrainerCallback` to save adapter_model.bin in corresponding folders, here are some examples https://github.com/huggingface/peft/issues/96
Traceback (most recent call last):
  File "/home/m1l03053/llama/Huatuo-Llama-Med-Chinese/finetune.py", line 280, in <module>
    fire.Fire(train)
  File "/home/m1l03053/.conda/envs/3.9/lib/python3.9/site-packages/fire/core.py", line 141, in Fire
    component_trace = _Fire(component, args, parsed_flag_args, context, name)
  File "/home/m1l03053/.conda/envs/3.9/lib/python3.9/site-packages/fire/core.py", line 475, in _Fire
    component, remaining_args = _CallAndUpdateTrace(
  File "/home/m1l03053/.conda/envs/3.9/lib/python3.9/site-packages/fire/core.py", line 691, in _CallAndUpdateTrace
    component = fn(*varargs, **kwargs)
  File "/home/m1l03053/llama/Huatuo-Llama-Med-Chinese/finetune.py", line 270, in train
    trainer.train(resume_from_checkpoint=resume_from_checkpoint)
  File "/home/m1l03053/.conda/envs/3.9/lib/python3.9/site-packages/transformers/trainer.py", line 1696, in train
    return inner_training_loop(
  File "/home/m1l03053/.conda/envs/3.9/lib/python3.9/site-packages/transformers/trainer.py", line 2095, in _inner_training_loop
    self._load_best_model()
  File "/home/m1l03053/.conda/envs/3.9/lib/python3.9/site-packages/transformers/trainer.py", line 2292, in _load_best_model
    self._issue_warnings_after_load(load_result)
UnboundLocalError: local variable 'load_result' referenced before assignment
wandb: Waiting for W&B process to finish... (failed 1). Press Control-C to abort syncing.
wandb: 
wandb: Run history:
wandb:               eval/loss █▅▄▃▃▂▂▂▂▂▁▁▁▁▁▁▁▁▁▁
wandb:            eval/runtime ▁▃▄▆▇▅▂▆▅██▃▅█▆█▆█▅▃
wandb: eval/samples_per_second █▆▅▃▂▄▇▃▄▁▁▆▃▁▃▁▃▁▄▆
wandb:   eval/steps_per_second █▆▅▃▂▄▇▃▄▁▁▆▄▁▂▁▃▁▄▆
wandb:             train/epoch ▁▁▁▂▂▂▂▂▂▃▃▃▃▃▃▄▄▄▄▄▅▅▅▅▅▆▆▆▆▆▆▇▇▇▇▇▇███
wandb:       train/global_step

数据集必须是instruct的形式吗？

肝癌疾病的中文医学文献，这个数据后面转换成了instruct的形式。想请问下，有没有可能不做转换？

CUDA Setup failed despite GPU being available.

===================================BUG REPORT===================================
Welcome to bitsandbytes. For bug reports, please run

python -m bitsandbytes

and submit this information together with your error trace to: https://github.com/TimDettmers/bitsandbytes/issues

bin C:\Users\86152\PycharmProjects\Huatuo-Llama-Med-Chinese-main\venv\lib\site-packages\bitsandbytes\libbitsandbytes_cpu.so
CUDA_SETUP: WARNING! libcudart.so not found in any environmental path. Searching in backup paths...
C:\Users\86152\PycharmProjects\Huatuo-Llama-Med-Chinese-main\venv\lib\site-packages\bitsandbytes\cuda_setup\main.py:145: UserWarning: WARNING: The following directories listed in your path were found to be non-existent: {WindowsPath('/usr/local/cuda/lib64')}
warn(msg)
CUDA SETUP: WARNING! libcuda.so not found! Do you have a CUDA driver installed? If you are on a cluster, make sure you are on a CUDA machine!
C:\Users\86152\PycharmProjects\Huatuo-Llama-Med-Chinese-main\venv\lib\site-packages\bitsandbytes\cuda_setup\main.py:145: UserWarning: WARNING: No libcudart.so found! Install CUDA or the cudatoolkit package (anaconda)!
warn(msg)
C:\Users\86152\PycharmProjects\Huatuo-Llama-Med-Chinese-main\venv\lib\site-packages\bitsandbytes\cuda_setup\main.py:145: UserWarning: WARNING: No GPU detected! Check your CUDA paths. Proceeding to load CPU-only library...
warn(msg)
CUDA SETUP: Loading binary C:\Users\86152\PycharmProjects\Huatuo-Llama-Med-Chinese-main\venv\lib\site-packages\bitsandbytes\libbitsandbytes_cpu.so...
argument of type 'WindowsPath' is not iterable
CUDA SETUP: Problem: The main issue seems to be that the main CUDA library was not detected.
CUDA SETUP: Solution 1): Your paths are probably not up-to-date. You can update them via: sudo ldconfig.
CUDA SETUP: Solution 2): If you do not have sudo rights, you can do the following:
CUDA SETUP: Solution 2a): Find the cuda library via: find / -name libcuda.so 2>/dev/null
CUDA SETUP: Solution 2b): Once the library is found add it to the LD_LIBRARY_PATH: export LD_LIBRARY_PATH=$LD_LIBRARY_PATH:FOUND_PATH_FROM_2a
CUDA SETUP: Solution 2c): For a permanent solution add the export from 2b into your .bashrc file, located at ~/.bashrc
Traceback (most recent call last):
File "C:\Users\86152\PycharmProjects\Huatuo-Llama-Med-Chinese-main\infer.py", line 8, in
from peft import PeftModel
File "C:\Users\86152\PycharmProjects\Huatuo-Llama-Med-Chinese-main\venv\lib\site-packages\peft_init_.py", line 22, in
from .mapping import MODEL_TYPE_TO_PEFT_MODEL_MAPPING, PEFT_TYPE_TO_CONFIG_MAPPING, get_peft_config, get_peft_model
File "C:\Users\86152\PycharmProjects\Huatuo-Llama-Med-Chinese-main\venv\lib\site-packages\peft\mapping.py", line 16, in
from .peft_model import (
File "C:\Users\86152\PycharmProjects\Huatuo-Llama-Med-Chinese-main\venv\lib\site-packages\peft\peft_model.py", line 31, in
from .tuners import AdaLoraModel, LoraModel, PrefixEncoder, PromptEmbedding, PromptEncoder
File "C:\Users\86152\PycharmProjects\Huatuo-Llama-Med-Chinese-main\venv\lib\site-packages\peft\tuners_init_.py", line 20, in
from .lora import LoraConfig, LoraModel
File "C:\Users\86152\PycharmProjects\Huatuo-Llama-Med-Chinese-main\venv\lib\site-packages\peft\tuners\lora.py", line 40, in
import bitsandbytes as bnb
File "C:\Users\86152\PycharmProjects\Huatuo-Llama-Med-Chinese-main\venv\lib\site-packages\bitsandbytes_init_.py", line 6, in
from . import cuda_setup, utils, research
File "C:\Users\86152\PycharmProjects\Huatuo-Llama-Med-Chinese-main\venv\lib\site-packages\bitsandbytes\research_init_.py", line 1, in
from . import nn
File "C:\Users\86152\PycharmProjects\Huatuo-Llama-Med-Chinese-main\venv\lib\site-packages\bitsandbytes\research\nn_init_.py", line 1, in
from .modules import LinearFP8Mixed, LinearFP8Global
File "C:\Users\86152\PycharmProjects\Huatuo-Llama-Med-Chinese-main\venv\lib\site-packages\bitsandbytes\research\nn\modules.py", line 8, in
from bitsandbytes.optim import GlobalOptimManager
File "C:\Users\86152\PycharmProjects\Huatuo-Llama-Med-Chinese-main\venv\lib\site-packages\bitsandbytes\optim_init_.py", line 6, in
from bitsandbytes.cextension import COMPILED_WITH_CUDA
File "C:\Users\86152\PycharmProjects\Huatuo-Llama-Med-Chinese-main\venv\lib\site-packages\bitsandbytes\cextension.py", line 20, in
raise RuntimeError('''
RuntimeError:
CUDA Setup failed despite GPU being available. Please run the following command to get more information:

    python -m bitsandbytes

    Inspect the output of the command and see if you can locate CUDA libraries. You might need to add them
    to your LD_LIBRARY_PATH. If you suspect a bug, please take the information from python -m bitsandbytes
    and open an issue at: https://github.com/TimDettmers/bitsandbytes/issues

(venv) PS C:\Users\86152\PycharmProjects\Huatuo-Llama-Med-Chinese-main>
出现这种bug，不知道是什么引起的，应该怎么解决

chinese-alpaca使用的模板与华驼不一致

chinese-alpaca使用原版stanford-alpaca不带input的模板，是英文的，华驼使用中文的模板在chinese-alpaca的基础上继续指令精调，这样的话，他们模板不一致，会不会产生很大的gap呢？

是不是可以用Text generation web UI来部署

如果可以的话，基础模型是Alpaca-7B 还是Chinese-Alpaca-7B
Lora用华驼这个项目的模型？

运行finetune.py 之后，在infer.py阶段有错误

请问下图中的两个文件是怎么生成的呢，是直接运行finetune.py得到的吗？

感谢回答。

申请入群，具体操作不会

百度网盘密码错了啊

有没有先进行预训练让llama拥有中文能力

整个模型是直接在llama上面指令微调得来的吗？是不是先用中文语料让llama学习中文之后再指令微调会更好呢？

“华佗" or “华驼” ？

为啥是“华驼” ？哈哈

关于训练数据的生成

您好，想请教以下问题：
1、知识库中的单条数据用于生成几条训练数据？训练数据生成后采用了什么筛选策略呢？如何判断知识库中的信息被正确利用了？
2、med_template.json里的prompt仅用于推理阶段吗？
不胜感激！

申请入群，运行infer文件报错，想请教一下如何解决！

当我运行infer.py文件时，报错CUDA Setup failed despite GPU being available.但是，torch.cuda.is_available=true，试了很多方法都没能解决这个问题。
When I ran the infer.py file, I reported an error CUDA Setup failed despite GPU being available. However, torch. cuda. is_ Available=true, I have tried many methods but have not been able to solve this problem.
如有小伙伴知晓如何解决，请加我wechat：15665877987，不胜感激！

有兴趣做一个更加实用的健康模型吗？

您好，看到您的成果很惊喜，请问有兴趣一起做一个更加实用的健康模型吗？

代码的问题

在多轮对话infei时会出现大量的#号，这是为什么

: 小张最近感觉身体不适，出现心悸、气促等症状。体检发现心脏扩大、搏动减弱。 : 我建议您及家人及您的医生进行进一步的检查，以确定小张的病因。同时，我建让您及家人做好的饮食和生活习惯，以避免其他的健康问题。###########################################################################################################################: 小王被确诊为肝炎双重感染，最可能的并发症是什么？ : 根据相关文献，肝炎双重感染的危险因素主要包括肝炎、营养不良、免疫抑制剂、病毒敏感免疫等。因此，我建让您及家人及您的医生进行进一步的检查，以确定小张的病因。

模型文件应该放在什么位置

请问，LoRA权重下载了之后，应当放在哪个路径下？

lora alpaca med

使用lora-alpaca-med进行推理的时候，prompt_template应该选择什么呢, med-template.json还是alpaca_short.json？

运行infer.sh文件出现错误

运行infer.sh报错：没有找到adapter_config.json文件。
保存的权重文件如下：

请问是哪里出错导致没有adapter_config.json文件？

申请入群

请问如何加群，群显示超过200人需要邀请，下面的个人码显示被加太多，被腾讯风控了

请问如何利用GPT3.5生成医学文献多轮问答数据？

看到你们的项目中利用GPT3.5接口围绕医学文献多轮问答数据，我觉得这个idea很好。但是从示意图当中还是没太看明白具体是怎样生成多轮问答数据的。能再给出具体的解释吗？多谢！

eval_loss为nan是什么原因

如上图
训练过程中eval_loss为nan，参数我只修改了 batch，别的都保持一致
请问这是什么原因？

你好，感谢开源！希望能开源怎么利用chatgpt 接口进行的多轮对话数据生成！谢谢

请问这种输出乱码一般是什么原因呢？

[BUG]运行infer.py cuda设置失败报错

请问一下，测试用的环境是什么啊。我是用的python3.10.11，安装完环境，cuda（版本为11.7）后运行infer.py
python infer.py --base_model 'decapoda-research/llama-7b-hf' --lora_weights './lora-llama-med' --use_lora True --instruct_dir './data/infer.json' --prompt_template 'med_template'会报如下错误：
Welcome to bitsandbytes. For bug reports, please run

python -m bitsandbytes

and submit this information together with your error trace to: https://github.com/TimDettmers/bitsandbytes/issues

bin C:\Users\cqy.conda\envs\huatuo_cuda117\lib\site-packages\bitsandbytes\libbitsandbytes_cpu.so
C:\Users\cqy.conda\envs\huatuo_cuda117\lib\site-packages\bitsandbytes\cuda_setup\main.py:145: UserWarning: WARNING: The following directories listed in your path were found to be non-existent: {WindowsPath('/Users/cqy/.conda/envs/huatuo_cuda117/lib'), WindowsPath('C')}
warn(msg)
C:\Users\cqy.conda\envs\huatuo_cuda117\lib\site-packages\bitsandbytes\cuda_setup\main.py:145: UserWarning: C:\Users\cqy.conda\envs\huatuo_cuda117 did not contain ['libcudart.so', 'libcudart.so.11.0', 'libcudart.so.12.0'] as expected! Searching further paths...
warn(msg)
C:\Users\cqy.conda\envs\huatuo_cuda117\lib\site-packages\bitsandbytes\cuda_setup\main.py:145: UserWarning: WARNING: The following directories listed in your path were found to be non-existent: {WindowsPath('/127.0.0.1'), WindowsPath('7890'), WindowsPath('http')}
warn(msg)
CUDA_SETUP: WARNING! libcudart.so not found in any environmental path. Searching in backup paths...
C:\Users\cqy.conda\envs\huatuo_cuda117\lib\site-packages\bitsandbytes\cuda_setup\main.py:145: UserWarning: WARNING: The following directories listed in your path were found to be non-existent: {WindowsPath('/usr/local/cuda/lib64')}
warn(msg)
CUDA SETUP: WARNING! libcuda.so not found! Do you have a CUDA driver installed? If you are on a cluster, make sure you are on a CUDA machine!
C:\Users\cqy.conda\envs\huatuo_cuda117\lib\site-packages\bitsandbytes\cuda_setup\main.py:145: UserWarning: WARNING: No libcudart.so found! Install CUDA or the cudatoolkit package (anaconda)!
warn(msg)
C:\Users\cqy.conda\envs\huatuo_cuda117\lib\site-packages\bitsandbytes\cuda_setup\main.py:145: UserWarning: WARNING: No GPU detected! Check your CUDA paths. Proceeding to load CPU-only library...
warn(msg)
CUDA SETUP: Loading binary C:\Users\cqy.conda\envs\huatuo_cuda117\lib\site-packages\bitsandbytes\libbitsandbytes_cpu.so...
argument of type 'WindowsPath' is not iterable
CUDA SETUP: Problem: The main issue seems to be that the main CUDA library was not detected.
CUDA SETUP: Solution 1): Your paths are probably not up-to-date. You can update them via: sudo ldconfig.
CUDA SETUP: Solution 2): If you do not have sudo rights, you can do the following:
CUDA SETUP: Solution 2a): Find the cuda library via: find / -name libcuda.so 2>/dev/null
CUDA SETUP: Solution 2b): Once the library is found add it to the LD_LIBRARY_PATH: export LD_LIBRARY_PATH=$LD_LIBRARY_PATH:FOUND_PATH_FROM_2a
CUDA SETUP: Solution 2c): For a permanent solution add the export from 2b into your .bashrc file, located at ~/.bashrc
Traceback (most recent call last):
File "E:\cqy-gpt\gpt-fine-tune\Huatuo-Llama-Med-Chinese\infer.py", line 8, in
from peft import PeftModel
File "C:\Users\cqy.conda\envs\huatuo_cuda117\lib\site-packages\peft_init_.py", line 22, in
from .mapping import MODEL_TYPE_TO_PEFT_MODEL_MAPPING, PEFT_TYPE_TO_CONFIG_MAPPING, get_peft_config, get_peft_model
File "C:\Users\cqy.conda\envs\huatuo_cuda117\lib\site-packages\peft\mapping.py", line 16, in
from .peft_model import (
File "C:\Users\cqy.conda\envs\huatuo_cuda117\lib\site-packages\peft\peft_model.py", line 31, in
from .tuners import (
File "C:\Users\cqy.conda\envs\huatuo_cuda117\lib\site-packages\peft\tuners_init_.py", line 21, in
from .lora import LoraConfig, LoraModel
File "C:\Users\cqy.conda\envs\huatuo_cuda117\lib\site-packages\peft\tuners\lora.py", line 40, in
import bitsandbytes as bnb
File "C:\Users\cqy.conda\envs\huatuo_cuda117\lib\site-packages\bitsandbytes_init_.py", line 6, in
from . import cuda_setup, utils, research
File "C:\Users\cqy.conda\envs\huatuo_cuda117\lib\site-packages\bitsandbytes\research_init_.py", line 1, in
from . import nn
File "C:\Users\cqy.conda\envs\huatuo_cuda117\lib\site-packages\bitsandbytes\research\nn_init_.py", line 1, in
from .modules import LinearFP8Mixed, LinearFP8Global
File "C:\Users\cqy.conda\envs\huatuo_cuda117\lib\site-packages\bitsandbytes\research\nn\modules.py", line 8, in
from bitsandbytes.optim import GlobalOptimManager
File "C:\Users\cqy.conda\envs\huatuo_cuda117\lib\site-packages\bitsandbytes\optim_init_.py", line 6, in
from bitsandbytes.cextension import COMPILED_WITH_CUDA
File "C:\Users\cqy.conda\envs\huatuo_cuda117\lib\site-packages\bitsandbytes\cextension.py", line 20, in
raise RuntimeError('''
RuntimeError:
CUDA Setup failed despite GPU being available. Please run the following command to get more information:

    python -m bitsandbytes

    Inspect the output of the command and see if you can locate CUDA libraries. You might need to add them
    to your LD_LIBRARY_PATH. If you suspect a bug, please take the information from python -m bitsandbytes
    and open an issue at: https://github.com/TimDettmers/bitsandbytes/issues

请问这个问题要如何解决？感谢

About Huatuo

Dear contributors,

Thanks for your insightful work and open-source project. I am happy to share our project (https://github.com/FreedomIntelligence/LLMZoo) with you, which includes a tailored version of LLM in biomedince, and coincidently has the same name "Huatuo". The only difference is that we used different characters (驼or佗). Interestingly, we submitted a paper called "Huatuo" for ACL 2023 last December and meanwhile we have a website called https://www.huatuogpt.cn/ . Plus, we have registered related trademarks in Feb. 2023. See an earlier news in http://www.sribd.cn/article/722 .

We do not intend to offend you but share some similar work with you. Hopefully, some win-win discussions could happend. If you not mind, maybe both names (华驼or华佗) could co-exist -- it might need some further discussions.

Thanks for your project and we highly value the contributions in NLP from SCIR lab. Feel free to write to me via email or something else.

Best regards,
Benyou Wang
[email protected]
HuatuoGPT team.

scir-hi / huatuo-llama-med-chinese Goto Github PK

huatuo-llama-med-chinese's Introduction

本草[原名：华驼(HuaTuo)]: 基于中文医学知识的大语言模型指令微调

BenTsao (original name: HuaTuo): Instruction-tuning Large Language Models With Chinese Medical Knowledge

News

A Quick Start

基模型

LoRA模型权重下载

Infer

方法

数据集构建

医学知识库

医学文献

Finetune

训练细节

计算资源需求参考

实验过程

模型效果对比

常见问题

项目参与者

致谢

免责声明

Citation

huatuo-llama-med-chinese's People

Contributors

Stargazers

Watchers

Forkers

huatuo-llama-med-chinese's Issues

and submit this information together with your error trace to: https://github.com/TimDettmers/bitsandbytes/issues

and submit this information together with your error trace to: https://github.com/TimDettmers/bitsandbytes/issues

Recommend Projects

Recommend Topics

Recommend Org