Comments (1)
Hi,
Qwen1.0 is no longer actively maintained; please consider upgrading to Qwen2.
RuntimeError: Expected attn_mask dtype to be bool or to match query dtype, but got attn_mask.dtype: c10::BFloat16 and query.dtype: c10::Half instead
indicates dtype mismatch. As quantized models should run in fp16 (torch.half or torch.float16), consider loading the model in fp16. For Qwen1.0, you should pass fp16=True
to AutoModelForCausalLM.from_pretrained
. For Qwen1.5/Qwen2, you should pass torch_dtype=torch.float16
to AutoModelForCausalLM.from_pretrained
.
from qwen.
Related Issues (20)
- Qwen pre_trained, 打印一下内容,就没有了,不确定是否训练完成 HOT 2
- [BUG] 转换Qwen1.5-14B报错 HOT 1
- 多轮对话训练数据格式组织 HOT 1
- [BUG] Questionable embedding feature shape extracted from Qwen-7B-Chat HOT 2
- [BUG] <title> 命令行运行参数解析错误
- 工具调用的时候,本来用户没有输入参数,但是模型会自动幻想参数 HOT 2
- [BUG] model的forward函数接收attention_mask的时候,若attention_mask[i, 0]==0,则序列i输出的logits全都是NaN值 HOT 6
- 模型的TEMPLATE是怎么样的 HOT 1
- [BUG] <title>全参数微调qwen-14b-chat时卡住 HOT 1
- 运行web_demo.py程序时问答卡顿 HOT 1
- [BUG] <title>Qwen有支持OpenAI形式的functoncall的计划吗 HOT 1
- 量化细节请教 HOT 1
- [BUG] 这模型似乎很固执或直男癌,prompt里明确了不要怎么怎么样,每次输出还是不按要求去 HOT 1
- [BUG] <昇腾910B 镜像下载失败 docker pull qwenllm/qwen-mindspore:latest>
- [BUG] the max_position_embeddings parameter in the config.json for Qwen2-57B-A14B has been mistakenly set to 131072.
- [BUG] <title>Adding regular tokens is not supported HOT 1
- 如何修改模型的结构 HOT 1
- [BUG] <title> vLLM推理乱码 HOT 2
- Qwen 的开源模型能输出 logprobs吗?
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from qwen.