是否已有关于该错误的issue或讨论？ | Is there an existing issue / discussion for this? <ul clas

[BUG] CUDA Error: invalid device function /tmp/pip-req-build-5rlg4jgm/ln_fwd_kernels.cuh 236,about qwenlm/qwen

jklj077 commented on May 27, 2024 1

Unfortunately, flash attention v2 does not support P100 (nor V100). You may need to uninstall the related packages in the image (pip uninstall flash_attn dropout_layer_norm) or build the image from scratch and set environment variable BUNDLE_FLASH_ATTENTION to false.

from qwen.

jklj077 commented on May 27, 2024

If you are using the provided the docker image with tag qwenllm/qwen(:latest), it is based on CUDA 11.7 and bundles the layer_norm module from flash attention v2, where that invalid device function (cudaOccupancyMaxActiveBlocksPerMultiprocessor which is a CUDA runtime API) is called.

It is likely your nvidia driver is too old to support CUDA 11.7 (and later versions). Please run nvidia-smi and provide the result.

from qwen.

taoqinghua commented on May 27, 2024

If you are using the provided the docker image with tag qwenllm/qwen(:latest), it is based on CUDA 11.7 and bundles the layer_norm module from flash attention v2, where that invalid device function (cudaOccupancyMaxActiveBlocksPerMultiprocessor which is a CUDA runtime API) is called.

It is likely your nvidia driver is too old to support CUDA 11.7 (and later versions). Please run nvidia-smi and provide the result.

+---------------------------------------------------------------------------------------+
| Processes: |
| GPU GI CI PID Type Process name GPU Memory |
| ID ID Usage |
|=======================================================================================|
| No running processes found |
+---------------------------------------------------------------------------------------+

from qwen.

taoqinghua commented on May 27, 2024

Unfortunately, flash attention v2 does not support P100 (nor V100). You may need to uninstall the related packages in the image (pip uninstall flash_attn dropout_layer_norm) or build the image from scratch and set environment variable BUNDLE_FLASH_ATTENTION to false.

谢谢。

from qwen.

[BUG] CUDA Error: invalid device function /tmp/pip-req-build-5rlg4jgm/ln_fwd_kernels.cuh 236 about qwen HOT 4 CLOSED

Comments (4)

Related Issues (20)

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent