<div class="snippet-clipboard-content notranslate position-relative overflow-auto" data-snippet-clip

I tried convert to fp16 <div class="snippet-clipboard-content notranslate position

<a class="user-mention notranslate" data-hovercard-type="user" data-hovercard-url="/us

Ok, fix is now live on the <a href="https://github.com/arcee-ai/mergekit/tree/lora-ext

RuntimeError: "svd_cuda_gesvdj" not implemented for 'BFloat16' about mergekit HOT 4 CLOSED

ehartford commented on September 27, 2024

RuntimeError: "svd_cuda_gesvdj" not implemented for 'BFloat16'

from mergekit.

Comments (4)

linux-leo commented on September 27, 2024

I also encountered this issue before, can confirm. Probably works with fp16 though, so a simple conversion built into the script may be enough to get it working.

from mergekit.

ehartford commented on September 27, 2024

I tried convert to fp16

def decompose_delta_weight(
    new_weight: torch.Tensor,
    base_weight: torch.Tensor,
    reduced_rank: int,
    device: Optional[str] = None,
) -> Tuple[torch.Tensor, torch.Tensor]:
    if device is None:
        device = "cuda" if torch.cuda.is_available() else "cpu"

    new_weight = new_weight.half().to(device)
    base_weight = base_weight.half().to(device)

Then I get:

  File "/home/ehartford/mergekit/mergekit/scripts/extract_lora.py", line 36, in _low_rank_decomposition
    U, S, Vh = torch.linalg.svd(weight, full_matrices=False)
               ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
RuntimeError: "svd_cuda_gesvdj" not implemented for 'Half'

from mergekit.

thomasgauthier commented on September 27, 2024

@ehartford thanks for pointing this out. It seems torch.linalg.svd only works with full precision fp32 (so .float() instead of .half() should work). Working on a fix right now. Should be up in a few minutes.

from mergekit.

thomasgauthier commented on September 27, 2024

Ok, fix is now live on the lora-extraction branch

@cg123 can we merge it into main and close this?

from mergekit.

RuntimeError: "svd_cuda_gesvdj" not implemented for 'BFloat16' about mergekit HOT 4 CLOSED

Comments (4)

Related Issues (20)

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent