I am trying to port a new model I've created to GGUF however I'm hitting issues in con

If I add lm_head.linear1 and lm_head.linear2 <p dir="au

convert.py fails importing a new model architecture about llama.cpp HOT 3 CLOSED

JohnSully commented on June 27, 2024

convert.py fails importing a new model architecture

from llama.cpp.

Comments (3)

jukofyork commented on June 27, 2024

If I add lm_head.linear1 and lm_head.linear2

Even if this works it will likely think this is just two linear .weight type projections in series, whereas to use a .bias it needs to do an affine projection.

I don't know enough about llama.cpp to help more, but IIRC the Qwen models have some affine projections in then and use .bias as well as .weight, so this might be worth a look.

from llama.cpp.

compilade commented on June 27, 2024

Can you provide some tips on what I need to modify to make this work?

If it's a variation of an existing architecture, you might be able to simply specify new optional tensors on model load and then detect their presence in the compute graph to use them when they are present.

This is kind of how StableLM2 1.6B support was added in #5052.

Also if there is any documentation on porting new model architectures I would appreciate it if you could point me to it.

https://github.com/ggerganov/llama.cpp/blob/master/docs/HOWTO-add-model.md

from llama.cpp.

JohnSully commented on June 27, 2024

Thanks this looks like what I need. Google has gotten really bad at finding things lately.

…

On Tue, May 21, 2024 at 2:34 PM compilade ***@***.***> wrote: Can you provide some tips on what I need to modify to make this work? If it's a variation of an existing architecture, you might be able to simply specify new optional tensors on model load and then detect their presence in the compute graph to use them when they are present. This is kind of how StableLM2 1.6B support was added <https://github.com/ggerganov/llama.cpp/pull/5052/files#diff-150dc86746a90bad4fc2c3334aeb9b5887b3adad3cc1459446717638605348ef> in #5052 <#5052>. Also if there is any documentation on porting new model architectures I would appreciate it if you could point me to it. https://github.com/ggerganov/llama.cpp/blob/master/docs/HOWTO-add-model.md — Reply to this email directly, view it on GitHub <#7406 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/AA5W4ARHYKKYEEKE3PG73E3ZDOHTHAVCNFSM6AAAAABH7EAZ6OVHI2DSMVQWIX3LMV43OSLTON2WKQ3PNVWWK3TUHMZDCMRTGIYTCOBRGA> . You are receiving this because you authored the thread.Message ID: ***@***.***>

from llama.cpp.

Recommend Projects

convert.py fails importing a new model architecture about llama.cpp HOT 3 CLOSED

Comments (3)

Related Issues (20)

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent