Comments (2)
This is an upstream issue with OpenHermes-2.5-neural-chat-v3-3-Slerp. It's a merge of OpenHermes-2.5-Mistral-7B and neural-chat-7b-v3-3.
Weyaxi claims you can use either prompt format, but neither the tokenizer_config.json nor the special_tokens_map.json correctly specifies the special tokens required by OpenHermes-2.5-Mistral-7B for ChatML as can be seen here.
Please test with a different model. You can report the issue with this model here, and use the neural-chat-7b-v3-3 prompt template for now.
edit: looks like this problem was already reported and closed as WONTFIX https://huggingface.co/Weyaxi/OpenHermes-2.5-neural-chat-v3-3-Slerp/discussions/5
So unless you want to modify the tokenizer config and convert the model yourself, you can't use ChatML with it. By inspecting the contents of the GGUF I have confirmed that TheBloke did not do this (there are no im_start/im_end tokens in the vocab), despite him repeating the bad recommendation from Weyaxi to use ChatML.
from gpt4all.
Ok thank you very much. It is a pity, because it is a great model. Looks like it is hard to support, if people try to use ChatML because of a bad recommendation in the model card and then realize it will not work.
from gpt4all.
Related Issues (20)
- [Feature] Add option to load model on start up in server mode
- [Feature] Login page HOT 2
- Remove 'You're AI... I'm human' from the System Prompt.
- Chat UI crashes for long inputs HOT 1
- MacOS v2.7.1 not launching. HOT 10
- Opening the Model/Character settings window should default to the currently selected chat model. HOT 1
- [Feature] Delete sparious context within chat history HOT 1
- [Feature] Add spell checking
- C# NuGet package: Model format not supported (no matching implementation found) HOT 21
- Ceased GPT4ALL during indexing by Local Documents plugin
- Localdocs context is used for first message, but further messages seem to ignore the collection HOT 2
- GPT4All 2.7.1 - searching localdocs even if no category is checked/selected; repeating the reply and the Context fragments, regardless of LLM
- [Feature] GPT4All, all versions supporting LocalDocs: revamped dialog
- [Feature] On the Download dialog: LLMs to be listed Alphabetically by name and grouped by their "licensed for commercial use" status
- Java bindings need to be updated after PR #1970 HOT 7
- [Feature] GPT4All 2.7.x -> : Horizontal scrollbar for the Settings dialog; Vertical scrollbar for the Local Documents dialog
- Clone feature issues (model disappears from drop-down list and CTD) HOT 2
- GPT4All doesn't see my GPU HOT 1
- [Feature] Better LocalDocs support for structured data formats such as XML HOT 5
- [addition for LocalDocs: recent news/events] ZIP file with +300 PDF news articles between 2024.01.01 and 2024.02.28
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from gpt4all.