Comments (5)
I tested out newer Llama 3s made with the latest llama.cpp and they do have issues like showing formatting and talking past the end token when using GPT4All. And supporting them would be a very nice bonus because they're notably more coherent and less buggy after recent fixes. For example, they can solve 3333+777, rather than respond with 33 + 77 = 101.
This is the answer GPT4All v2.7.4 with the including L38 Instruct Q4_0 gives.
"Let me calculate the sum for you...
33 + 33 = 66
66 + 77 = 143
So, the answer is: 143. Is there anything else I can help you with?"
from gpt4all.
these issues have been fixed in llama.cpp but the lama.cpp fork of gpt4all has not been updated so far. There are also some speed improvements for prompt processing which hopefully will also be made available in gpt4all.
from gpt4all.
@Phil209 about formatting issues, have you encountered the following problem:
ERROR: byte not found in vocab: '
'
from gpt4all.
@agilebean No, I've never seen anything like "ERROR: byte not found in vocab:" before.
The formatting being shown is the standard stuff after the end token, such as "###System...", followed by various things like a potential user response, followed by what the assistant should then say..., or related examples, or an interesting related fact, or instructions for how it should responsibly respond as an AI, and so on.
from gpt4all.
This is off topic, but I'd just like to say thanks for the Q4_0 of Llama 3 8b Instruct you provided. I used various apps to test other Q4_K_M or Q5_K_M versions assuming that they would be better, but your smaller Q4_0 in GPT4ALL performed the best. For example, the responses when asking for a list of main characters, and the actors who portrayed, reliably had fewer hallucinations.
from gpt4all.
Related Issues (20)
- v2.8.0 crashes and disappears when using CUDA (incompatible PTX) HOT 14
- Certain models with "code" in their name crash GPT4All 2.8.0 HOT 6
- Some questions about java calling gpt4all HOT 2
- python binding does not use explicitly requested NVIDIA GPU
- Cannot move window to see which models to choose to install HOT 3
- Update 2.8.0 error HOT 3
- Crash on long prompts (CPU) v2.8.0 HOT 2
- [Feature] Add support for local Nomic Text Embed models compatible models for local docs HOT 1
- Gibberish Response when using GPU (Quadro K6000) HOT 1
- How to mimic the GPT4ALL GUI output using Python library gpt4all() and embed4all() HOT 2
- Gpt4all shows only a gray screen HOT 1
- [Feature] Use CUDA device as default HOT 1
- [Feature] Button to save chat individually manually
- Download of models stuck HOT 1
- Unable to set the max context limit
- dolphin-2.9.2-qwen2-72b-gguf, Cannot run, cannot use, error HOT 1
- Screen Turns Black Intermittently When Opening GPT4all on Desktop HOT 1
- [Feature] flathub verified HOT 1
- Crashes when indexing RAG
- GPT4All version inconsistency in winget upgrade on Windows HOT 8
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from gpt4all.