Comments (7)
Have you tried waiting until model load is successful and then try to switch?
from gpt4all.
This should be fixed with new model loading revamp that just went in.
from gpt4all.
@YS92 Could you confirm that this was fixed in v2.7.1? I'm not 100% certain that the model loading changes fixed this apparent resource leak.
from gpt4all.
I have this issue in v2.7.3. At least, I suspect it's the same issue. When the client starts, I can pick a model and open a chat just fine. If I try to switch models, there's an extremely high probability that the client will simply crash. It will also reliably (but not always) crash if I reload the same model.
The issue only happens if I use my GPU. I'm not close to my memory limit, but I am above half (about 5GB used out of 8GB total VRAM). I'm fairly confident that the current model is not being unloaded from memory before the new one loads up.
from gpt4all.
@YS92 Could you confirm that this was fixed in v2.7.1? I'm not 100% certain that the model loading changes fixed this apparent resource leak.
In v2.7.1, v2.7.2 and v2.7.3, model loading does not lead to problems on my laptop.
from gpt4all.
I've done some further testing on my system. I have confirmed that the issue only exists when running in GPU mode. There are no issues if I force the program to use my CPU.
If you think it's appropriate, I could open a separate issue for this. Either way, here's my system info, if it's helpful:
GPT4All v2.7.3
OS: Windows 10 Pro 22H2
CPU: AMD Ryzen 5 1600
RAM: 16 GB
GPU: NVidia RTX 3070
GPU Driver Version: 551.61
I have tested this with the following models:
Orca 2 (Medium)
GPT4All Falcon
Nous Hermes 2 Mistral DPO
Mistral Instruct
MPT Chat
More info:
I haven't set a default model, so nothing loads on startup for me. I can reliably load any model twice. The third model I load always causes a crash. Any combination of different models, as well as reloading the same model, results in a crash. There is one exception to this: the MPT Chat model will not crash, unless it is the third model I load. This model seems to be an anomaly because (AFAIK) it is incapable of utilizing the GPU.
To add to this, ejecting the second model usually also causes a crash. This is not guaranteed, but it happens most of the time.
I'm not convinced the issue is a resource leak. The task manager shows that, for the first reload, GPT4All does release its allocated memory, both in RAM and VRAM. The second load also doesn't appear to take up any more memory (at least not noticeably) than the first.
from gpt4all.
Does this still happen for you with GPT4All 3.1?
from gpt4all.
Related Issues (20)
- Hailo-Support[Feature] Feature request title...
- Please add inkubaLM
- [Feature] please add Idefics 3 8B Llama 3
- CUDA 12.0 error while trying to run in CPU HOT 2
- [Python SDK] No Windows/Linux Release of 2.8.0 HOT 3
- Can't change GPU Layers setting for any model with a number larger than default
- Chats not retained if UI crash HOT 2
- Intel ARC device not listed as GPU HOT 1
- [Feature] Non-CUDA download option HOT 1
- [Feature] Keyboard shortcut to close current tab HOT 2
- [Feature] Play sound when answer is finished.
- [Feature] Save as screenshot HOT 1
- [Feature] Show processing time
- [Feature] Allow to start typing question while the model is loading
- how to uninstall gpt4all on Macbook (M) HOT 17
- Llama 3.1 8B model not working with Metal GPU (Mac M2) HOT 4
- [Feature] Close to tray
- After Updating to v3.2.0: GPT4All Replies with 'GGGG' and Crashes When Using RAG HOT 7
- Visibility issue when OS is in Dark Mode
- Error when import the gpt4all library HOT 2
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from gpt4all.