Comments (12)
The answers are pretty accurate (on one file that is a bit tricky to understand), but it is slower that privategpt
from localgpt.
Are you running it on a GPU?
from localgpt.
Are you running it on a GPU?
on windows: yes, i can see it
Unless it loads in the GPU to actually use the CPU
from localgpt.
Can you share details of your hardware and cuda? I will have a look at it.
from localgpt.
Hardware is 3090 basic
Cuda is 11.7
Look, i watched your video:
Do you really need the visual studio environment to make this work ?
I notice that the loading time of your model is around 12 seconds, while for my it is like 2'20.
If i check the memory, i notice that it is very slow to load in the memory and suddenly i receive an answer very fast.
I think my problem is likely related to how the model loads in the memory of my computer.
Is there a way to improve this ?
from localgpt.
You are right, you don't need Visual Code Studio to make it work. It was just to show the code. But it's better to just directly run it in the terminal.
Not sure what could be causing this. In my case, I am loading it from an SSD. Not sure what your storage is. I can't really think of anything else at the moment. I will keep this open in case anyone else encounters this or we can figure something out.
from localgpt.
Huggingface stores its model here:
C:\Users\username\.cache\huggingface\hub
and my C drive is SSD too
from localgpt.
Not sure what else it could be. Someone might have a better idea. Sorry couldn't help.
from localgpt.
@lelapin123 I'm repeating myself: but give CASALIOY a try. It's faster than privateGPT and solves the issues those repos won't fix.
from localgpt.
I'm also seeing very slow performance, tried CPU and default cuda, on macOS with apple m1 chip and embedded GPU. I see python3.11 process using 400% cpu (assuign pegging 4 cores with multithread), 50~ threds, 4GIG RAM for that process, will sit there for a while, like 60 seconds at these stats, then respond. Is it suppose to be this slow?
from localgpt.
I'm also seeing very slow performance, tried CPU and default cuda, on macOS with apple m1 chip and embedded GPU. I see python3.11 process using 400% cpu (assuign pegging 4 cores with multithread), 50~ threds, 4GIG RAM for that process, will sit there for a while, like 60 seconds at these stats, then respond. Is it suppose to be this slow?
Personnaly, it took 25minutes to answer the same question à quoi sert in the walkthrought presentation of code. How have you obtain this nice performance ?
from localgpt.
hello @PromtEngineer the localgpt takes too much time to give a result and i am using TP GPU in google colab
from localgpt.
Related Issues (20)
- Why its sharing questions and data from different browser sessions?
- How do I add memory to chat-zero-shot-react-description?
- Autoawq HOT 2
- Mistral not supported HOT 2
- cpp-llama-python not found. HOT 1
- problem when ingesting (just CPU) HOT 1
- auto-gptq and auto awq is breaking in requiremetns.txt HOT 1
- I encountered a mistake when I executed run_localGPT_API.py HOT 1
- Extra Options with run_localGPT_API.py?
- error in /opt/nvidia/nvidia_entrypoint.sh
- run_localGPT_API HOT 8
- Support llama-3 HOT 10
- If I want to improve the Recall access the reranker model ,how can I do it?
- llama3 support: ERROR: byte not found in vocab, segmentation fault HOT 1
- Unable to execute 'python run_localGPT.py --device_type cpu'
- Can I reuse the models which I have running locally via ollama service ?
- Error response from daemon: could not select device driver "" with capabilities: [[gpu]]. HOT 1
- How to authenticate to huggingface.co, from the run_localGPT.py script, using Docker? HOT 1
- Cannot access gated repo HOT 2
- Improved metadata at ingest
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from localgpt.