Comments (9)
@spinning27 Fair. However, an argument can be made to pick a version that works, and stick to it?
In any case, I think having the ability to use open-source LLMs would definitely be interesting to people and organizations that want to decentralize, and protect their data from third party APIs.
from rasagpt.
@paulpierre First of all, great concept! This is going to get a lot of traction for sure.
About an open-source LLM, wouldn't it be best to use GPT4All? It can be used commercially as well since there is a variant that's based on GPT-J.
Also, it is supported by Langchain which means that retrieval augmented QA will also be relatively easy.
from rasagpt.
of course and thanks @spinning27
what is your host operating system? i can look into creating a LLaMa branch over the weekend because I'm actually curious on the implementation.
some questions before exploring:
- base LLaMa isn't fine-tuned for QA/chat AFAIR. would you be open to other optimal options:
- would it make more sense to run remote inference on HF vs local. i think most devs stand to benefit from this convenience
let me know your thoughts 👍
from rasagpt.
yea, that's totally possible.
specifically which ones did you have in mind? there's a new one every 3 days ;)
from rasagpt.
@paulpierre , thanks for your prompt reply.
How about quantized llama 7b that has been in town for a bit time. :)
from rasagpt.
Actually, anyone @paulpierre mentioned would do as long as it is free (the cost of openAI API access could go up quite quickly with lots of query).
ATM, I mainly play it for out of curiosity. I have access to a GPU server machine or run locally on my M1 machine.
StableVicana-13B seems memory hungry. Not managed to run it on the machine yet, except quantized (compressed) version via llama.cpp.
from rasagpt.
Langchain currently develops at break-neck speed. Not sure sometimes it does not work like this one.
from rasagpt.
The best solution to beat the pricing of OPENAI is use your own deployed llms using fastChat or textgen-ui, they have nice openai api
However, A M1 laptop can easily run llm locally and this can be used for any prototype thing.
Sample code.
llm = LlamaCpp(model_path="models/llama-7b.ggmlv3.q4_0.bin", n_ctx=2048, verbose=True)
embeddings_model = LlamaCppEmbeddings(model_path="models/llama-7b.ggmlv3.q4_0.bin")
from rasagpt.
Been looking to build this myself and there we go - a fine-crafted project already there.
I've made a replacement for ada embeddings @paulpierre - perhaps hosting it locally would cut on running costs = https://github.com/proitservices/elmo_embedding_api
also, GPT4ALL is a great choice and with DB + Langchain the vicuna13b model would do just amazing (mistral 7b is also a good choice).
Would love to see this project grow into a fully featured and capable 'Jarvis' with memory and math capabilities + a rain of APIs extensions oob.
Happy to help with those, Peter
from rasagpt.
Related Issues (20)
- I want to know which line of code will directly call the FastAPI API and then call the langchain model
- Successful installation: Problem
- Make install ERROR Error response from daemon is not running HOT 4
- Using 'make models' can not create 'API' database HOT 2
- installation issue: HOT 1
- No operator matches the given name and argument types. You might need to add explicit type casts. HOT 8
- users cannot get response from Telegram
- Installation stuck at Building api HOT 1
- problem with creating a database at the organization stage HOT 2
- problem with building models in postgres
- Permission errors while creating .config and reading config.yml HOT 2
- Telegram no response my message
- nlu.yml and domain.yml look like not working HOT 1
- Using 'make install', occur error: sqlalchemy.exc.ArgumentError HOT 3
- hi,still alive here?
- service "rasa-core" is not running
- Error when "Created project: Pepetamine": psycopg2.errors.UndefinedFunction: operator does not exist: entity_status = integer. HOT 1
- service "rasa-core" is not running
- ImportError: cannot import name 'TryCast' from 'sqlalchemy.sql.elements'
- How to build a Airline Ticket Bot with RASA GPT
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from rasagpt.