Comments (8)
I haven't tried this yet but it might help increasing prompt size by compressing the prompt.
https://github.com/yasyf/compress-gpt
from alpaca-electron.
I haven't tried this yet but it might help increasing prompt size by compressing the prompt.
I'll take a look at how it works later. If it's not too complicated, I'll try to implement something similar.
from alpaca-electron.
The context size is set to max (2048) already
from alpaca-electron.
is 2048 a hard limit with llama.cpp? or is that a function of the model? I know GPT' 3.5 is somewhere around 4000 but it seems to have a better memory for longer before it goes senile? I'm not sure how they achieve that I suspect they might be offloading a summarised version of the previous post to keep the 'Bot on track?
from alpaca-electron.
is 2048 a hard limit with llama.cpp? or is that a function of the model?
Yes, 2048 seems to be the hard limit. It allows you to make context go more than 2048, but it warns that performance may be negatively impacted.
from alpaca-electron.
Are you planning on implementing context? By "context," I mean compressing previous messages and placing them in the prompt like GPT3/4's createChatCompletion
. I recently managed to run the Vicuna ggml-vicuna-13b-4bit-rev1.bin
model by browsing to its file and loading it. Unfortunately, I encountered a bug in the prompt that generated infinite text output. For instance, when I asked the model to make a song in the style of Marshal Mathers about AI and Humans coexisting, it printed 33 verses before I had to stop it. I suspect the text would have gone on indefinitely. Otherwise, my Lenovo Legion 5I Pro ran both ggml-model-q4_1.bin
and the Vicuna model satisfactorily — nearly as fast as GPT-4.
The Chatbot-Ui really caught my attention, and I'm fascinated by the idea of combining it with other UIs like Next.js and Electron. How challenging would this be, and is it even possible?
from alpaca-electron.
Are you planning on implementing context? By "context," I mean compressing previous messages and placing them in the prompt like GPT3/4's
createChatCompletion
.
In theory, I could do that. But it would make the performance very poor on most computer. OpenAI can do this because they have a bunch of beefy GPUs at their disposal. But this runs locally, sometimes on near-potato hardware.
The Chatbot-Ui really caught my attention, and I'm fascinated by the idea of combining it with other UIs like Next.js and Electron. How challenging would this be, and is it even possible?
I don't know how to use Next.js because I hate HTML frameworks (e.g. bootstrap, vue, angular, react). It probably wouldn't be hard to turn it into an electron app though. If it runs in the web browser, you could just embed that very same page into an electron app and that's it.
from alpaca-electron.
I'm not sure how it affects performance but at least it might be good to be aware of that possibility. At least it's implemented as drop-in-replacement which is quite cool, imho.
from alpaca-electron.
Related Issues (20)
- Added Arm Support. I think...? HOT 2
- Compile and build V1.0.6 from source code
- Add capability to read PDF files and answer questions based on the PDF file.
- Markdown support HOT 1
- [ENHANCEMENT] Add Support for 5-bit quantized models HOT 2
- Read me instructions for Windows could be improved
- [ENHANCEMENT] HOT 2
- GGML v3 support HOT 3
- [ENHANCEMENT] Adjust font size HOT 1
- [ENHANCEMENT]Is It Even Possible To Make Alpaca Connect To Internet. HOT 2
- Alpaca-Electron-mac-arm64-v1.0.5 app is "damaged". HOT 4
- [Feature Request] Support InternLM
- [BUG] Part of prompt ends up in response HOT 8
- alpaca-electron V1.0.5 has Command Execution Vulnerability
- Alpaca-electron V1.0.5 has Command Execution Vulnerability
- Alpaca-electron V1.0.5 has Command Execution Vulnerability
- Unable to read downloaded model [BUG]
- [ENHANCEMENT] ability to save or have multiple conversations. HOT 1
- [suspicion] HOT 1
- [BUG] settings not applied.
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from alpaca-electron.