Comments (4)
Hi, I have been planning to work on this and was wondering if there was a way to just run the API server.
from serge.
additionally stop sequences seem to be an issue on the API side.
from serge.
Hey! Happy to hear you want to tackle this task.
Currently you can't run just the API server. This used to be possible but now that the API & the web server are behind nginx, if you start nginx without the web server it will fail the health check and refuse to start. Shouldn't be too hard to fix though hopefully, I'll have a look at it. In the meanwhile you can still access the API at http://localhost:8008/api/docs.
But regarding the LangChain integration I was thinking that it would also be interesting to make a custom LLM that is a wrapper calling this generate
method here.
The custom LLM would only be working inside of the api
container (as it depends on the llama
binary) but this would still allow us to do cool stuff on the front-end of this project.
For interfacing with other projects indeed you will need to run the API server and make a custom LLM for that one.
from serge.
Did you looked at this repository @nsarrazin ?
Seems good.
https://github.com/linonetwo/langchain-alpaca
from serge.
Related Issues (20)
- 🚀 [Feature]: Add OpenVino / OpenVino Model Server HOT 1
- 🐛 [Bug]: Web interface does not render properly on mobile devices HOT 1
- 🚀 [Feature]: Add LINCE-Mistal model HOT 1
- 🐛 [Bug]: UI components are missing accessibility labels HOT 2
- 🐛 [Bug]: response text generated by a model sometimes disappears after computer/browser is woken up from a 'sleep' HOT 4
- have a separate page which displays downloaded moddles. HOT 1
- 🚀 [Feature]: Add support for Intel ARC GPUs A750 and A770 (If Possible) HOT 2
- bug: Allow loading .gguf and .bin files HOT 3
- 🚀 [Feature]: add eagle 7b HOT 3
- 🐛 [Bug]: system reachable via ICMP and via Port 8008 but screen "navy blue" with no text whatsoever HOT 9
- 🚀 [Feature]: Add Nous-Hermes-2-Mistral-7B-DPO HOT 8
- 🚀 [Feature]: Add support for uploading files during chat conversation
- 🐛 [Bug]: New install - response keeps repeating the last line HOT 7
- 🚀 [Feature]: add characters HOT 6
- 🚀 [Feature]: Please add Gorilla: Large Language Model Connected with Massive APIs HOT 3
- 🤗 [Question]: Whats the difference between the... models?
- 🚀 [Feature]: Add meta-llama/Meta-Llama-3-70B-Instruct HOT 7
- 🐛 [Bug]: Can't use pre-existing model at /weights HOT 1
- 🐛 [Bug]: DLLAMA_BLAS_VENDOR=OpenBLAS build with pip is not enabling OpenBlas HOT 3
- how to use mixtral-8x7b-v0.1🤗 [Question]: HOT 2
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from serge.