Comments (13)
Good call. I think with that type of UX it would be great. I plan on making token usage visible, and adding Whisper hours to that would be pretty easy. I'll work on this - really like the idea.
from chatbot-ui.
Web speech API is a good option since it seems to be free. It's pretty powerful for what it is but
a) only works on Chrome and Firefox (not Brave) and
b) especially if you use out of distribution, slang, domain language or technical terms web speech api is inaccurate
See for yourself here [Chrome only].
Could be good as first implementation or option for the user.
In terms of UI for the feature, IMO Voice Control for ChatGPT extension has done the best job. I downloaded Chrome just to be able to use it. UX and Keyboard commands and ergonomics make a lot of sense.
from chatbot-ui.
One option could be to allow users to directly run the Whisper Model directly on their laptops, it would be for power users only, likely
from chatbot-ui.
I really like this idea. The only issue I see with it is that Whisper usage costs might add up pretty quickly for the user. I'll add this to my list and look into it some more. I've worked with the Whisper API so I could get it done quite quickly.
from chatbot-ui.
I could add it as an option. I think @markusstrasser is advocating for Whisper due to its performance. Extremely accurate.
from chatbot-ui.
Sounds great, looking forward to this feature!
from chatbot-ui.
I could add it as an option. I think @markusstrasser is advocating for Whisper due to its performance. Extremely accurate.
This extension might be helpful: C-Nedelcu/talk-to-chatgpt#45
from chatbot-ui.
Good point. At 0.006$/minute transcribed I estimate it to be around 3-15 cents per hour of active chatbot usage, depending on the type of convo. Maybe we can display 'minutes transcribed' and 'costs', possibly set a maximum budget per session/day etc. to clip the downside.
Would be great since it improves user's speaking ability as a freebie (whereas typing won't improve any further)
from chatbot-ui.
How about adding web speech api?
from chatbot-ui.
Thank you for making this! I was working on something similar but this blows it out of the water.
Would creating a button that sends an audiostream to a local instance of https://github.com/ggerganov/whisper.cpp be very difficult? I have a version of that running but I am terrible at TypeScript
from chatbot-ui.
Whisper.cpp has also a WebAssembly version that works in the browser alone https://github.com/ggerganov/whisper.cpp/tree/master/examples/whisper.wasm
from chatbot-ui.
I don't mind running whisper locally, not too much to setup, could we do a config setting for to and from speech endpoints, i would probably be looking at running https://github.com/askrella/speech-rest-api or something similar
I'd be happy to collaborate on a PR if this, or even an option for openapi whisper as well, I'm not sure how often I choose to use speech, but probably less if i'm weary of api costs
from chatbot-ui.
is this integrated? in the hosted version I do not see any microphone button for doing STT?
from chatbot-ui.
Related Issues (20)
- Docker Image is Old UI HOT 7
- RFC: UI improvements HOT 1
- architecture diagram HOT 1
- ERGPT_DEFAULT_MODEL=meta-llama/Llama-3-70b-chat-hf
- Instructions to start as a system service HOT 1
- How add API key all users
- 🧠How to add CloudFlare AI Gateway cache🧠
- [FEATURE REQUEST] Add Claude Sonnet 3.5 HOT 1
- Sonnet 3.5 is still not showing in chatbotui.com HOT 6
- Ctrl-Enter should "Save & Send" when editing a user message.
- huggingface api-inference
- Inline Latex can not show correctly HOT 1
- Image Optimization - Vercel HOT 1
- Global Search function
- Custom tool: sherpAPI - no reaults
- Add gpt-4o mini HOT 2
- unable to access the file
- "Could Not Resolve Host: supabase_kong_chatbotui" Error
- chatbotai.com - OpenAI does not appear under Model HOT 1
- how to use google/bing search api in main branch?
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from chatbot-ui.