hparcells / rtvc Goto Github PK

View Code? Open in Web Editor NEW

47.0 3.0 3.0 511 KB

💬 "Realtime" voice transcription and cloning using ElevenLabs's API.

Home Page: https://rtvc.hunterparcells.com

License: GNU General Public License v3.0

JavaScript 1.52% TypeScript 84.60% SCSS 13.87%

ai elevenlabs voice-cloning api website voice-synthesis voicecloning interactive transcription speech-to-speech

rtvc's Introduction

rtvc

Realtime voice cloning using ElevenLabs's API.

https://rtvc.hunterparcells.com/

Disclaimer

This project was largely thrown together in a single afternoon, ~~the UI at the moment is very crude~~ and many things may be still unoptimized.

There are probably many other better solutions that exist than this project, such as dedicated voice changer applications or ones for instant voice cloning, but I thought this would be interesting to make.

Requirements

A browser that supports the Web Speech API. Chrome Desktop is recommended
A subscription to ElevenLabs. Traditionally this is $5/month but they currently have an offer for $1 for your first month.
If you plan to play the audio through a microphone, I use VoiceMeeter to route my desktop audio to a virtual mic output.

Running Locally

Clone or download this repository.
Install needed dependencies with npm i with Node.js.
Build the app using npm run build.
Run using npm start.

For development, skip steps 3-4 and instead run npm run dev.

rtvc's People

Contributors

Stargazers

Watchers

Forkers

bugbounted emadoz00 matrex

rtvc's Issues

Add Customization Options for Stability and Similarity Boost

The configuration for generating speech using stability and similarity_boost seem to be very useful after doing a little bit of experimenting.

These options should be added to the UI and passed into the API call.
It would be nice if these options could use a React hook.

eleven_multilingual_v1

Is that possible to play the voice with the eleven_multilingual_v1?

Audio Playback Stops Randomly/Sturrers Until Manually Pressing the Red "Stop" Button

Describe the bug
The audio playback of the synthesized/cloned voice stops randomly and stutters.

To Reproduce
Steps to reproduce the behavior:

Start RTVC
Speak and wait for the program to transcribe your message
Wait for the program to send the transcribed audio to ElevenLabs API
The audio playback will start then immediately stop (usually after the first word) and will not continue (or will stutter) until you press the red "Stop" button.

This happens regardless of whether or not you continue to speak while the program is listening for speech.

Expected behavior
No stuttering or halting of audio playback.

Smartphone (please complete the following information):

Device: Samsung S21 FE
OS: Android 13
Browser: Chrome (latest)

Additional context

Display Character Quota and Limit

The user's current remaining and allocated characters from their plan should be displayed somewhere. This information can be found at the /v1/user/subscription endpoint. API documentation can be found at https://api.elevenlabs.io/docs.

During this, the transcript to be sent through the API should be limited to 5000 characters. ElevenLab's website says "The maximum number of characters you can generate in a single request on the Platform is 2,500." but it seems to be 5000 as that's what the text box's limit is. Perhaps this is outdated information?

ElevenLabs API Key Details

For data validation and condition checks, knowing the exact length and details of API keys would be useful. From my own API key, it seems to be a 32 character alphanumeric (lowercase only) string. If anyone else has any other details or can confirm this information, please let me know.

For now, the useEffect(() => {}, [apiKey]) function will only fetch voices when the API key entered in the input field is exactly 32 characters in length.

hparcells / rtvc Goto Github PK

rtvc's Introduction

rtvc

Disclaimer

Requirements

Running Locally

rtvc's People

Contributors

Stargazers

Watchers

Forkers

rtvc's Issues

Add Customization Options for Stability and Similarity Boost

eleven_multilingual_v1

Audio Playback Stops Randomly/Sturrers Until Manually Pressing the Red "Stop" Button

Display Character Quota and Limit

ElevenLabs API Key Details

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent