Code Monkey home page Code Monkey logo

chat's Introduction

chat

Chat with an AI Vtuber

vtube plugin llm tts

Uses:

  • whisper.cpp
  • llama.cpp
  • Coqui TTS
  • VTube Studio

Clone The Repository

git clone --recurse-submodules https://github.com/TelevisionNinja/chat.git

Install Dependencies

VTube Studio

Download and install VTube Studio from Steam

eSpeak-NG

Linux and MacOS

sudo apt install espeak-ng

Windows

Download and install the msi variant from https://github.com/espeak-ng/espeak-ng/releases

Python

Python Install

Linux and MacOS

  1. Download and install Python 3.11
  2. cd into the repository ```cd ./chat``
  3. Create the virtual environment python3.11 -m venv venv
  4. Activate the virtual environment source ./venv/bin/activate
  5. Install the Python Dependencies below into the virtual environment

Windows

Download and install Python 3.11

Pytorch

Install Pytorch by following the 'Get Started' instructions from the website

RVC

Python Dependencies

pip install TTS sounddevice pyvts

Linux

sudo apt install make gcc g++ libsdl2-dev

CUDA

sudo apt install nvidia-cuda-toolkit

MacOS

Install brew with

/bin/bash -c "$(curl -fsSL https://raw.githubusercontent.com/Homebrew/install/HEAD/install.sh)"

Install Dependencies with

brew install make gcc sdl2

Windows

CUDA

  1. Download and install Visual Studio and the C++ workload from Microsoft
  2. Download and install the CUDA Toolkit from the Nvidia developer website

SDL

  1. Download the VC variant from https://github.com/libsdl-org/SDL/releases
  2. Extract to the directory ./src/llama.cpp
  3. Rename the extracted folder to SDL2

VB-Cable

Download and install VB-Cable by VB-Audio Software

Build

Linux

With CUDA

cd ./src/llama.cpp

make -j GGML_CUDA=1

cd ../..

Without CUDA

cd ./src/llama.cpp

make -j

cd ../..

MacOS

cd ./src/llama.cpp

make -j

cd ../..

Windows

Use Command Prompt

With CUDA

cd ./src/llama.cpp

set SDL2_DIR=SDL2\cmake

cmake -DWHISPER_SDL2=ON -DGGML_CUDA=ON -B build
cmake --build build --config release -j

cd ../..

Without CUDA

cd ./src/llama.cpp

set SDL2_DIR=SDL2\cmake

cmake -DWHISPER_SDL2=ON -B build
cmake --build build --config release -j

cd ../..

Make Scripts Executable

Linux and MacOS

sudo chmod +x ./src/llama.cpp/common/talk/speak.sh
sudo chmod +x ./launch.sh

Run

  1. Launch VTube Studio

  2. Run the launch script

    Linux and MacOS

    ./launch.sh

    Windows

    powershell .\launch.ps1
  3. Click on the allow button for the plugin to function

  4. Go to the model settings and change the input to AIVoiceVolume in the Mouth Open section

chat's People

Contributors

televisionninja avatar

Watchers

Kostas Georgiou avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.