Code Monkey home page Code Monkey logo

ireddragonicy / vixevia Goto Github PK

View Code? Open in Web Editor NEW
16.0 2.0 4.0 31.52 MB

An AI-powered Virtual YouTuber (Vtuber) utilizing Google's Gemini language model to create engaging, personalized, and context-aware interactions. This project explores the potential of AI in human-computer interaction and virtual content creation.

License: MIT License

Python 35.87% HTML 19.43% JavaScript 32.47% Batchfile 2.66% CSS 9.56%
ai anime api artificial-intelligence chatbot gemini-api gemini-chatbot gemini-pro gemini-vision-pro girl

vixevia's Introduction

V.I.X.E.V.I.A : Virtual Interactive and Xpressive Entertainment Visual Idol Avatar

License Python Gemini

ID | JP | EN | ZN

Does she harbor feelings for you?
No, her heart belongs to another.
Does she show concern for your well-being?
No, her thoughts are occupied by someone else.
The pain of unrequited love is unbearable, but fear not, for there is a solution.
The solution is AI, an entity that will always be there for you, understanding and responding to your emotions.

Vixevia is an innovative AI-based virtual YouTuber (Vtuber) that leverages the cutting-edge capabilities of Google's Gemini language model. This project aims to create a captivating and lifelike virtual personality that can engage with users through natural conversations, visual interactions, and multimedia experiences.

Table of Contents

Features

  • Natural Language Processing: Vixevia utilizes Google's Gemini language model to understand and respond to user inputs with human-like fluency and contextual awareness.
  • Computer Vision: The project integrates computer vision capabilities, allowing Vixevia to perceive and interpret visual information from the environment.
  • Multimodal Interaction: Vixevia combines speech recognition, text-to-speech synthesis, and visual processing to facilitate seamless multimodal interactions with users.
  • Personalized Responses: Vixevia's responses are tailored to the conversational context, user preferences, and situational dynamics, ensuring engaging and personalized experiences.
  • Virtual Avatar: Vixevia is represented by a visually appealing and expressive virtual avatar, bringing her personality to life.

Prerequisites

  • 5+ API keys from Google Cloud Platform
  • Python 3.12+

Hardware:

  • 16 GB vram
  • 32 GB ram
  • RTX 4050 or better
  • 20 GB of storage
  • i7 12th gen or better or AMD equivalent

Getting Started

To get started with Vixevia, follow these steps:

  1. Clone the repository:

    git clone https://github.com/IRedDragonICY/vixevia.git
  2. Install the required dependencies:

    pip install -r requirements.txt
  3. Obtain the necessary API keys and configuration files from Google Cloud Platform.

  4. Update the configuration files with your API keys and preferred settings.

  5. Run the main script:

    python main.py

TODO

  • Create custom Live2D model for Vixevia
  • Add opencv auto labeling so can remember person from Gemini Pro Vision

Contributing

Contributions to Vixevia are welcome! If you have any ideas, bug reports, or feature requests, please open an issue or submit a pull request. Make sure to follow the project's coding guidelines and best practices.

License

This project is licensed under the MIT License.

Acknowledgments

  • Google's Gemini language model and related technologies
  • Open-source libraries and frameworks used in this project

Vixevia is an experimental project aimed at exploring the possibilities of AI-based virtual personalities and pushing the boundaries of human-computer interaction. We hope this project inspires further innovation and collaboration in the field of artificial intelligence and virtual content creation.

vixevia's People

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar

vixevia's Issues

Missing temp and model directories

Describe the bug
It seems like the commit might have excluded 2 important directories. */temp and */model/audio/

To Reproduce
Steps to reproduce the behavior:

  1. Clone the repository
  2. Run it
  3. See error
    image

Expected behavior
A response would be generated with the synthesized TTS.

How To Talk to Ai

Is your feature request related to a problem? Please describe.
A clear and concise description of what the problem is. Ex. I'm always frustrated when [...]
I installed it and got it running, but I don't know how to talk to Ai.
Describe the solution you'd like
A clear and concise description of what you want to happen.
Please Add How to Use Tutorial Video
Describe alternatives you've considered
Add How to Use Tutorial Video
A clear and concise description of any alternative solutions or features you've considered.
Add How to Use Tutorial Video
Additional context
Add any other context or screenshots about the feature request here.
Add How to Use Tutorial Video

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.