rasan147 / voiceai-asuna Goto Github PK

View Code? Open in Web Editor NEW

23.0 4.0 5.0 3 MB

If you're familiar with the anime Sword art online, you know it! This project is a virtual Assistant for multiple OS

Home Page: https://aigirl.repl.co

License: Apache License 2.0

Python 77.39% HTML 6.35% JavaScript 12.65% CSS 3.61%

ai alice anime python3 virtual-assistant virtual-girlfriend

voiceai-asuna's Introduction

PROJECT Asuna (NOT AI/ChatGPT yet)

Welcome to project Asuna.

Live Demo: https://ai-asuna.onrender.com

(voice available in Minimize mode in chat)

Feel Free to Support Me:

Description:

This is not a ChatGPT or full blown all knowing AI
This is an English only, pattern based chat bot (for now)
Currently using regular expressions to catch and reply specific patterns of messaged and collecting inputs and unknown inputs to train in future
Once, With sufficient data and resources, we will perform AI training
If you have any idea or want to provide IO data, please file an Issue or Pull Request
Documentation will be created soon for ease of group development.
Currently this is an one man project and my 1st language is not English, Using 1000 years of knowledge from animes and movies to enrich the chat input response.
So please don't expect much but I hope I'll be able to provide great performance with it.
Please have patience Maintaining completely new (self made) server, UI and back end is not easy task (for me). Then again adding IO based patterns, sequencing them takes time.
My/Our main goal is to create a Multi platform accessible voice assistant.
our additional plan is to create their animated avatar and make it available online (via Browsers)

Current status:

How To Run:

First install the REQUIREMENTS, click it to see details
To Launch the server run the RUN_ME.py file
```
python RUN_ME.py
```
Demo video coming soon...

Requirement:

Python 3.7 or higher
Works on Android Pydroid 3 😄 too (most development is done using this)

Common IO: (similar inputs may/will work too)

Basic hiiii, hello
What's your/my name / how're u
Whats the time / tell time
Static Q/A, like whats newtons 3rd law / whos the president of canada / whats root(69+420)
Whats the latest news / news highlights
Tell me about yourself / ... your hobby/favorite game/anime
Love ya
Repeat after me -> will reply whatever you say next. Say stop/stop repeating to stop
change dress to change costumes and change room to switch background
Many more (forgot mostly) and many more coming soon

CONTRIBUTION GUIDELINE:

Make changes on whatever you feel like
Place some good comments (so that an intermediate python programmer can understand)
Make a PR and try to explain what you have changed and if theres any issue.
Keep in mind if you are interested :
- pyrobox.py is the server main file (like django or flask)
- App_server.py is the file that handles client-host request responses.
- Chat_raw2.py is the tool that actually handles what msg will do what and reply what (can be used as standalone in CLI mode for development mode, will use test account)
Don't worry about marge-issue, I'll update the code
Most importantly AS A MOBILE CODER, I USUALLY DON'T FOLLOW ANY CODE STYLE GUIDELINE (or pep8), SO PLEASE DON'T WORRY ABOUT THAT TOO MUCH. (I'll try to follow it in future)

Thanks to:

Reki Kawahara and abec (for creating Asuna)
Sony group (for Wake me up Asuna App idea and illustrations)
Pixi.js and live2D for character animation
~~Replit for continuously hosting Demo link for free~~ (not anymore)
Render.com for hosting Demo link for free
(Coming soon) Anyone who's willing to share chat data and ideas

voiceai-asuna's People

Contributors

Stargazers

Watchers

Forkers

ukaserge s-b-repo uzairurk icree8 zyxdevs

voiceai-asuna's Issues

Multiline msg is crashing

Multi line input is not yet supported but server and chat handler, but can be sent by web ui. Either disable or fix multipart form data issue

[BUG] Shouldn't speak expressions (text `expression` more text)

31c8a57

this commit supports expressions as text and parse it, but still doesn't remove it while giving voice

you'll need to create an endpoint in your website's backend that takes in text input and outputs the synthesized audio. You can use the Tortoise-TTS library in your backend code to generate the audio. Here's a sample code snippet in Python:

python

from tortoiseTTS import Tortoise

def generate_audio(text):
tortoise = Tortoise()
audio = tortoise.say(text)
return audio

You can expose this function as an API endpoint using a Python web framework like Flask or Django. For example, with Flask:

python

from flask import Flask, request

app = Flask(name)

@app.route('/api/synthesize', methods=['POST'])
def synthesize():
text = request.form.get('text')
audio = generate_audio(text)
return audio

if name == 'main':
app.run()

Note that this is just a basic example, and you may need to modify it depending on your specific use case. Also, be aware that generating audio can be a computationally intensive task, so you may want to optimize the code for performance if you expect a high volume of requests.

Add mongodb support and PyroDB as localfallback

[FEAT] add device control

Add "allow-device-control" in user.json (False)
Toggle switch in webUI
On 1st toggle user needs to agree from commandline running terminal window
Then will verify,
Verify process:

Browser request for token
Server generate 2 token:
i. One to use as url query so that browser can go to (probably use uuid.uuid4() library to generate a random token)
ii. Other in the get request in above link (similarly)
Server send 1st token and generate a site with it and put the 2nd there
Browser request 127.0.0.1:port/?verifyDevice=token
Make the 4 request url Allow all origin header response in server
If Server gets request (that means same device, since 127.0.0.1 can only be used from same device), sends 2nd token, browser gets that
Browser sends the second token to server (same as step 4). Server approves both in browser and server and saves (that token) in user.json and localstorage. Since its in localstorage, unless malicious intent, it won't be shared
Also the request message like "raise volume" will be revarified (using step 4) with the token.

It won't hurt to make some extra requests rather than letting someone control your pc

This is also done in every login or verify user

Use latest wikipedia api

Check and update from
Wikipedia-API
https://github.com/martin-majlis/Wikipedia-API

Older one is causing dirty issues

Put input size limit

Block from both js and py

Use updated server backbone

I've improved the server backbone in https://github.com/RaSan147/py_httpserver_Ult

Planning to migrate

[Plan] Make & check vercel support

There's a chance replit server may go down in 2024

[Bug] blank signup/login form are being accepted

Check both js and py

machine learning

To create a chat AI that can respond to user messages and provide appropriate responses, you will need to follow a few basic steps:

Collect and preprocess data: You will need a dataset of conversation examples to train your chat AI. You can either collect this data manually or use an existing dataset such as Cornell Movie Dialogs Corpus, Ubuntu Dialogue Corpus, etc. After collecting the data, you need to preprocess it to remove noise, normalize text, and convert it into a machine-readable format.

Choose a model architecture: There are various types of models that can be used for chat AI, such as sequence-to-sequence models, transformer models, and memory networks. You can choose the model architecture based on your requirements and the size of your dataset.

Train the model: Once you have chosen a model architecture, you need to train the model on your preprocessed dataset. This involves feeding the model with input-output pairs and adjusting the model's parameters to minimize the loss function.

Test and evaluate the model: After training the model, you need to test it on a separate test dataset to evaluate its performance. You can use metrics such as perplexity, BLEU score, and ROUGE score to evaluate the model's performance.

Deploy the model: Once you are satisfied with the model's performance, you can deploy it to a production environment such as a web application or a chatbot platform.

As for using open source machine learning, there are several libraries and frameworks available that you can use, such as TensorFlow, PyTorch, and Keras. These frameworks provide pre-built models, as well as tools for training and evaluating custom models.

place gui assets in the src folder

[FEAT] Add conversation

get some conversation from here:
https://helenadailyenglish.com/basic-english-conversation-100-daily-topics

add more by editing and comments

[BUG] Fix convo

Auto catching wrong intent

   Love you >> Love you
   user msg id:  359
   Flags:  {}
   intent: ['say_hello', 'love_you']

tell time giving gmt

send localtime and say it

Skins missing?

hope this helps with bot sending video

You can use various video processing libraries like FFmpeg or GStreamer to manipulate the video files and streaming protocols like HLS or MPEG-DASH to serve them over the internet. You can also use third-party APIs like Cloudinary or Amazon S3 to store and retrieve the video files. However, it's important to note that enabling a chat AI to send videos on a self-hosted website could potentially create legal or ethical issues, especially if the videos are not moderated or monitored for inappropriate content. It's important to carefully consider the implications of such a feature before implementing it.

Sure, here's an example of how you could allow a chat AI to send videos on a self-hosted website using HTML and JavaScript:

First, create an HTML form with an input field for the user to upload a video file:

python

Next, create a JavaScript function to handle the form submission and send the video file to the server using AJAX:

javascript

Finally, create a PHP script to handle the video upload and store the video file on the server:

php

Note that this code is just an example and would need to be customized to fit your specific website and use case.

Here's an alternative implementation of the same functionality using Node.js and the Express framework:

First, install the necessary dependencies by running the following command in your project directory:

npm install express multer

Next, create a server.js file with the following code:

javascript

const express = require('express');
const multer = require('multer');
const app = express();
const upload = multer({ dest: 'uploads/' });

app.post('/upload', upload.single('video'), (req, res) => {
if (!req.file) {
return res.status(400).send('No video uploaded');
}
res.send('Video uploaded successfully');
});

app.listen(3000, () => {
console.log('Server listening on port 3000');
});

Finally, create an HTML form with an input field for the user to upload a video file, and use JavaScript to submit the form to the server:

php

Again, note that this code is just an example and would need to be customized to fit your specific website and

How to display the whole body of the model

Asuna Model

Hello, I'm a not a pro python developer so I've got a question. (maybe stupid)

How can I change the model? I don't need a full explanation just some ideas how to do it.

Thanks for your VoiceAI Asuna project. It's amazing!!!!

[FEAT] Add convo links

Compliments
https://www.happier.com/blog/nice-things-to-say-100-compliments/

feature request:please add voice in/out

title covers it would be sweet to talk and reply in audio

rasan147 / voiceai-asuna Goto Github PK

voiceai-asuna's Introduction

PROJECT Asuna (NOT AI/ChatGPT yet)

Live Demo: https://ai-asuna.onrender.com

Feel Free to Support Me:

Description:

Current status:

How To Run:

Requirement:

Common IO: (similar inputs may/will work too)

CONTRIBUTION GUIDELINE:

Thanks to:

voiceai-asuna's People

Contributors

Stargazers

Watchers

Forkers

voiceai-asuna's Issues

Recommend Projects

Recommend Topics

Recommend Org