View Code? Open in Web Editor NEW

Voice AI is a Python application that leverages the OpenAI's Whisper and GPT-3.5 APIs to convert speech into text and generate responses.

License: GNU General Public License v3.0

Python 100.00%

voice.ai's Introduction

Voice AI

Voice AI is a Python application that leverages the OpenAI's Whisper and GPT-3.5 APIs to convert speech into text and generate responses.

Features

Python 3.6+
OpenAI Python v0.27.0+
Packages: openai, pyaudio, pynput, clipboard, tkinter
A microphone to record audio
Install these packages using pip: pip install openai pyaudio pynput clipboard tkinter

Clone the repository or download the code.
Install the necessary Python packages if not already installed.
Run the script with the command python your_script.py, replacing your_script.py with the name of the Python file you have this script saved in.

Start the application.
Enter your OpenAI API key in the "API Key" field.
Enter a system message for the GPT-3.5 API in the "System Message" field. This will act as a prompt to guide the AI's responses.
Set a push-to-talk key. This key will be used to start and stop recording. Make sure to choose a key that does not interfere with your system's hotkeys.
Click the "Start" button to start the application. Once started, the input fields and the start button will be disabled.
Press and hold the push-to-talk key to start recording. Release the key to stop recording.
The application will then transcribe the recorded audio into text using the Whisper API, and then generate a response using the GPT-3.5 API.
The transcript and the AI response will be displayed in the text area and the AI response will be copied to your clipboard.