Code Monkey home page Code Monkey logo

voicegpt's Introduction

IntelliVoiceGPT

A Personal Assistant for Linux, MacOS and Windows

IntelliVoiceGPT converts your audio input to text using OpenAI's Whisper. Then, it uses a LangChain Agent to choose a set of actions, including generating AppleScript (for desktop automation) and JavaScript (for browser automation) commands from your prompt using OpenAI's GPT-3 ("text-davinci-003"), and then executing the resulting script. It then uses Google Text-To-Speech (GTTS) to convert text to audio.

Original Project

Features

  • Voice Recognition: Listens to user's voice commands and transcribes them to text.
  • LangChain Chaining: 'Chunks' user input into a new question which can be solved with tools and agents.
  • AI Commands: Uses LLM to segment voice commands into separate prompts, which then get executed by agent.py
  • Text-to-Speech: Converts the assistant's text responses into voice and speaks them out.
  • Reminders: Create reminders in the reminders app
  • To-do-list: Create to do lists in the notes app
  • Internet Statistics: Check internet speed and ping
  • Internet History: Check previous internet results
  • Weather: Get several weather metrics in chosen city

Usage

  1. Download Repository folder or git clone https://github.com/conqueror62821/VoiceGPT
  2. run python setup.py install in terminal
  3. In .env file add your OPENAI api key: OPENAI_API_KEY=your_key_here
  4. Open terminal and run python3 main.py
  5. Choose from the available prompts or talk to ChatGPT

Additional Features:

OpenWeatherMap API:

  • Create an openweather acccount and generate an api key
  • in .env add api key to:
  • OPENWEATHER_API_KEY=your_key_here

Example Prompts:

  • Find the result of a calculation. Prompt: "What is 5 * 5?" -> It will write AppleScript to open up a calculator and type in 5 * 5.
  • Find restaurants nearby. Prompt: "Find restaurants near me" -> It will open up Google search, read the text on the page, and say the best restaurants.

Video Example

Screen.Recording.2023-06-25.at.3.43.27.pm.mov

Contribution

Feel free to fork this project, make some changes, and submit a pull request. All contributions are welcome!

TODO:

  • Integrate File system tools in MacOS
  • Create more AppleScripts

License

This project is licensed under the MIT License - see the LICENSE file for details.

voicegpt's People

Contributors

conqueror62821 avatar

Stargazers

 avatar  avatar

Watchers

 avatar

Forkers

zlocate

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.