Code Monkey home page Code Monkey logo

whisper-paste's Introduction

Voice-to-Text (almost) anywhere in Chrome using OpenAI Whisper API ๐Ÿš€

Just click ctrl+alt+r to start/stop recording and the extension will use OpenAI Whisper API to transcribe your voice and paste the transcription at the current caret position. ๐ŸŽ‰

This repository is a fork of https://github.com/Ordinath/Whisper_to_ChatGPT
I don't know much about web development, and I've spent only a few hours on this extension, so it might still be a little buggy and slow, but mostly it works!

โœจ Features

  • ๐ŸŽค Record and transcribe your voice on-the-fly using OpenAI's Whisper API
  • โŒจ Configurable keyboard shortcut to quickly start/stop the recording.
  • ๐Ÿ”ง Customize the prompt for better API voice recognition results
  • ๐Ÿ’ฌ Support for multiple Whisper API prompts for versatile transcription contexts
  • ๐ŸŒ Implicit translation support for transcribing and translating your input to English
  • ๐Ÿ’พ Download your transcriptions as sound files for further use
  • ๐ŸŒ Use the extension with main inputs on chat.openai.com and edit-inputs (Not tested in this fork).
  • ๐Ÿ“Œ Snippets feature (in beta) for quickly pasting frequently used text in the ChatGPT text area (Not tested in this fork).

๐Ÿ”ง How to Build and Run Locally

To run the extension locally in your Chrome browser, follow these steps:

  1. Download or clone the repository from GitHub: git clone https://github.com/redocrepus/Whisper-Paste.git
  2. Install the dependencies by running npm install in the project folder (I've verify that it works with Node.JS v16.5.0, but it also might work with later versions. It did not work for me with the latest version.)
  3. Run npm run build to build the app for production to the build folder
  4. Open Google Chrome and navigate to chrome://extensions
  5. Enable "Developer mode" by toggling the switch in the top-right corner
  6. Click on "Load unpacked" button and select the build folder created in step 3
  7. The extension should now appear in your list of installed extensions
  8. Click the extension's button (microphone) to configure.

๐Ÿ”‘ API Key Disclaimer

This extension requires an OpenAI account with a valid API key to function properly. OpenAI provides a small amount of free credits for all accounts, which is more than enough to use the Whisper API in ChatGPT and enjoy the extension's features.

๐Ÿ“ฃ Feedback and Contributions

Feel free to open issues, submit pull requests, or just reach out for any reason.

Enjoy dictating!

๐Ÿ› ๏ธ Development

This project was bootstrapped with Create React App.

Available Scripts

  1. npm start : Runs the app in development mode, open http://localhost:3000 to view it in your browser
  2. npm run build : Builds the app for production to the build folder For more information, refer to the Create React App documentation and React documentation.

whisper-paste's People

Contributors

ordinath avatar redocrepus avatar

Stargazers

Alireza Sheikholmolouki avatar AbuAbdhullah avatar  avatar Nick Dijkstra avatar

Forkers

alireza29675

whisper-paste's Issues

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.