Code Monkey home page Code Monkey logo

iphone-speech-to-text's Introduction

Speech-to-Text library used by Astrid for iPhone!

This library leverages the Google Voice API and the Speex audio codec to achieve speech-to-text on iOS devices where Siri is not available. The library includes version 1.2-rc1 of the Speex codec and three supporting classes to make speech-to-text on iOS as simple as possible. There are a few steps required to configure the project in Xcode:

  1. Clone the project

  2. Add the SpeechToText Xcode project to your project

  3. In your project's settings, update the target's build phases by adding SpeechToText to the "Target Dependencies" section and libSpeechToText.a under the "Link Binary with Libraries" section.

  4. Under the target's Build Settings tab, add the iPhone-Speech-to-Text directory (i.e. the path on your hard drive to wherever it exists) to the "Header Search Paths" field, and click the box marked "Recursive"

  5. Import "SpeechToTextModule.h" wherever you want to use it!

The SpeechToTextModule class is the core of the library, but is very simple. An instance of this class can start and stop audio recording, and will pass an NSData * object back to its delegate when it receives a response from the Google Voice API. This data should be interpreted as JSON and can be decoded by your favorite JSON parsing library. The resulting JSON object should contain an array under the key "hypotheses", and each entry in this array will be a possible interpretation of the audio with the key "utterance" for the text and the key "confidence" for the confidence.

Optional delegate methods for SpeechToTextModuleDelegate allow you to display a rolling sine wave view that indicates volume amplitude as recording takes place (showSineWaveView and dismissSineWaveView) or display an optional loading view while the module communicates with the voice API.

Contributors workflow

If you want to help make this library better, read this!

Setup:

git clone [email protected]:your-github-id/iPhone-Speech-to-Text.git (your-github-id should obviously be replaced)

git remote add upstream [email protected]:todoroo/iPhone-Speech-to-Text.git

Working on new features/fixes:

git checkout -b my-new-features upstream/master

work, work, work!

git commit (a separate commit for each bug fix, feature change, style or copy edit please!)

git fetch upstream

git rebase -i upstream/master (i like to rebase -i to verify what I'm committing and squish small commits)

git push origin HEAD

then go to github and submit a pull request!

For further information, read Tim's Collaborator Guide.

iphone-speech-to-text's People

Contributors

sbosley avatar bearpelican avatar jonparis avatar

Stargazers

Ravindranath Akila avatar

Watchers

James Cloos avatar Ravindranath Akila avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.