Code Monkey home page Code Monkey logo

mscogntivebs's Introduction

MSCogntiveBS

Inspiration

As useful as videos are, sometimes it would be much easier if there were a condensed version of it in text. Video or audio files can be lengthy and often contain superfluous information. Imagine being a student and recording video lectures for your classes. Wouldn't it be nice to have something automatically summarize the key points for you? SpeechToSummary attempts to achieve this goal.

What it does SpeechToSummary takes in speech or an audio file as input, converts it to text using Microsoft's Bing Speech API, then runs a ranking algorithm on the sentences to determine the most relevant content. It then presents a summary of the topics presented in the audio.

How we built it

We used Microsoft's cognitive services. We also implemented some basic heuristics to rank sentences by keyphrases and cues. The program is written in Javascript.

Challenges we ran into

Figuring out the API, getting the speech to text API to separate by sentences, implementing file upload of a .wav audio files.

Accomplishments that we're proud of Taking care of some optimizations in space and time complexity while figuring out API's, and...the fact that it actually somewhat works.

What we learned

It's actually really easy to use Microsoft's API's once you're set up.

What's next for SpeechToSummary

Future improvements on SpeechToSummary include increasing the maximum file upload size, improving summarization algorithm (look into Lexrank algorithm), and porting the app to different devices (mobile, especially). We would also look into other ways to interact with the information, such as with a bot, and ways to allow the user to customize key topics the program should search for.

mscogntivebs's People

Contributors

billgao0807 avatar reych avatar emmayang7 avatar

Watchers

 avatar Jeffrey Vaudrin-McLean avatar  avatar  avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.