Code Monkey home page Code Monkey logo

yolo-slave's Introduction

Machine Learning to stream and detect laptop GUI

Based on my recent lessons with Jeff Heaton on Deep Learning, I attempted to extend the power of Yolov3 from detecting pre-defined objects, from still images / videos to computer desktop display. This tool is more versatile as users can pull up different items on screen, browser various webpages, and see the object detection acting in live environment. My work is inspired and revised from this lesson.

I have to note that at my workstation setup, I have my laptop and x2 slave screens. Therefore it is easy enough for me to open up various items on one screen, while another OpenCV pop-up window displays on a second screen where Yolov3 object detection takes place.

Yolov3 in action when watching YouTube video...

ODS stickers

... or flicking through my 360 photos taken from Venice...

ODS stickers

... or Google search on the best pet

ODS stickers

It is only the first attempt of extending Yolov3's application, and the main intention comes from my workplace inspiration that when I need to explain an idea to my audience, it will help elaborate much easier with the use of Yolo's object detection functionality, as Yolo essentially supplement my presentation with on-screen annotations.

Major drawback so far is the serious display lag, guess it will be better with higher CPU PC. Regardless, it is still fun enough to put this tool into practice. Next step I envision is to train customized objects (e.g. by transfer learning) so that Yolo is able to pick up additional customized items other than the pre-defined ones.

Feel free to comment and let me know what you think. I have also linked this repository to my Medium blog, as I believe sharing of experience is vital to keep the ML community vibrant.

Feel free to fork and enjoy! :)

yolo-slave's People

Contributors

lafilletterevolution avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.