Code Monkey home page Code Monkey logo

maximzaika / imbfaceandagerecognition Goto Github PK

View Code? Open in Web Editor NEW
0.0 1.0 0.0 1.36 MB

IBM VisualRecognition has been depreciated by IBM. Backend server that clients can connect to and upload imagesthat contain person's face. The server then sends it to IBM cloud to do the processing work to identify number of people, their age, and then send the X,Y coordiates back, where backend Highlights the face and states their age. Then it sends another request to create text-to-speech audio file. All this is then visual to a client.

License: MIT License

JavaScript 89.32% CSS 3.54% HTML 7.15%

imbfaceandagerecognition's Introduction

Depreciation Notice

  • IBM has depreciated Visual Recognition, hence there is no video showing the way this server functions. It is advised not to use this code, though, part of it can be reused. All the API keys are no longer functional.

IBM Cloud Watson Facial Image Analysis Using Cloud Services (Facial Recognition + Text to Speech)

In this project, server takes as input an image which is uploaded by a client through a user-friendly webpage. The server then uses a machine learning techniques (IBM Watson Cloud) to get data about the faces in the uploaded image and send it back to the client. Data received from IBM Cloud server includes number of faces detected on the image, an average age, and a gender of a person on the image. After that the system converts the data into a line of text, which then is passed back to the IBM Cloud's text-to-speech conversion algorithm. IBM Cloud returns the audio file back to the server allowing the user can play the audio from the webpage.

System Architecture

  • Added uploader to allow users upload images to our server
  • Added a user-friendly UI
  • Added UI animations
  • Enabled communication with IBM Cloud via API Keys
  • Server sends the image to IBM Cloud
  • Server receives processed data from IBM Cloud (number of faces, min/max age, gender)
  • Server converts min/max age to average age
  • Server converts number of faces, min/max age, and gender to a line of text
  • Server sends the line of text to the IBM Cloud's speech-to-text conversion algorithm
  • Server receives the audio file from the IBM Cloud
  • Server draws the rectangular box around all the faces
  • Server displays converted line of text with the information about the number of people, number of faces, average age, and a gender
  • Server displays the original image, and the image with rectangular shape around each face
  • Server displays the play/stop audio button

Languages & Libraries

  • Languages: HTML, CSS, JavaScript
  • ExpressJS, SocketIO, SocketIO-Stream, SocketIO-File-Upload, Watson-Developer-Cloud

Class Diagram

img.png

Examples

img.png

img.png

Installation

  1. Install node.js
  2. Run npm install
  3. Run node server.js
  4. Access via the url: http://localhost:3000

imbfaceandagerecognition's People

Contributors

maximzaika avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.