albanjoseph / signapse Goto Github PK

Signapse is an open source software tool for helping everyday people learn sign language for free!

License: GNU General Public License v3.0

CMake 3.48% Shell 1.50% C++ 66.65% C 14.22% Python 14.14%

ai asl asl-recognizer asl-translator computervision cpp neuralnetwork opencv signlanguage signlanguagedetection signlanguagerecognition

signapse's Issues

Marketable Video

I've just realised that shooting a "launch video" for the product is actually in the mark scheme and seems to be worth a reasonable amount. We should definitely do this since the product works and we can easily throw something together quickly.

There are example videos on 4 github projects from last year:

Update docs

Documentation has again fallen behind the developed source code. Another iteration of docs update is required to help keep everything refreshed.

Add static analysis for continuous integration

Static analysers provide an easy way to detect memory leaks, lint code and find other issues in the codebase. Tools like clang-tidy and cppcheck can be easily added to the code-base and crafted into a github action. This would be cool

Tests badge to readme

Might be good to have a tests badge in the readme. Sometimes see them on other github projects. Gives an immediate indication as to whether integrations have passed or failed our unit testing and shows we are using an integration process which builds confidence in our code-base.

Remove mirror from view feedback

ATM the hand signs are flipped vertically to allow the user to see themselves as they might in a mirror. This actually teaches users to make the WRONG SIGNS. Remove this feature.

Testing

We should build integration tests including linting, compiling and unit tests. An integration machine could be set up and run to help us continuously evaluate our software and prepare for subsequent releases

Check Safety

While we are confident the program handles memory leakage and multithreading responsibly, some further checks should be performed to ensure security.

Another check through the OOP design should be performed to make sure classes are SOLID and adhere to Liskov's P

GUI Tweaks

The GUI could be improved a bit more, some ideas:

Add the Signapse logo
Ensure that when a user completes a sign, the next sign is automatically selected
Mirror the example images
Add a score for a given session
Rename task -> sign
Add some sort of message or reward when the user completes learning a sign (ie "Well Done!" or turn the app background green for a second or so)
Scale fit the images to the frames they sit inside

Add bounding box for user hands in frame

To allow for an object classifer to be used in place of a detection model the localisation of the sign in camera frame must be pre-determined. For this, an area within frame should be selected with some drawing of the bounding indicated on-screen. This is a pre-cursor step to populating the CNNProcessor class with a classification network for sign identification.

A localisation bounding box should be selected and shown in feedback to the user (at present using imshow() function from OpenCV). CNNProcessor requires functionality to crop the frame around this bbox for further processing. These features should be built on branch hand-crop and merged to dev before work on training the CNN may begin.

Train CNN

To perform sign detection, we require a CNN classifier network trained on images of various hand signs. This task comprises training a CNN and interfacing with the network via an executable runtime.

Data:
A diverse source of data is required. Sign language MNIST on Kaggle seems like a good place to start. Perhaps down-sampling the nr-classes would be safe to begin with rather than the full alphabet, only 4 or 5 signs could be distinguished.

https://www.kaggle.com/datamunge/sign-language-mnist

Network Architecture:
Some work could be carried out in selecting a suitable network architecture for our task. Main tade-off when selecting a classifier is execution latency (on RPi and x86 machines) vs accuracy at the output (top-1, top-5 or other metric). Possible architectures could be mobilenets (v1, 2 or 3) or a larger network like Inceptionv3.

Runtime/interface:
Various neu-network runtimes could facilitate execution within the app. First port-of-call would be to try out OpenCV's DNN library as OpenCV packages are already installed by our app. Tensoflow also offers TFLite runtime, this may accelerate execution time. Invariably, network execution should be built into the CNNProcessor object. Results from frames processed should be added to the Scene struct and passed along to output. Work may also be required to analyse latency of this stage and scale-down nr network executions completed to get appropriate throughput through CNNProcessor.

Wireframe the GUI

At present the app is accessible using the terminal only, in the future we would like a GUI to help interact with the system. An initial task should be to make a wireframe of the GUI interface (on paper or otherwise) and then build this wireframe using QtDesigner.

Guides on how to use and install QtDesigner are available:
https://doc.qt.io/qt-5/qtdesigner-manual.html

Add Description and Tags to Github

Currently, the Git repo has no description, looks a bit off. Would be good to add a description there and some tags for search engine optimisation

Refactor everything!

As the code-base has matured, it's now time to refactor and clean up some of the mess we've left. This includes:

Consistent file formatting across all code files (can made consistent easily with an IDE and some shortcuts)
Removing code which executes the legacy video processing pipeline
File structure
- Currently gui/ images/ and models/ could all go into a directory assets/ as is standard (I think)
Remove commented out lines, debugging statements, simplify functions throughout
Rename some files and classes, a consistent style with capital letters would be nice, suggested style:
- Camel case is always used
- Capital letter begins a class, public method or property and a filename
- Lower-case letter begins everything else

Wiki Entries!

We must get in some wiki entries to ensure we get the marks we deserve for the course! From reading the mark scheme, here are some some suggested wiki entries:

Quick Start
(Set of instructions and commands to install Signapse from fresh on both RPi and Ubuntu)
How it Works
(Could include a description of the legacy pipeline and how this is improved with the callbacks re-design)
Documentation
(Links to the hosted docs, already exists I think?)
Real-time Evaluation
(This is explicit in the mark scheme, not sure how to work on this, open to suggestions)
Unit Testing
(Describes the unit testing methodology, tests and google-test framework)
Social Media
(Some spraff about our social media strategy and getting published in two tech journals)
Organisation/team management
(Must describe how we've successfully used git to organise and manage ourselves. Could also describe different roles in our team)

Python scripts for training CNN

At present, the python code used to generate our deep learning model is kept offline on my machine. This code is still a part of Signapse and should be included in our application repo. Some effort is required in packaging up the code, including the training and evaluation scripts used and including them into Signapse. The goal being that the network may be generated "from scratch" using only the materials provided in the repo.

Numerical graphs and results can also be added to our Wiki, we want to show off our work on network training!

Blocking Queue

To synchronise between threads, modifications to the underlying queue data structure are required. A blocking queue should be created which hangs the current thread while there is no element available to deliver. This can be implemented as shown in:

https://stackoverflow.com/questions/12805041/c-equivalent-to-javas-blockingqueue

The branch new-queue should implement our blocking queue and integrate with the Reel object.

Pre-build OpenCV binaries

Building OpenCV from scratch is the only known way to access an up-to-date version of the library from Signapse. Unfortunately, this library is pretty huge, resulting in lengthy compilation time. This is a direct obstruction to helping users get up and running with the application and should be remedied.

One idea is to pre-build binaries for realistic deployment scenarios and host these online. Deb packages could be used for this purpose, similar to apt. To start, we should have and x86 and ARM deb package variants for OpenCV 4.5.X+

Undocumented code

Advances in the code-base are currently undocumented. We should try to add documentation using Doxygen to all created classes and functionality.

Documentation will lag advances in the code-base. For this reason, a docs branch should be made from dev at commit 74631ccd8b93da8b4ca26b5673dfbe0d41a393d5. Continuous review will see documentation built for subsequent advances in the code.

Rename member variable in Reel class for clarity.

Reel has protected member variable "frameQueue". This should be renamed "sceneQueue" for clarity as it is actually a queue of scenes.

Re-architect video processing using call-back routines

After some discussion with the course convenors, the existing video processing pipeline is shown to not fit assessment requirements for the course as asynchronous callback routines have been neglected in favour of an execution blocking pipeline.

The architecture can be made more elegant and objects better encapsulated if callback routines are devised for the camera and CNN processing elements. This should be completed for the next release of Signapse

GUI integration

A GUI for signapse has now been wireframed and designed in QtDesigner. Remaining effort is required to integrate our GUI into signapse, adding required functionality, testing the user experience and evaluating the "real time responsiveness of the application" as per point 3 on the RTEP mark scheme.

Assignees currently @albanjoseph and @charger4241, but all-hands are likely to be required to complete before milestone v2 (6th April)

Social Buzz

The first release of Signapse is soon. Let's make a buzz about it on social media. The social channels available from the readme can be updated with hype material before and after our release. Maybe some screencaps and a demo of our system would help drive the message that we have a really great product thus far!

albanjoseph / signapse Goto Github PK

signapse's Issues

Recommend Projects

Recommend Topics

Recommend Org