Code Monkey home page Code Monkey logo

emg100's Introduction

A Benchmark Dataset for Electromyography-based Silent Speech Recognition

This repository contains dataset employed in A Simplified Adversarial Architecture for Cross-Subject Silent Speech Recognition Using Electromyography. The details and original provenance of the dataset come from another paper by our team, entitled A Benchmark Dataset for Audio-Visual-Electromyography-based Multi-modal Speech Recognition.

Dataset Description

Data Acquisition

A surface Electromyography (sEMG) acquisition system is constructed, through which a 6-channel sEMG signal is obtained by deploying six pairs of acquisition electrodes and a reference electrode on one side of the face and neck area.

These signals are acquired to demonstrate the physiological activities of six muscles, including mentalis, risorius, levator labii superioris, anterior belly of the digastric, mylohyoid, and platysma.

MusclesDistribution

Corpus

The corpus is mainly designed for elderly and disability assistance scenarios in the light of Maslow's hierarchy of needs, covering 5 kinds of basic needs in daily life.

  • It is a collection of 100 short sentences in Chinese Mandarin together with an empty one containing no articulation movement as a reference for active muscle activity.
  • Each sentence consists of 3 to 5 Chinese characters.

The corpus will be released synchronously when the dataset paper is officially published.

Data Size

The dataset contains 99,688 utterances from 100 speakers in 101 categories, lasting more than 55 hours.

Pre-released Data

We have released the sEMG silent speech dataset in our experiments here.

This dataset is for study and research use only, and any other commercial use is strictly prohibited. If you need to use it, please contact us to obtain the data extraction code. After signing the data usage application and providing the necessary applicant information, you will be able to access the complete data.

An official citation will be updated here subsequently.

Relevant Paper on the Dataset

Another work by our team entitled A Benchmark Dataset for Audio-Visual-Electromyography-based Multi-modal Speech Recognition is undergoing peer review. The non-peer-reviewed version is released here first, and citation and access will be updated once the paper is accepted.

emg100's People

Contributors

halofame avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.