Code Monkey home page Code Monkey logo

asosoft-speech-corpus's Introduction

AsoSoft Speech Corpus

Speech Recognition for Kurdish

AsoSoft is the first company to work in the field of Speech Recognition for the Kurdish language. We develop Speech Recognition, Speaker Recognition, and Speech Command software and tools for the Kurdish language through Artificial Intelligence and Signal Processing. Kurdish language speech data and its related resources like tags are of most important language resources which are required for NLP research and applications such as automatic speech recognition, speaker recognition, etc. In this project, speech data for the Kurdish language (Central Kurdish) was designed and collected so that it could be used in automatic speech recognition, speaker recognition, phonology research, dialect analysis, etc. So far, approximately 30 hours of speech have been recorded and transcribed in order to produce this corpus.

A subset of the AsoSoft speech corpus for research and non-commercial use is available for downloaded. This subset of the AsoSoft Speech Corpus can be used for spoken language processing tasks in Central Kurdish such as speech recognition, speaker recognition, gender identification, and phonetic analysis. This subset includes 45 speakers, each of them has uttered 72 (same) sentences; the first 70 sentences of the AsoSoft Speech Corpus (i.e., sentence 1 to sentence 70) and the last two sentences (i.e., sentence 699 and sentence 700). Each of the last two sentences covers all Central Kurdish phonemes. The original version of the dataset contains 700 sentences for each speaker. The sentences are manually designed to represent the phonetic characteristics of the Central Kurdish. The recording date of this dataset is during the year 2016.

Metadata

The information of the speakers is given in a table which contains:

  • Microphone type: USB/Philips/Jack/Laptop
  • Noise level
  • Gender (this label can be used for gender identification tasks): Femal/Male
  • Dialect/city (this label may be used for the phonetic analysis of the various dialects of Kurdish and also dialect identification task)
  • Age
  • Education
  • Length (total and average)

Files

In the dataset, for each recording three files are given:

  • .wav: wave file recorded in 22.05 kHz, 16bit, mono
  • .wrd: transcription in Kurdish alphabet
  • .phn: phonetic transcription in ASCII format

The file name format is as bellow:
SpeakerID(3digits) + Gender + RecordingDevice(Laptop/PC/Mobile) + Mic + SentenceID(3digits)
For example 001MLU001:
SpeakerID=001, Gender=Male, RecordingDevice=Laptop, Mic=USB, and SentenceID=001

Cite

If you are using this corpus, please cite the following reference:

@article{veisi2021Jira,
  title={Jira: a Kurdish Speech Recognition System Designing and Building Speech Corpus and Pronunciation Lexicon},
  author={Veisi, Hadi and Hosseini, Hawre and MohammadAmini, Mohammad and  Fathy, Wirya and Mahmudi, Aso},
  journal={arXiv preprint arXiv:2102.07412},
  year={2021}
}

asosoft-speech-corpus's People

Contributors

aso-mehmudi avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.