Code Monkey home page Code Monkey logo

Contact Me

  • Keybase BTC
  • Keybase ZEC
  • Twitter Follow
-----BEGIN PGP PUBLIC KEY BLOCK-----
mDMEXxkkHBYJKwYBBAHaRw8BAQdAdeo+g+9bHyb7EIxR8krZ07oJWrxLhFDjxbS4
RdsMwYy0LERlbmRpIFN1aHViZHkgKEp1bHkyMDIwKSA8ZGVuZGlAYml0d3lyZS5j
b20+iJYEExYIAD4WIQTn2TSCfZagLRA+qnpfneJ/eb668wUCXxkkHAIbAwUJACeN
AAULCQgHAgYVCgkICwIEFgIDAQIeAQIXgAAKCRBfneJ/eb6686DjAQDBQGgOe/aF
LLmiiua3ysS40E2hGA62NQCJPBNDopN12wD/aunCeRy81ojMDA19OsSAHBGgSwzk
D7XEQqwYiMIcjwO4OARfGSQcEgorBgEEAZdVAQUBAQdAwfuAPhSh3N29/ph7Bx8b
I4a5s7BFMGMmixrtyYkZ2A4DAQgHiH4EGBYIACYWIQTn2TSCfZagLRA+qnpfneJ/
eb668wUCXxkkHAIbDAUJACeNAAAKCRBfneJ/eb6682DAAQCu4ifk/JdaAJC45BfV
ynd/yqa4BezPySNqFmeYunq53QEAiU9M+hZgwZpx5UKRTCf9ULiVK4tN+66GcSO0
2EvLFQk=
=zH/r
-----END PGP PUBLIC KEY BLOCK-----

Dendi Suhubdy's Projects

speech-to-text-wavenet icon speech-to-text-wavenet

Speech-to-Text-WaveNet : End-to-end sentence level English speech recognition based on DeepMind's WaveNet and tensorflow

speech_signal_processing_and_classification icon speech_signal_processing_and_classification

Front-end speech processing aims at extracting proper features from short- term segments of a speech utterance, known as frames. It is a pre-requisite step toward any pattern recognition problem employing speech or audio (e.g., music). Here, we are interesting in voice disorder classification. That is, to develop two-class classifiers, which can discriminate between utterances of a subject suffering from say vocal fold paralysis and utterances of a healthy subject.The mathematical modeling of the speech production system in humans suggests that an all-pole system function is justified [1-3]. As a consequence, linear prediction coefficients (LPCs) constitute a first choice for modeling the magnitute of the short-term spectrum of speech. LPC-derived cepstral coefficients are guaranteed to discriminate between the system (e.g., vocal tract) contribution and that of the excitation. Taking into account the characteristics of the human ear, the mel-frequency cepstral coefficients (MFCCs) emerged as descriptive features of the speech spectral envelope. Similarly to MFCCs, the perceptual linear prediction coefficients (PLPs) could also be derived. The aforementioned sort of speaking tradi- tional features will be tested against agnostic-features extracted by convolu- tive neural networks (CNNs) (e.g., auto-encoders) [4]. The pattern recognition step will be based on Gaussian Mixture Model based classifiers,K-nearest neighbor classifiers, Bayes classifiers, as well as Deep Neural Networks. The Massachussets Eye and Ear Infirmary Dataset (MEEI-Dataset) [5] will be exploited. At the application level, a library for feature extraction and classification in Python will be developed. Credible publicly available resources will be 1used toward achieving our goal, such as KALDI. Comparisons will be made against [6-8].

speech_synthesis icon speech_synthesis

PyTorch implementation of convolutional neural networks-based text-to-speech synthesis models

speechmetrics icon speechmetrics

A wrapper around speech quality metrics MOSNet, BSSEval, STOI, PESQ, SRMR, SISDR

speechsplit icon speechsplit

Unsupervised Speech Decomposition Via Triple Information Bottleneck

spendkey icon spendkey

Recover Monero address using the private spend key

spleeter icon spleeter

Deezer source separation library including pretrained models.

split icon split

Unopinionated utilities for resizeable split views

sploit-tools icon sploit-tools

My own tools for easing the task of pentesting / exploit writing

spsc_queue icon spsc_queue

A highly optimized single producer single consumer message queue C++ template

spscqueue icon spscqueue

A bounded single-producer single-consumer wait-free and lock-free queue written in C++11

sptk icon sptk

A modified version of Speech Signal Processing Toolkit (SPTK)

sqlite_orm icon sqlite_orm

❤️ SQLite ORM light header only library for modern C++

srcnn-pytorch icon srcnn-pytorch

PyTorch implementation of Image Super-Resolution Using Deep Convolutional Networks (ECCV 2014)

srec icon srec

PyTorch Implementation of "Lossless Image Compression through Super-Resolution"

srslte icon srslte

Open source SDR LTE software suite from Software Radio Systems (SRS)

sse-popcount icon sse-popcount

SIMD (SSE) population count --- http://0x80.pl/articles/sse-popcount.html

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.