Code Monkey home page Code Monkey logo

GitHub Stats Card

Top Languages Card

Repositories I would like to share

It is to introduce our model, Style-Restricted GAN.

It is to introduce our paper regarding the application of Activation Maximization for audio-domain data.

It contains the lessons I created for Gonsalves AI laboratory.

It is to introduce our model, Latent Conditional GAN.

It contains the implementation of the website for text-to-speech synthesis.

It contains 2D marker detection using convolutional layers and pooling layers.

It contains the program to determine how to split bill with your friends.

Sho Inoue's Projects

beaqlejs icon beaqlejs

*BeaqleJS* provides a framework to create browser based listening tests and is purely based on open web standards like HTML5 and Javascript.

dbviz icon dbviz

The official PyTorch implementation - Can Neural Nets Learn the Same Model Twice? Investigating Reproducibility and Double Descent from the Decision Boundary Perspective (CVPR'22).

emotion2vec icon emotion2vec

Official PyTorch code for extracting features and training downstream models with emotion2vec: Self-Supervised Pre-Training for Speech Emotion Representation

fastspeech2 icon fastspeech2

An implementation of Microsoft's "FastSpeech 2: Fast and High-Quality End-to-End Text to Speech"

gst-tacotron icon gst-tacotron

A PyTorch implementation of Style Tokens: Unsupervised Style Modeling, Control and Transfer in End-to-End Speech Synthesis

hifi-gan icon hifi-gan

HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis

matcha-tts icon matcha-tts

[ICASSP 2024] ๐Ÿต Matcha-TTS: A fast TTS architecture with conditional flow matching

qmk_firmware icon qmk_firmware

Open-source keyboard firmware for Atmel AVR and Arm USB families

research_blog icon research_blog

ๅฃฐใƒ•ใ‚งใƒ้‡Ž้ƒŽใฎ้Ÿณๅฃฐ็”Ÿๆˆ้Œฒ(https://shinshoji01.hatenablog.com/) ใง็ดนไป‹ใ—ใฆใ‚‹ใ‚ฝใƒผใ‚นใ‚ณใƒผใƒ‰

seq2seq-evc icon seq2seq-evc

This is the implementation of our Interspeech 2021 paper: Limited data emotional voice conversion leveraging text-to-speech: two-stage sequence-to-sequence training.

speech-backbones icon speech-backbones

This is the main repository of open-sourced speech technology by Huawei Noah's Ark Lab.

speechgpt icon speechgpt

SpeechGPT: Empowering Large Language Models with Intrinsic Cross-Modal Conversational Abilities.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.