Jaeyong Kang's Projects
This is a list of datasets consisting of speech, music, and sound effects, which can provide training data for Generative AI, AIGC, AI model training, intelligent audio tool development, and audio applications. It is mainly used for speech recognition, speech synthesis, singing voice synthesis, music information retrieval, music generation, etc.
Official implementation of Paper Future Frame Prediction for Anomaly Detection -- A New Baseline, CVPR 2018
List of articles related to deep learning applied to music
Collection of resources on the applications of Large Language Models (LLMs) in Audio AI.
Segmentation Guided Thoracic Classification
CNN architectures, training procedures, and evaluations for prediction of diseases on the ChestX-ray14 dataset.
Convolutional Neural Network (CNN) for modeling user interests
Simulation data generator for complex biosensor DB
The official code repository for examples in the O'Reilly book 'Generative Deep Learning'
a list of demo websites for automatic music generation research
Must-read papers on graph neural networks (GNN)
An exploration of how generative text-to-music AI models can be used for emotion guidance
Homepage
Mask R-CNN for object detection and instance segmentation on Keras and TensorFlow
This repo is for Miccai monuseg challenge, by Jimmy from aetherAI
Simple collection of MIR datasets with metadata and links
A toolkit for symbolic music generation
test
News media articles crawler using python
Audio processing by using pytorch 1D convolution network
Noisy2Clean: Novel Speech Denoising Framework using Diffusion Model
Useful Toolbox for Anomaly Detection
Sentiment analysis of tweets
Semeval17
Modeling user interests using labeled dataset
User Interest modeling using Wikipedia and News Media