Code Monkey home page Code Monkey logo

Hi there 👋

I'm AlexandaJerry, a postgraduate student studying phonetics.

  • 🔭 I’m currently working on speech-related research.
  • 🌱 I’m currently learning python programming and deep learning.
  • 👯 I’m interested in Praat scripting and R visualization.

AlexandaJerry's GitHub stats

Alexanda's Projects

annotated_deep_learning_paper_implementations icon annotated_deep_learning_paper_implementations

🧑‍🏫 50! Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, ...), gans(cyclegan, stylegan2, ...), 🎮 reinforcement learning (ppo, dqn), capsnet, distillation, ... 🧠

auto_labeling_for_bert_vits2 icon auto_labeling_for_bert_vits2

这个项目是数据预处理。第一步是对获取到的音频做处理,结合Funasr的时间戳去掉空背景音。也包含了喂给BERT前的label

automatic_speech_annotator icon automatic_speech_annotator

Automatic speech annotator processing speech with voice activaty detection, overlapping speech detection, speaker diarization and automatic speech recognition

charsiu icon charsiu

Charsiu: A neural phonetic aligner.

chatpaper icon chatpaper

Use ChatGPT to summarize the arXiv papers. 全流程加速科研,利用chatgpt进行论文全文总结+专业翻译+润色+审稿+审稿回复

chenyme-aavt- icon chenyme-aavt-

这是一个全自动(音频)视频翻译项目。利用Whisper识别声音,AI大模型翻译字幕,最后合并字幕视频,生成翻译后的视频。

cross-modal-bert icon cross-modal-bert

CM-BERT: Cross-Modal BERT for Text-Audio Sentiment Analysis(MM2020)

dataset_generator_for_vits icon dataset_generator_for_vits

基于达摩院视频切割技术的视频转换为短音频的vits数据集生成工具 A VITS Dataset Generation Tool for Converting Video to Short Audio Based on Damo Academy Video Cutting Technology

emotion2vec icon emotion2vec

Official PyTorch code for extracting features and training downstream models with emotion2vec: Self-Supervised Pre-Training for Speech Emotion Representation

emotivoice icon emotivoice

EmotiVoice 😊: a Multi-Voice and Prompt-Controlled TTS Engine

ft-w2v2-ser icon ft-w2v2-ser

Official implementation for the paper Exploring Wav2vec 2.0 fine-tuning for improved speech emotion recognition

gmkvextractguiflatpak icon gmkvextractguiflatpak

📦 Flatpak Package of gMKVExtractGUI, a small GUI utility to extract tracks, chapters and CUE sheets from mkv files

gpt-sovits icon gpt-sovits

1 min voice data can also be used to train a good TTS model! (few shot voice cloning)

label-studio icon label-studio

Label Studio is a multi-type data labeling and annotation tool with standardized output format

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.