Code Monkey home page Code Monkey logo

Thodoris Kouzelis's Projects

ast icon ast

Code for the Interspeech 2021 paper "AST: Audio Spectrogram Transformer".

audioldm icon audioldm

AudioLDM: Generate speech, sound effects, music and beyond, with text.

diffusers icon diffusers

🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch

disfluentfa icon disfluentfa

A Weakly Supervised Forced Alignment for disluent speech

dreamsound icon dreamsound

Code for Investigating Personalization Methods in Text to Music Generation

kaldi icon kaldi

kaldi-asr/kaldi is the official location of the Kaldi project.

local_pnp icon local_pnp

This repo contains experiments for local editing in Diffusion Models

sail_align icon sail_align

SailAlign is an open-source software toolkit for robust long speech-text alignment implementing an adaptive, iterative speech recognition and text alignment scheme that allows for the processing of very long (and possibly noisy) audio and is robust to transcription errors. It is mainly written as a perl library but its functionality also depends on freely available software, namely HTK, srilm and sclite.

secretsanta icon secretsanta

Host secret santa without leaking your guests' informations 🎄

seffcaps icon seffcaps

Automated Audio Captioning of Sound Effects in Movies and Videos

wavetransformer icon wavetransformer

Code base for WaveTransformer: A novel architecture for automated audio captioning

wsac icon wsac

This reporsitory code form Weakly Supervised Automaed Audio Captioning via Text Only Training

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.