Code Monkey home page Code Monkey logo

endimionzf's Projects

bark icon bark

πŸ”Š Text-Prompted Generative Audio Model

comfyui icon comfyui

The most powerful and modular stable diffusion GUI, api and backend with a graph/nodes interface.

deepvideoanalytics icon deepvideoanalytics

Analyze videos, perform detections, index frames & detected objects, search by examples

demucs icon demucs

Code for the paper Hybrid Spectrogram and Waveform Source Separation

ffmprovisr icon ffmprovisr

Repository of useful FFmpeg commands for archivists!

flowframes icon flowframes

Flowframes Windows GUI for video interpolation using DAIN (NCNN) or RIFE (CUDA/NCNN)

ltu icon ltu

Code, Dataset, and Pretrained Models for Audio and Speech Large Language Model "Listen, Think, and Understand".

machinevideoeditor icon machinevideoeditor

This repository does not contain code, its purpose it for issue tracking and wiki

photonix icon photonix

A modern, web-based photo management server. Run it on your home server and it will let you find the right photo from your collection on any device. Smart filtering is made possible by object recognition, face recognition, location awareness, color analysis and other ML algorithms.

pyannote-audio icon pyannote-audio

Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding

pyannote-core icon pyannote-core

Advanced data structures for handling temporal segments with attached labels.

pyscenedetect icon pyscenedetect

:movie_camera: Python and OpenCV-based scene cut/transition detection program & library.

rvc-studio icon rvc-studio

The best looking and most functional webui for RVC related tasks. See website for UI demo:

segment-anything icon segment-anything

The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.

stash-box icon stash-box

Stash App's own OpenSource video indexing and Perceptual Hashing MetaData API

storytoolkitai icon storytoolkitai

An editing tool that uses AI to transcribe, understand content and search for anything in your footage, integrated with ChatGPT and other AI models

text-generation-webui icon text-generation-webui

A Gradio web UI for Large Language Models. Supports transformers, GPTQ, AWQ, EXL2, llama.cpp (GGUF), Llama models.

tinydiarize icon tinydiarize

Minimal extension of OpenAI's Whisper adding speaker diarization with special tokens

tts icon tts

πŸΈπŸ’¬ - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

waas icon waas

Whisper as a Service (GUI and API with queuing for OpenAI Whisper)

whisper-at icon whisper-at

Code and Pretrained Models for Interspeech 2023 Paper "Whisper-AT: Noise-Robust Automatic Speech Recognizers are Also Strong Audio Event Taggers"

whisper-timestamped icon whisper-timestamped

Multilingual Automatic Speech Recognition with word-level timestamps and confidence

whisperx icon whisperx

WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)

willow-inference-server icon willow-inference-server

Open source, local, and self-hosted highly optimized language inference server supporting ASR/STT, TTS, and LLM across WebRTC, REST, and WS

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    πŸ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. πŸ“ŠπŸ“ˆπŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❀️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.