Roger Condori's Projects
adetailer for Diffusers
A ready-to-use curated list of Spectral Indices for Remote Sensing applications.
Program that enables seamless interaction with your documents through an advanced vector database and the power of Large Language Model (LLM) technology.
MLOps packaging: build and push to GitHub Container Registry
Next generation face swapper and enhancer
Generative Agents: Interactive Simulacra of Human Behavior
Python wrapper for fast inference with GPT-SoVITS
hf-demo
Select elements within an image and generate captions for those elements
Inpaint Anything performs stable diffusion inpainting on a browser UI using masks from Segment Anything.
InsightSolver: Colab notebooks for exploring and solving operational issues using deep learning, machine learning, and related models.
This repository serves as a collection of various models
Instant voice cloning by MyShell.
C++ library for converting text to phonemes for Piper
Portfolio
Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding
Riffusion is a project that enables audio style transfer using pre-trained models. This repository contains the code and resources needed to perform audio style transfer and generate impressive results.
A widgets-based interactive notebook for SD
Synchronized Translation for Videos. Video dubbing
A colab gradio web UI for running Large Language Models
WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)