Kunal Suri's Projects
500 AI Machine learning Deep learning Computer vision NLP Projects with code
Open source simulator for autonomous vehicles built on Unreal Engine / Unity, from Microsoft AI & Research
π Awesome lists about all kinds of interesting topics
Curated list of AI-powered developer tools.
autoupdate paper list
Champ: Controllable and Consistent Human Image Animation with 3D Parametric Guidance
π€ Awesome list for ChatGPT β an artificial intelligence chatbot developed by OpenAI
[CVPR2024] Animate Anyone: Consistent and Controllable Image-to-Video Synthesis for Character Animation
[modelscope.cn] An initiative to replicate Sora
[CVPR 2024] Breathing Life Into Sketches Using Text-to-Video Priors
MuseV: Infinite-length and High Fidelity Virtual Human Video Generation with Visual Conditioned Parallel Denoising
This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.
[CVPR 2024] PIA, your Personalized Image Animator. Animate your images by text prompt, combing with Dreambooth, achieving stunning videos. PIAοΌδ½ ηδΈͺζ§εεΎεε¨η»ηζε¨οΌε©η¨ζζ¬ζη€Ίε°εΎεεδΈΊε₯ε¦ηε¨η»
[ICCV 2023] ProPainter: Improving Propagation and Transformer for Video Inpainting
A curated list of recent diffusion models for video generation, editing, restoration, understanding, etc.
Amphion (/Γ¦mΛfaΙͺΙn/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, music, and speech generation research and development.
π Text-Prompted Generative Audio Model
a list of demo websites for automatic music generation research
A collection of awesome GPT4 vision use cases
PyTorch image models, scripts, pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (ViT), MobileNet-V3/V2, RegNet, DPN, CSPNet, Swin Transformer, MaxViT, CoAtNet, ConvNeXt, and more
Awesome-LLM: a curated list of Large Language Model
This repository contains a hand-curated resources for Prompt Engineering with a focus on Generative Pre-trained Transformer (GPT), ChatGPT, PaLM etc
Drop in a screenshot and convert it to clean code (HTML/Tailwind/React/Vue)
Create Magic Story!
NEW - YOLOv8 π in PyTorch > ONNX > OpenVINO > CoreML > TFLite
A curated list of awesome open source workflow engines
An integrated modeling solution for BPMN, DMN and Forms based on bpmn.io.
A node-based image processing GUI aimed at making chaining image processing tasks easy and customizable. Born as an AI upscaling application, chaiNNer has grown into an extremely flexible and powerful programmatic image processing application.
Pre-made chains/templates for chaiNNer. Pytorch, NCNN, and ONNX included.
The most powerful and modular stable diffusion GUI, api and backend with a graph/nodes interface.