tonghengcheng,Hay Kim,github

advancedliteratemachinery

A collection of original, innovative ideas and algorithms towards Advanced Literate Machinery. This project is maintained by the OCR Team in the Language Technology Lab, Alibaba DAMO Academy.

animatediff

Official implementation of AnimateDiff.

animatelcm

AnimateLCM: Let's Accelerate the Video Generation within 4 Steps!

aniportrait

AniPortrait: Audio-Driven Synthesis of Photorealistic Portrait Animation

anytext

Official implementation code of the paper <AnyText: Multilingual Visual Text Generation And Editing>

anyv2v

A Plug-and-Play Framework For Any Video-to-Video Editing Tasks

awesome-video-diffusion

A curated list of recent diffusion models for video generation, editing, restoration, understanding, etc.

brushnet

The official implementation of paper "BrushNet: A Plug-and-Play Image Inpainting Model with Decomposed Dual-Branch Diffusion"

champ

Champ: Controllable and Consistent Human Image Animation with 3D Parametric Guidance

cinemo

Cinemo: Consistent and Controllable Image Animation with Motion Diffusion Models

colossalai

Making large AI models cheaper, faster and more accessible

consistentid

Customized ID Consistent for human

consisti2v

ConsistI2V: Enhancing Visual Consistency for Image-to-Video Generation

controlnet_plus_plus

Inference code for: ControlNet++: Improving Conditional Controls with Efficient Consistency Feedback

ctrl-adapter

Official implementation of Ctrl-Adapter: An Efficient and Versatile Framework for Adapting Diverse Controls to Any Diffusion Model

diff-harmonization

A novel zero-shot image harmonization method based on Diffusion Model Prior.

diffsynth-studio

diffusers

🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch

docscanner

The official repo for “DocScanner: Robust Document Image Rectification with Progressive Learning”.

dough

Dough is a open source tool for steering AI animations with precision.

dragnuwa

图像编辑

dynamicrafter

DynamiCrafter: Animating Open-domain Images with Video Diffusion Priors

easyanimate

📺 An End-to-End Solution for High-Resolution and Long Video Generation Based on Transformer Diffusion

echomimic

Lifelike Audio-Driven Portrait Animations through Editable Landmark Conditioning

facefusion

Next generation face swapper and enhancer

fresco

[CVPR 2024] FRESCO: Spatial-Temporal Correspondence for Zero-Shot Video Translation

git_test

git 命令测试

gpt-sovits

1 min voice data can also be used to train a good TTS model! (few shot voice cloning)

hunyuandit

Hunyuan-DiT : A Powerful Multi-Resolution Diffusion Transformer with Fine-Grained Chinese Understanding

tonghengcheng Goto Github PK

Hay Kim's Projects

Recommend Projects

Recommend Topics

Recommend Org