showlab / awesome-video-diffusion Goto Github PK

A curated list of recent diffusion models for video generation, editing, restoration, understanding, etc.

awesome diffusion-models video-editing video-generation video-understanding video-restoration text-to-motion text-to-video

awesome-video-diffusion's Introduction

Awesome Video Diffusion

A curated list of recent diffusion models for video generation, editing, restoration, understanding, nerf, etc.

(Source: Make-A-Video, Tune-A-Video, and Fate/Zero.)

Open-source Toolboxes and Foundation Models
Evaluation Benchmarks and Metrics
Video Generation
Controllable Video Generation
Long Video/Film Generation
Video Generation with Physical Prior/3D
Video Editing
Long-form Video Generation and Completion
Human or Subject Motion
AI Safety for Video Generation
Video Enhancement and Restoration
Audio Synthesis for Video
Human Feedback for Video Generation
Policy Learning with Video Generation
3D / NeRF
World Model
Video Understanding
Healthcare and Biology

Open-source Toolboxes and Foundation Models

awesome-video-diffusion's People

Contributors

Stargazers

Watchers

Forkers

duotun aloezhang168 nelsontseng0704 ton-lee pupubear007 kookie12 cohenqu mrm202 wtness23 loken14 rese1f artyom-morozov inkyusa songfang gaowudao alexanderinum dyflional lpercc chengbinjin ginobilinie onuryozcu aiarcade fire-down-below ironieser tristanshao fangyuan-ksgk veracitea linnanwang tiankaihang ghbook zfbok jacobyuan7 lliai qiudi0127 scofield7419 yuchenlichuck evodmik sunmiaobo wd1511 kustomzone zjtco-yr jialetao aniki-ly friedrich-m flymeblack shutongjin ruizhaocv hyeonho99 happy-hsy ninjasaid2k metamorphart gaocode yangyang117 nameongithub llyx97 jidedaka demohai ryanaltair anminhhung tyrannicawe olliacc byteshow1234 paperwave li1177 esugis ssusantachary solarlemon sumerudataanalaytics asdf2kr 3a1b2c3 af-74413592 yuxis vivekratr clkimsdu jiazheng-xing kiranhyd junruixiao cswaynecool yl17104265 klonggan sckim0430 suxuping rucchzy aninda-leonardo 623851394 snowdenlee waywardspooky gfengg bravexone777 sundogs8603 kimx3966 guang000 flyinsky222 cjay318 junhaozhang98 fradino nanli320 yepjin positivewon jinwook-shim

awesome-video-diffusion's Issues

Add ICCV 2023 paper

Hi! Could you please include our work StableVideo at ICCV 2023? Thank you so much!

StableVideo: Text-driven Consistency-aware Diffusion Video Editing
Wenhao Chai, Xun Guo, Gaoang Wang, Yan Lu
ICCV 2023
https://arxiv.org/abs/2308.09592

drag新论文

DragVideo- Interactive Drag-style Video Editing 2312.02216

Drag-A-Video- Non-rigid Video Editing with Point-based Interaction 2312.02936

Update information about DiffusionRet

Congrats on the impressive work.

Our DiffusionRet has been accepted by ICCV 2023. The code is available at Code.

Could you please update the information on our paper?

question

hi, is there any open source model that can take videos of the same theme as input, and generates variations of it in the same style or theme?

Year 2024 now!

Hppay new year!
Your date for new papers are wrong XD

11 new Papers be added?

https://arxiv.org/abs/2311.14294 - "Decouple Content and Motion for Conditional Image-to-Video Generation"

https://arxiv.org/abs/2311.15813 - "FlowZero: Zero-Shot Text-to-Video Synthesis with LLM-Driven Dynamic Scene Syntax"

https://arxiv.org/abs/2311.16635 - "MotionZero:Exploiting Motion Priors for Zero-shot Text-to-Video Generation"

https://arxiv.org/abs/2311.16933 - "SparseCtrl: Adding Sparse Controls to Text-to-Video Diffusion Models"

https://arxiv.org/abs/2311.17009 - "Space-Time Diffusion Features for Zero-Shot Text-Driven Motion Transfer"

https://arxiv.org/abs/2311.17338 - "VideoAssembler: Identity-Consistent Video Generation with Reference Entities using Diffusion Model"

https://arxiv.org/abs/2311.17536 - "Smooth Video Synthesis with Noise Constraints on Diffusion Models for One-shot Video Tuning"

https://arxiv.org/abs/2311.18830 - "MotionEditor: Editing Video Motion via Content-Aware Diffusion"

https://arxiv.org/abs/2311.18834 - "ART⋅V: Auto-Regressive Text-to-Video Generation with Diffusion Models"

https://arxiv.org/abs/2311.18827 - "Motion-Conditioned Image Animation for Video Editing"

https://arxiv.org/abs/2311.18829 - "MicroCinema: A Divide-and-Conquer Approach for Text-to-Video Generation"

Can you add these video diffusion papers?

https://arxiv.org/abs/2308.09710 - "SimDA: Simple Diffusion Adapter for Efficient Video Generation"
https://arxiv.org/abs/2309.03549 - "Reuse and Diffuse: Iterative Denoising for Text-to-Video Generation"
https://arxiv.org/abs/2309.15103 - "LAVIE: High-Quality Video Generation with Cascaded Latent Diffusion Models"

Why not add latest research?

It has been at least a month without any update...

wrong arxiv link for [3D-VLA: A 3D Vision-Language-Action Generative World Model]

Hi! Would you like to build a dataset section?

FYI, we have open-sourced a text-2-video dataset named ECTV, you may take a look at:

Dataset: https://huggingface.co/datasets/RaphaelLiu/EvalCrafter_T2V_Dataset
Paper: https://arxiv.org/abs/2310.11440
GitHub: https://github.com/evalcrafter/EvalCrafter

Many thanks for your awesome list!!!

Will you consider build a benchmark/evaluation section?

Please ref to Evalcrafter for more information. Thanks!

Webpage: https://evalcrafter.github.io/
GitHub: https://github.com/evalcrafter/EvalCrafter

Many thanks for your awesome list!!!

Update information about rerender-a-video

old

Rerender A Video: Zero-Shot Text-Guided Video-to-Video Translation (Jun., 2023)

updated

Rerender A Video: Zero-Shot Text-Guided Video-to-Video Translation (SIGGRAPH Asia 2023)

Update information about TI2V-Zero

Hi, @zhangjiewu, thanks a lot for your impressive work!

Our TI2V-Zero has been accepted to CVPR 2024, and our code is also available at this link. Could you update the information on our paper?

Thanks again for your time!

What about adding this project repo?

Open-Sora: Democratizing Efficient Video Production for All

https://github.com/hpcaitech/Open-Sora?tab=readme-ov-file

It seems the community is pretty active and vibrant.

You have a AnimateAnyone paper with the title of MagicAnimate paper.

Is the Align your Gaussian paper allowed here?

https://arxiv.org/abs/2312.13763 - 4D generation

Are flow matching based papers allowed here?

Hi! Thanks for the great list!

Are these papers allowed to be added here? They are based on flow matching which is quite similar in nature to diffusion.

https://arxiv.org/abs/2211.14575 - Efficient Video Prediction via Sparsely Conditioned Flow Matching
https://arxiv.org/abs/2306.03988 - Learn the Force We Can: Enabling Sparse Motion Control in Multi-Object Video Generation

Website update

Hi Jay. Thanks for listing our work, DynamiCrafter, in the repo! Since we have released our code and launched a website for the project, could you please help to update them in the paper list? Many thanks for considering my request.

DynamiCrafter: Animating Open-domain Images with Video Diffusion Priors
Code: https://github.com/Doubiiu/DynamiCrafter
Website: https://doubiiu.github.io/projects/DynamiCrafter/

Recommend Projects

React

A declarative, efficient, and flexible JavaScript library for building user interfaces.
Vue.js

🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
Typescript

TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
TensorFlow

An Open Source Machine Learning Framework for Everyone
Django

The Web framework for perfectionists with deadlines.
Laravel

A PHP framework for web artisans
D3

Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

javascript

JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
web

Some thing interesting about web. New door for the world.
server

A server is a program made to process requests and deliver data to clients.
Machine learning

Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Visualization

Some thing interesting about visualization, use data art
Game

Some thing interesting about game, make everyone happy.

Recommend Org

Facebook

We are working to build community through open source technology. NB: members must have two-factor auth.
Microsoft

Open source projects and samples from Microsoft.
Google

Google ❤️ Open Source for everyone.
Alibaba

Alibaba Open Source for everyone
D3

Data-Driven Documents codes.
Tencent

China tencent open source team.

showlab / awesome-video-diffusion Goto Github PK

awesome-video-diffusion's Introduction

Awesome Video Diffusion

Table of Contents

Open-source Toolboxes and Foundation Models

Evaluation Benchmarks and Metrics

Video Generation

Controllable Video Generation

Video Generation with Physical Prior/3D

Video Editing

Long-form Video Generation and Completion

Human or Subject Motion

AI Safety for Video Generation

Video Enhancement and Restoration

Audio Synthesis for Video

Human Feedback for Video Generation

Policy Learning with Video Generation

3D / NeRF

World Model

Video Understanding

Healthcare and Biology