Code Monkey home page Code Monkey logo

awesome-video-diffusion's Introduction

Awesome Video Diffusion Awesome

A curated list of recent diffusion models for video generation, editing, restoration, understanding, nerf, etc.

(Source: Make-A-Video, Tune-A-Video, and Fate/Zero.)

Table of Contents

Open-source Toolboxes and Foundation Models

Evaluation Benchmarks and Metrics

Video Generation

Controllable Video Generation

Video Generation with Physical Prior/3D

Video Editing

Long-form Video Generation and Completion

Human or Subject Motion

AI Safety for Video Generation

Video Enhancement and Restoration

Audio Synthesis for Video

Human Feedback for Video Generation

Policy Learning with Video Generation

3D / NeRF

World Model

Video Understanding

Healthcare and Biology

awesome-video-diffusion's People

Contributors

aulaywang avatar boximator avatar finspire13 avatar fradino avatar ground-a-video avatar hyeonho99 avatar jacobyuan7 avatar jeremycjm avatar jianhongbai avatar jinga-lala avatar junhaozhang98 avatar knightyxp avatar langmanbusi avatar leoooo333 avatar myniuuu avatar nihaomiao avatar ninjasaid2k avatar olliacc avatar ruizhaocv avatar scofield7419 avatar shoufachen avatar shyuanbest avatar vinesmsuic avatar weijiawu avatar xumingw avatar yrcong avatar zhangjiewu avatar zhuhz22 avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

awesome-video-diffusion's Issues

Add ICCV 2023 paper

Hi! Could you please include our work StableVideo at ICCV 2023? Thank you so much!

StableVideo: Text-driven Consistency-aware Diffusion Video Editing
Wenhao Chai, Xun Guo, Gaoang Wang, Yan Lu
ICCV 2023
https://arxiv.org/abs/2308.09592

drag新论文

DragVideo- Interactive Drag-style Video Editing 2312.02216

Drag-A-Video- Non-rigid Video Editing with Point-based Interaction 2312.02936

question

hi, is there any open source model that can take videos of the same theme as input, and generates variations of it in the same style or theme?

Year 2024 now!

Hppay new year!
Your date for new papers are wrong XD

11 new Papers be added?

https://arxiv.org/abs/2311.14294 - "Decouple Content and Motion for Conditional Image-to-Video Generation"

https://arxiv.org/abs/2311.15813 - "FlowZero: Zero-Shot Text-to-Video Synthesis with LLM-Driven Dynamic Scene Syntax"

https://arxiv.org/abs/2311.16635 - "MotionZero:Exploiting Motion Priors for Zero-shot Text-to-Video Generation"

https://arxiv.org/abs/2311.16933 - "SparseCtrl: Adding Sparse Controls to Text-to-Video Diffusion Models"

https://arxiv.org/abs/2311.17009 - "Space-Time Diffusion Features for Zero-Shot Text-Driven Motion Transfer"

https://arxiv.org/abs/2311.17338 - "VideoAssembler: Identity-Consistent Video Generation with Reference Entities using Diffusion Model"

https://arxiv.org/abs/2311.17536 - "Smooth Video Synthesis with Noise Constraints on Diffusion Models for One-shot Video Tuning"

https://arxiv.org/abs/2311.18830 - "MotionEditor: Editing Video Motion via Content-Aware Diffusion"

https://arxiv.org/abs/2311.18834 - "ART⋅V: Auto-Regressive Text-to-Video Generation with Diffusion Models"

https://arxiv.org/abs/2311.18827 - "Motion-Conditioned Image Animation for Video Editing"

https://arxiv.org/abs/2311.18829 - "MicroCinema: A Divide-and-Conquer Approach for Text-to-Video Generation"

Update information about TI2V-Zero

Hi, @zhangjiewu, thanks a lot for your impressive work!

Our TI2V-Zero has been accepted to CVPR 2024, and our code is also available at this link. Could you update the information on our paper?

Thanks again for your time!

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.