mayuema's Projects
[ACM MM 2022 Oral] AKU: This repo is the official implementation of "Visual Knowledge Graph for Human Action Reasoning in Videos"
Official PyTorch implementation for the paper "AnimateZero: Video Diffusion Models are Zero-Shot Image Animators"
Official pytorch implementation of "ControlVideo: Training-free Controllable Text-to-Video Generation"
深度学习面试宝典(含数学、机器学习、深度学习、计算机视觉、自然语言处理和SLAM等方向)
Pytorch Implementation for "FateZero: Fusing Attentions for Zero-shot Text-based Video Editing"
[arXiv 2024] Follow-Your-Click: This repo is the official implementation of "Follow-Your-Click: Open-domain Regional Image Animation via Short Prompts"
[arXiv 2023] MagicStick: This repo is the official implementation of "MagicStick: Controllable Video Editing via Control Handle Transformations"
[AAAI 2024] Follow-Your-Pose: This repo is the official implementation of "Follow-Your-Pose : Pose-Guided Text-to-Video Generation using Pose-Free Videos"
Frozen in Time: A Joint Video and Image Encoder for End-to-End Retrieval [ICCV'21]
[AAAI 2024] This is the implementation for the paper M-BEV: Masked BEV Perception for Robust Autonomous Driving
Just instruction
My HomePage
Official code for "Bridging Video-text Retrieval with Multiple Choice Questions", CVPR 2022 (Oral).
[ACM MM 2022]: Multi-Modal Experience Inspired AI Creation
PyCIL: A Python Toolbox for Class-Incremental Learning
清华大学计算机系课程攻略 Guidance for courses in Department of Computer Science and Technology, Tsinghua University
[ICLR2022] official implementation of UniFormer
A Toolkit for Text-to-Video Generation and Editing
[NeurIPS 2022] VideoMAE: Masked Autoencoders are Data-Efficient Learners for Self-Supervised Video Pre-Training