Code Monkey home page Code Monkey logo

👋 Hi, I am Hangjie Yuan (袁杭杰 in Chinese). I am currently a Ph.D. candidate from Zhejiang University supervised by Prof. Dong Ni, and a long-term research intern at Alibaba DAMO Academy. I am undertaking a visiting Ph.D. program at MMLab@NTU, supervised by Prof. Ziwei Liu. Additionally, I am supervised by Prof. Samuel Albanie (the University of Cambridge) and Dr. Shiwei Zhang (Alibaba DAMO Academy).

While conducting research, I prioritize humanity above all else. Therefore, the ultimate goal of my research is to prioritize human well-being.

Check out some of my cool projects: InstructVideo, VideoComposer, and the RLIP series (RLIP & RLIPv2).

Wanna keep up with my adventures? Click on over to my personal page for all the latest and greatest.

Hangjie Yuan's Projects

-jacobyuan7.github.io icon -jacobyuan7.github.io

Github Pages template for academic personal websites, forked from mmistakes/minimal-mistakes

anydoor icon anydoor

Official implementations for paper: Anydoor: zero-shot object-level image customization

awesome-video-diffusion icon awesome-video-diffusion

A curated list of recent diffusion models for video generation, editing, restoration, understanding, etc.

dab-detr icon dab-detr

[ICLR 2022] DAB-DETR: Dynamic Anchor Boxes are Better Queries for DETR

diffusers icon diffusers

🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch

din-group-activity-recognition-benchmark icon din-group-activity-recognition-benchmark

[ICCV 2021] A new codebase containing various methods for Group Activity Recognition. Paper title: Spatio-Temporal Dynamic Inference Network for Group Activity Recognition.

dn-detr icon dn-detr

[CVPR 2022 Oral]Official implementation of DN-DETR

erd icon erd

Overcoming Catastrophic Forgetting in Incremental Object Detection via Elastic Response Distillation

natspeech icon natspeech

A Non-Autoregressive Text-to-Speech (NAR-TTS) framework, including official PyTorch implementation of PortaSpeech (NeurIPS 2021) and DiffSpeech (AAAI 2022)

ocn-hoi-benchmark icon ocn-hoi-benchmark

[AAAI 2022] Detecting Human-Object Interactions with Object-Guided Cross-Modal Calibrated Semantics.

prodiff icon prodiff

PyTorch Implementation of ProDiff (ACM-MM'22) with a Extremely-Fast diffusion speech synthesis pipeline

readme icon readme

README文件语法解读,即Github Flavored Markdown语法介绍

rlip icon rlip

[NeurIPS 2022 Spotlight] RLIP: Relational Language-Image Pre-training and a series of other methods to solve HOI detection and Scene Graph Generation.

rlipv2 icon rlipv2

[ICCV 2023] RLIPv2: Fast Scaling of Relational Language-Image Pre-training

scene-graph-benchmark.pytorch icon scene-graph-benchmark.pytorch

A new codebase for popular Scene Graph Generation methods (2020). Visualization & Scene Graph Extraction on custom images/datasets are provided. It's also a PyTorch implementation of paper “Unbiased Scene Graph Generation from Biased Training CVPR 2020”

vgen icon vgen

Official repo for I2VGen-XL: High-Quality Image-to-Video Synthesis Via Cascaded Diffusion Models

videocrafter icon videocrafter

VideoCrafter1: Open Diffusion Models for High-Quality Video Generation

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.