Code Monkey home page Code Monkey logo

Hi there 👋

  • 🔭 Yuhui Yuan is currently a senior researcher at the Visual Computing Group of Microsoft Research Asia. He completed his Ph.D., M.S., and B.S. degrees from the Institute of Computing Technology, CAS, Peking University, and Nanjing University in 2022, 2017, and 2014, respectively. Currently, he is leading efforts on (i) developing generative AI technologies to help ship multiple products to Microsoft Designer and (ii) developing the next-generation graphic design engine for high-quality business content generation (e.g., posters, flyers, infographics, diagram, chart, and slides). His recent representative works include LISA for reasoning segmentation (CVPR’2024), COLE for multi-layered and editable graphic design generation, Glyph-ByT5 for accurate visual text rendering, and SPO for human preference learning of diffusion models.

  • 🔭 He has rich experience in areas such as general visual semantic/instance/panoptic segmentation and object recognition since joining MSRA in 2017. His representative works on segmentation and object detection include OCRNet (ECCV’2020), OCNet (IJCV’2021), and H-DETR (CVPR’2023).

  • 🔭 Please send an email to [email protected] or [email protected] if you are interested in an internship position related to business content creation and editing or multimodal reasoning and planning.

Researcher.YuanYuhui's Projects

18303 icon 18303

18.303 - Linear PDEs course

18330 icon 18330

18.330 Introduction to Numerical Analysis

18335 icon 18335

18.335 - Introduction to Numerical Methods course

18s191 icon 18s191

Course 18.S191 at MIT, Spring 2021 - Introduction to computational thinking with Julia:

3detr icon 3detr

Code & Models for 3DETR - an End-to-end transformer model for 3D object detection

3dfuture_ins_seg icon 3dfuture_ins_seg

1st Place Solutions of 3D AI Challenge 2020(IJCAI-PRICAI 2020 Workshop) - Instance Segmentation Track

ab3dmot icon ab3dmot

Official Python Implementation for "3D Multi-Object Tracking: A Baseline and New Evaluation Metrics", IROS 2020, ECCVW 2020

absphreak icon absphreak

:octocat: Hello there! This repository is for the welcome message on my Github Profile.

adaptive-segmentation-mask-attack icon adaptive-segmentation-mask-attack

Pre-trained model, code, and materials from the paper "Impact of Adversarial Examples on Deep Learning Models for Biomedical Image Segmentation" (MICCAI 2019).

avatarme icon avatarme

Public repository for the CVPR paper AvatarMe

awesome-rnn icon awesome-rnn

Recurrent Neural Network - A curated list of resources dedicated to RNN

axial-deeplab icon axial-deeplab

This is a PyTorch re-implementation of Axial-DeepLab (ECCV 2020 Spotlight)

baike_scrapy icon baike_scrapy

implement a spider to crawl baidu baike based on scrapy ..

basicimagetool icon basicimagetool

implement operate on 2 or more images, including resize\ rotate\ move operation

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.