Light

huang-yh / gaussianformer Goto Github PK

View Code? Open in Web Editor NEW

176.0 20.0 7.0 273.93 MB

[ECCV 2024] Scene as Gaussians for Vision-Based 3D Semantic Occupancy Prediction

gaussianformer's Introduction

GaussianFormer: Scene as Gaussians for Vision-Based 3D Semantic Occupancy Prediction

Paper | Project Page

GaussianFormer: Scene as Gaussians for Vision-Based 3D Semantic Occupancy Prediction

Yuanhui Huang, Wenzhao Zheng$\dagger$, Yunpeng Zhang, Jie Zhou, Jiwen Lu$\ddagger$

$\dagger$ Project leader $\ddagger$ Corresponding author

GaussianFormer proposes the 3D semantic Gaussians as a more efficient object-centric representation for driving scenes compared with 3D occupancy.

News

[2024/05/28] Paper released on arXiv.
[2024/05/28] Demo release.

Demo

Overview

Considering the universal approximating ability of Gaussian mixture, we propose an object-centric 3D semantic Gaussian representation to describe the fine-grained structure of 3D scenes without the use of dense grids. We propose a GaussianFormer model consisting of sparse convolution and cross-attention to efficiently transform 2D images into 3D Gaussian representations. To generate dense 3D occupancy, we design a Gaussian-to-voxel splatting module that can be efficiently implemented with CUDA. With comparable performance, our GaussianFormer reduces memory consumption of existing 3D occupancy prediction methods by 75.2% - 82.2%.

Getting Started

Code coming soon~

Related Projects

Our work is inspired by these excellent open-sourced repos: TPVFormer PointOcc SelfOcc SurroundOcc OccFormer BEVFormer

Citation

If you find this project helpful, please consider citing the following paper:

@article{huang2024gaussian,
    title={GaussianFormer: Scene as Gaussians for Vision-Based 3D Semantic Occupancy Prediction},
    author={Huang, Yuanhui and Zheng, Wenzhao and Zhang, Yunpeng and Zhou, Jie and Lu, Jiwen},
    journal={arXiv preprint arXiv:2405.17429},
    year={2024}
}

gaussianformer's People

Contributors

Stargazers

Watchers

Forkers

andrewjsong whuhxb 0iui0 huijie-liu zhangzw12319 jasper-sudo-sun lvchuandong

gaussianformer's Issues

Code Released

Hopefully the code will be released soon, thanks.

How long will it take for the code to be released?

How long will it take for the code to be released?

When will the code to be released?

Thanks for your excellent work. When will we get the code ?

Code release

Could you release your great code for the future??

Thank you for your excellent research!

Self-encoding

Hello. Thank you for your excellent work. I would like to know how you handle gaussians that fall on the same grid during sparse convolution.

Code Release Time

Hello, , thank you very much to your team for open-sourcing such an excellent project. May I ask when the code will be released?

when will code to be released?

Thanks for your greate work, and hope the code to be relseased for study.

Rendered RGB images and semantic maps

Hi, I noticed that the Tab.6 of the paper (arxiv version) mentioned that the photometric loss doesn't improve the performance. However, I am wondering how the rendered RGB and semantic maps look like. Would you provide some visualization results?

Thanks!

Gaussian-to-Voxel Splatting

Thanks for your good job! Can you release code related to part of Gaussian-to-Voxel Splatting?

Recommend Projects

React

A declarative, efficient, and flexible JavaScript library for building user interfaces.
Vue.js

🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
Typescript

TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
TensorFlow

An Open Source Machine Learning Framework for Everyone
Django

The Web framework for perfectionists with deadlines.
Laravel

A PHP framework for web artisans
D3

Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

javascript

JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
web

Some thing interesting about web. New door for the world.
server

A server is a program made to process requests and deliver data to clients.
Machine learning

Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Visualization

Some thing interesting about visualization, use data art
Game

Some thing interesting about game, make everyone happy.

Recommend Org

Facebook

We are working to build community through open source technology. NB: members must have two-factor auth.
Microsoft

Open source projects and samples from Microsoft.
Google

Google ❤️ Open Source for everyone.
Alibaba

Alibaba Open Source for everyone
D3

Data-Driven Documents codes.
Tencent

China tencent open source team.