Code Monkey home page Code Monkey logo

zubair-irshad / neo-360 Goto Github PK

View Code? Open in Web Editor NEW
221.0 24.0 9.0 116.91 MB

Pytorch code for ICCV'23 paper. NEO 360: Neural Fields for Sparse View Synthesis of Outdoor Scenes

Home Page: https://zubair-irshad.github.io/projects/neo360.html

License: Other

Python 100.00%
3d 3d-vision artificial-intelligence autonomous-driving autonomous-vehicles computer-vision convolutional-neural-networks deep-learning differentiable-rendering implicit-neural-representation nerf neural-fields neural-networks neural-radiance-fields neural-rendering novel-view-synthesis reconstruction residual-networks

neo-360's Issues

Real world results from KITTI-360 evaluation

Hello,

Thank you for your work on the paper and your code publication. I wish to ask if there is already a pre-trained model for KITTI-360 or a pipeline setup for it. I have been currently working on setting up this dataset to see occupancy (density fields) results from the input view setup in the dataset. As I could see the experiment result for KITTI-360 in the paper Fig. 14 in Additional Qualitive results, I hope to see if the pre-trained model is available.

Would you share some insights regarding the training setup for KITTI-360? :)

Thank you!

Cameras in ground or objects

Hello !

Just to let you and future user know that sometimes (very rarely) a camera may be partially in the ground or an object (e.g., a trash bin).

Example with PDMultiObjv6/train/SF_VanNessAveAndTurkSt8/train/instance_segmentation_2d/midsize_sedan_04_emg_01-006.png

midsize_sedan_04_emg_01

Segmentation classes

Hey,

Where can I find the correspondency between semantic indices and classes ? It looks like 5 corresponds to 'cars', what about the others ?
Thank you in advance,

Code Release of Neo-360

Hi! This is a great work! I'm recently investigating efficient generalizable NeRF. My research is highly related to NeO-360. I really appreciate this work and would like to do my research based on it! When do you plan to release the whole code?

How to handle multi view images ?

Thanks for your jobs!,
i have a question, in the paper, a single image can generate three planes, and when infer, the fr(residual feat) can get from train image, but when i have multi view images, how to use multi view images generate the three planes , and when i want to genenrate a novel view, how to get fr ?

Why Tri-plane matters instead of 3D feature grid?

Hello, Neo-360 is a good paper. But I still have a question. If Tri-Plane is removed from your pipeline and the 3D sample point is projected into the 3D feature grid directly without 3DCNN and tri-plane, will the performance of Neo-360 degrade? In your ablation studies, I see the importance of your 3D feature grid but not the tri-plane. Could you explain why tri-plane matters?

Dataset License?

Congrats on the remarkable work. The repo mentions MIT License. Is the dataset also published under the same?

Intrinsics missing

Dear authors,

Thank you very much for sharing your dataset.
It seems to me that the intrinsics are missing ?

Axes convention

Hi again !

Could you share the axes convention you used for the world and the cameras ?

For example:
- the x-axis is pointing to the camera's right
- the y-axis is pointing to the camera's forward
- the z-axis is pointing to the camera's upward

Have a good day !

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.