yinjunbo / is-fusion Goto Github PK

This repository contains the PyTorch implementation of the CVPR'2024 paper (Highlight), IS-Fusion: Instance-Scene Collaborative Fusion for Multimodal 3D Object Detection.

License: Apache License 2.0

Python 87.31% Dockerfile 0.02% Makefile 0.01% Batchfile 0.02% C++ 5.81% Cuda 6.75% Shell 0.07%

is-fusion's People

Contributors

Stargazers

Watchers

is-fusion's Issues

Missing multi_scale_deformable_attn_function Module in Middle Encoders

Issue Description:
Hi there, thank you for your exceptional code implementation. However, I've encountered an issue regarding the absence of the multi_scale_deformable_attn_function module within the middle encoders. This module should ideally include MultiScaleDeformableAttnFunction_fp32 and ms_deform_attn_core_pytorch, which are crucial for running and reproducing the performance reported in the article. This missing module significantly hampers the ability to achieve the expected results and needs to be addressed for effective utilization of the codebase.

Visualization

May I ask how you visualize the point cloud and the detection result as shown in the framework?

Thank you!

When will the code be released?

Thank you for your excellent work. May I ask, when will the code be released?

等待代码

快点发代码吧，同等分辨率条件下能超过focalFormer吗？我的BEVFixV3就指望您地代码了

Some issues and implementation details

Thank you very much for sharing your excellent work! I 'd like to record some implementation details.

Some files may be missing in mmdet3D/models.
I've tried to supplement them with the files from the autoalignv2 project.
In mmdet3D/models./init.py, I had to delete line 20 (from .vtransforms import *) to make sure it can successfully run.

However, I still have some questions:

In the project, I can't find the Point-to-Grid Transformer module you've mentioned in your paper.
How can you get the checkpoint of the baseline Transfusion which can achieve 65.1mAP? Did you train it from scratch? The checkpoint (IS-Fusion_epoch_10.pth) you provided seems like final weights rather than a pre-trained model.

I'm sorry for taking up your time, and I hope to receive your reply.

yinjunbo / is-fusion Goto Github PK

is-fusion's People

Contributors

Stargazers

Watchers

is-fusion's Issues

Missing multi_scale_deformable_attn_function Module in Middle Encoders

Visualization

When will the code be released?

等待代码

Some issues and implementation details

configuration issues

环境问题

code of TTA and model ensemble

Hierarchical Scene Fusion vs LSS

你的图像融合的时候怎么训练的

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent