Code Monkey home page Code Monkey logo

leod's Introduction

LEOD

This is the official Pytorch implementation for our CVPR 2024 paper:

LEOD: Label-Efficient Object Detection for Event Cameras
Ziyi Wu, Mathias Gehrig, Qing Lyu, Xudong Liu, Igor Gilitschenski
CVPR'24 | GitHub | arXiv

TL;DR

Event cameras are bio-inspired low-latency sensors, which hold great potentials for safety-critical applications such as object detection in self-driving. Due to the high temporal resolution (>1000 FPS) of event data, existing datasets are annotated at a low frame rate (e.g., 4 FPS). As a result, models are only trained on these annotated frames, leading to sub-optimal performance and slow convergence speed. In this paper, we tackle this problem from the perspective of weakly-/semi-supervised learning. We design a novel self-training framework that pseudo-labels unannotated events with reliable model predictions, which achieves SOTA performance on two largest detection benchmarks.

Install

This codebase builds upon RVT. Please refer to install.md for detailed instructions.

Experiments

This codebase is tailored to Slurm GPU clusters with preemption mechanism. There are some functions in the code (e.g. auto-detect and load previous checkpoints) which you might not need. Please go through all fields marked with TODO in train.py in case there is any conflict with your environment. To reproduce the results in the paper, please refer to benchmark.md.

Citation

Please cite our paper if you find it useful in your research:

@inproceedings{wu2024leod,
  title={LEOD: Label-Efficient Object Detection for Event Cameras},
  author={Wu, Ziyi and Gehrig, Mathias and Lyu, Qing and Liu, Xudong and Gilitschenski, Igor},
  booktitle={CVPR},
  year={2024}
}

Acknowledgement

We thank the authors of RVT, SORT, Soft Teacher, Unbiased Teacher and all the packages we use in this repo for opening source their wonderful works.

License

LEOD is released under the MIT License. See the LICENSE file for more details.

Contact

If you have any questions about the code, please contact Ziyi Wu [email protected]

leod's People

Contributors

wuziyi616 avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar

leod's Issues

The mp4 files is not getting generated or saved to ./vis/gen1_rnndet_small/pred/

Problem: The video .mp4 is not getting generated in given folder (/vis/gen1_rnndet_small/pred/) but code runs successfully.

after successful run of the visualisation code :

python vis_pred.py model=rnndet dataset=gen1 dataset.path=./datasets/gen1/ checkpoint="pretrained/Sec.4.2-WSOD_SSOD/gen1-WSOD/rvt-s-gen1x0.01_ss-final.ckpt" +experiment/gen1="small.yaml" model.postprocess.confidence_threshold=0.1 num_video=5 reverse=False

Using python-based detection evaluation
Set MaxViTRNN backbone (height, width) to (256, 320)
Set partition sizes: (8, 10)
Set num_classes=2 for detection head
Found 46 sequences in datasets/gen1/test
Using 1.25% of data --> 1 sequences
creating streaming test datasets: 100%|█████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████| 1/1 [00:00<00:00, 418.26it/s]
num_full_sequences=1
num_splits=0
num_split_sequences=0
Event sequence: 0it [00:00, ?it/s] [2024-04-02 20:47:06,734][py.warnings][WARNING] - /home/tushar/miniforge3/envs/leod/lib/python3.9/site-packages/torch/functional.py:504: UserWarning: torch.meshgrid: in an upcoming release, it will be required to pass the indexing argument. (Triggered internally at /opt/conda/conda-bld/pytorch_1678402412426/work/aten/src/ATen/native/TensorShape.cpp:3483.)
return _VF.meshgrid(tensors, **kwargs) # type: ignore[attr-defined]

Predict: 100%|███████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████| 2/2 [00:00<00:00, 6.55it/s]
Plot: 2it [00:00, 258.37it/s]████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████| 2/2 [00:00<00:00, 6.38it/s]
Event sequence: 1it [00:01, 1.73s/it]
== Timing statistics ==

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.