scuwyh2000 / randbox Goto Github PK

View Code? Open in Web Editor NEW

50.0 1.0 4.0 3.35 MB

[ICCV 2023] PyTorch implementation of RandBox

Home Page: http://arxiv.org/abs/2307.08249

Shell 0.68% Python 99.32%

computer-vision debias iccv2023 machine-learning object-detection open-world

randbox's Introduction

Random Boxes Are Open-world Object Detectors (ICCV 2023)

RandBox is a novel and effective model for open-world object detection.

Random Boxes Are Open-world Object Detectors
Yanghao Wang, Zhongqi Yue, Xian-sheng Hua, Hanwang Zhang
arXiv 2307.08249

Updates

(07/2023) Code is released.

Models

Task	K-mAP	U-R	WI	A-OSE	Download
Task 1	61.8	10.6	0.0240	4498	model
Task 2	45.3	6.3	0.0078	1880	model
Task 3	39.4	7.8	0.0054	1452	model
Task 4	35.4	-	-	-	model

Getting Started

The installation instruction and usage are in Getting Started with RandBox.

Citing RandBox

If you use RandBox in your research or wish to refer to the baseline results published here, please use the following BibTeX entry.

@InProceedings{Wang_2023_ICCV,
    author    = {Wang, Yanghao and Yue, Zhongqi and Hua, Xian-Sheng and Zhang, Hanwang},
    title     = {Random Boxes Are Open-world Object Detectors},
    booktitle = {Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV)},
    month     = {October},
    year      = {2023},
    pages     = {6233-6243}
}

randbox's People

Contributors

Stargazers

Watchers

Forkers

aust-hansen yuanwei0908 shaniaos whuhxb

randbox's Issues

How to partition the dataset？

I have downloaded the Coco and VOC datasets. How can I divide these datasets into t1, t2, t2_ft..........

Questions of transfer learning to validation images of LVIS v1.0

In the paper, you state that you evaluate models trained on Task 1 on the 4,810 validation images of LVIS v1.0, however, there are 19809 images in the validation set of LVIS v1.0. Will you release the images list used for transfer learning?

Datasets

Are the data sets used in the paper the same as those used in ORE (Towards Open World Object Detection)? Can I directly use the link to the dataset provided in ORE?

Inconsistent evaluation

Your paper described the evaluation protocol as:

the final predictions are the bboxes whose probability of belonging to a class in K ∪ {“unknown′′} are at least 0.1.

But your code uses a different threshold:

RandBox/randbox/pascal_voc_evaluation.py

Line 90 in a86be58

threshold = 0.15

loss_nc_labels loss function does not make any sense

Namaste,

This is regarding function loss_nc_labels

Whatever computed till line 183 is not used for further computation at all. Then in line 186 the variable target_classes_onehot is defined as follows:

target_classes_onehot = torch.zeros([src_logits.shape[0], src_logits.shape[1], self.num_classes + 1], ...)
target_classes_onehot.scatter_(2, target_classes.unsqueeze(-1), 1)
target_classes_onehot = target_classes_onehot[:, :, :-1]

Since the target_classes for the unknown class is always self.num_classes, this means that target_classes_onehot will always be full of zeros. No matter what is the ground truth or what is the predicted class confidences, this code will make the target of the output logits to zero.

Can anyone explain where I am going wrong? I am not using fed_loss.

Thanks

Reason on own pictures

How do you use your own images or videos for inference detection? Is there a script?

infer

How long does it take for a 512 × 512 graph to be output in real time? Can it be converted to TRT?

Can you indicate where the code generated by the random box is?

No such file or directory: 'datasets/test.txt'

@scuwyh2000 你好
1.推理过程中报错:FileNotFoundError: [Errno 2] No such file or directory: 'datasets/test.txt',

RandBox/randbox/pascal_voc_evaluation.py

Lines 54 to 55 in 3f32c05

    
           self._anno_file_template = os.path.join('./datasets', "Annotations/xml", "{}.xml") 
        
           self._image_set_path = os.path.join('./datasets', "test.txt")

这里路径是不是不对

Questions about re-implementation

Hello author, I conducted experiments according to your file configuration (completely consistent settings) and found that the experimental results are different from the paper results.

Task t1:
AOSE is about 1500 higher
K-map is 4 points lower
UR is 3 points lower
What causes this?

t2 t2_ft

请问t2和t2_ft的区别是什么

evaluation error

Hi, I got a error when run run_eval.sh, can you give me some help?

Traceback (most recent call last):
  File "train_net.py", line 376, in <module>
    launch(
  File "/mnt/home/wls/anaconda3/envs/randbox/lib/python3.8/site-packages/detectron2/engine/launch.py", line 67, in launch
    mp.spawn(
  File "/mnt/home/wls/anaconda3/envs/randbox/lib/python3.8/site-packages/torch/multiprocessing/spawn.py", line 230, in spawn
    return start_processes(fn, args, nprocs, join, daemon, start_method='spawn')
  File "/mnt/home/wls/anaconda3/envs/randbox/lib/python3.8/site-packages/torch/multiprocessing/spawn.py", line 188, in start_processes
    while not context.join():
  File "/mnt/home/wls/anaconda3/envs/randbox/lib/python3.8/site-packages/torch/multiprocessing/spawn.py", line 150, in join
    raise ProcessRaisedException(msg, error_index, failed_process.pid)
torch.multiprocessing.spawn.ProcessRaisedException: 

-- Process 0 terminated with the following error:
Traceback (most recent call last):
  File "/mnt/home/wls/anaconda3/envs/randbox/lib/python3.8/site-packages/torch/multiprocessing/spawn.py", line 59, in _wrap
    fn(i, *args)
  File "/mnt/home/wls/anaconda3/envs/randbox/lib/python3.8/site-packages/detectron2/engine/launch.py", line 126, in _distributed_worker
    main_func(*args)
  File "/mnt/home/wls/RandBox/train_net.py", line 359, in main
    res = Trainer.ema_test(cfg, model)
  File "/mnt/home/wls/RandBox/train_net.py", line 261, in ema_test
    results = cls.test(cfg, model, evaluators=evaluators)
  File "/mnt/home/wls/anaconda3/envs/randbox/lib/python3.8/site-packages/detectron2/engine/defaults.py", line 608, in test
    results_i = inference_on_dataset(model, data_loader, evaluator)
  File "/mnt/home/wls/anaconda3/envs/randbox/lib/python3.8/site-packages/detectron2/evaluation/evaluator.py", line 204, in inference_on_dataset
    results = evaluator.evaluate()
  File "/mnt/home/wls/RandBox/randbox/pascal_voc_evaluation.py", line 189, in evaluate
    rec, prec, ap, unk_det_as_known, num_unk, tp_plus_fp_closed_set, fp_open_set = voc_eval(
  File "/mnt/home/wls/RandBox/randbox/pascal_voc_evaluation.py", line 456, in voc_eval
    R = class_recs[str(mapping[int(image_ids[d])])]
KeyError: '009122'

Variables not found

thanks for your work, however, when I run your code, I found some problem:

Traceback (most recent call last):
File "f:\documents_lry\detectron2\detectron2\engine\train_loop.py", line 155, in train
self.run_step()
File "f:\documents_lry\detectron2\detectron2\engine\defaults.py", line 496, in run_step
self._trainer.run_step()
File "f:\documents_lry\detectron2\detectron2\engine\train_loop.py", line 310, in run_step
loss_dict = self.model(data)
File "D:\Anaconda3\envs\detectron\lib\site-packages\torch\nn\modules\module.py", line 1102, in _call_impl
return forward_call(*input, **kwargs)
File "F:\Documents_lry\RandBox\randbox\detector.py", line 296, in forward
targets, x_boxes = self.prepare_targets(gt_instances)
File "F:\Documents_lry\RandBox\randbox\detector.py", line 350, in prepare_targets
d_boxes = self.prepare_concat(gt_boxes)
File "F:\Documents_lry\RandBox\randbox\detector.py", line 330, in prepare_concat
x = (x_start * 2. - 1.) * self.scale
NameError: name 'x_start' is not defined

there are not 'x_start' in function "self.prepare_concat(gt_boxes)", I can only found 'x_start' in function "model_predictions", could you please fix this problem?

Data Preperation

Hi, thanks for your great work!

I am confused about the data preparation. According to your get_started, the data are from MSCOCO. However, in the ./split files, there are some image ids that seem not from COCO, such as 001299 or 2008_008613.

May I ask where are they from?

How to draw the Figure 2 in the paper?

Thank author for your great work. I am wondering how to draw Figure 2 in the paper as shown blow:

I have the following questions about the implementation of Figure 2.

All the proposals have been shown in the Figure or only part has been shown.
The proposals are before or after NMS?

I hope the authors can give me some advice for representing this figure. Thanks a lot!

代码

你好，请问代码更新完了吗

Is there any script for dividing the dataset?

I dont find such a script

Code for generating diffused_boxes is not clear

Namaste,

Thank you very much for the code. In the following line diffused boxes are created. I have two questions:

The lines previous to this creates many variables which are not used by the function at all. The argument gt_boxes itself is not needed.
I could not understand the logic behind generating boxes in this way.

        x = torch.randn(self.num_proposals, 4, device=self.device)
        x = (x * 2. - 1.) * self.scale
        x = torch.clamp(x, min=-1 * self.scale, max=self.scale)
        x = ((x / self.scale) + 1) / 2.
        diff_boxes = box_cxcywh_to_xyxy(x)
        return diff_boxes

regards

Replicating implementation doubts. Wilderness Impact metrics, training protocol...

Hello. First of all, thank you for your work and for sharing it!

I am implementing the evaluation process of OWOD (defined in PascalVOCDetectionEvaluator.evaluate). I have encountered several problems to correctly apply the benchmark to my own work. Here are them:

For evaluating every task and every metric (even WI), I assume you are always performing evaluation against all_task_test.txt. Is that right?
For the Wilderness Impact, they get reported the WI values for different recalls. Which one do you chose?
Are instances categorized as "difficult" used in for the metrics?
The reported number of test instances of T1 in the below image of your paper refer only to the KNOWN classes or are accounting for both KNOWN and UNKNOWN objects?

Failure to cite existing OWOD work.

Dear authors,

Your paper overlooks several significant works in the Open World Object Detection (OWOD) field. Specifically, the following papers from this year's CVPR conference:

"PROB: Probabilistic Objectness for Open World Object Detection"
"Annealing-Based Label-Transfer Learning for Open World Object Detection"
"CAT: LoCalization and IdentificAtion Cascade Detection Transformer for Open-World Object Detection"

As well as other significant works published in 2022, such as "OPCL" (ICIP2022) and "2B-OCD" (HCMA2022).

It is essential to acknowledge and include existing OWOD work to ensure the accuracy and completeness of your research. Failure to do so undermines the credibility of your work and disregards the contributions of others in the field.

Sincerely,
Orr Zohar

	self._anno_file_template = os.path.join('./datasets', "Annotations/xml", "{}.xml")
	self._image_set_path = os.path.join('./datasets', "test.txt")