Code Monkey home page Code Monkey logo

motsynth-baselines's People

Contributors

guillembraso avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar  avatar

motsynth-baselines's Issues

Possible typo in the paper

Thanks for sharing MOTSynth dataset and the corresponding code!

I have one minor comment for the paper. On paper P7 section 4.4. Multi-object Tracking left-column Results paragraph, the paper says "we obtain 45.0 MOTA and 51.2 IDF1 with our MOTSynth trained model, yielding +3.5 MOTA and +1.6 IDF1 improvement over the COCO trained model (43.5 MOTA and 49.6 IDF1).". Does it seem the improvement on MOTA should be +1.5 MOTA?

Thank you!

Inconsistent bbox annotations in COCO format annotation file and gt.txt

Hi, I found that some bounding box annotations are labeled as area=0 but still appear in the gt.txt file.

For example, in the 0089 video, frame 21:

{
"id": 890020000466,
"image_id": 890020,
"category_id": 1,
"segmentation":
{
"size":
[
1080,
1920
],
"counts": "PPYo1"
},
"area": 0.0,
"bbox":
[
854,
562,
4,
27
]
}

I am wondering do you have a threshold to determine if the area should be set as 0.0? Shall we modify this area value or remove this object from the gt.txt?

Thank you!

About the challenge

Hi author of MOTSYNTH
Can we use real data (e.ge. MOT17) with un-labeled and use unsupervised learning?

Camera coordinates in the MOTSynth dataset

In the annotation file, I noticed that there are two fields named "cam_world_pos" and "cam_world_rot", what are the typical use cases of these two parameters? If there are any examples or explanations, that would be greatly appreciated. Thank you!

why to frames need to substract 3

Why to frames need to substract 3?
I visulize the image finding the annotations is inconsistent with the generated images. In addition, when I remove the code of 'substract 3', annotations becom consistent.

What causes the target bounding box to be annotated incorrectly?

I encountered something strange while visualizing the dataset. As shown below, seq 012, 0110.jpg:
0110
image
The text upon the bbox represents the identity and visibility.

The 65 and 38 identities seem to be annotated incorrectly. I think MOTSynth is automatically annotated through some code. I wonder to know what causes this incorrectness.

BTW, I also want to ask about the meaning of visibility in MOT annotation. Is it calculated by the bounding box area or the mask area?

THANKS A LOT.

YOLOv3 settings

Could you please provide more details regarding the YOLOv3 baseline?

Since most of the common configurations (e.g., YOLOv3-tiny, YOLOv3, YOLOv3-SPP, YOLOv3-SPP-ultralytics) all accept the input size of 608.

Thank you in advance!

Question about the data and submission info

Hi, I have two questions about this MOTSynth challenge

  1. After following data preparing process as following site mot annotations has starting from repo '000'.
    image
    However, extracted frames from original zip file is starting from '003', so what am I missing in this stage? It looks like wget code suppose to get zip file from sequence including '000'.
    image

https://github.com/dvl-tum/motsynth-baselines/blob/main/docs/DATA_PREPARATION.md

  1. According to evaluation server, it suppose to add not only MOT17-test.txt but also MOTSynth.txt. Can't we just submit MOT17-test.txt for saving time? (because evaluation also takes one-hour or more for only MOT17)
    image

Thanks

When will the image depth information be uploaded?

Hi.
I want to try to integrate depth information to train my multi-object tracker, but I didn't find them in the downloaded MOTSynth dataset. When will you upload the depth information of the dataset?
Looking forward to your reply.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.