Code Monkey home page Code Monkey logo

Comments (9)

MukhlasAdib avatar MukhlasAdib commented on September 28, 2024

Hi!

Are you using the semantic lidar-based method? If yes, can you make visualization like what you did here?

Maybe it's because the vehicle's position is out of the LIDAR's upper fov range.

from carla-2dbbox.

rak7045 avatar rak7045 commented on September 28, 2024

Hello,
Thanks for your reply. I am using depth camera information. I would like to use the information from depth than semantic LiDAR.

from carla-2dbbox.

MukhlasAdib avatar MukhlasAdib commented on September 28, 2024

Sorry for my late reply. Right, the current version of depth-based annotation still suffer from false negative problem. There are several reasons that can cause it:

  • The occlusion filter failed in detecting the car. If this is the problem, then you need to tune the occlusion filter parameters again, i.e. depth_margin, patch_ratio, and resize_ratio. Yes, it requires more works.
  • The center point of the car is out of camera FoV. The algorithm use the center point of the car to determine whether the car is inside the camera FoV or not. The algorithm can miss a car if there is only a small portion of the body that appears in camera. I think it can be solved by using a more robust way to perform FoV filter. But currently I don't have time to upgrade the algorithm. Moreover, I don't think this is the problem in your case since the center of the car is clearly visible in the image.

In case you haven't read the details of the algorithm, you can find it here. Thank you.

from carla-2dbbox.

rak7045 avatar rak7045 commented on September 28, 2024

Hello again,

When I tried to collect data at 60fps with the following orientation of camera:
carla.Transform(carla.Location(x=23,y= 0, z=25), carla.Rotation(roll= 0.0, pitch = -90, yaw= 0.0)).
The FOV is set to 69°. I see there is a problem with the bounding box text information.

The problem is:
In an image there is only one object but the bbox txt information contains of two objects where one is out of the image resolution. Why this problem arises and how can we eliminate this?
This problem arises with 90° pitch angle irrespective of altitude, but when altitude increases the inappropriate information decreases. Do you have any idea on this?

I think due to FOV it gets the information and couldn't deal how can we eliminate these. Could you please help me with this?

Below I am attaching the picture along with the bbox txt file for reference.

002470

002470.txt

TIA

from carla-2dbbox.

rak7045 avatar rak7045 commented on September 28, 2024

Hello @MukhlasAdib
Sorry to disturb you. Excuse me. Could you please help me with the above question?

from carla-2dbbox.

MukhlasAdib avatar MukhlasAdib commented on September 28, 2024

Ah right, sorry. It looks like a bug in the algorithm. So I need to look deeper into the problem. I will inform you if I get something, but I cannot promise you it will be fast.

from carla-2dbbox.

MukhlasAdib avatar MukhlasAdib commented on September 28, 2024

@rak7045 Are you using semantic LIDAR or depth camera?

from carla-2dbbox.

rak7045 avatar rak7045 commented on September 28, 2024

Hello,
I am using depth camera information.

from carla-2dbbox.

MukhlasAdib avatar MukhlasAdib commented on September 28, 2024

Yeah, looks like it has something to do with the angle filter and your specific rotation setting. I need some time to check it. But for temporary solution, you can add a simple pixel coordinate filter by yourself before saving the results. Something like

if any(bbox[:,1] < 0) or any(bbox[:,1] > IMAGE_HEIGHT):
    # remove the bbox
elif any(bbox[:,0] < 0) or any(bbox[:,0] > IMAGE_WIDTH):
    # remove the bbox
else:
    # keep the bbox

from carla-2dbbox.

Related Issues (9)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.