Code Monkey home page Code Monkey logo

Comments (6)

corkillj avatar corkillj commented on May 27, 2024

"id+x1,y1+x2,y2+...+x4,y4"
I have no idea what this is supposed to be, but you just need to give it format "x_min,y_min,x_max,y_max" -> the top left and the bottom right of the corner of the bounding box. Maybe draw it on a piece of paper and you will understand.

from retinaface.

Hezhexi2002 avatar Hezhexi2002 commented on May 27, 2024

"id+x1,y1+x2,y2+...+x4,y4"
I have no idea what this is supposed to be, but you just need to give it format "x_min,y_min,x_max,y_max" -> the top left and the bottom right of the corner of the bounding box. Maybe draw it on a piece of paper and you will understand.

Thank you for your reply,maybe the format I described here is not clear,I will take a picture of it this afternoon when I go to the shool lab because my computer has been left there:-)

from retinaface.

Hezhexi2002 avatar Hezhexi2002 commented on May 27, 2024

"id+x1,y1+x2,y2+...+x4,y4"
I have no idea what this is supposed to be, but you just need to give it format "x_min,y_min,x_max,y_max" -> the top left and the bottom right of the corner of the bounding box. Maybe draw it on a piece of paper and you will understand.

I know the format you described above.It's the original format of yolo series' label.txt,However,the question is that now I want to use the four keypoints of the corner of the bbox to replace it,and the retinaface is initially trained on wider face which have 5 keypoints and the xywh of the bbox,so I wonder what should I do to make it possible to train retinaface on my datasets.Maybe it need a script to convert or something else?

from retinaface.

Hezhexi2002 avatar Hezhexi2002 commented on May 27, 2024

"id+x1,y1+x2,y2+...+x4,y4" I have no idea what this is supposed to be, but you just need to give it format "x_min,y_min,x_max,y_max" -> the top left and the bottom right of the corner of the bounding box. Maybe draw it on a piece of paper and you will understand.
(1).txt
My label.txt is I post here,the first number is the class id and left 8 numbers is the normalized coordinates of the four points of the corner of the bbox so could I use the script the author provided in the repo to convert the txt into json?

from retinaface.

corkillj avatar corkillj commented on May 27, 2024

"id+x1,y1+x2,y2+...+x4,y4"
I have no idea what this is supposed to be, but you just need to give it format "x_min,y_min,x_max,y_max" -> the top left and the bottom right of the corner of the bounding box. Maybe draw it on a piece of paper and you will understand.

I know the format you described above.It's the original format of yolo series' label.txt,However,the question is that now I want to use the four keypoints of the corner of the bbox to replace it,and the retinaface is initially trained on wider face which have 5 keypoints and the xywh of the bbox,so I wonder what should I do to make it possible to train retinaface on my datasets.Maybe it need a script to convert or something else?

This Retinaface Repo expects "x_min,y_min,x_max,y_max" for the bounding box, not xywh. Also it expects the input in pixel coordinates. You will have to write a converter that unnormalizes the coordinates according to the image size, or adjust the retinaface code. To train without landmarks I guess you could set the weight of the landmark loss to 0 and just give it dummy landmarks. (Retinaface will have worse results when training without landmarks)

from retinaface.

Hezhexi2002 avatar Hezhexi2002 commented on May 27, 2024

"id+x1,y1+x2,y2+...+x4,y4"
I have no idea what this is supposed to be, but you just need to give it format "x_min,y_min,x_max,y_max" -> the top left and the bottom right of the corner of the bounding box. Maybe draw it on a piece of paper and you will understand.

I know the format you described above.It's the original format of yolo series' label.txt,However,the question is that now I want to use the four keypoints of the corner of the bbox to replace it,and the retinaface is initially trained on wider face which have 5 keypoints and the xywh of the bbox,so I wonder what should I do to make it possible to train retinaface on my datasets.Maybe it need a script to convert or something else?

This Retinaface Repo expects "x_min,y_min,x_max,y_max" for the bounding box, not xywh. Also it expects the input in pixel coordinates. You will have to write a converter that unnormalizes the coordinates according to the image size, or adjust the retinaface code. To train without landmarks I guess you could set the weight of the landmark loss to 0 and just give it dummy landmarks. (Retinaface will have worse results when training without landmarks)

Thank you for your reply again,I think I maybe get your point now.So the labels should contains the unnormalized coordinates instead of the normalized ones unless you adjust the code,However,I just need the landmarks which are the four points of the corners of the armor or I won't use the retinaface .Still I have a question what is format of the datasets this repo needs,txt or json,I see the README.md which says you need to convert the txt into json first then you can train,and now I consider to use cvat to label my image which can export the datasets as the same format as WIDER FACE,so is that mean I can use it to train directly?I would appreciate it if you can give me some instructions again:-)

from retinaface.

Related Issues (19)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.