Comments (6)
"id+x1,y1+x2,y2+...+x4,y4"
I have no idea what this is supposed to be, but you just need to give it format "x_min,y_min,x_max,y_max" -> the top left and the bottom right of the corner of the bounding box. Maybe draw it on a piece of paper and you will understand.
from retinaface.
"id+x1,y1+x2,y2+...+x4,y4"
I have no idea what this is supposed to be, but you just need to give it format "x_min,y_min,x_max,y_max" -> the top left and the bottom right of the corner of the bounding box. Maybe draw it on a piece of paper and you will understand.
Thank you for your reply,maybe the format I described here is not clear,I will take a picture of it this afternoon when I go to the shool lab because my computer has been left there:-)
from retinaface.
"id+x1,y1+x2,y2+...+x4,y4"
I have no idea what this is supposed to be, but you just need to give it format "x_min,y_min,x_max,y_max" -> the top left and the bottom right of the corner of the bounding box. Maybe draw it on a piece of paper and you will understand.
I know the format you described above.It's the original format of yolo series' label.txt,However,the question is that now I want to use the four keypoints of the corner of the bbox to replace it,and the retinaface is initially trained on wider face which have 5 keypoints and the xywh of the bbox,so I wonder what should I do to make it possible to train retinaface on my datasets.Maybe it need a script to convert or something else?
from retinaface.
"id+x1,y1+x2,y2+...+x4,y4" I have no idea what this is supposed to be, but you just need to give it format "x_min,y_min,x_max,y_max" -> the top left and the bottom right of the corner of the bounding box. Maybe draw it on a piece of paper and you will understand.
(1).txt
My label.txt is I post here,the first number is the class id and left 8 numbers is the normalized coordinates of the four points of the corner of the bbox so could I use the script the author provided in the repo to convert the txt into json?
from retinaface.
"id+x1,y1+x2,y2+...+x4,y4"
I have no idea what this is supposed to be, but you just need to give it format "x_min,y_min,x_max,y_max" -> the top left and the bottom right of the corner of the bounding box. Maybe draw it on a piece of paper and you will understand.I know the format you described above.It's the original format of yolo series' label.txt,However,the question is that now I want to use the four keypoints of the corner of the bbox to replace it,and the retinaface is initially trained on wider face which have 5 keypoints and the xywh of the bbox,so I wonder what should I do to make it possible to train retinaface on my datasets.Maybe it need a script to convert or something else?
This Retinaface Repo expects "x_min,y_min,x_max,y_max" for the bounding box, not xywh. Also it expects the input in pixel coordinates. You will have to write a converter that unnormalizes the coordinates according to the image size, or adjust the retinaface code. To train without landmarks I guess you could set the weight of the landmark loss to 0 and just give it dummy landmarks. (Retinaface will have worse results when training without landmarks)
from retinaface.
"id+x1,y1+x2,y2+...+x4,y4"
I have no idea what this is supposed to be, but you just need to give it format "x_min,y_min,x_max,y_max" -> the top left and the bottom right of the corner of the bounding box. Maybe draw it on a piece of paper and you will understand.I know the format you described above.It's the original format of yolo series' label.txt,However,the question is that now I want to use the four keypoints of the corner of the bbox to replace it,and the retinaface is initially trained on wider face which have 5 keypoints and the xywh of the bbox,so I wonder what should I do to make it possible to train retinaface on my datasets.Maybe it need a script to convert or something else?
This Retinaface Repo expects "x_min,y_min,x_max,y_max" for the bounding box, not xywh. Also it expects the input in pixel coordinates. You will have to write a converter that unnormalizes the coordinates according to the image size, or adjust the retinaface code. To train without landmarks I guess you could set the weight of the landmark loss to 0 and just give it dummy landmarks. (Retinaface will have worse results when training without landmarks)
Thank you for your reply again,I think I maybe get your point now.So the labels should contains the unnormalized coordinates instead of the normalized ones unless you adjust the code,However,I just need the landmarks which are the four points of the corners of the armor or I won't use the retinaface .Still I have a question what is format of the datasets this repo needs,txt or json,I see the README.md which says you need to convert the txt into json first then you can train,and now I consider to use cvat to label my image which can export the datasets as the same format as WIDER FACE,so is that mean I can use it to train directly?I would appreciate it if you can give me some instructions again:-)
from retinaface.
Related Issues (19)
- RuntimeError during get_model HOT 1
- Validation accuracy & Mobilenet HOT 9
- Inference using GPU HOT 1
- Default parameter values HOT 1
- RGB vs. BGR HOT 2
- google-colab 1.0.0 has requirement ipykernel~=4.10 HOT 1
- Unintuitive output of predict_jsons()
- map
- Spellings mistake in repository README and PyPi description page
- Negative values in the predicted annotations HOT 2
- How to predict with batch size ? HOT 1
- Fixed bug in the training code! HOT 4
- Bug: Inference on cuda results in fixed bounding box offset! HOT 2
- Does this project use focal loss?
- Suggest to loosen the dependency on albumentations
- How to train with custom dataset by using the pretrained model?
- Why def _pad_to_square do not to adjust the landmark? HOT 3
- How Syncronized BatchNorm implemented HOT 1
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from retinaface.