Comments (5)
Hi, you can find the preprocessed datasets from the README.md
However, I cannot provide the preprocessing code as it was on my old computer which was broken :(
The basic idea is very simple, and you can try to implement it by yourself:
Firstly, get the coordinates of key points in the whole body image, and then get the bounding box value, and then crop the image, and then adjust the coordinates.
For the heatmap generation, you can check this function:
https://github.com/HowieMa/NSRMhand/blob/master/dataset/hand_ldm.py#L91
from nsrmhand.
Hey, Thanks for the quick response, it worked for me!
I have some doubts about the evaluation PCK metric in utils.py file, can you tell the size of the box [ variable - bb_size] what is the value of that?
from nsrmhand.
Hey, Thanks for the quick response, it worked for me!
I have some doubts about the evaluation PCK metric in utils.py file, can you tell the size of the box [ variable - bb_size] what is the value of that?
Hi, this is just the tightest bounding box of the hand.
from nsrmhand.
Also, in your paper, it is mentioned: "All hand image patches are resized to 368×368 before fed into our model, yielding 46×46 representation maps". So is the final evaluation done on keypoints extracted from a 46x46 size heatmaps while the PCK threshold is [σ * (im_size/2.2)]. where σ ranges from 0 to 1 and im_size is the width of the cropped image?
from nsrmhand.
Also, in your paper, it is mentioned: "All hand image patches are resized to 368×368 before fed into our model, yielding 46×46 representation maps". So is the final evaluation done on keypoints extracted from a 46x46 size heatmaps while the PCK threshold is [σ * (im_size/2.2)]. where σ ranges from 0 to 1 and im_size is the width of the cropped image?
Hi, actually, I perform the evaluation on the original image scale, but it should be nearly the same as the evaluation on the 46 * 46 heatmap.
For example, the original image for cropped hand has size 256 * 256, I will firstly resize it to 368 * 368, and send it to the model. The output heatmap has size 46 * 46, then I can get the location of the peak value. For example, the peak's coordinate in the 46 * 46 heatmap is (23,23). Then its location in the original image is 23 * 256 / 46 = 128. If the ground truth location is (130, 130). Then I can calculate the PCK by
sqrt ((130 - 128)**2 + (130 - 128)^^2 ) < sigma * 256 / 2.2
You can easily find all solutions to your questions from here. Thanks!
Line 48 in f4640d1
Line 77 in f4640d1
from nsrmhand.
Related Issues (20)
- demo output HOT 10
- mediapipe hand landmark detect HOT 1
- loss function HOT 1
- How to use NSRM in HR-Net HOT 6
- How to get 21 keypoints annotations after cropping the original image? HOT 2
- missing limb structure problems HOT 1
- hand tightest bounding box? HOT 1
- What do mask1, mask2 and mask3 mean? HOT 1
- predict two hands from single picture HOT 5
- The results of other test images is error HOT 1
- Encounter segfault when running inference.py HOT 2
- ./CPM used in training? HOT 1
- src.augmentation missing
- model structure in code incompatible with in paper HOT 5
- Required dataset HOT 1
- Question about LM loss HOT 1
- Can the model predict hand boundary boxes on its own? HOT 2
- Can you provide the OneHand 10k dataset? HOT 1
- How to run the OneHand10K dataset
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from nsrmhand.