<a class="user-mention notranslate" data-hovercard-type="user" data-hovercard-url="/us

It is the cyclopean eye (mean of both eyes). We do not detect the eyes explicitl

You can still use known camera position (from previous calibration) as the origi

<a class="user-mention notranslate" data-hovercard-type="user" data-hovercard-url="/us

some question about generate ground truth about gaze360 HOT 6 CLOSED

erkil1452 commented on September 6, 2024

some question about generate ground truth

from gaze360.

Comments (6)

erkil1452 commented on September 6, 2024

It is the cyclopean eye (mean of both eyes). We do not detect the eyes explicitly so the point is just estimated based on the AlphaPose skeleton.
The tag has a known size (and it is relatively large) so we can get its position from the size in the image. This assumes we know the camera intrinsics, the pixel coordinates of the marker's corners (therefore the 3D view rays) and the physical size of the markers corners. From here we need to find the 3D rotation and translation of the marker that fits the size and shape contraint.
We use 7 frames. I believe we did test fewer as well but I do not see the results anywhere. Presumably they were at least a little bit worse. The MSE Static in Table 2 should show the 1 frame case.
It seems they just ask people to look at a target they choose for them so they know where they look: https://ait.ethz.ch/projects/2020/ETH-XGaze/ . They also know the size of the screen and from the look of the head rest, they will also be able to control the position of the participant. So they have all 3D points under control.

from gaze360.

jxncyym commented on September 6, 2024

@erkil1452 thank you for your reeply.

For the fourth question, I want to confirm something about the process of getting the groundtruth .my understanding is that: because the datas are collected in the lab, if we set a point(assume the left top corner of the screen) as the origin of the world coordinate system, so they know every 3D world coordinate of all the lab(include every point of the screen), then they can use Zhengyou Zhang calibration to get the rotation and translation matrix for every camera. then the rotation and translation matrix can be used to covert these 3D world coordinate to camera coordinate, and use the camera coordinate to compute the ground truth. I don't know whether my understanding is right?
another question is : once the camera is calibrated(assume using the method of Zhengyou Zhang calibration), then the rotation and translation matrix will not changed in any environment?
why you use the head image instead of face image to train the model? and do you evaluate the performance of using face image?

from gaze360.

erkil1452 commented on September 6, 2024

You can still use known camera position (from previous calibration) as the origin. Subtracting the position of the gaze target on the screen and head position will give the gaze vector.
Intrinsic matrix does not change when you move camera. Extrinsic matrix (translation and rotation) does generally change.
In 360 deg setup, the face is often not visible. And we only tested the method using the head crops.

from gaze360.

jxncyym commented on September 6, 2024

@erkil1452 I'm very sorry I still not fully understood what you said. For I'm newer to this， could you describe detail about the process？ I guess: first we fix the camera and the screen，then we calibrate the camera to get the rotation and translation matrix. do you mean use the camera position as the origin of the world coordinate system，and use the distance of the camera position and the gaze target to compute the gaze target world coordinate，compute the head position world coordinate in the same way， then subtract the gaze target and head position， we can get the gaze vector，and use the rotation and translation matrix to convert the gaze vector to camera coordinate system， Is that rigtht？If what I said is not right， could you describe the process detail？ for I am a newer to this.

from gaze360.

erkil1452 commented on September 6, 2024

Yes, it is as you say.

from gaze360.

jxncyym commented on September 6, 2024

thank you very much

from gaze360.

some question about generate ground truth about gaze360 HOT 6 CLOSED

Comments (6)

Related Issues (20)

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent