Code Monkey home page Code Monkey logo

handtailor's People

Contributors

lyuj1998 avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar  avatar

handtailor's Issues

Depth and scale ambiguity

Hi, I try to estimate the 3D hand in the camera coordinate. I use demo.py with known intrinsic, the render result seems to be OK, but the 3D hand is not aligned with the GT.
image
image
image

root depth prediction question

I'm trying to understand where you got common.DEPTH_RANGE and common. DEPTH_MIN that are used here. In your paper you specify that you scale normalized the predicted root depth when using the perspective projection. Are these values what are used to unnormalized the predicted root depth? I may have incorrectly thought that normalization had to do with the length from the root to the MCP joint (or whatever the chosen joint was), is that not the case when predicting root depth

Can I only use Hand Module without IkNet to 3D hand Pose Estimation

你好,请问可以用您论文中去除 IkNet 的Hand Module去做三维姿态的估计吗,我尝试使用您论文中提到 公式(6)损失去训练网络,用热力图监督二维坐标,用21点3D pose监督uvd2xyz后的3D坐标,用根关节点的值监督root,最终效果不好,请问我这样的作法对吗?
image

Running with webcam

Is it possible to run app_in_the_wild.py without using a Real Sense camera? As in is it possible to simply connect the webcam of the laptop?

No response

image
我想请问为什么我在运行demo.py 之后没有任何反应呢

In the wild model scale error/typo

I believe this is a typo

s = torch.sqrt((deltaj[0,0]**2 + deltaj[0,0]**2)/(1 - deltaj[0,2]**2))

where you're using the u coordinate twice instead of u-v according to this

image

it should be

s = torch.sqrt((deltaj[0,0]**2 + deltaj[0,1]**2)/(1 - deltaj[0,2]**2))

jaxlib---Windows

Hello, how to solve the installation problem of jaxlib on Windows?

align_idx

Hi, I was wondering that why we use align_idx=9, instead of root_idx 0. Is there any reason? If we will align by root before computing loss anyway, why don't we align by root in the beginning?

Questions on optimization/tailor module

Hiya, I had a few questions about the tailor module

Just as a high-level overview (just to make sure I correctly understand the approach). It seems that you want to refine the predicted pose parameters from the inverse kinematics network with the insight that the 2d keypoints will generally have higher accuracy/ reliability allowing for not only a more accurate prediction but also helping with bad initializations and occlusion problems in typical optimization-based techniques face (along with fewer iterations due to a good initial condition)

  1. Did you experiment with different optimizers in jax other than Adam? Such as the Levenberg-Marquat solver (not sure if this is implemented in jax) used in papers like MegaTrack/RGB2Hands/Online Optical Marker-based Hand Tracking with Deep Labels
  2. For the in-the-wild examples, I've noticed certain poses lead to failure cases for estimating the scale camera parameter, is this something you've noticed as well? usually when the hand is horizontal rather than vertical

I enjoyed reading your paper greatly! Thank you so much for the contribution and I look forward to your response

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.