srijandas07 / vpn Goto Github PK
View Code? Open in Web Editor NEWPose driven attention mechanism
Pose driven attention mechanism
Can you please provide requirements file for the packages and the versions used in this project?
I can not import layer 'Merge' from 'keras.layers' like in your code. I have tried some different versions of tensorflow (v1, v2,...), but it still doesn't work.
Hello,
Can you please elaborate on what should be the format of the input data? Specifically what should be the names of the cropped images and how should they be stored?
Thanks!
First of all, thank you for sharing your good research.
For the spatial embedding, the Euclidean distance of the L2 normalized feature is used in the paper.
But the code on repository uses Manhatten distance.
I'd like to know if these changes are intended and what difference the changed metrics make.
Thank you in advance.
Could you add the requirements.txt to the project? The dependencies are quite complicated for me to handle
Hi, thank you for your good job. I am using the Smarthome dataset now. Could you tell me the specific 19 activities in CV1 and CV2?
Hello, the pretrained models links in the readme are incorrect. Can you please fix them.
Thank you for your great work! I have two qusetions as follows:
Q1: According to the code blew, I am not aware of why As or z1
and At or z2
denote sptioal features
and temporal features
, respectively. From where I stand, sptioal features derive from features through the convolution layer along the channel axis, while temporal features derive from features through the pooling layer. Do I misunderstand it or miss some details?
z1 = Dense(256, activation='tanh', name='z1_layer', trainable=True)(model_gcnn.get_layer('gcnn_out').output)
z2 = Dense(128, activation='tanh', name='z2_layer', trainable=True)(model_gcnn.get_layer('gcnn_out').output)
fc_main_spatial = Dense(49, activity_regularizer=attention_reg, kernel_initializer='zeros', bias_initializer='zeros',
activation='sigmoid', trainable=True, name='dense_spatial')(z1)
fc_main_temporal = Dense(2, activity_regularizer=attention_reg, kernel_initializer='zeros',
bias_initializer='zeros',
activation='softmax', trainable=True, name='dense_temporal')(z2)
Q2: What's the contribution about 2D CNN layer
after the Pose Backbone
? How dose it help with the spatioal-temporal coupler?
Looking for your reply! Thank you!
Hi, I am wondering how to test the pre-trained models, especially on the smarthome dataset?
Hi, I am trying to train NTU with your model but your code does not mention how the poses' x, y, z are normalized. Could you give me your normalization method.
Hi, could you please give me the list of videos for the training set and validation set (cross-view method)? I found that the original NUCLA dataset has 1463 videos, but in your paper, there are only 1194.
Thank you for your help!
Hello! Thanks for your job! I want to use NW-UCLA dataset, but the downloaded data from [https://wangjiangb.github.io/my_data.html] can not be unzipped. Could you please tell me how to get this dataset? Thank you a lot!
Hello there!
Thank you for sharing this great piece of research! I was trying to train the model from scratch, so I downloaded the NTU dataset from the official website. Your code requires the dataset to be in the form of images cropped around the human patches (in the folder "patches_full_body"), could you please share the script that generates these cropped images to make sure that I'm doing everything in the exact same way as the paper? How are the patches generated and how do you deal with multiple people in the scene?
Thank you
A declarative, efficient, and flexible JavaScript library for building user interfaces.
๐ Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. ๐๐๐
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google โค๏ธ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.