Code Monkey home page Code Monkey logo

3d-cnn-action-recognition's Introduction

Hey I'm Deepak

I am software engineer, indie maker and tech blogger from India. I love building software, contributing to open source projects and tinkering with new ideads.

๐Ÿ™‹โ€โ™‚๏ธ About Me

๐Ÿš€ Languages and Tools:

mongodb postman express


Subham Raoniar's streak

๐Ÿ“Š My Github Stats


Subham Raoniar's Github Stats
Note: Top languages is only a metric of the languages my public code consists of and doesn't reflect experience or skill level.

Subham Raoniar's Activity Graph



Connect with me:

โค Views and Followers

GitHub Badge

3d-cnn-action-recognition's People

Contributors

dipakkr avatar venkateshgupta5 avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar  avatar

3d-cnn-action-recognition's Issues

only 16 frames of each video

If i am not wrong then in your code you are taking only first 16 frames of each video and ignore remaining....Is there any reason for this?

extract frames for multiple extentions

Hi

I have made a python script to extract frames for multiple extenstions, but i dont delete the files once i convert the videos, but that can be done with an additional line of code.

frames =5 
video_folder = '/media/ryan/shakira/3d-cnn-action-recognition/some_folder'
for file in os.listdir(video_folder):
   if file.endswith('.webm'):
       os.mkdir(os.path.join(video_folder,file.split('.')[0]))
       folder_name = file.split('.')[0]
       os.system('ffmpeg -t 4 -i {} -vf fps={} {}/%05d.jpg -t 10'.format(os.path.join(video_folder,file), frames,os.path.join(video_folder,folder_name )))
   elif file.endswith('.mp4'):
       os.mkdir(os.path.join(video_folder,file.split('.')[0]))
       folder_name = file.split('.')[0]
       os.system('ffmpeg -t 4 -i {} -vf fps={} {}/%05d.jpg -t 10'.format(os.path.join(video_folder,file), frames,os.path.join(video_folder,folder_name )))
   elif file.endswith('.blob'):
       os.mkdir(os.path.join(video_folder,file.split('.')[0]))
       folder_name = file.split('.')[0]
       os.system('ffmpeg -i {} -vf fps={} {}/%05d.jpg'.format(os.path.join(video_folder,file), frames,os.path.join(video_folder,folder_name )))
   else:
print('*********************************************************', file)

Brief on V2

Hi

Thanks a lot for sharing your work!

Could you please tell me what is the difference between the main 3dcnn.py and the one in 3dcnnv2.

Thanks

cnn-v2: axes don't match array Error

Hello,
I tried working on v2 of your 3dcnn but it gives me this error:

    main()
  File "3dcnn2_M.py", line 187, in main
    args.output, args.color, args.skip)
  File "3dcnn2_M.py", line 120, in loaddata
    return np.array(X).transpose((0, 2, 3, 4, 1)), labels
ValueError: axes don't match array

Any solution to that?

thanks

I am getting this error, what might be the issue?

amax@amax:/media/amax/Masters/Hairen/ActionRecognition/3d-cnn-action-recognition/3d-cnn-action-recognition-master$ python3 3dcnn.py --batch 32 --epoch 50 --videos dataset/ --nclass 10 --output 3dcnnresult/ --color True --skip False --depth 10
/home/amax/anaconda3/lib/python3.6/site-packages/h5py/init.py:34: FutureWarning: Conversion of the second argument of issubdtype from float to np.floating is deprecated. In future, it will be treated as np.float64 == np.dtype(float).type.
from ._conv import register_converters as _register_converters
Using TensorFlow backend.
/home/amax/anaconda3/lib/python3.6/importlib/_bootstrap.py:219: RuntimeWarning: compiletime version 3.5 of module 'tensorflow.python.framework.fast_tensor_util' does not match runtime version 3.6
return f(*args, **kwds)
0%| | 0/1 [00:00<?, ?it/s]UCF-10
Traceback (most recent call last):
File "3dcnn.py", line 237, in
main()
File "3dcnn.py", line 145, in main
args.output, args.color, args.skip)
File "3dcnn.py", line 101, in loaddata
X.append(vid3d.video3d(v_file_path, color=color, skip=skip))
File "/media/amax/Masters/Hairen/ActionRecognition/3d-cnn-action-recognition/3d-cnn-action-recognition-master/videoto3d.py", line 29, in video3d
frame = cv2.resize(frame, (self.height, self.width))
cv2.error: OpenCV(3.4.3) /io/opencv/modules/imgproc/src/resize.cpp:4044: error: (-215:Assertion failed) !ssize.empty() in function 'resize'

100%|โ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆ| 1/1 [00:00<00:00, 11.82it/s]

GPU required

Can you please tell how many numbers of GPU are required for the training purpose??
Or Any specification of GPU??

extract class names from my data set ?

I want to use the same work that you have done but on another dataset (the MSR Daily 3d dataset)
the problem is I did'nt know how to extract the classe name and save them into a file text because de dataset is organised as this :

-video
-MSRDailyAct3D_pack1
-a01_s01_e01_rgb.avi
-a01_s01_e02_rgb.avi ........
.
.
-MSRDailyAct3D_pack8
so for every pack two actions are performed exemple pack1 (eat and drink)
I really need some help if you could sugget what should I do to generate class names for my dataset ?

getting error

python 3dcnn.py --batch 4 --epoch 100 --videos UT Interaction/ --nclass 6 --output 3dcnnresult/ --color True --skip False
File "", line 1
python 3dcnn.py --batch 4 --epoch 100 --videos UT Interaction/ --nclass 6 --output 3dcnnresult/ --color True --skip False
^
SyntaxError: invalid syntax

what i'm doing wrong please help

Killed during loaddata

I tried to run 3dCNN-v2/3dcnn.py.
python3 3dcnn.py --batch 4 --epoch 100 --videos /home/user/3d-cnn-action-recognition/data2/ --nclass 101 --output 3dcnnresult/ --color False --skip False
In the middle of loading data, it suddenly Killed.
(Only "Killed" showed)
I used UCF101 dataset.
Why this happens?

inputting videos

When I input my dataset and I print X in line 102 I get arrays of zeros. it seems that what VideoCapture returns in line 101 is arrays of zero. Why this happens? Does the code able to read my videos or not when arrays are zero?How can I see the input video frames to make sure? I am running 3dcnn.py

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.