Deep Human Action Recognition

Multi-task framework for jointly estimating 2D or 3D human poses from monocular color images and classifying human actions from video sequences.

Predicted Pose

Predicted action (NTU Dataset)

Limitations

Note that python3 is required and is used unless specified.

On Keras and on TensorFlow, only data format 'channels_last' is supported.

Dependencies

Install required python packages before you continue:

  pip3 install -r requirements.txt

Benset

Benset is the name of the data set I created. But the Benset directory contains all necessary functions for recording video sequences with KinectV2 cameras, the necessary preprocessing of the videos, the automatic loading of the videos for Keras training purposes and all necessary training and evaluation scripts for training the Deephar network.

MPII

Images from MPII should be manually downloaded and placed at datasets/MPII/images.

Human3.6M

Videos from Human3.6M should be manually downloaded and placed in datasets/Human3.6M/S*, e.g. S1, S2, S3, etc. for each subject. After that, extract videos with:

  cd datasets/Human3.6M
  python2 vid2jpeg.py vid2jpeg.txt

Python2 is used here due to the dependency on cv2 package.

PennAction

Video frames from PennAction should be manually downloaded and extracted in datasets/PennAction/frames. The pose annotations and predicted bounding boxes will be automatically downloaded by this software.

NTU

Video frames from NTU should be also manually extracted. A Python script is provided to help in this task. Python 2 is required.

Additional pose annotation is provided for NTU, which is used to train the pose estimation part for this dataset. It is different from the original Kinect poses, since it is a composition of 2D coordinates in RGB frames plus depth. This additional annotation can be downloaded here (2GB from Google Drive).

Citing

@InProceedings{Luvizon_2018_CVPR, author = {Luvizon, Diogo C. and Picard, David and Tabia, Hedi}, title = {2D/3D Pose Estimation and Action Recognition Using Multitask Deep Learning}, booktitle = {The IEEE Conference on Computer Vision and Pattern Recognition (CVPR)}, month = {June}, year = {2018} }

License

MIT License

Name		Name	Last commit message	Last commit date
Latest commit History 41 Commits
benset		benset
capture_dataset_kinectv2		capture_dataset_kinectv2
datasets		datasets
deephar		deephar
evaluation_kinectv2		evaluation_kinectv2
evaluation_webcam		evaluation_webcam
exp		exp
images		images
logs		logs
models		models
.gitignore		.gitignore
INSTALL.md		INSTALL.md
LICENSE.md		LICENSE.md
README.md		README.md
requirements.txt		requirements.txt
run.sh		run.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Deep Human Action Recognition

Limitations

Dependencies

Benset

MPII

Human3.6M

PennAction

NTU

Citing

License

About

Releases

Packages

Languages

License

hammb/deep-human-action-recognition

Folders and files

Latest commit

History

Repository files navigation

Deep Human Action Recognition

Limitations

Dependencies

Benset

MPII

Human3.6M

PennAction

NTU

Citing

License

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages