ComputerVision20212-ActionRecognition

Human action recognition (HAR) is an important task in the computer vision domain, especially in many situations such as video surveillance, video content analysis, video security control. However, it is a challenging task due to background clutter, lighting and the fact that human actions are usually variant over time, from different viewpoints, and occluded by other objects in environment In this Captone Project, we focuse on studying action recognition with multi-modality. We examine and evaluate 3 different approaches on the two main datasets.

Requirements

Python 3.6 or higher
PyTorch and torchvision.
OpenCV with GPU support
timm==0.4.8/0.4.12
TensorboardX

Dataset

We use 2 datasets HMDB51 and UCF101 with extracted first 10 classes in each dataset. The full dataset and splits can be download from:

UCF101

HMDB51
For Temporal Segments Network and Motion-Augmented RGB Stream, download dataset here and annotation file.

Methology

Three model are implemented and evaluated on above datasets, which are Temporal Segment Network (TSN), Motion-Augmented RGB Stream (MARS), Video Masked AutoEncoders (VideoMAE). Detail implementation, instruction for training and our evaluation result are included in each corresponding folder.

Name		Name	Last commit message	Last commit date
Latest commit History 14 Commits
.idea		.idea
MARS		MARS
TSN		TSN
VideoMAE		VideoMAE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

ComputerVision20212-ActionRecognition

Requirements

Dataset

Methology

About

Uh oh!

Releases

Packages

Contributors 2

Uh oh!

Languages

khanhtrq/ComputerVision20212-ActionRecognition

Folders and files

Latest commit

History

Repository files navigation

ComputerVision20212-ActionRecognition

Requirements

Dataset

Methology

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Uh oh!

Languages

Packages