Human action recognition using 3D joint information and pyramidal HOOFD features

Üstündağ, Barış Can (2014) Human action recognition using 3D joint information and pyramidal HOOFD features. [Thesis]

[thumbnail of BarisCanUstundag_10048318.pdf] PDF
BarisCanUstundag_10048318.pdf

Download (7MB)

Abstract

With the recent release of low-cost depth acquisition devices, there is an increasing trend towards investigation of depth data in a number of important computer vision problems, such as detection, tracking and recognition. Much work has focused on human action recognition using depth data from Kinect type 3D cameras since depth data has proven to be more effective than 2D intensity images. In this thesis, we develop a new method for recognizing human actions using depth data. It utilizes both skeletal joint information and optical flows computed from depth images. By drawing an analogy between depth and intensity images, 2D optical flows are calculated from depth images for the entire action instance. From the resulting optical flow vectors, patches are extracted around each joint location to learn local motion variations. These patches are grouped in terms of their joints and used to calculate a new feature called 'HOOFD' (Histogram of Oriented Optical Flows from Depth). In order to encode temporal variations, these HOOFD features are calculated in a pyramidal fashion. At each level of the pyramid, action instance is partitioned equally into two parts and each part is employed separately to form the histograms. Oriented optical flow histograms are utilized due to their invariance to scale and direction of motion. Naive Bayes and SVM classifiers are then trained using HOOFD features to recognize various human actions. We performed several experiments on publicly available databases and compared our approach with state-of-the-art methods. Results are quite promising and our approach outperforms some of the existing techniques.
Item Type: Thesis
Uncontrolled Keywords: Action recognition. -- Classiffication. -- RGBD images. -- Depth data. -- HOOFD. -- Aktivite tanıma. -- Sınıflandırma. -- RGBD imgeler. -- Derinlik verisi. -- HOOFD.
Subjects: T Technology > TJ Mechanical engineering and machinery > TJ163.12 Mechatronics
Divisions: Faculty of Engineering and Natural Sciences > Academic programs > Mechatronics
Faculty of Engineering and Natural Sciences
Depositing User: IC-Cataloging
Date Deposited: 16 May 2017 11:03
Last Modified: 26 Apr 2022 10:09
URI: https://research.sabanciuniv.edu/id/eprint/31354

Actions (login required)

View Item
View Item