Human action recognition using 3D joint information and pyramidal HOOFD features

Üstündağ, Barış Can (2014) Human action recognition using 3D joint information and pyramidal HOOFD features. [Thesis]

[img]PDF - Registered users only - Requires a PDF viewer such as GSview, Xpdf or Adobe Acrobat Reader

Official URL: http://risc01.sabanciuniv.edu/record=b1589541 (Table of Contents)


With the recent release of low-cost depth acquisition devices, there is an increasing trend towards investigation of depth data in a number of important computer vision problems, such as detection, tracking and recognition. Much work has focused on human action recognition using depth data from Kinect type 3D cameras since depth data has proven to be more effective than 2D intensity images. In this thesis, we develop a new method for recognizing human actions using depth data. It utilizes both skeletal joint information and optical flows computed from depth images. By drawing an analogy between depth and intensity images, 2D optical flows are calculated from depth images for the entire action instance. From the resulting optical flow vectors, patches are extracted around each joint location to learn local motion variations. These patches are grouped in terms of their joints and used to calculate a new feature called 'HOOFD' (Histogram of Oriented Optical Flows from Depth). In order to encode temporal variations, these HOOFD features are calculated in a pyramidal fashion. At each level of the pyramid, action instance is partitioned equally into two parts and each part is employed separately to form the histograms. Oriented optical flow histograms are utilized due to their invariance to scale and direction of motion. Naive Bayes and SVM classifiers are then trained using HOOFD features to recognize various human actions. We performed several experiments on publicly available databases and compared our approach with state-of-the-art methods. Results are quite promising and our approach outperforms some of the existing techniques.

Item Type:Thesis
Uncontrolled Keywords:Action recognition. -- Classiffication. -- RGBD images. -- Depth data. -- HOOFD. -- Aktivite tanıma. -- Sınıflandırma. -- RGBD imgeler. -- Derinlik verisi. -- HOOFD.
Subjects:T Technology > TJ Mechanical engineering and machinery > TJ163.12 Mechatronics
ID Code:31354
Deposited By:IC-Cataloging
Deposited On:16 May 2017 11:03
Last Modified:16 May 2017 11:03

Repository Staff Only: item control page