Demirel, Berker and Özkan, Hüseyin (2024) Decompl: decompositional learning with attention pooling for group activity recognition from a single volleyball image. In: IEEE International Conference on Image Processing (ICIP), Abu Dhabi, United Arab Emirates
Full text not available from this repository. (Request a copy)
Official URL: https://dx.doi.org/10.1109/ICIP51287.2024.10647499
Abstract
Group Activity Recognition (GAR) aims to detect the activity performed by multiple actors in a scene. Prior works model the spatio-temporal features based on the RGB, optical flow or keypoint data types. On the contrary, our hypothesis is that by only using the RGB data without temporality, the performance can be maintained with a negligible loss in accuracy. To that end, we propose a novel GAR technique for volleyball videos, DECOMPL, which consists of two complementary branches. In the visual branch, it extracts the features using attention pooling. In the coordinate branch, it considers the configuration of the players and extracts the spatial information from the box coordinates. Moreover, we analyzed the Volleyball dataset that the recent literature is mostly based on, and systematically reannotated it to emphasize the group concept. Experimental results demonstrated the effectiveness of the proposed model DECOMPL, which delivered the best/second best GAR performance with the reannotations/original annotations among the comparable state-of-the-art methods. Code and new annotations are available at GitHub: https://github.com/berkerdemirel/decompl.
Item Type: | Papers in Conference Proceedings |
---|---|
Uncontrolled Keywords: | Attention pooling; Group activity recognition; Image classification; Volleyball |
Divisions: | Faculty of Engineering and Natural Sciences > Academic programs > Electronics Faculty of Engineering and Natural Sciences |
Depositing User: | Hüseyin Özkan |
Date Deposited: | 22 Apr 2025 12:13 |
Last Modified: | 22 Apr 2025 12:13 |
URI: | https://research.sabanciuniv.edu/id/eprint/51382 |