A new pose-based representation for recognizing actions from multiple cameras

Pehlivan Tort, Selen; DUYGULU ŞAHİN, PINAR

doi:10.1016/j.cviu.2010.11.004

A new pose-based representation for recognizing actions from multiple cameras

Pehlivan Tort S., DUYGULU ŞAHİN P.

Computer Vision and Image Understanding, cilt.115, sa.2, ss.140-151, 2011 (SCI-Expanded, Scopus)

Yayın Türü: Makale / Tam Makale
Cilt numarası: 115 Sayı: 2
Basım Tarihi: 2011
Doi Numarası: 10.1016/j.cviu.2010.11.004
Dergi Adı: Computer Vision and Image Understanding
Derginin Tarandığı İndeksler: Science Citation Index Expanded (SCI-EXPANDED), Scopus
Sayfa Sayıları: ss.140-151
Anahtar Kelimeler: Pose representation, Action recognition, Arbitrary view, Multi-camera
Açık Arşiv Koleksiyonu: AVESİS Açık Erişim Koleksiyonu
TED Üniversitesi Adresli: Hayır

Özet

We address the problem of recognizing actions from arbitrary views for a multi-camera system. We argue that poses are important for understanding human actions and the strength of the pose representation affects the overall performance of the action recognition system. Based on this idea, we present a new view-independent representation for human poses. Assuming that the data is initially provided in the form of volumetric data, the volume of the human body is first divided into a sequence of horizontal layers, and then the intersections of the body segments with each layer are coded with enclosing circles. The circular features in all layers (i) the number of circles, (ii) the area of the outer circle, and (iii) the area of the inner circle are then used to generate a pose descriptor. The pose descriptors of all frames in an action sequence are further combined to generate corresponding motion descriptors. Action recognition is then performed with a simple nearest neighbor classifier. Experiments performed on the benchmark IXMAS multi-view dataset demonstrate that the performance of our method is comparable to the other methods in the literature. © 2010 Elsevier Inc. All rights reserved.