Learning hierarchical invariant spatio-temporal features for action recognition with independent subspace analysis
Previous work on action recognition has focused on adapting hand-designed local features, such as SIFT or HOG, from static images to the video domain. In this paper, we propose ...