Abstract
Action Recognition in videos is an active research field that is fueled by an acute need, spanning several application domains. Still, existing systems fall short of the applications' needs in real-world scenarios, where the quality of the video is less than optimal and the viewpoint is uncontrolled and often not static. In this paper, we extend the Motion Interchange Patterns (MIP) framework for action recognition. This effective framework encodes motion by capturing local changes in motion directions and additionally uses mechanisms to suppress static edges and compensate for global camera motion. Here, we suggest to apply the MIP encoding on gradient-based descriptors to enhance invariance to light changes and achieve a better description of the motion's structure. We compare our method using Patterns of Oriented Edge Magnitudes (POEM) and Difference of Gaussians (DoG) as gradient-based descriptors to the original MIP on two challenging large-scale datasets.
Keywords
Affiliated Institutions
Related Publications
PCA-SIFT: a more distinctive representation for local image descriptors
Stable local feature detection and representation is a fundamental component of many image registration and object recognition algorithms. Mikolajczyk and Schmid (June 2003) rec...
Video Google: a text retrieval approach to object matching in videos
We describe an approach to object and scene retrieval which searches for and localizes all the occurrences of a user outlined object in a video. The object is represented by a s...
Natural Feature Detection on Mobile Phones with 3D FAST
In this paper, we present a novel feature detection approach designed for mobile devices, showing optimized solutions for both detection and description. It is based on FAST (Fe...
A general framework for object detection
This paper presents a general trainable framework for object detection in static images of cluttered scenes. The detection technique we develop is based on a wavelet representat...
ORB: An efficient alternative to SIFT or SURF
Feature matching is at the base of many computer vision problems, such as object recognition or structure from motion. Current methods rely on costly descriptors for detection a...
Publication Info
- Year
- 2013
- Type
- article
- Volume
- 2
- Pages
- 263-268
- Citations
- 12
- Access
- Closed
External Links
Social Impact
Social media, news, blog, policy document mentions
Citation Metrics
Cite This
Identifiers
- DOI
- 10.1109/cvprw.2013.46