Abstract

The latency of a perception system is crucial for a robot performing interactive tasks in dynamic human environments. We present MOPED, a fast and scalable perception system for object recognition and pose estimation. MOPED builds on POSESEQ, a state of the art object recognition algorithm, demonstrating a massive improvement in scalability and latency without sacrificing robustness. We achieve this with both algorithmic and architecture improvements, with a novel feature matching algorithm, a hybrid GPU/CPU architecture that exploits parallelism at all levels, and an optimized resource scheduler. Using the same standard hardware, we achieve up to 30× improvement on real-world scenes.

Keywords

Computer scienceScalabilityRobustness (evolution)Latency (audio)Artificial intelligencePoseCognitive neuroscience of visual object recognitionComputer visionExploitLow latency (capital markets)Feature extractionReal-time computing

Affiliated Institutions

Related Publications

Publication Info

Year
2010
Type
article
Citations
100
Access
Closed

External Links

Social Impact

Social media, news, blog, policy document mentions

Citation Metrics

100
OpenAlex

Cite This

Manuel Martínez, Alvaro Collet, Siddhartha S Srinivasa (2010). MOPED: A scalable and low latency object recognition and pose estimation system. . https://doi.org/10.1109/robot.2010.5509801

Identifiers

DOI
10.1109/robot.2010.5509801