Sections
You are here: Home Members Martin Winter PhD-Thesis

Spatial Relations of Features and Descriptors for Appearance Based Object Recognition

Authors Martin Winter
School Institute for Computer Graphics and Vision, Graz University of Technology
Address

Inffeldgasse 16a/2nd floor, A-8010 Graz, Austria

Date November 2007
Abstract

Appearance based object recognition using local features is a very wide area of research. Up to now, a lot of promising approaches have been proposed, still some key issues remain open. There is a requirement for substantial improvements with respect to algorithm runtime, necessary storage space or recognition performance on more comprehensive databases. In this thesis, we investigate simple spatial relations on different levels of the recognition pipeline and propose four novel algorithms to improve the recognition performance of specific object recognition tasks. First we develop a novel distinguished region detector which is complementary to existing approaches. The basic idea is to find distinguished regions by spatially related interest points and we introduce the concept of ‘maximal stableness across scale’. Therefore, the detected regions are called ‘Maximally Stable Corner Clusters (MSCC)’. Second, we design a new descriptor based on the two dimensional joint occurrence histogram of local orientations, which is ideally suited for the regions detected by the MSCCs. In a set of experiments, we demonstrate competitive performance of the developed detector and descriptor approach with respect to other state of the art approaches. Our detector consistently detects regions different from those found by other detectors. Subsequently we present a completely new object representation based only on the binary joint occurrences of quantized descriptors. This representation, despite being high dimensional, is very sparse and efficient. Moreover we can verify, that a high recognition performance is obtained even if we work with very weak, non discriminative descriptors. Finally we propose an efficient method to substantially increase the recognition performance of a standard vocabulary tree based recognition system with our novel binary joint occurrences representation. Extensive experimental evaluations on rather challenging recognition tasks show the robustness of the proposed algorithms to partial occlusions and substantial background clutter. Investigation of the scaling behavior and some profiling experiments finally demonstrate the efficiency and appropriateness of the developed methods to solve specific object recognition tasks.

Link

For a PDF-version of the thesis please contact the author at JOANNEUM RESEARCH.

[Powered by Plone]