AIR: An Agent for Robust Image Matching and Retrieval

Authors

  • Jimmy Addison Lee Institute for Infocomm Research
  • Attila Szabo Institute for Infocomm Research
  • Yiqun Li Institute for Infocomm Research

Keywords:

Image recognition, Image matching, Image retrieval, Spatial relation approach, longest increasing subsequence

Abstract

This paper presents a novel scheme coined AIR (Agent for Image Recognition), acting as an agent, to oversee the image matching and retrieval processes. Firstly, neighboring keypoints within close spatial proximity are examined and used to hypothesize true keypoint matches. While this approach is robust to noise (e.g. a tree) since spatial relation is considered, missing (undetected) keypoints in one image can also be recovered resulting in more keypoint matches. Secondly, the agent is able to recognize instability of projective transformations in certain cases (e.g. non-planar scenes). The geometric approach is substituted with LIS (Longest Increasing Subsequence) approach which does not require any complex geometric transformations. The effectiveness of AIR is substantiated by an image retrieval experiment which demonstrates that it achieves a twofold increase in true matches and higher matching accuracy when compared to RANSAC homography approach.

Downloads

Download data is not yet available.

References

Agarwala A, Agrawala M, Cohen M, Salesin D, Szeliski R (2006). Photograping long scenes with multi-viewpoint panoramas. ACM Trans. on Graphics (SIGGRAPH). 25(3): 853-861.

Brown M, Lowe DG (2003). Recognizing panoramas. Proc. 9th IEEE Int’l Conference on Computer Vision (ICCV). 2: 1218-1225.

Brown M, Lowe DG (2007). Automatic panoramic image stitching using invariant features. Int’l Journal of Computer Vision (IJCV). 74(1): 59-73.

Baumberg A (2000). Reliable feature matching across widely separated views. Proc. IEEE Computer Society Int’l Conference on Computer Vision and Pattern Recognition (CVPR). 1: 774-781.

Goedeme T, Tuytelaars T, Van-Gool L (2004). Fast wide baseline matching for visual navigation. Proc. IEEE Computer Society Int’l Conference on Computer Vision and Pattern Recognition (CVPR). 1: 24-29.

Kannala J, Brandt SS (2007). Quasi-dense wide baseline matching using match propagation. Proc. IEEE Computer Society Int’l Conference on Computer Vision and Pattern Recognition (CVPR). 1-8.

Lee JA, Yow KC, Chia YS (2009). Robust matching of building facades under large viewpoint changes. Proc. 12th IEEE Int’l Conference on Computer Vision (ICCV). 1258-1264

Katare A, Mitra SK, Banerjee A (2007). Content based image retrieval system for multi object images using combined features. Proc. IEEE Computer Society Int’l Conference on Computer Vision and Pattern Recognition (CVPR). 595-599

Wang J, Zha H, Cipolla R (2005). Combining interest points and edges for content-based image retrieval. Proc. 12th IEEE Int’l Conference on Image Processing (ICIP). 3: 1256-1259

Belongie S, Malik J, Puzicha J (2002). Shape matching and object recognition using shape contexts. IEEE Trans. on Pattern Analysis and Machine Intelligence (TPAMI). 23(4): 509-522

Frome A, Huber D, Kolluri R, Billow T, Malik J (2004). Recognizing objects in range data using regional point descriptors. Proc. 8th European Conference on Computer Vision (ECCV). 3: 224-237

Lowe DG (2004). Distinctive image features from scale-invariant keypoints. Int’l Journal of Computer Vision (IJCV). 60(2): 91-110

Leordeanu M, Hebert M, Sukthankar R (2007). Beyond local appearance: Category recognition from pairwise interactions of simple features. Proc. IEEE Computer Society Int’l Conference on Computer Vision and Pattern Recognition (CVPR). 1-8

Mikolajczyk K, Leibe B, Schiele B (2006). Multiple object class detection with a generative model. Proc. IEEE Computer Society Int’l Conference on Computer Vision and Pattern Recognition (CVPR). 1: 26-36

Mutch J, Lowe DG (2006). Multiclass object recognition with sparse, localized features. Proc. IEEE Computer Society Int’l Conference on Computer Vision and Pattern Recognition (CVPR). 1: 11-18.

Mikolajczyk K, Schmid C (2005). A performance evaluation of local descriptors. IEEE Trans. on Pattern Analysis and Machine Intelligence (TPAMI). 27(10): 1615-1630

Bay H, Tuytelaars T, Van-Gool L (2006). SURF: Speeded Up Robust Features. Proc. 9th European Conference on Computer Vision (ECCV). 1: 404-417

Lowe DG (2001). Local feature view clustering for 3D object recognition. Proc. IEEE Computer Society Int’l Conference on Computer Vision and Pattern Recognition (CVPR). 1: 682-688

Se S, Lowe DG, Little J (2002). Global localization using distinctive visual features. Proc. IEEE/RSJ Int’l Conference on Intelligent Robots and Systems (IROS). 226-231

Fischler MA., Bolles RC. (1981). Random sample consensus: a paradigm for model fitting with applications to image analysis and automated cartography. Comm. of the ACM. 24 (6). 381-395

Hartley RI, Zisserman A (2000). Multiple View Geometry in Computer Vision. Cambridge University Press UK

Nister D., Stewenius H (2006). Scale recognition with a vocabulary tree. Proc. IEEE Computer Society Int’l Conference on Computer Vision and Pattern Recognition (CVPR). Vol. 2. pp. 2161-2168

Ballard DH (1981). Generalizing the hough transform to detect arbitrary shapes. Pattern Recognition (PR). 13 (2). 111-122

Grimson WEL (1990). Object Recognition by Computer: The Role of Geometric Constraints. The MIT Press Cambridge. 263-284

Hough PVC (1962). Method and means for recognizing complex patterns. U.S. Patent 3069654

Fredman M (1975). On computing the length of longest increasing subsequences. Discrete Mathematics. 11 (1). 29-35

Faugeras O (1993). Three-Dimensional Computer Vision: A Geometric Viewpoint. The MIT Press Cambridge

Chandrasekhar V, Chen DM, Tsai SS, Cheung NM, Chen H, Takacs G, Reznik Y, Vedantham R, Grzeszczuk R, Bach J, Girod B (2011). The stanford mobile visual search data set. Proc. 2nd Annual ACM SIGMM Conference on Multimedia Systems (MMSys). 117-122

Downloads

Published

19.06.2013

How to Cite

Lee, J. A., Szabo, A., & Li, Y. (2013). AIR: An Agent for Robust Image Matching and Retrieval. International Journal of Intelligent Systems and Applications in Engineering, 1(2), 34–39. Retrieved from https://ijisae.org/index.php/IJISAE/article/view/10

Issue

Section

Research Article