Paper
16 January 2006 Using context and similarity for face and location identification
Marc Davis, Michael Smith, Fred Stentiford, Adetokunbo Bamidele, John Canny, Nathan Good, Simon King, Rajkumar Janakiraman
Author Affiliations +
Proceedings Volume 6061, Internet Imaging VII; 60610E (2006) https://doi.org/10.1117/12.650981
Event: Electronic Imaging 2006, 2006, San Jose, California, United States
Abstract
This paper describes a new approach to the automatic detection of human faces and places depicted in photographs taken on cameraphones. Cameraphones offer a unique opportunity to pursue new approaches to media analysis and management: namely to combine the analysis of automatically gathered contextual metadata with media content analysis to fundamentally improve image content recognition and retrieval. Current approaches to content-based image analysis are not sufficient to enable retrieval of cameraphone photos by high-level semantic concepts, such as who is in the photo or what the photo is actually depicting. In this paper, new methods for determining image similarity are combined with analysis of automatically acquired contextual metadata to substantially improve the performance of face and place recognition algorithms. For faces, we apply Sparse-Factor Analysis (SFA) to both the automatically captured contextual metadata and the results of PCA (Principal Components Analysis) of the photo content to achieve a 60% face recognition accuracy of people depicted in our database of photos, which is 40% better than media analysis alone. For location, grouping visually similar photos using a model of Cognitive Visual Attention (CVA) in conjunction with contextual metadata analysis yields a significant improvement over color histogram and CVA methods alone. We achieve an improvement in location retrieval precision from 30% precision for color histogram and CVA image analysis, to 55% precision using contextual metadata alone, to 67% precision achieved by combining contextual metadata with CVA image analysis. The combination of context and content analysis produces results that can indicate the faces and places depicted in cameraphone photos significantly better than image analysis or context analysis alone. We believe these results indicate the possibilities of a new context-aware paradigm for image analysis.
© (2006) COPYRIGHT Society of Photo-Optical Instrumentation Engineers (SPIE). Downloading of the abstract is permitted for personal use only.
Marc Davis, Michael Smith, Fred Stentiford, Adetokunbo Bamidele, John Canny, Nathan Good, Simon King, and Rajkumar Janakiraman "Using context and similarity for face and location identification", Proc. SPIE 6061, Internet Imaging VII, 60610E (16 January 2006); https://doi.org/10.1117/12.650981
Lens.org Logo
CITATIONS
Cited by 25 scholarly publications and 11 patents.
Advertisement
Advertisement
RIGHTS & PERMISSIONS
Get copyright permission  Get copyright permission on Copyright Marketplace
KEYWORDS
Image analysis

Facial recognition systems

Cameras

Visualization

Machine vision

Analytical research

Computer vision technology

RELATED CONTENT

Learning deep similarity in fundus photography
Proceedings of SPIE (February 24 2017)
Applications Of Computer Vision
Proceedings of SPIE (December 13 1976)
Vision-guided localization for automated camera control
Proceedings of SPIE (October 06 1994)
Complexity of computing nice viewpoints of objects in space
Proceedings of SPIE (October 23 2000)
Holistic facial expression classification
Proceedings of SPIE (June 01 2005)
Prediction of curvature of curves based on trifocal tensor
Proceedings of SPIE (September 18 2001)

Back to Top