Computational Vision | ICS

Where's Waldo: Matching People in Images of Crowds

Rahul Garg, Deva Ramanan, Steve Seitz, Noah Snavely

Given a community-contributed set of photos of a crowded public event, this paper addresses the problem of finding all images of each person in the scene. This problem is very challenging due to large changes in camera viewpoints, severe occlusions, low resolution and photos from tens or hundreds of different photographers. Despite these challenges, the problem is made tractable by exploiting a variety of visual and contextual cues – appearance, timestamps, camera pose and co-occurrence of people. This paper demonstrates an approach that integrates these cues to enable high quality person matching in community photo collections downloaded from Flickr.com.

Download: pdf

Text Reference

Rahul Garg, Deva Ramanan, Steve Seitz, and Noah Snavely. Where's waldo: matching people in images of crowds. In IEEE Conference on Computer Vision and Pattern Recognition (CVPR). 2011.

BibTeX Reference

@inproceedings{GargRSS_CVPR_2011,
    author = "Garg, Rahul and Ramanan, Deva and Seitz, Steve and Snavely, Noah",
    booktitle = "IEEE Conference on Computer Vision and Pattern Recognition (CVPR)",
    title = "Where's Waldo: Matching People in Images of Crowds",
    year = "2011",
    tag = "object_recognition,people"
}

Computational Vision at UC Irvine

Download: pdf

Text Reference

BibTeX Reference