Why Did the Person Cross the Road (There)? Scene Understanding Using Probabilistic Logic Models and Common Sense Reasoning

TitleWhy Did the Person Cross the Road (There)? Scene Understanding Using Probabilistic Logic Models and Common Sense Reasoning
Publication TypeBook Chapters
Year of Publication2010
AuthorsKembhavi A, Tom Yeh, Davis LS
EditorDaniilidis K, Maragos P, Paragios N
Book TitleComputer Vision – ECCV 2010
Series TitleLecture Notes in Computer Science
Volume6312
Pagination693 - 706
PublisherSpringer Berlin / Heidelberg
ISBN Number978-3-642-15551-2
Abstract

We develop a video understanding system for scene elements, such as bus stops, crosswalks, and intersections, that are characterized more by qualitative activities and geometry than by intrinsic appearance. The domain models for scene elements are not learned from a corpus of video, but instead, naturally elicited by humans, and represented as probabilistic logic rules within a Markov Logic Network framework. Human elicited models, however, represent object interactions as they occur in the 3D world rather than describing their appearance projection in some specific 2D image plane. We bridge this gap by recovering qualitative scene geometry to analyze object interactions in the 3D world and then reasoning about scene geometry, occlusions and common sense domain knowledge using a set of meta-rules. The effectiveness of this approach is demonstrated on a set of videos of public spaces.

URLhttp://dx.doi.org/10.1007/978-3-642-15552-9_50