Machine Learning and Friends Lunch |
||||
|
Computer Vision and Machine Learning - Representations and Algorithms for Creating Interactive Visual MediaChris Pal UMass AbstractStandard video is a simple stack of rectangular images of fixed resolution, displayed linearly in time. Can we create a representation of a scene that places dynamic, high resolution content in the directions we are most likely to look? I tell you how to do this.
I start with a top down view of how to decompose a complex scene
using
probabilistic generative models. I contrast the computational
efficiency and
representational power of different models. I then focus on the
goal of
efficiently constructing a high quality representation and
re-examine the tasks
involved. I outline the critical computations required for
registering
high-definition quality video frames into huge, high quality
panoramic
representations. I show how performing this task effectively and
efficiently then
facilitates the construction of rich, animated and interactive
representations.
Finally, I show how effective video registration also empowers an
interactive
system for real-time panoramic photography. This talk represents a
summary of my contributions described in a series of papers at ECCV, ICCV, CVPR,
SIGGRAPH, OZCHI and other venues.
|