March 19th, 2012 @ 3:30pm Tamara Berg (Stony Brook)

Learning from Descriptive Text


People communicate using language, whether spoken, written, or typed. A significant amount of this language describes the world around us, especially the visual world in an environment, or depicted in images or video. Such visually descriptive language is potentially a rich source of 1) information about the world, especially the visual world, and 2) training data for how people construct natural language to describe imagery. In addition there exist billions of photographs with associated text available on the web; examples include web pages, captioned or tagged photographs, and video with speech or closed captioning. In this talk I will describe several projects related to images, descriptive text, and depiction, including recent work on generating simple natural language to describe images. All papers, created datasets, and demos are available on my webpage at:


Tamara Berg received her B.S. in Mathematics and Computer Science from the University of Wisconsin, Madison in 2001. She then completed a PhD from the University of California, Berkeley in 2007 and spent 1 year as a research scientist at Yahoo! Research. She is currently an Assistant Professor in the computer science department at Stony Brook University and a core member of the consortium for Digital Art, Culture, and Technology (cDACT). Her research straddles the boundary between Computer Vision and Natural Language Processing with applications to large scale recognition and multimedia retrieval.

seminars/seminaritems/2012-03-19.txt · Last modified: 2012/03/14 14:17 by silberman