Recent Changes - Search:

MLFL Home

University of Massachusetts

MLFL Wiki

The Structural Topic Model And Applied Social Science

Abstract: Statistical models of text have become increasingly popular in statistics and computer science as a method of exploring large document collections. Social scientists often want to move beyond exploration, to measurement and experimentation, and make inference about social and political processes that drive discourse and content. In this talk, I will overview our recent work developing a novel topic model which supports this type of substantive research by modeling the relationships between observed covariates and latent topics. Our approach uses a simple generalized linear model framework to allow the analyst to condition on arbitrary structure affecting topic prevalence and content. I discuss applications from across the social sciences: the analysis of survey experiments, end of course surveys in MOOCs and media reporting in China. In each case we show how to leverage problem specific structure and perform inference on the resulting covariate relationships. All the methods described are available as part of the open source R package, stm, available at structuraltopicmodel.com.

Bio: Brandon Stewart is a Ph.D. Candidate in the department of Government at Harvard University. His work centers on methods for automated content analysis in the social sciences with a particular focus on international relations.

Edit - History - Print - Recent Changes - Search
Page last modified on March 23, 2014, at 08:04 PM