I'm a fourth-year Ph.D. student at UMass Amherst working in the Information Extraction and Synthesis Laboratory with Professor Andrew McCallum. Previously, I received a B.S. in Computer Science from Tufts University and worked as a researcher at MIT Lincoln Laboratory building intelligent decision support systems for US intelligence analysts. I spent the summer of 2014 at Google training models of worker-task interaction that I used to optimize a new crowdsourcing platform.
I'm primarily interested in algorithms for data integration--or combining multiple, heterogeneous sources of data into a single, consistent knowledge base. My work draws on and develops methods for information extraction, entity resolution and other forms of scalable statistical inference. I'm also broadly interested in interactive machine learning, (integer) linear programming and crowdsourcing as a tool for targeted data acquisition.
I am also the chief designer and maintainer of ReSearcher.cc, a database of scientific affiliations and collaborations. Our backend integration engine dynamically (on page-load) produces ReSearcher's content by synthesizing multiple sources of data including edits from logged in users. So far, ReSearcher has been used by ICCV and CVPR to help identify conflicts of interest in peer review. (If you're interested in using ReSearcher to support your conference, contact me!)
- Getting More for Less: Optimized Crowdsourcing with Dynamic Tasks and Goals. Kobren, A., Tan, C.H., Ipeirotis, P., Gabrilovich, E.The International Conference on the World Wide Web (WWW). Florence, Italy. May 2015. [bibtex]
- Domain Specific Knowledge Base Construction via Crowdsourcing. Kobren, A., Logan, T., Sampangi, S., McCallum, A.NIPS '14 Workshop on Automated Knowledge Base Construction (AKBC). Montreal, Canada. December 2013. (Outstanding Paper Award) [bibtex]
- Universal Schema for Slot Filling and Cold Start: UMass IESL at TACKBP 2013. Singh, S., Yao, L., Belanger, D., Kobren, A., Anzaroot, S., Wick, M., Passos, A., Pandya, H., Choi, J., Martin, B., and McCallum, A. Text Analysis Conference (Knowledge Base Population Track) '13 Workshop (TAC KBP). Gaithersburg, Maryland, USA. November 2013. [bibtex]
- Assessing Confidence of Knowledge Base Content With an Experimental Study in Entity Resolution. Wick, M.L., Singh, S., Kobren, A. and McCallum, A. CIKM '13 Workshop on Automated Knowledge Base Construction (AKBC). San Francisco, California, USA. October 2013. [bibtex]
- Large-scale Author Coreference via Hierarchical Entity Representations. Wick, M.L., Kobren, A. and McCallum, A. ICML '13 Workshop on Peer Reviewing and Publishing Models (PEER). Atlanta, Georgia, USA. June 2013. [bibtex]
- Probabilistic Reasoning about Human Edits in Information Integration. Wick, M.L., Kobren, A. and McCallum, A. ICML '13 Workshop on Machine Learning Meets Crowdsourcing. Atlanta, Georgia, USA. June 2013. [bibtex]
Other Fun Stuff
In my spare time, I enjoy cooking (especially ethnic foods that involve uncommon spices), learning for free, climbing, playing soccer and other sports, composing/playing music, snowboarding, playing German-style board games, traveling the world, listening to NPR (e.g. The Moth Radio Hour and wNYC's RadioLab), eating gummy candy, and learning about philosophy, religion, and nutrition.
In a previous life, I co-found a start-up and was a junior olympic Tae Kwon Do champ.
My favorite book is Zen and the Art of Motorcycle Maintenance.
Amherst, Massachusetts, USA
akobren (at) cs (dot) umass (dot) edu