My name is Simeng Sun (), I am a 2nd year Ph.D. student in Computer Science at UMass Amherst advised by Prof. Mohit Iyyer. I was very fortunate to have worked with Prof. Ani Nenkova while I was a master's student at UPenn, Nitin Madnani, Aoife Cahill and Martin Chodorow while I was an intern at ETS, and Prof. Hui Zhang while I studied at Beihang University. (Twitter), (Linkedin), (Google Scholar)


Ph.D. student in Computer Science, UMass Amherst Aug. 2019 - now
M.S.E in Computer and Information Science, UPenn Aug. 2017 - May 2019
B.E. in Computer Science and Technology, Beihang University Sep. 2013 - Jun. 2017
Exchange student, Trinity College Dublin Sep. 2015 - Jan. 2016


Research Interests

I am generally interested in natural language processing and machine learning. In the past, I did research on author name disambiguation, text summarization and named entity recognition. More recently, I focus on language modeling and machine translation.


Hard-Coded Gaussian Attention for Neural Machine Translation
Weiqiu You*, Simeng Sun*, Mohit Iyyer
Annual Meeting of the Association for Computational Linguistics (ACL), long, 2020

The Feasibility of Embedding Based Automatic Evaluation for Single Document Summarization
Simeng Sun, Ani Nenkova
Empirical Methods in Natural Language Processing (EMNLP), short, 2019

How to Compare Summarizers without Target Length? Pitfalls, Solutions and Re-Examination of the Neural Summarization Literature
Simeng Sun, Ori Shapira, Ido Dagan, Ani Nenkova
North American Chapter of the Association for Computational Linguistics (NAACL-HLT), NeuralGen Workshop, 2019

Name Disambiguation for Chinese Scientific Authors with Multi-Level Clustering
Simeng Sun, Hui Zhang, Ning Li, Yong Chen
IEEE International Conference on Computational Science and Engineering (CSE) and IEEE International Conference on Embedded and Ubiquitous Computing (EUC), 2017


Energy-Based Reranking: Improving Neural Machine Translation Using Energy-Based Models
Subhajit Naskar, Amirmohammad Rooshenas, Simeng Sun, Mohit Iyyer, Andrew McCallum


CS685: Advanced natural language processing at UMass Amherst, Fall 2020 (TA)
CS585: Introduction to Natural Language Processingat UMass Amherst, Fall 2019 (TA)
CIS520: Machine Learning at UPenn (TA)
CIS421/521: Artificial Intelligence at UPenn, Fall 2018 (TA)


Email: simengsun AT umass DOT edu
Address: (To Be Added)


I enjoy listening to Lexicon Valley in my spare time.