I’m a sixth-year PhD student from Computer Vision Lab at CICS UMass Amherst, advised by Prof. Subhransu Maji. Before joining UMass, I obtained my bachelor’s degree from Peking University in 2015 with double majors in physics and computer softwares.
I’m interested in broad topics in computer vision, especially the combination of vision and natural language. I study the joint modeling of visual and language signals, and leverage the supervision of language to further understand various visual domains including fine-grained categories, objects/stuff in images, visual textures, and videos.
I’m expected to graduate in Summer 2021. I’m looking for research positions in the industry.
I worked with Xiaohui Shen, Xiaojie Jin, and Longyin Wen on localizing clips in videos with natural language descriptions.
I worked with Nick Johnston, George Toderici, David Minnen, and Michele Covell on deep image compression.