I am a Masters PhD candidate in the College of Information and Computer Sciences at the University of Massachusetts Amherst. I work with Prof. Yanlei Diao in the Database and Information Management Lab.
I completed my undergraduate education in Computer Science and Engineering from the Indian Institute of Technology Guwahati. After graduation from IIT Guwahati, I worked at Strand Life Sciences where I was part of the team developing Strand NGS.
My research is supported by NSF, Google Research, IBM Research, and NEC Laboratories.
My Google Scholar Citations page.
Yiwen Zhu, Matteo Interlandi, Abhishek Roy, Krishnadhan Das, Hiren Patel, Malay Bag, Hitesh Sharma, Alekh Jindal. Phoebe: A Learning-based Checkpoint Optimizer. VLDB 2021.
Rathijit Sen, Abhishek Roy, Alekh Jindal, Rui Fang, Jeff Zheng, Xiaolei Liu, Ruiping Li. AutoExecutor: Predictive Parallelism for Spark SQL Queries. VLDB 2021 (Demo).
Abhishek Roy, Alekh Jindal, Priyanka Gomatam, Xiating Ouyang, Ashit Gosalia, Nishkam Ravi, Swinky Mann, Prakhar Jain. SparkCruise: Workload Optimization in Managed Spark Clusters at Microsoft. VLDB 2021 (Industry).
Alekh Jindal, Shi Qiao, Hiren Patel, Abhishek Roy, Jyoti Leeka, Brandon Haynes. Production Experiences from Computation Reuse at Microsoft. EDBT 2021 (Industry).
Gray Systems Lab. Cloudy with high chance of DBMS: A 10-year prediction for Enterprise-Grade ML. CIDR 2020.
Alekh Jindal, Hiren Patel, Abhishek Roy, Shi Qiao, Jarod Yin, Rathijit Sen, and Subru Krishnan. Peregrine: Workload Optimization for Cloud Query Engines. SoCC 2019. PDF
Abhishek Roy, Alekh Jindal, Hiren Patel, Ashit Gosalia, Subru Krishnan, and Carlo Curino. SparkCruise: Handsfree Computation Reuse in Spark. VLDB 2019 (Demo). PDF
Abhishek Roy, Yanlei Diao, Uday Evani, Avinash Abhyankar, Clinton Howarth, RĂ©mi Le Priol, Toby Bloom. Massively Parallel Processing of Whole Genome Sequence Data: An In-Depth Performance Study. SIGMOD 2017. PDF
Yanlei Diao, Abhishek Roy, Toby Bloom. Building Highly-Optimized, Low-Latency Pipelines for Genomic Data Analysis. CIDR 2015. PDF
Abhishek Roy, Yanlei Diao, Evan Mauceli, Yiping Shen, Bai-Lin Wu. Massive Genomic Data Processing and Deep Analysis. VLDB 2012 (Demo). PDF
Ling Chen, Abhishek Roy. Event Detection from Flickr Data through Wavelet-based Spatial Analysis. CIKM 2009. PDF