Sudipta Saha Shubha
Endowed Fellow, University of Virginia | Research Intern @ Microsoft Systems for Generative AI
Contact: ss7krd@virginia.edu
Endowed Fellow, University of Virginia | Research Intern @ Microsoft Systems for Generative AI
Contact: ss7krd@virginia.edu
Welcome to my home page!
Hi, I am Sudipta, a Computer Science PhD student at the University of Virginia (UVA). I am working with Dr. Haiying Shen from UVA and Dr. Anand Iyer from Georgia Tech. I am broadly interested in research opportunities at the intersection of networked systems and machine learning. My research works involve cloud/high-performance/edge computing, computer networks, operating system, GPU/TPU architecture, and deep learning (DL) models. I am currently working on developing cost- and energy-efficient and scalable infrastructures for distributed inference serving of generative AI (e.g., LLM) models.
I have published my research in top systems and ML conferences including OSDI, SIGCOMM, EuroSys, SoCC, IPDPS, NAACL, and ICASSP, and have a track record of impactful research transitioned to production through internships at Microsoft and Hewlett Packard Enterprise (HPE) Labs.
Before starting my PhD journey, I have completed my B.Sc. in Computer Science and Engineering (CSE) from Bangladesh University of Engineering and Technology (BUET) in September, 2017. After my graduation, I worked as a software engineer at Works Applications, Singapore, and then as a full-time research assistant at the Department of CSE, BUET in collaboration with Samsung Research.
News:
[May, 2024] Was awarded the John A. Stankovic Graduate Research Award, this award is presented to a UVA CS PhD student with outstanding research productivity in an academic year.
[March, 2024] Conference paper on a cost-efficient, model interference-aware, and scalable deep learning (both CNNs and LLMs) inference serving system got accepted at OSDI 2024 with all-accept reviews in the very first submission attempt!
[September, 2023] Was awarded the University of Virginia Endowed Fellowship, this fellowship is awarded to a graduate student with outstanding research productivity (proposal, publications, and presentations) and awards/honors.
[May, 2023] Conference paper on an efficient end-to-end DNN inference serving system was accepted at SIGCOMM 2023! This was a dream project for me! Really happy it ended so well!
[May, 2023] Will be working on the DeepSpeed project at Microsoft Research during this summer!
[April, 2023] Was awarded NSF travel grant to present paper at IPDPS 2023!
[January, 2023] Conference paper on large-scale distributed COVID-19 spread prediction system was accepted at IPDPS 2023!
[October, 2022] Conference paper was accepted at IEEE BigData 2022!
[September, 2021] Conference paper was published at IEEE CLOUD 2021!
[April, 2021] Conference paper was accepted at IEEE SECON 2021!
[September, 2020] Journal paper was accepted at Elsevier Smart Health 2021!
[Feabruary, 2020] Conference paper was accepted at LREC 2020!
[January, 2020] Conference paper was accepted at ICASSP 2020!
[June, 2019] Was awarded ACL travel grant to present paper at NAACL!
[February, 2019] Conference paper was accepted at NAACL 2019!
[September, 2018] Journal paper was published in Journal of Network and Computer Applications (JNCA), Elsevier (Volume 124, 15 December 2018, Pages 44-62) (Journal Impact Factor: 6.281). Find it here: Paper Link.
[August, 2018] Started working as a Research Assistant at Department of Computer Science and Engineering (CSE) of Bangladesh University of Engineering and Technology (BUET). Our research project is being funded by Samsung Research, South Korea.