Research Interests: Foundation models| Efficient Pre-training| Efficient Inference| Knowledge Distillation

Overview

I am a PhD student at the University of Texas at Austin, advised by Prof. Sujay Sanghavi in the Department of Electrical and Computer Engineering. For my research, I work on simple things and simple things works for me. I am currently working on efficient training strategies for large models (mostly LLMs). Some of my recent works has been featured in Ahead of AI magazine, Marktechpost, and the Interconnects newsletter.

Before moving to Austin, I graduated with an M.Eng. degree in Information and Communication Engineering from Chongqing University of Posts and Telecommunications, Chongqing, China in 2019, and received a B.Tech degree in Electronics and Communication Engineering from the Maulana Abul Kalam Azad University of Technology (formerly known as West Bengal University of Technology), Kolkata, India. During my undergrad, I gloriously failed to scale up my startup, Tronix India, and later worked in an Indian multinational IT firm, TechMahindra.

I am a person who stutters some info about stuttering here.

Internships


Preprints

Selected Publications 


Selected DEmos, Posters and talks


Academic Services 

Recent Updates