Hi, I am Mostofa Patwary, Sr. Manager and Principal Research Scientist at the Applied Deep Learning Research team at NVIDIA. My research interests span in the areas of Natural Language Processing, Large Scale Deep Learning, and High Performance Computing. I lead the large foundation language model training team that is resposible for data curation, model training, evaluation and the uses in real world applications. 


Previously, I worked as a senior researcher at the Silicon Valley AI Lab at Baidu Research, Parallel Computing Lab at Intel Research, and at the Northwestern University in Illinois.


I received my PhD from the Department of Informatics at University of Bergen, Norway. As part of the PhD program, I also studied as a research scholar at Purdue University, USA.


My bachelor and masters degrees are from the Department of Computer Science and Engineering at Bangladesh University of Engineering and Technology (BUET), Bangladesh. 

Email: mostofa dot patwary at gmail dot com 

Recent Selected Contributions:

Publications: