Current Openings
Research Associates/Assistants
Vaibhav Singh: DTU, India → Flipkart → IIITD, India (RA)→ NYU, USA (Masters) → MILA, Canada (Ph.D)
Categorizing loss landscapes in low-rank acoustic models
Yash Thakran: IIITD, India (B.Tech) → Oxford Wave Research
Abuse detection from multilingual audio
Devansh Gupta: IIITD, India (B.Tech) → USC, USA (Ph.D)
Time-Frequency visualization for acoustic models
Siddhant Rai Viksit: IIITD, India (B.Tech)
Atypical speech analytics
Siya Garg: IIITD, India (B.Tech) → Google, India
Emotion-aware text-to-speech synthesis
Ritoma Sen: IIITD, India (B.Tech) → Microsoft, India → TU Munich (Masters)
Topological data analysis
Postdocs (Collaborative Work)
-
Doctoral Students
IIIT Delhi
Vishal Kumar: [SERB JRF → TBO Mahalanobis Fellow]
Speaker verification and spoofing for smart devices
Jointly supervised by Dr. Mathew, IDIAP Switzerland
Suryaka Suresh [SERB Senior Research Fellow]:
Topology of learning in deep neural networks
Puneet Singh [Institute Fellow]
Generative speech technology
Dibyajoyti [External -IIT Delhi]
Information flow in deep networks
Masters Students
IIITD, India
Aryan Chaudhary - (M.Tech CS-AI) → Dell R&D Center, India
Quaternionic neural networks for speech processing
Akash Verma - (M.Tech CS-AI) → Mercedes Benz
Continual learning
Barneet Singh - (M.Tech CS-AI) → Chegg
Audio deepfake detection
Akshat Patyal - (M.Tech CS-AI) → Keysight Technologies
Speech translation in articulatory space
Yash Agarwal - (M.Tech CS-AI) → Renesas Electronics
Neural style transfer in video diffusion models
Deepika - (M.Tech CS-AI) → Nvidia
Image-to-Music generation
IITD, India
Dibyajoyti Jena
Information Flow Across Neural Networks
Project Staff
Bishshoy Das (2022-23): IIT Delhi (IIITD-IITD MIFR Project)
Akanksha Singh (2024): JPMC Project
Interns
Jay Kaoshik: VIT, India (2022)
T. Pranav: TIET Patiala, Punjab (2022)
Utkarsh Garg: Graphic Era Dehradun, Uttrakhand (2023)
Ritul Agarwal: NSUT, India (2023)
Ranjit Patro: IISER Berhampur, India (2023)
Shruti Singh: UPES, India (2023)
Ujali Sharma: IGDTUW, India (2024)
Samarth Kukreja: NITK, India (2024)
Shambhavi: IIT Patna (2024)
Collaborators:
Prof. David Clifton [CHI Lab, University of Oxford] - Healthcare informatics
Dr. Anshul Thakur [CHI Lab, University of Oxford] - Clinical Machine Learning
Prof. Jared Tanner [Mathematical Institute, University of Oxford] - Harmonic analysis and function approximation theory
Prof. Vidit Nanda [Mathematical Institute, University of Oxford] - Topological data analysis
Dr. Estelle Massart [UCLouvain, Belgium] - Optimization
Dr. Mathew M. Doss [Speech and Audio Group, IDIAP, Switzerland] - Speech/Audio processing
Prof. Sumantra Dutta Roy [Department of Electrical Engineering, IIT Delhi, India] - Computer vision
Dr. Karan Nathwani [Department of Electrical Engineering, IIT Jammu, India] - Speech processing
Dr. Aanchan Mohan [Northeastern University, Vancouver Campus, Canda] - Atypical speech processing