Profile Links:
I am currently working as an Institute Postdoctoral Fellow at IIT-Bombay Digital Audio Processing Lab. Prior to that I worked as a Speech Generation Research Intern at the Sony Research India (Sony Research & Development). I have obtained my PhD from the Computer Science and Engineering (CSE) Department of the National Institute of Technology Durgapur (NIT-DGP, West Bengal, India) (batch 2019-2024). My PhD research was based on "Speech Synthesis using Generative Adversarial Network (GAN)". Apart from that, I am profoundly interested in specific areas of applied AI research, namely, speaker identification and verification, speech command recognition, speaker anonymization, text-to-speech synthesis, singing voice synthesis, neural architecture search, medical image analysis, AI in agriculture, etc. I have obtained my M. Tech in CSE from NIT-DGP (batch 2017-2019). I have obtained my B.E (B.E/B.Tech) degree from Jorhat Engineering College, a state Government Engineering College of Assam (batch 2012-2016). I am also the co-initiator of the Soft Computing and Machine Learning Group (SCML Group: https://www.linkedin.com/company/scml-group). Several of my recent research works have been published in esteemed international journals and conferences such as IEEE TNNLS, IEEE TAI, ICASSP, APSIPA, SPECOM, IJCCI, etc. In addition to my research, I also have a deep interest in music, literature, painting and karate (martial arts).
Resume link: shorturl.at/SB3dg
Ph.D. Thesis link: https://tinyurl.com/yycn4b48
Total Publications (Journals, Conferences, Book Chapters): 20
Citation score: 156 H-index: 9 i10-index: 7
Contact Details:
Mail-id: sandipandhartsk03@gmail.com
Phone: +91-8638940660, +91-8876816368 (WhatsApp)
Postdoc Experience
Indian Institute of Technology Bombay (IIT-B)
Department: Department of Electrical Engineering (EE Dept.)
Duration: Dec 2024 - Nov 2025
Supervisor: Prof. (Dr.) Preeti Rao (Professor at the EE Dept, IIT-B)
Domain of research: Singing Voice Synthesis, Speaker Anonymization, Text-to-Speech Synthesis
Fellowship: IIT-Bombay Institute Postdoctoral Fellowship
Education Details (Ph.D., M.Tech, B.E)
With my Ph.D. supervisor Dr. Nanda Dulal Jana Sir (first from right), Assistant Prof of NIT Durgapur, CSE Dept, and co-supervisor Prof. (Dr.) Swagatam Das Sir (first from left), Professor and former HOD of ISI Kolkata, ECS-Unit
PhD: Computer Science and Engineering (CSE), National Institute of Technology (NIT) Durgapur, India (Institute of National Importance)
PhD supervisors: Dr. Nanda Dulal Jana (Assistant Prof of NIT Durgapur, CSE Dept) and Prof. (Dr.) Swagatam Das (Professor and former HOD of Indian Statistical Institute Kolkata, Electronics and Communication Sciences Unit).
Batch: 2019-2024
Domain of research : Speech Synthesis Using Deep Generative Models.
UGC-NET-2019 and GATE-2017 Qualified
Received NIT-Durgapur Institute Ph.D. Fellowship (2019-2024)
With my M.Tech project supervisor Prof. (Dr.) Gautam Sanyal Sir (Retired Professor and former HOD of NIT Durgapur CSE Dept)
M.Tech
M.Tech : Computer Science and Engineering (CSE), NIT Durgapur, India (Institute of National Importance)
Batch: 2017-2019
M.Tech final year thesis : Aesthetic Analysis of Images Containing Human Faces Using Deeplearning Approaches.
Final year project guide : Prof. (Dr.) Gautam Sanyal. (Retired professor and former HOD of NIT Durgapur CSE Dept)
Received GATE Scholarship (2017-2019)
JEC B.Tech CSE Batch of 2012-2016
B.E
B.E/B.Tech : Computer Science and Engineering (CSE), Jorhat Engineering College (JEC) (State Government Engineering College of Assam, India)
Batch: 2012-2016
Schooling
Class 12: Tinsukia College, (Assam, India) Science Stream. (Year:2010-2012)
Class 10: Tinsukia Railway High School (Assam, India). (Year: 2000-2010)
Experience
Working as an Institute Postdoctoral Fellow (IPDF) at the Digital Audio Processing (DAP) Lab of the Indian Institute of Technology Bombay (IIT-B) Electrical Engineering Department (From Dec 23, 2024) under the supervision of Prof. (Dr.) Preeti Rao. (https://shorturl.at/nI91H)
Speech Generation Research Intern at the Sony Research India: Sony Research & Development (April 15, 2024 - Dec 20, 2024) . (https://shorturl.at/LXLcj)
Lead Research Project Coordinator of a consultancy project titled "Development of AI Based Voice to Voice Synthesis", sponsored by Stone Media (Mumbai) in collaboration with NIT-DGP (Dr. Nanda Dulal Jana's Lab, CSE Dept) (March 1, 2024 - Aug 14, 2024). (https://shorturl.at/xUDUl)
Co-initiator of the Soft Computing and Machine Learning Group (SCML Group: https://www.linkedin.com/company/scml-group) since 2019 (Aug).
Recent Achievements
Recipient of the ERCIM Alain Bensoussan Postdoctoral Fellowship 2025 (link: https://fellowship.ercim.eu/)
Received invitation to deliver a talk on "The Rise of Generative AI: Redefining Image, Speech, and Music" by Sikim Manipal Institute of Technology (SMIT), India
Received Invitation to deliver a speech on "Speech Synthesis Reimagined: AI Models Mimicking Natural Human Speech" at SOA Deemed to be University, Bhubaneswar, Odisha, India
Received the best Ph.D. thesis award (6th Universal Innovators Leadership Awards 2025) at the 8th International Conference on Innovative Computing and Communication (ICICC) 2025 held in Delhi.
Received invitation to deliver a talk on "Recent Advancements in Speech Synthesis Technology" organized by Sikim Manipal Institute of Technology (SMIT), India
Joined IIT-Bombay as an Institute Postdoctoral Fellow (IPDF) at the Electrical Engineering Department (Dec, 2024).
Received invitation to give a talk on "Generative AI-based speech and music synthesis" in the Winter School on Deep Learning (WSDL) 2025 organized by ISI-Kolkata.
Successfully defended my PhD thesis on November 25, 2024, at the Department of Computer Science and Engineering, NIT Durgapur, India.
Received Invitation to deliver a speech on "Conversational-AI using Generative AI Models" at Sikim Manipal Institute of Technology (SMIT), Sikim
Received Invitation to give a talk on "Generative-AI based Speech Synthesis" at the Institute of Engineering and Management (IEM) Kolkata
Received invitation as a guest speaker to give a talk on "Revolutionizing Voice Assistants : Advances in Generative AI-based Speech Synthesis" at Sri Sathya Sai University for Human Excellence, Karnataka, Department of Mathematical and Communication Sciences
Received invitation to give a talk on "Speech Synthesis Using Deep Generative Models : Some Recent Approaches" at Manipal University Jaipur, Department of Computer and Communication Engineering
Presented my PhD Research works at the PhD thesis pre-submission seminar held in April 15, 2024, NIT Durgapur CSE Department
Received Research Internship offer from Sony Research India: Sony Research & Development (April 2024)
Academic Institutions and R&D Labs I Have Worked (will be working) With as a Student and Researcher