Present Designation:
R&D Staff Engineer, Infineon Technologies AG, Irvine, California | R&D Consultant, TCS Research - Mumbai, India | Post-doctoral Research Associate, University of Southern California (USC) Viterbi, ECE , Signal Analysis and Interpretation Laboratory (SAIL), USA.
B.Tech, M. E., Ph. D., Postdoc.
Phone: (+1)9495919225, (+91)7501774160
Email: monisankha.pal@gmail.com, monisankha86.pal@gmail.com
Research Interests:
Speech Signal Processing, Machine Learning, Deep Learning, Speaker Diarization, Speaker Recognition, Speech Enhancement, Spoken Language Understanding, Voice Spoofing and Anti-spoofing, Adversarial Attack and Defense in Deep Speaker Recognition, Voice Conversion, Speech Fluency.
Education:
Ph. D. in speech signal processing from the Department of Electronics and Electrical Communication Engineering, Indian Institute of Technology Kharagpur, India (2018)
Thesis title: Voice Conversion, Its Impact on Voice Biometrics, and Development of Countermeasure.
M. E. in Communication Engineering from the Department of Electronics & Telecommunication Engineering, Jadavpur University, India (2011)
B. Tech in Electronics and Communication Engineering from Institute of Engineering & Management, West Bengal University of Technology, India (2008)
Publications:
Refereed Journal Publications:
Pal, M., Kumar, M., Peri, R., Park, T. J., Kim, S. H., Lord, C., Bishop, S., Narayanan, S., "Meta-learning with latent space clustering in generative adversarial network for speaker diarization." IEEE/ACM Transactions on Audio, Speech, and Language Processing, 29, 1204-1219, 2021. (Impact Factor: 3.398) [Cited by 9]
Jati, A., Hsu, C. C., Pal, M., Peri, R., AbdAlmageed, W., Narayanan, S., "Adversarial attack and defense strategies for deep speaker recognition systems." Computer Speech & Language, 68, 101199, 1-14, 2021. (Impact Factor: 2.116) [Cited by 15]
Kumar, K., Paul, D., Pal, M., Sahidullah, M., Saha, G., "Speech Frame Selection for Spoofing Detection with an Application to Partially Spoofed Audio-Data." International Journal of Speech Technology, Springer, 24, 193-203, 2021. (Impact Factor: 1.220) [Cited by 3]
Pal, M., Paul, D., Saha, G., "Synthetic speech detection using fundamental frequency variation and spectral features." Computer Speech & Language, 48, 31-50, 2018. (Impact Factor: 2.116) [Cited by 28]
Pal, M., and Saha, G., "Spectral Mapping Using Prior Re-estimation of i-vectors And System Fusion for Voice Conversion." IEEE/ACM Transactions on Audio, Speech, and Language Processing, 25(11), 2071-2084, 2017. (Impact Factor: 3.398) [Cited by 4]
Paul, D., Pal, M., Saha, G., "Spectral Features for Synthetic Speech Detection." IEEE Journal of Selected Topics in Signal Processing, 11:605-617, 2017. (Impact Factor: 4.981) [Cited by 47]
Pal, M., and Saha, G., "On robustness of speech based biometric systems against voice conversion attack." Applied Soft Computing, 30:214-228, 2015. (Impact Factor: 5.472) [Cited by 31]
Refereed Conference Publications:
Pal, M., Raikar, A., Panda, A., Kopparapu, K. S., "Synthetic speech detection using meta-learning with prototypical loss." arXiv preprint arXiv:2201.09470 (2022).
Pal, M., Jati, A., Peri, R., Hsu, C. C., AbdAlmageed, W., Narayanan, S., "Adversarial defense for deep speaker recognition using hybrid adversarial training."Proc. IEEE, ICASSP, Toronto, Canada, 6164-6168, 2021. [Cited by 6]
Pal, M., Kumar, M., Peri, R., Park, T. J., Kim, S. H., Lord, C., Bishop, S., Narayanan, S., "Speaker Diarization using Latent Space Clustering in Generative Adversarial Network." Proc. IEEE, ICASSP, Barcelona, Spain, 6504-6508, 2020. [Cited by 13]
Peri, R., Pal, M., Jati, A., Somandepalli, K., Narayanan, S., "Robust Speaker Recognition using Unsupervised Adversarial Invariance" Proc. IEEE, ICASSP, Barcelona, Spain, 6614-6618, 2020. [Cited by 14]
Park, T. J., Kumar, M., Flemotomos, N., Pal, M., Peri, R., Lahiri, R., Georgiou, P., Narayanan, S., "The Second DIHARD challenge: System Description for USC-SAIL Team." Proc. Interspeech, Austria, 998-1002, 2019. [Cited by 8]
Jati, A., Peri, R., Pal, M., Park, T. J., Kumar, N., Travadi, R., Georgiou, P., Narayanan, S., "Multi-task Discriminative Training of Hybrid DNN-TVM Model for Speaker Verification with Noisy and Far-Field Speech." Proc. Interspeech, Austria, 2463-2467, 2019. [Cited by 9]
Pal, M., Kumar, M., Peri, R., Narayanan, S., "A study of semi-supervised speaker diarization system using GAN mixture model." arXiv preprint arXiv:1910.11416, 2019. [Cited by 3]
Pal, M., Paul, D., Sahidullah, M., Saha, G., "Robustness of Voice Conversion Techniques Under Mismatched Conditions." arXiv preprint arXiv:1612.07523, 2016. [Cited by 2]
Paul, D., Pal, M., Saha, G., "Novel Speech Features for Improved Detection of Spoofing Attacks." Proceedings of IEEE India Conference (INDICON), at New Delhi, 2015. [Cited by 15]
Pal, M., and Chattopadhyay, S., "A novel orthogonal minimum cross-correlation spreading code in CDMA system." Proceedings of IEEE International Conference on Emerging Trends in Robotics and Communication Technologies (INTERACT), 80-84, 2010. [Cited by 23]
Conferences to be communicated:
Pal, M., Raikar, A., Panda, A., Kopparapu, K. S., "Synthetic speech detection using meta-learning with prototypical loss". To be communicated.
Patent filed:
1. Saha, G., Paul, D., Pal, M., "System and method for automatic synthetic speech detection for speech based biometric authentication." Status: Filed (Ref : 1241/KOL/2015 dated December 3, 2015).
Others (Unsubmitted draft):
1. Pal, M., Paul, D., Sahidullah, M., Saha, G., "Robustness of Voice Conversion Techniques Under Mismatched Conditions." arXiv preprint arXiv:1612.07523, 2016.
Reviewer for:
Computer Speech & Language (Elsevier), Interspeech, ICASSP.
Society Membership:
IEEE (Student Member: 2010-2011)
IEEE Signal Processing Society (2010-2011)
International Speech Communication Association (ISCA) (2020-2021).
Skills:
Tools: Kaldi, Scikit-learn, Librosa, PyTorch, Tensorflow, Keras, VOICEBOX, MSR Identity Toolbox, HSM, STRAIGHT, Audacity, Wavesurfer, Praat.
Programming: C, Python, Bash Scripting, MATLAB.
Languages: English, Bengali and Hindi