Publications

Journals (Submitted)

Bhattacharjee, S., Mishra, J., Shekhawat, H. S., & Prasanna, S. R. M. (2025). Improving ASR fairness for cleft lip and palate speech: A study on severity-aware augmentation. Speech Communication. (Manuscript under review) (Q1)

Journals

Kurnaz, O., Mishra, J., Kinnunen, T. H., & Hanilci, C. (2025). Joint optimization of speaker and spoof detectors for spoofing-robust automatic speaker verification. IEEE/ACM Transactions on Audio, Speech, and Language Processing. (IF: 5.1) 10.1109/TASLPRO.2026.3688932 (Q1)
Jyothi, M. V. S., Banerjee, O., Govind, D., Samudravijaya, K., Dubey, A. K., Gangashetty, S. V., Rakesh, U. K., Rajeev, A., & Mishra, J. (2025). Characterizing stroke-affected speech using F0 and duration-based features. Scientific Reports. (IF: 3.9) https://doi.org/10.1038/s41598-026-40155-9 (Q1)
Mishra, J., Chhibber, M., Shim, H. J., & Kinnunen, T. H. (2025). Towards Explainable Spoofed Speech Attribution and Detection: A Probabilistic Approach for Characterizing Speech Synthesizer Components. Computer Speech & Language. (IF: 3.4), https://doi.org/10.1016/j.csl.2025.101840 (Q1)
Kurnaz, O., Mishra, J., Kinnunen, T. H., & Hanilçi, C. (2025). Optimizing a-dcf for spoofing-robust speaker verification. IEEE Signal Processing Letters. (IF: 3.9), 10.1109/LSP.2025.3545290 (Q1)
Mishra, J., & Prasanna, S. R. M. (2024). Implicit self-supervised language representation for spoken language diarization. IEEE/ACM Transactions on Audio, Speech, and Language Processing. (IF: 5.1), 10.1109/TASLP.2024.3426978 (Q1)
Mishra, J., & Prasanna, S. R. M. (2024). Spoken language change detection inspired by speaker change detection. Circuits, Systems, and Signal Processing, 43, 1–26. Springer. (IF: 2.0 ), https://doi.org/10.1007/s00034-024-02743-w (Q2)
Mishra, J., & Prasanna, S. R. M. (2024). Generative attention based framework for implicit language change detection. Digital Signal Processing, 154, 104678. Academic Press. (IF: 3.0 ), https://doi.org/10.1016/j.dsp.2024.104678 (Q2)
Sharma, R., Govind, D., Mishra, J., Dubey, A. K., Deepak, K. T., & Prasanna, S. R. M. (2024). Milestones in speaker recognition. Artificial Intelligence Review, 57(3), 58. Springer Netherlands Dordrecht. (IF: 13.9), https://doi.org/10.1007/s10462-023-10688-w (Q1)
Dutta, K., Mishra, J., & Pati, D. (2018). Effective use of combined excitation source and vocal-tract information for speaker recognition tasks. International Journal of Speech Technology, 21(4), 1057–1070. Springer, https://doi.org/10.1007/s10772-018-09568-4 (Q1)

Conferences

International Conferences:

Chhibber, M., Mishra, J., Shim, H., & Kinnunen, T. H. (2026). Advancing zero-shot open-set speech deepfake source tracing. Proceedings of The Odyssey 2026. (Accepted) (Core Rank: B)
Colibro, D., Vair, C., Tu, Y., Li, J., Huang, Z., Chen, Y., Lee, K. A., Mak, M.-W., Mishra, J., Singh, V., Xuan, X., Chhibber, M., Kurnaz, O., Kinnunen, T., Lee, S., Jung, C., Nam, K., Chung, J. S., Wang, S., & (2026). Advancing speaker recognition: I4U’s audio systems for NIST SRE24. Proceedings of The Odyssey 2026. (Accepted) (Core Rank: B)
Swain, M., Maji, B., Mishra, J., Schedl, M., Søgaard, A., & Jensen, J. R. (2026). Towards fair ASR for second language speakers using fairness-prompted finetuning. Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2026). (Core Rank: B)
Kurnaz, O., Mishra, J., Kinnunen, T. H., & Hanilci, C. (2026). Joint optimization of ASV and CM tasks: BTUEF team’s submission for the WildSpoof challenge. Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2026). (Core Rank: B)
Bhattacharjee, S., Mishra, J., Shekhawat, H. S., & Prasanna, S. R. (2025). Parameter-Efficient Fine-Tuning of Foundation Models for CLP Speech Classification. The 17th Asia Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA-ASC), Singapore.
Sadashiv, T. N. R., Bedge, A., Bore, S. S., Mishra, J., Bhattacharjee, M., & Prasanna, S. R. (2025). Fusion of Modulation Spectrogram and SSL with Multi-head Attention for Fake Speech Detection. The 17th Asia Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA-ASC), Singapore.
Firc, A., Chibber, M., Mishra, J., Singh, V. P., Kinnunen, T., & Malinka, K. (2025). STOPA: A Database of Systematic VariaTion Of DeePfake Audio for Open-Set Source Tracing and Attribution, Interspeech 2025, 10.21437/Interspeech.2025-2065, Rotterdam, Netherlands. (Core Rank: A)
Gogoi, P., Singh, V. P., Khadirnaikar, S., Siddhartha, S., Kalita, S., Mishra, J., ... & Prasanna, S. R. M. (2025). Leveraging AM and FM Rhythm Spectrograms for Dementia Classification and Assessment. Interspeech 2025, 10.21437/Interspeech.2025-2097, Rotterdam, Netherlands. (Core Rank: A)
Chhibber, M., Mishra, J., Shim, H., & Kinnunen, T. H. (2025). An Explainable Probabilistic Attribute Embedding Approach for Spoofed Speech Characterization. ICASSP 2025 - 2025 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Hyderabad, India (Core Rank: B)
Kurnaz, O., Demirtaş, S. C., Büker, A., Mishra, J., & Hanilçi, C. (2024). Spoofing-robust speaker verification using parallel embedding fusion: BTU speech group's approach for ASVspoof5 Challenge. The Automatic Speaker Verification Spoofing Countermeasures Workshop (ASVspoof 2024), 138–143, Kos, Greece.
Nandyala, S., Manche, P., Mishra, J., & Prasanna, S. R. M. (2024). Speaker and Digit Representation based Voice OTP System. In 2024 International Conference on Signal Processing and Communications (SPCOM) (pp. 1–5). IEEE, Bangalore, India
Mishra, J., Bhattacharjee, M., & Prasanna, S. R. M. (2023). I-MSV 2022: Indic-Multilingual and Multi-sensor Speaker Verification Challenge. In International Conference on Speech and Computer (pp. 437–445). Springer, Dharwad, India
Mishra, J., Patil, J., Chowdhury, A., & Prasanna, S. R. M. (2023). End-to-end spoken language diarization with Wav2vec embeddings. INTERSPEECH 2023 (pp. 501–505), Dublin, Ireland. (Core Rank: A)
Sadashiv, T. N. R., Kumar, D., Agarwal, A., Tzudir, M., Mishra, J., & Prasanna, S. R. M. (2023). Source and system-based modulation approach for fake speech detection. In International Conference on Speech and Computer (pp. 142–155). Springer Nature Switzerland Cham, Dharwad, India
Mukherjee, S., Mishra, J., & Prasanna, S. R. M. (2023). Significance of Indic Self-supervised Speech Representations for Indic Under-Resourced ASR. In International Conference on Speech and Computer (pp. 100–113). Springer Nature Switzerland Cham., Dharwad, India
Manche, P., Nandyala, S., Mishra, J., Ananthanarayanan, G., & Prasanna, S. R. M. (2023). Design and Development of Voice OTP Authentication System. In International Conference on Speech and Computer (pp. 513–528). Springer Nature Switzerland Cham, Dharwad, India
Agarwal, A., Mishra, J., & Prasanna, S. R. M. (2022). Significance of excitation source sequence information for speaker verification. In 2022 IEEE International Conference on Signal Processing and Communications (SPCOM) (pp. 1–5). IEEE.
Mishra, J., Gandra, J., Patil, V., & Prasanna, S. R. M. (2022). Issues in sub-utterance level language identification in a code-switched bilingual scenario. In 2022 IEEE International Conference on Signal Processing and Communications (SPCOM) (pp. 1–5). IEEE.
Mishra, J., & Prasanna, S. R. M. (2022). Importance of supra-segmental information and self-supervised framework for spoken language diarization task. In International Conference on Speech and Computer (pp. 494–507). Springer.
Arya, L., Agarwal, A., Mishra, J., & Prasanna, S. R. M. (2022). Analysis of layer-wise training in direct speech-to-speech translation using BI-LSTM. In 2022 25th Conference of the Oriental COCOSDA (pp. 1–6). IEEE.
Agarwal, A., Mishra, J., & Prasanna, S. R. M. (2020). VOP detection in variable speech rate condition. In INTERSPEECH 2020 (pp. 3690–3694). (Core Rank: A)
Mishra, J., Singh, M., & Pati, D. (2018). LP residual features to counter replay attacks. In 2018 International Conference on Signals and Systems (ICSigSys) (pp. 261–266). IEEE.
Mishra, J., Singh, M., & Pati, D. (2018). Exploring linear prediction residual signal for developing countermeasures to playback attacks. In 2018 IEEE International Students’ Conference on Electrical, Electronics and Computer Science (SCEECS) (pp. 1–6). IEEE.
Mishra, J., Singh, M., & Pati, D. (2018). Processing linear prediction residual signal to counter replay attacks. In 2018 International Conference on Signal Processing and Communications (SPCOM) (pp. 95–99). IEEE.
Dutta, K., Mishra, J., & Pati, D. (2017). An effective combination scheme for improving speaker verification performance. In TENCON 2017 - 2017 IEEE Region 10 Conference (pp. 1296–1299). IEEE.
Singh, M., Mishra, J., & Pati, D. (2017). Development of playback attacks detection system. In TENCON 2017 - 2017 IEEE Region 10 Conference (pp. 1415–1420). IEEE.
Dutta, K., Mishra, J., & Pati, D. (2017). Improvement in speaker verification performance using an innovative combination scheme. In 2017 14th IEEE India Council International Conference (INDICON) (pp. 1–5). IEEE.
Singh, M., Mishra, J., & Pati, D. (2016). Replay attack: Its effect on GMM-UBM based text-independent speaker verification system. In 2016 IEEE Uttar Pradesh Section International Conference on Electrical, Computer and Electronics Engineering (UPCON) (pp. 619–623). IEEE.

National Conferences:

Mishra, J., & Prasanna, S. R. M. (2023). Challenges in spoken language diarization in code-switched scenario. In 2023 National Conference on Communications (NCC) (pp. 1–6). IEEE.
Mishra, J., Siddhartha, S., & Prasanna, S. R. M. (2022). Importance of excitation source and sequence learning towards spoken language identification task. In 2022 National Conference on Communications (NCC) (pp. 190–194). IEEE.
Agarwal, A., Swain, A., Mishra, J., & Prasanna, S. R. M. (2022). Significance of prosody modification in privacy preservation on speaker verification. In 2022 National Conference on Communications (NCC) (pp. 245–249). IEEE.
Mishra, J., Agarwal, A., & Prasanna, S. R. M. (2021). Spoken language diarization using an attention-based neural network. In 2021 National Conference on Communications (NCC) (pp. 1–6). IEEE.
Siddhartha, S., Mishra, J., & Prasanna, S. R. M. (2020). Language specific information from LP residual signal using linear sub-band filters. In 2020 National Conference on Communications (NCC) (pp. 1–5). IEEE.
Mishra, J., Pati, D., & Prasanna, S. R. M. (2019). Modelling glottal flow derivative signal for detection of replay speech samples. In 2019 National Conference on Communications (NCC) (pp. 1–5). IEEE.
Singh, M., Mishra, J., & Pati, D. (2017). Usefulness of linear prediction residual signal for development of replay attacks detection system. In 2017 Twenty-third National Conference on Communications (NCC) (pp. 1–4). IEEE.

Page updated

Report abuse