Publications

Conference and Journal Papers

2025

Zhengjun Yue, Mara Barberis, Tanvina Patel, Judith Dineley, Willemijn Doedens, Lottie Stipdonk, Yuanyuan Zhang, Elke de Witte, Erfan Loweimi, Hugo Van hamme, Djaina Satoer, Marina Ruiter, Laureano Moro Velazquez, Nicholas Cummins, Odette Scharenborg, "Challenges and practical guidelines for atypical speech data collection, annotation, usage and sharing: A multi-project perspective", in Interspeech 2025.
Dimme de Groot, Tanvina Patel, Devendra Kayande, Odette Scharenborg, Zhengjun Yue, "Objective and Subjective Evaluation of Diffusion-Based Speech Enhancement for Dysarthric Speech", in Interspeech 2025.

2024

Tanvina Patel and Odette Scharenborg, "Improving End-to-End Models for Children’s Speech Recognition" Applied Sciences 14, no. 6: 2353, March 2024.
Yuanyuan Zhang, Zhengjun Yue, Tanvina Patel and Odette Scharenborg, "Improving child speech recognition with augmented child-like speech". Accepted for publication at Interspeech 2024, Kos, Greece.
Wiebke Hutiri, Tanvina Patel, Aaron Ding, and Odette Scharenborg, "As biased as you measure: Methodological pitfalls of bias evaluations in speaker verification research". Accepted for publication at Interspeech 2024, Kos, Greece.
Chris Bras, Tanvina Patel and Odette Scharenborg, "Using articulated speech EEG signals for imagined speech decoding". Accepted for publication at Interspeech 2024, Kos, Greece.

2023

Yuanyuan Zhang, Aaricia Herygers, Tanvina Patel, Zhenjun Yue, Z., Odette Scharenborg , "Exploring data augmentation in bias mitigation against non-native-accented speech", in IEEE Automatic Speech Recognition and Understanding (ASRU) Workshop 2023, Taipei, Taiwan.
Chaufang Lin, Tanvina Patel, Odette Scharenborg, "Improving whispered speech recognition performance using pseudo-whispered based data augmentation", in IEEE Automatic Speech Recognition and Understanding Workshop (ASRU) 2023, Taipei, Taiwan.

2022

Tanvina Patel and Odette Scharenborg, "Using cross-model learnings for the Gram Vaani ASR Challenge 2022", in Proceedings of ISCA INTERSPEECH 2022, Incheon, Korea
Yuanyuan Zhang, Yixuan Zhang, Bence Halpern, Tanvina Patel and Odette Scharenborg, "Mitigating bias against non-native accents", in Proceedings of ISCA INTERSPEECH 2022, Incheon, Korea.
Yixuan Zhang, Yuanyuan Zhang., Tanvina Patel and Odette Scharenborg, "Comparing data augmentation and training techniques to reduce bias against non-native accents in hybrid speech recognition systems" in Proceedings of Speech for Social Good workshop, satellite event of INTERSPEECH, Incheon, Korea.

2020

Tanvina Patel, “Semi-Supervised Learning for speech recognition in Indian languages,” in NVIDIA GPU Technology Conference (GTC), October 2020, p. [A21560]

2018

Tanvina Patel, Krishna DN, Noor Fathima, Nisar Shah, Mahima C, Deepak Kumar and Anuroop Iyengar, "Development of Large Vocabulary Speech Recognition System with Keyword Search for Manipuri", in INTERSPEECH 2018, Hyderabad, pp. 1031-1035. [Link].
Tanvina Patel, Krishna D N, Noor Fathima, Nisar Shah, Mahima C, Deepak Kumar and Anuroop Iyengar, "An Automatic Speech Transcription System for Manipuri Language", in INTERSPEECH 2018, Hyderabad, pp. 2388-2389. [Link].
Noor Fathima, Tanvina Patel, Mahima C and Anuroop Iyengar, “TDNN-based Multilingual Speech Recognition System for Low Resource Indian Languages", in INTERSPEECH 2018, Hyderabad, pp. 3197-3201. [Link].
Krishna D N, Noor Fathima, Tanvina B. Patel, Mahima C, Nisar Shah, and Anuroop Iyengar, “Automatic Speech Recognition for Low-resource Manipuri Language” in the NVIDIA GPU Technology Conference (GTC), 26-29 March, San Jose McEnery Convention Center, San Jose, CA. [Link].
Hemant A. Patil, and Tanvina B. Patel, "Analysis of normal and pathological voices by novel chaotic titration method," in Signal and Acoustic Modelling for Speech and Communication Disorders, Hemant A. Patil, Amy Neustein, and Manisha Kulshetra, Eds., Berlin: De Gruyter, Dec. 2018, pp. 87-120. doi: 10.1515/9781501502415 [Link].

2017

Tanvina B. Patel and Hemant A. Patil, “Significance of source-filter interaction for classification of natural vs. spoofed speech,” in IEEE Journal of Selected Topics in Signal Processing (JSTSP), Special Issue on Spoofing and Countermeasures for Automatic Speaker Verification, vol. 11, no. 4, pp. 644-659, June 2017 (Available Online on 15th March 2017 [Link]).
Tanvina B. Patel and Hemant A. Patil, “Cochlear filter and instantaneous frequency based features for spoofed speech detection,” in IEEE Journal of Selected Topics in Signal Processing (JSTSP), Special Issue on Spoofing and Countermeasures for Automatic Speaker Verification, vol. 11, no. 4, pp. 618-631, June 2017 (Available Online on 30th December 2016 [Link]).
Hemant Patil, Madhu Kamble, Tanvina Patel, Meet Soni, "Novel Variable Length Teager Energy Separation Based Instantaneous Frequency Features for Replay Detection" in Proc. 17th Annual Conference of International Speech Communication Association (ISCA), INTERSPEECH, Stockholm, Sweden. 2017, 20-24 Aug., pp. 12-16. [Link]

2016

Himanshu Bhavsar, Tanvina B. Patel and Hemant A. Patil, “Novel Nonlinear Prediction Based Features for Spoofed Speech Detection”, in Proc. 17th Annual Conference of International Speech Communication Association (ISCA), INTERSPEECH, San Francisco, USA, 8-12 Sept., 2016, pp. 155-159. [Link]
Meet Soni, Tanvina B. Patel and Hemant A. Patil, “Novel Subband Autoencoder Features for Detection of Spoofed Speech”, in Proc.17th Annual Conference of International Speech Communication Association (ISCA), INTERSPEECH, San Francisco, USA, 8-12 Sept., 2016, pp. 1820-1824. [Link]
Avni Rajpal, Tanvina B. Patel, Hardik B. Sailor, Maulik C. Madhavi, Hemant A. Patil and Hiroya Fujisaki, “Native Language Identification Using Spectral and Source-Based Features”, in Proc. 17th Annual Conference of International Speech Communication Association (ISCA), INTERSPEECH, San Francisco, USA, 8-12 Sept., 2016, pp. 2383-2387. [Link]
Deep Gandhi, Tanvina B. Patel and Hemant A. Patil, "A Novel Lowpass Filtering-Based Approach for Estimating Strength of Excitation From Speech Signal" in International Conference on Signal Processing and Communications (SPCOM), IISc, Bangalore, India, 12-15 June, 2012 . [Link]
Tanvina B. Patel and Hemant Patil, "Effectiveness of Fundamental Frequency (F0) And Strength Of Excitation (SoE) For Spoofed Speech Detection" in Proc. 41st IEEE Int. Conf. Acoust., Speech and Signal Process., (ICASSP), Shanghai, China, 20-25 March, 2016, pp. 5105-5109. [IEEE Xplore link].
Tanvina B. Patel and Hemant Patil, "Analysis of Natural And Synthetic Speech Using Fujisaki Model" in Proc. 41st IEEE Int. Conf. Acoust., Speech and Signal Process., (ICASSP), Shanghai, China, March 20-25, 2016, pp. 5250-5254. [IEEE Xplore link].

2015

Tanvina B. Patel and Hemant A. Patil, “Combining Evidences from Mel Cepstral, Cochlear Filter Cepstral and Instantaneous Frequency Features for Detection of Natural vs. Spoofed Speech,” in Proc. 16th Annual Conference of International Speech Communication Association (ISCA), INTERSPEECH, Dresden, Germany, 6-10 September, 2015, pp. 2062-2066. [Link]

This work was submitted as a part of the special session on ASV Spoof 2015 Challenge (Link). The proposed system performed relatively best among all the 16 submissions of the challenge (Link).

Pramod Bachhav, Hemant Patil and Tanvina B. Patel, “A novel filtering based approach for epoch extraction,” in Proc. 40th IEEE Int. Conf. Acoust., Speech and Signal Process., (ICASSP), 19–24 April 2015, Brisbane, Australia, pp. 4784-4788. [IEEE Xplore link].

2014

Hemant A. Patil and Tanvina B. Patel, "Chaotic mixed excitation source for synthesis of speech signal," in Proc. 15th Annual Conference of International Speech Communication Association (ISCA), INTERSPEECH, Singapore, September 14-18, 2014, pp. 785-789. [Link]
. Tanvina B. Patel and Hemant A. Patil, "Novel approach for estimating length of the vocal folds using Fujisaki model," in Proc. 9thInternational Symposium on Chinese Spoken Language Processing (ISCSLP), pp. 308-312, Singapore, September 12-14, 2014, pp. 308-312. [IEEE Xplore link].
Nirmesh J. Shah, Hemant Patil, Maulik Madhvi, Hardik Sailor and Tanvina Patel, "Deterministic annealing EM algorithm for developing Gujarati TTS system," in Proc. 9th International Symposium on Chinese Spoken Language Processing (ISCSLP), Singapore, September 12-14, 2014, pp. 526-530. [IEEE Xplore link].

2013

Hemant A. Patil and Tanvina B. Patel, “Nonlinear prediction of speech using Volterra-Wiener Series,” in 14th Proc. Annual Conference of International Speech Communication Association (ISCA), INTERSPEECH, Lyon, France, August 25-29, 2013, pp. 1687-1691. [Link]
Hemant A Patil, Tanvina B. Patel, Nirmesh J Shah, Hardik B Sailor, Raghava Krishnan, G R Kasthuri, T. Nagarajan, Lilly Christina, Naresh Kumar, Veera Raghavendra, S P Kishore, S R M Prasanna, Nagaraj Adiga, Sanasam Ranbir Singh, Konjengbam Anand, Pranaw Kumar, Bira Chandra Singh, S L Binil Kumar, T G Bhadran, T Sajini, Arup Saha, Tulika Basu, K Sreenivasa Rao, N P Narendra, Anil Kumar Sao, Rakesh Kumar, Pranhari Talukdar, Purnendu Acharyaa, Somnath Chandra, Swaran Lata and Hema Murthy, “A Syllable-Based framework for Unit Selection Synthesis in 13 Indian Languages”, in the Oriental International Committee for the Co-Ordination and Standardization of Speech Databases and Assessment Techniques (O'COCOSDA) Conference, Gurgaon, India, November 25-27, 2013, pp. 1 - 8. [IEEE Xplore link].
Hemant Patil, Tanvina Patel, Swati Talesara, Nirmesh Shah, Hardik Sailor, Bhavik Vachhani, Janaki Akhani, Bhargav Kankariya, Yashesh Gaur and Vibha Prajapati, “Algorithm for Speech Segmentation at Syllable-Level for Tex-to-Speech Synthesis System in Gujarati”, in the Oriental International Committee for the Co-Ordination and Standardization of Speech Databases and Assessment Techniques (O'COCOSDA) Conference, Gurgaon, India, November 25-27, 2013, pp. 1 - 7. [IEEE Xplore link]
Swati Talesara, Hemant A. Patil, Tanvina Patel, Hardik Sailor and Nirmesh Shah, ‘A Novel Gaussian Filter-based Automatic Labeling of Speech Data for TTS System in Gujarati Language’ in Proc. International Conference on Asian Language Processing (IALP), Urumqi, China, 17-19 August, 2013, pp. 139 - 142. [IEEE Xplore link].

2012

Tanvina B. Patel, Hemant A. Patil, Kunal P. Acharya, “Analysis of Normal and Pathological Voices Based on Nonlinear dynamics” in International Conference on Electrical, Electronics and Computer Engineering., (ICEECE), IRNet, Ahmedabad, India, 12 February, 2012. [Link]
. Hemant A. Patil and Tanvina B. Patel, ‘Novel Chaotic Titration Method for analysis of Normal and Pathological Voices’, in International Conference on Signal Processing and Communications (SPCOM), IISc, Bangalore, India, 22-25 July, 2012, pp:1-5. [IEEE Xplore link].

Digital Object Identifier: 10.1109/SPCOM.2012.6290044

Google Sites

Report abuse