Publications
International Journals
Hemant A. Patil, Aastha Kachhi, and Ankur T. Patil, “CQT-based cepstral features for classification of normal vs. pathological infant cry,” to appear in IEEE Trans. Speech Audio, and Language Processing, pp. 1-14, Oct. 27, 2023. Early Access Link: https://ieeexplore.ieee.org/document/10298803
Priyanka Gupta and Hemant A. Patil, “Morse wavelet transform-based features for voice liveness detection,” to appear in Computer Speech and Language, Elsevier, vol. 84, 23 pages, March 2024.
Early Access Link: https://www.sciencedirect.com/journal/computer-speech-and-language/vol/84/suppl/C
Sylvio Barbon Junior, Rodrigo Capobianco Guido, Gabriel Jonas Aguiar, Everton José Santana, Mario Lemes Proença Junior, and Hemant A. Patil, "Multiple voice disorders in the same individual: Investigating handcrafted features, multi-label classification algorithms, and base-learners," Speech Communication, Elsevier, ISSN 1872-7182, vol. 152, Jul. 2023.
Dipesh Kumar Singh, Gauri Prajapati, and Hemant A. Patil, “Voice privacy using time-scale and pitch modification,” S N Computer Science, Under Minor Revision, 2023 (Invited Paper).
Kirtana Sunil Phatnani, and Hemant A. Patil, "Modeling musical expectancy via reinforcement learning and directed graphs," Multimedia Tools and Applications, Springer, ISSN 1573-7721, pp. 1-25, 06 Sept. 2023.
Priyanka Gupta, Piyush Chodingala, and Hemant A. Patil, "Replay spoof detection using energy separation based instantaneous frequency estimation from quadrature and in-phase components," Computer, Speech and Language, Elsevier, 16 June 2022, 101423.
Ankur T. Patil, Hemant A. Patil, and Kuldeep Khoria, "Effectiveness of energy separation-based instantaneous frequency estimation for cochlear cepstral features for synthetic and voice-converted spoofed speech detection," Computer Speech & Language, Elsevier, vol. 72, pp. 101301, Mar. 2022.
Ankur T. Patil, Rajul Acharya, Hemant A. Patil, and Rodrigo Capobianco Guido, “Improving the potential of enhanced Teager energy cepstral coefficients (ETECC) for replay attack detection,” in Special Issue on State-of-the-art Handcrafted Feature Extraction for Speech and Voice Analysis, Computer Speech & Language, Elsevier, Elsevier, vol. 72, pp. 27-44, March 2022.
Kuldeep Khoria, Ankur T. Patil and Hemant A. Patil, "On significance of constant-Q transform for pop noise detection," Computer, Speech and Language, ISSN 0885-2308, Elsevier, 11 June 2022, 101421.
Kirtana Sunil Phatnani and Hemant A. Patil, "Music footprint recognition via sentiment, identity, and setting identification," Multimedia Tools and Applications, vol. 81, no. 16, pp. 22247-22262, 2022.
Meet Soni and Hemant A. Patil, “Non-intrusive quality assessment of noise suppressed speech using unsupervised deep features,” in Speech Communication, Elsevier, vol. 130, pp. 27-44, Jun. 2021.
Gauri P. Prajapati, Dipesh Kumar Singh, Preet P. Amin, and Hemant A. Patil, "Voice privacy using cycleGAN and time-scale modification," Computer Speech & Language, Elsevier, 29 Jan. 2022, 101353.
Nirmalya Sen, Md Sahidullah, Hemant A. Patil, Shyamal Kumar Das Mandal, Krothapalli Sreenivasa Rao, and Tapan Kumar Basu,"Utterance partitioning for speaker recognition: an experimental review and analysis with new findings under GMM-SVM framework," International Journal of Speech Technology (IJST), Springer, vol. 24, no. 4, pp. 1067-1088, 2021.
Madhu R. Kamble, Hardik. B. Sailor, Hemant. A. Patil, and Haizhou Li, “Advances in anti-spoofing: From the perspective of ASVspoof challenges," in APSIPA Transactions on Signal and Information Processing, vol. 9, pp. 1-18, 2020 (Invited Paper).
Madhu R. Kamble and Hemant. A. Patil, “Amplitude weighted frequency modulation features for spoof speech detection,” in Journal of Signal Processing Systems (JSPS), Springer, pp. 1-15, 2020 (Invited Paper).
Madhu R. Kamble and Hemant A. Patil, "Detection of replay spoof speech using Teager energy feature cues," in Special issue on Advances in Automatic Speaker Verification Anti-spoofing, in Computer, Speech and Language, Elsevier, vol. 65, pp. 1-19, 2020.
Madhu R. Kamble, Hemlata Tak, and Hemant A. Patil, "Amplitude and frequency modulation-based features for detection of replay spoof speech," in Speech Communication, Elsevier, vol. 125, pp. 114-127, December 2020.
Nirmesh J. Shah and Hemant A. Patil, “Novel outliers removal approach for voice conversion,” in Computer Speech & Language, Elsevier, vol. 58, pp. 127-152, November 2019.
Maulik C. Madhavi, and Hemant A. Patil, "Vocal tract length normalization using a Gaussian mixture model framework for query-by-example spoken term detection,” in Computer Speech & Language, Elsevier, vol. 58, pp. 175-202, November 2019.
Siddhant Gupta, Ankur T. Patil, Mirali Purohit, Maitreya Patel, Hemant A. Patil, and Rodrigo Capobianco Guido, “Residual neural network precisely quantifies dysarthria severity-level based on short-duration of speech segments,” in Special Issue on Advances in Deep Learning Based Speech Processing, Neural Networks, Elsevier, vol. 139, pp. 105-117, July 2021.
Hardik B. Sailor and Hemant A. Patil, "Novel unsupervised auditory filterbank learning using convolutional RBM for speech recognition," in ACM/IEEE Trans. Audio, Speech and Language Processing, vol. 24, no. 12, pp. 2341-2353, Dec. 2016.
Hardik B. Sailor and Hemant A. Patil, “Auditory feature representation using convolutional restricted Boltzmann machine and Teager energy operator for speech recognition,” Journal of Acoust. Soc. of America (JASA) Express Letters, vol. 141, no. 6, pp. 1–7, June 2017.
Maulik C. Madhavi, and Hemant A. Patil, "Design of mixture of GMMs for query-by-example spoken term detection” in Computer, Speech and Language, Elsevier, vo. 52, pp. 41-45, Nov. 2018.
Maulik C. Madhavi, and Hemant A. Patil, "Partial matching and search space reduction for QbE-STD," in Computer Speech & Language, Elsevier, vol. 45, pp. 58-082, Sept. 2017.
Hemant A. Patil, and Maulik C. Madhavi, Combining evidences from magnitude and phase information using VTEO for person recognition using humming," in special issue of Recent advances in speaker and language recognition and characterization Computer Speech and Language, Elsevier, vol. 52, pp. 225-256, November 2018.
Tanvina B. Patel and Hemant A. Patil, “Cochlear filter and instantaneous frequency based features for spoofed speech detection”, in IEEE Journal of Selected Topics in Signal Processing (JSTSP), Special Issue on Spoofing and Countermeasures for Automatic Speaker Verification, vol. 11, no. 4, pp. 618-631, June 2017.
Tanvina B. Patel and Hemant A. Patil, “Significance of source-filter interaction for classification of natural vs. spoofed speech”, in IEEE Journal of Selected Topics in Signal Processing (JSTSP), Special Issue on Spoofing and Countermeasures for Automatic Speaker Verification, vol. 11, no. 4, 644–659, June 2017.
Anshu Chittora and Hemant A. Patil, “Data collection of infant's cries for research and analysis”, in Journal of Voice, Elsevier, vol. 31, no. 2, pp. 252.e15-252.e26, March 2017.
Anshu Chittora and Hemant A. Patil, “Significance of higher-order spectral analysis in infant cry classification”, in Circuits, Systems & Signal Processing (CSSP), Springer US, 23 pages, Online First April 2017.
Anshu Chittora and Hemant A. Patil, “Spectral analysis of infant cries and adult speech”, in International Journal of Speech Technology (IJST), Springer, volume 19, no.4, pp 841–856, December 2016.
Anshu Chittora and Hemant A. Patil, “Newborn infant’s cry analysis”, in International Journal of Speech Technology (IJST), Springer, volume 19, no. 4, pp. 919-928, December 2016.
Namrata Singh, Nikhil Bhendawade, and Hemant A. Patil, “Novel cochlear filter based cepstral coefficients for classification of unvoiced fricatives,” International Journal on Natural Language Computing (IJNLC), vol. 3, no.4, pp. 21- 40, August 2014.
Hemant A. Patil, Maulik C. Madhavi and Keshab K. Parhi, "Static and dynamic information derived from spectral and source-like features for person recognition from humming," in Special Issue on Speaker Recognition, Int. J. of Speech Tech., IJST, Springer, July 2012.
Hemant A. Patil and Srikant Viswanath, “Effectiveness of Teager energy operator for epoch detection from speech signals,” in Int. J. of Speech Tech., IJST, Springer, vol. 14, no.4, pp.321-337, Dec. 2011.
Hemant A Patil and T. K. Basu, “LP spectra vs. Mel spectra for identification of professional mimics in Indian languages,” in Int. J. Speech Tech. , IJST, Springer-Verlag, vol. 11, no.1,pp. 1-16, March 2008.
Hemant A Patil and T. K. Basu, “Development of speech corpora for speaker recognition research and evaluation in Indian languages,” in Int. J. of Speech Tech., IJST, Springer-Verlag, vol. 11, no.1, pp.17-32, March 2008.
Hemant A. Patil and T. K. Basu, “Identifying phonetically similar languages using Teager energy based cepstrum,” special issue on “Frontiers of Language Processing and Information Retrieval for Asian Languages”, Engineering Letters, vol.16, no.1, 9 pages, March 2008.
Edited Journal Special Issue / Book Volumes
Hemant A. Patil, Aastha Kachhi, and Ankur T. Patil, “CQT-based cepstral features for classification of normal vs. pathological infant vry,” to appear in IEEE Trans. Speech Audio, and Language Processing, pp. 1-14, Oct. 27, 2023. Early Access Link: https://ieeexplore.ieee.org/document/10298803
Guest Coeditor with Prof. (Dr.) Rodrigo Capobianco Guido (Brazil) for a special issue in Computer, Speech and Language, Elsevier. [Link]
Amy Neustein and Hemant A. Patil (Eds.), Acoustic Analysis of Pathologies From Infancy to Young Adulthood DeGruyter, New York, vol. 7, 2020.
Hemant A. Patil and Amy Neustein (Eds.), Voice Technologies for Reconstruction and Enhancement, DeGruyter, New York. vol. 6, Feb. 2020.
Hemant A. Patil, Amy Neustein and Manisha Kulshetra (Eds.), Signal and Acoustic Modelling for Speech and Communication Disorders, DeGruyter, New York, vol. 5, 272 pages, Dec. 2018. [Link]
Hemant A. Patil (Guest Editor), Special Issue on Speaker Recognition, Int. J. Speech Tech., IJST, Springer , vol. 15, No.3, Sept. 2012.
Amy Neustein and Hemant A. Patil (Eds.), Forensic Speaker Recognition: Law Enforcements and Counter-terrorism. Springer-Verlag, New York, USA, Oct. 2011. [Link]
International Book Chapters
Aditya Pusuluri, Aastha Kachhi, and Hemant A. Patil, "Constant-Q based harmonic and pitch features for normal vs. pathological infant cry classification," in SPECOM, Hubli, Karnataka, India, Lecture Notes in Computer Science (LNCS) by Alexey Karpov et. al. (Eds.), Springer, 29 Nov.-02 Dec. 2023.
Uthiraa S., and Hemant A. Patil, "Analysis of Mandarin vs. English language for emotional voice conversion," in SPECOM, Hubli, Karnataka, India, Lecture Notes in Computer Science (LNCS) by Alexey Karpov et. al. (Eds.), Springer, 29 Nov.-02 Dec. 2023.
Baveet Singh Hora, Uthiraa S., and Hemant A. Patil, "Linear frequency residual cepstral coefficients for speech emotion recognition," in SPECOM, Hubli, Karnataka, India, Lecture Notes in Computer Science (LNCS) by Alexey Karpov et. al. (Eds.), Springer, 29 Nov.-02 Dec. 2023.
Uthiraa S., Aastha Kacchi, and Hemant A. Patil, "Linear frequency residual features for infant cry classification," in SPECOM, Hubli, Karnataka, India, Lecture Notes in Computer Science (LNCS) by Alexey Karpov et. al. (Eds.), Springer, 29 Nov.-02 Dec. 2023.
Priyanka Gupta, Rajul Acharya, Ankur T. Patil, and Hemant A. Patil, "On the asymptotic behaviour of the speech signal," in SPECOM, Hubli, Karnataka, India, Lecture Notes in Computer Science (LNCS) by Alexey Karpov et. al. (Eds.), Springer, 29 Nov.-02 Dec. 2023.
Kirtana Sunil Phatnani, and Hemant A. Patil, "Quantifying the emotional landscape of music with three dimensions," in SPECOM, Hubli, Karnataka, India, Lecture Notes in Computer Science (LNCS) by Alexey Karpov et. al. (Eds.), Springer, 29 Nov.-02 Dec. 2023.
Monil Charola, Siddharth Rathod, and Hemant A. Patil, "Robustness of whisper features for infant cry classification," in SPECOM, Hubli, Karnataka, India, Lecture Notes in Computer Science (LNCS) by Alexey Karpov et. al. (Eds.), Springer, 29 Nov.-02 Dec. 2023.
Siddharth Rathod, Monil Charola, and Hemant A. Patil, "Transfer learning using whisper for dysarthric automatic speech recognition," in SPECOM, Hubli, Karnataka, India, Lecture Notes in Computer Science (LNCS) by Alexey Karpov et. al. (Eds.), Springer, 29 Nov.-02 Dec. 2023.
Uthiraa S., P. Aditya, and Hemant A. Patil, "Modified group delay features for emotion recognition," in PReMI, ISI Kolkata, India, Lecture Notes in Computer Science (LNCS), Springer, 12-15 Dec. 2023.
Siddharth Rathod, Monil Charola, and Hemant A. Patil, "Noise robust whisper features for dysarthric severity-level classification," in PReMI, ISI Kolkata, India, Lecture Notes in Computer Science (LNCS), Springer, 12-15 Dec. 2023.
Krishna Parmar, Baveet Singh Vora, Shrey Machhar, Hemant A. Patil, Kiran Praveen, and Balaji Radhakrishanan, "Spoken language identification using linear frequency residual cepstral coefficients," in PReMI, ISI Kolkata, India, Lecture Notes in Computer Science (LNCS), Springer, 12-15 Dec. 2023.
Aditya Pusuluri, Aastha Kachhi, and Hemant A. Patil, "Analysis of Time-Averaged Feature Extraction Techniques on Infant Cry Classification," in Speech and Computer. SPECOM 2022. Lecture Notes in Computer Science (LNCS), vol 13721, Prasanna, S.R.M., Karpov, A., Samudravijaya, K., Agrawal, S.S. (Eds), Springer, Cham, 10 Nov. 2022, pp. 590–603.
Aastha Kachhi, Anand Therattil, Priyanka Gupta, and Hemant A. Patil, "Continuous Wavelet Transform for Severity-Level Classification of Dysarthria," in Speech and Computer. SPECOM 2022. Lecture Notes in Computer Science, vol 13721, Prasanna, S.R.M., Karpov, A., Samudravijaya, K., Agrawal, S.S. (Eds), Springer, Cham, 10 Nov. 2022, pp. 312–324.
Priyanka Gupta, and Hemant A. Patil, "Significance of Distance on Pop Noise for Voice Liveness Detection," in Speech and Computer. SPECOM 2022. Lecture Notes in Computer Science, vol 13721, Prasanna, S.R.M., Karpov, A., Samudravijaya, K., Agrawal, S.S. (Eds), Springer, Cham, 10 Nov. 2022, pp. 226–237.
Aastha Kachhi, Anand Therattil, Ankur T. Patil, Hardik B. Sailor, and Hemant A. Patil, "Significance of Energy Features for Severity Classification of Dysarthria," in Speech and Computer. SPECOM 2022. Lecture Notes in Computer Science, vol 13721, Prasanna, S.R.M., Karpov, A., Samudravijaya, K., Agrawal, S.S. (Eds), Springer, Cham, 10 Nov. 2022, pp. 325–337.
Ankur T. Patil, Harsh Kotta, Rajul Acharya, and Hemant A. Patil, "Spectral root features for replay spoof detection in voice assistants," in Alexey Karpov, Rodmonga Potapova (Eds.) Lecture Notes in Computer Science (LNCS), Springer, 23rd International Conference on Speech and Computer (SPECOM), St. Petersburg, Russia, vol. 12997, pp. 504-515, September 27-30, 2021.
Shrishti Singh, Kuldeep Khoria, and Hemant A. Patil, "Modified group delay function using different spectral smoothing techniques for voice liveness detection," in Alexey Karpov, Rodmonga Potapova (Eds.) Lecture Notes in Computer Science (LNCS), Springer, 23rd International Conference on Speech and Computer (SPECOM), St. Petersburg, Russia, vol. 12997, pp. 649-659, September 27-30, 2021.
Priyanka Gupta, Siddhant Gupta, and Hemant A. Patil, "Voice liveness detection using bump wavelet with CNN," in 9th International Conference on Pattern Recognition and Machine Intelligence (PReMI 2021), ISI Kolkata, Lecture Notes in Computer Science (LNCS), Springer, 15-18 Dec. 2021.
Gauri Prajapati, Dipesh Kumar Singh, and Hemant A. Patil, “Voice privacy through time-scale and pitch modification," in 9th International Conference on Pattern Recognition and Machine Intelligence (PReMI 2021), ISI Kolkata, Lecture Notes in Computer Science (LNCS), Springer, 15-18 Dec. 2021.
Priyanka Gupta and Hemant A. Patil, "Voice biometrics: Attacker's perspective," in Voice Biometrics: Technology, Trust and Security, Carmen Garcia-Mateo and Gerard Chollet, (Eds), IET, UK, 2021, pp. 125-150. ISBN: 9781785619007
Priyanka Gupta, Shrishti Singh, Gauri Prajapati, and Hemant A. Patil, " Voice Privacy in Biometrics," Biomedical Signal and Image Processing with Artificial Intelligence. EAI/Springer Innovations in Communication and Computing, Chirag Paunwala, Mita Paunwala, Rahul Kher, Falgun Thakkar, Heena Kher, Mohammed Atiquzzaman, Norliza Mohd. Noor (Eds.), Springer, Cham, 13 Sep. 2022, pp. 1–29,
Siddhant Gupta and Hemant A. Patil, "Analysis and Classification of Dysarthric Speech," in Biomedical Signal and Image Processing with Artificial Intelligence. EAI/Springer Innovations in Communication and Computing, Chirag Paunwala, Mita Paunwala, Rahul Kher, Falgun Thakkar, Heena Kher, Mohammed Atiquzzaman, Norliza Mohd. Noor (Eds.), Springer, Cham,13 Sep. 2022, pp. 167–182.
Kirtana Sunil Phatnani, and Hemant A. Patil, "Change and Periodic Events: Relevance to the Pandemic," in IoT Applications for Healthcare Systems. EAI/Springer Innovations in Communication and Computing, Rahul K. Kher, Chirag Paunwala, Falgun Thakkar, Heena Kher, Mita Paunwala, Prasan Kumar Sahoo, Larif Ladid (Eds), Springer, Cham, 2022, pp. 137–152
Anshu Chittora and Hemant A. Patil, “Infant cry analysis and its classification,” in Acoustic Analysis of Pathologies From Infancy to Young Adulthood DeGruyter, DeGruyter, New York, vol. 7, pp. 1- 61, 2020.
Hardik B. Sailor and Hemant A. Patil, “Unsupervised auditory filterbank learning for infant cry classification,” in Acoustic Analysis of Pathologies From Infancy to Young Adulthood DeGruyter, New York, vol. 7, pp. 63-92, 2020.
Kirtana Sunil Phatnani and Hemant A. Patil, “Role of music on infant developments ”in Acoustic Analysis of Pathologies From Infancy to Young Adulthood DeGruyter, New York, vol. 7, pp. 198-212, 2020.
Nirmesh J. Shah and Hemant A. Patil, “Non-audible murmur to audible speech conversion,” in Voice Technologies for Reconstruction and Enhancement, DeGruyter, New York, vol. 6, pp. 125-150, Feb. 2020.
Madhu R. Kamble, Maddala Venkata Siva Krishna, Aditya Krishna Sai Pulikonda, and Hemant A. Patil, "Novel Teager energy based subband features for audio acoustic scene detection and classification," PReMI 2019, P. Maji et al. (Eds.) Lecture Notes in Computer Science (LNCS), Springer, vol. 11941, pp. 436-444, 2019.
Hemant A. Patil and Tanvina B. Patel, “Analysis of normal and pathological voices by novel chaotic titration method,” in Signal and Acoustic Modelling for Speech and Communication Disorders, De Gruyter, vol. 5, pp. 87-120, Dec. 2018.
Madhu R. Kamble and Hemant A. Patil, Effectiveness of Mel scale-based ESA-IFCC features for classification of natural vs. spoofed speech," in B. U. Shankar et al. (Eds.): PReMI 2017, Lecture Notes in Computer Science (LNCS), Springer, vol. 10597, pp. 308-316, 2017.
Nirmesh Shah and Hemant A. Patil, "Analysis of features and metrics for alignment in text-dependent voice conversion," in B. U. Shankar et al. (Eds.): PReMI 2017, Lecture Notes in Computer Science (LNCS), Springer, vol. 10597, pp. 299-307, 2017.
Ankit Nagpal and Hemant A. Patil, "Novel gammatone filterbank based spectro-temporal features for robust phoneme recognition," in B. U. Shankar et al. (Eds.): PReMI 2017, Lecture Notes in Computer Science (LNCS), Springer, vol. 10597, pp. 342-350, 2017.
Maulik C. Madhavi, Hemant A. Patil, and Nikhil Bhendawade, "Spoken keyword retrieval using source and system features," in B. U. Shankar et al. (Eds.): PReMI 2017, Lecture Notes in Computer Science (LNCS), Springer, vol. 10597, pp. 333-341, 2017.
Rishabh Tak, Dharmesh Agrawal, and Hemant A. Patil, "Novel phase encoded Mel filterbank energies for environmental sound classification," in B. U. Shankar et al. (Eds.): PReMI 2017, Lecture Notes in Computer Science (LNCS), Springer, vol. 10597, pp. 317-325, 2017.
Apeksha Naik, Rishabh Tak, and Hemant A. Patil, "Novel phase encoded Mel cepstral features for speaker verification," in A. Karpov et al. (Eds.) SPECOM 2017, Lecture Notes in Artificial Intelligence (LNAI), Springer, vol. 10458, pp. 572-581, 2017.
Ami Gandhi and Hemant A. Patil, "Novel linear prediction temporal phase based features for speaker recognition," in A. Karpov et al. (Eds.) SPECOM 2017, Lecture Notes in Artificial Intelligence (LNAI), Springer, vol. 10458, pp. 564-571, 2017.
Purvi Agrawal and Hemant A. Patil, "Fusion of a novel Volterra-Wiener filter based nonlinear residual phase and MFCC for speaker verification," in A. Karpov et al. (Eds.) SPECOM 2017, Lecture Notes in Artificial Intelligence (LNAI), Springer, vol. 10458, pp. 389-397, 2017.
Anshu Chittora and Hemant A. Patil, “Modified group delay-based features for Asthma and HIE infant cries classification,” in Král P.and Matoušek V. (Eds.) TSD 2015. Lecture Notes in Computer Science (LNCS), Springer, vol. 9302, pp. 595-602, 2015.
Anshu Chittora and Hemant A. Patil, “Significance of unvoiced segments and fundamental frequency for infant cry analysis,” in Král Pand Matoušek V. (Eds.) TSD 2015. Lecture Notes in Computer Science (LNCS), Springer, vol. 9302, pp. 273-281, 2015.
Aditya Raikar, Ami Gandhi, and Hemant A. Patil, “Combining evidences from Mel cepstral and cochlear cepstral features for speaker recognition for whispered speech,” in Král P. and Matoušek V. (Eds.) TSD 2015. Lecture Notes in Computer Science (LNCS), Springer, vol. 9302, pp. 405-413, 2015.
Maulik C. Madhavi, Shubham Sharma, and Hemant A. Patil, "Vocal tract length normalization features for audio search, " in Král P. and Matoušek V. (Eds.) TSD 2015. Lecture Notes in Computer Science (LNCS), Springer, vol. 9302, pp. 387-395, 2015.
Maulik C. Madhavi, Shubham Sharma, and Hemant A. Patil, "VTLN using different warping functions for template matching," in Ryżko D., Gawrysiak P., Kryszkiewicz M., Rybiński H. (Eds.) Machine Intelligence and Big Data in Industry. Studies in Big Data, vol. 19, Springer, pp. 111-121, 2016.
Yashesh Gaur, Maulik C. Madhavi, and Hemant A. Patil, “Speaker recognition using sparse representation via superimposed features,” in P. Maji et. al. (Eds.) PReMI, Lecture Notes in Computer Science (LNCS), Springer-Verlag, Berlin Heidelberg, Germany, vol. 8251, pp. 140-147, 2013.
Kewal D. Malde, Anshu Chittora, and Hemant A. Patil, “Classification of fricative using novel modulation spectrogram based features,” in P. Maji et. al. (Eds.) PReMI, Lecture Notes in Computer Science (LNCS), Springer-Verlag, Berlin Heidelberg, Germany, vol. 8251, pp. 134-139, 2013.
Nirmalya Sen, Hemant.A. Patil, Shyamal Kr. Das Mandal and Sreenivasa Rao K, “Importance of utterance partitioning in SVM classifier with GMM supervectors for text-independent speaker verification,” in R. Prasath and T. Kathirvalavakumar (Eds.): MIKE 2013, Lecture Notes in Artificial Intelligence (LNAI), vol. 8284, Springer, pp. 780–789, 2013.
Hemant A. Patil, Maulik C. Madhavi, Rahul Jain, and Alok Kumar Jain, “Combining evidences from temporal and spectral features for person recognition from humming,” in Malay K. Kundu et. al. (Eds.) PerMIn, Lecture Notes in Computer Science (LNCS), vol. 7143, pp. 321-328, Springer-Verlag, 2012.
Hemant A. Patil, Parth A. Goswami, and T. K. Basu, “Novel interleaving schemes for speaker recognition over lossy networks,” in Malay K. Kundu et. al. (Eds.) PerMIn, Lecture Notes in Computer Science (LNCS), vol. 7143, Springer-Verlag, pp. 329-337, 2012.
Hemant A. Patil, Pallavi N. Baljekar, and T. K. Basu, " Novel temporal and spectral features derived from TEO for classification of normal and dysphonic voices,” Frontiers in Computer Education, Advances in Intelligent Computing, Springer-Verlag, vol. 133, pp. 559-567, 2012.
Hemant A. Patil, Aaron E. Cohen, and Keshab K. Parhi, “Speaker identification over narrowband VoIP networks,” Amy Neustein and Hemant A. Patil (Eds.), Forensic Speaker Recognition: Law Enforcement and Counter-terrorism, Springer-Verlag, New York, USA, pp. 125-151, 2011.
Hemant A. Patil, ‘Cry Baby’ Using spectrographic analysis to assess neonatal health status from an infant’s cry,” in Amy Neustein (Ed.), Advances in Speech Recognition: Mobile Environments, Call Centers and Clinics Springer-Verlag, pp.323-348, 2010.
Hemant A. Patil and Keshab K. Parhi, “Variable length Teager energy based Mel cepstral features for identification of twins,” in S. Chaoudhury et. al. (Eds.) PReMI, Lecture Notes in Computer Science (LNCS), Springer-Verlag, Berlin Heidelberg, Germany, vol. 5909, pp. 525-530, 2009.
Hemant A. Patil, Robin Jain, and Prakhar Jain, “Identification of speakers from their hum,” P. Sojka et al. (Eds.) TSD, Lecture Notes in Artificial Intelligence (LNAI), Springer-Verlag, Berlin Heidelberg, Germany, pp. 461-468, 2008.
Hemant A. Patil and T. K. Basu, “A novel approach to language identification using modified polynomial networks,” B. Prasad and S.R.M. Prasanna (Eds.), Speech, Audio, Image and Biomedical Signal Processing using Neural Networks, Studies in Computational Intelligence, Springer-Verlag, Berlin Heidelberg, Germany, vol.83, pp. 117-144, March 2008.
Hemant A. Patil and T. K. Basu, “Cepstral domain Teager energy for identifying perceptually similar languages” in A. Ghosh et al. (Eds.), PReMI, Lecture Notes in Computer Science (LNCS), Springer-Verlag, Berlin Heidelberg, Germany, vol. 4815, pp. 455-462, 2007.
Hemant A. Patil and T. K. Basu, “Design of cubic spline wavelet for open set speaker classification in Marathi,” Q. Huo et al. (Eds.) ISCSLP, Lecture Notes in Artificial Intelligence (LNAI), Springer-Verlag, Berlin Heidelberg, Germany, vol. 4274, pp. 126-137, 2006.
Hemant A. Patil, P. K. Dutta, and T. K. Basu, “The wavelet packet based cepstral features for open set speaker classification in Marathi,” M. Spiliopoulou et al. (Eds.)‘Studies in Classification, Data Analysis, and Knowledge Organization’, Springer-Verlag, Berlin Heidelberg, Germany, pp. 134-141, 2006.
Hemant A. Patil, P. K. Dutta, and T. K. Basu, “Person authentication using voice biometrics,” J. Dittmann et al. (Eds.), New Advances in Multimedia Security, Biometrics, Watermarking and Cultural Aspects, pp. 119-134, Logos Verlag Berlin, Germany, 2006.
Hemant A. Patil, P. K. Dutta, and T. K. Basu, “On the mono-lingual and cross-lingual speaker identification for Indian and European languages,” J. Dittmann et al. (Eds.), New Advances in Multimedia Security, Biometrics, Watermarking and Cultural Aspects, pp. 213-220, Logos Verlag Berlin, Germany, 2006.
Hemant A. Patil and T. K. Basu, “The Teager energy based features for identification of identical twins in multilingual environment,” N.R. Pal et al. (Eds.):ICONIP, Lecture Notes in Computer Science (LNCS), Springer-Verlag, Berlin Heidelberg, Germany, vol. 3316, pp. 333-337, 2004.
National Book Chapters
Hemant A. Patil and T. K. Basu, “Speech corpora for speaker classification experiments in Indian languages”, in ICET, D. K. Mishra and P. N. Ramachandran (Eds.) Allied Publishers, pp. 71-78, Dec. 22-24, 2004.
Hemant A. Patil and T. K. Basu, “Identification of twins in Hindi by Teager energy Mel cepstrum”, in ICET, D. K. Mishra and P. N. Ramachandran (Eds.) Allied Publishers, pp. 79-87, Dec. 22-24, 2004.
International Conferences
Monil Charola, Aastha Kachhi, and Hemant A. Patil, "Whisper Encoder features for Infant Cry Classification," in INTERSPEECH, Dublin, Ireland, ISCA, 20-24 Aug. 2023, pp. 1773-1777.
Siddharth Rathod, Monil Charola, Akshat Vora, Yash Jogi, and Hemant A. Patil, "Whisper features for dysarthric severity-level classification," in INTERSPEECH, Dublin, Ireland, 20-24 Aug. 2023
Uthiraa S., and Hemant A. Patil, "Analysis of emotions in speech using AESDD," in Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC 2023), Taipei, Taiwan, 31 Oct.-3 Nov. 2023
Priyanka Gupta, Piyushkumar K. Chodingala, and Hemant A. Patil, "Relevance of quadrature phase for replay detection in voice assistants (VAs)," in Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC 2023), Taipei, Taiwan, 31 Oct.-3 Nov. 2023
Uthiraa S, Akshat Vora, Prathamesh Bonde, Aditya Pusuluri, and Hemant A. Patil, "Spectral and pitch components of CQT spectrum for emotion recognition," accepted in Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC 2023), Taipei, Taiwan, 31 Oct.-3 Nov. 2023
Baveet Singh Hora, Krishna Parmar, Shrey Machhar, and Hemant A. Patil, "Exploring residual cepstral features for spoken language identification," in Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC 2023), Taipei, Taiwan, 31 Oct.-3 Nov. 2023
Priyanka Gupta, Aastha Kachhi, and Hemant A. Patil, "Classification of normal vs. pathological infant cries using Morse wavelets," in Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC 2023), Taipei, Taiwan, 31 Oct.-3 Nov. 2023
Siddharth Rathod, Aastha Kachhi, Priyanka Gupta, and Hemant A. Patil "Cochlear filter based cepstral features for dysarthric deverity-level classification," in 31st European Signal Processing Conference (EUSIPCO 2023), Helsinki, Finland, 04-08 Sept. 2023
Hastin Modi, Maitreya Patel, and Hemant A. Patil, "Attentions for short duration speech classification," in 31st European Signal Processing Conference (EUSIPCO 2023), Helsinki, Finland, 04-08 Sept. 2023.
Swati Shukla, Hemant A. Patil, Deepak K. Ghodgaonkar, and CVN Rao, "Target detection improvement using synchrosqueezing transform based IF estimation in SFCW radars," accepted in India Geoscience and Remote Sensing Symposium (InGARSS-2023), IIITB, Bangalore, India, 10-13 Dec. 2023.
Hemant A. Patil, Ankur T. Patil and Aastha Kachhi, "Constant Q Cepstral coefficients for classification of normal vs. Pathological infant cry," In IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2022), Singapore, IEEE, 23-27 May 2022, pp. 7392-7396
Anand Therattil, Priyanka Gupta, Piyushkumar K. Chodingala and Hemant A. Patil, "Cross-Database Evaluation for Detection of One-Point and Two-Point Replay Spoofed Speech Attacks," in Speaker Odyssey, The Speaker and Language Recognition Workshop, Beijing, China, June 28 - July 01, 2022.
Anand Therattil, Aastha Kachhi, and Hemant A. Patil, "Cross-Teager Cepstral Coefficients For Dysarthric Severity-Level Classification," in Speech for Social Good Workshop (S4SG)-a satellite event of INTERPSPEECH, Incheon, Korea, Sept. 24-25, 2022.
Priyanka Gupta, Piyushkumar K. Chodingala and Hemant A. Patil, "Energy Separation Based Instantaneous Frequency Estimation from Quadrature and In-phase Components for Replay Spoof Detection," In 30th European Signal Processing Conference (EUSIPCO 2022), Belgrade, Serbia, IEEE, 29 Aug. - 02 Sept., 2022. pp. 369-373
Priyanka Gupta and Hemant A. Patil, "Linear Frequency Residual Cepstral Features for Replay Spoof Detection on ASVSpoof 2019," In 30th European Signal Processing Conference (EUSIPCO 2022), Belgrade, Serbia, IEEE, 29 Aug. - 02 Sept., 2022. pp. 349-353
Priyanka Gupta, Piyushkumar K. Chodingala and Hemant A. Patil, "Morlet Wavelet-Based Voice Liveness Detection Using Convolutional Neural Network," In 30th European Signal Processing Conference (EUSIPCO 2022), Belgrade, Serbia, IEEE, 29 Aug. - 02 Sept., 2022. pp. 100-104
Ankur T. Patil, Aastha Kachhi and Hemant A. Patil, "Subband Teager Energy Representations for Infant Cry Analysis and Classification," In 30th European Signal Processing Conference (EUSIPCO 2022), Belgrade, Serbia, IEEE, 29 Aug. - 02 Sept., 2022. pp. 1313-1317
Ankur T. Patil, Kuldeep Khoria and Hemant A. Patil, "Voice Liveness Detection using Constant-Q Transform-Based Features," In European Signal Processing Conference (EUSIPCO 2022), Belgrade, Serbia, IEEE, 29 Aug. -02 Sept. 2022, pp. 110-114
Aastha Kachhi, Priyanka Gupta and Hemant A. Patil, "Features Motivated from Uncertainty Principle for Classification of Normal vs. Pathological Infant Cry," in European Signal Processing Conference (EUSIPCO 2022), Belgrade, Serbia, 29 Aug.-02 Sept., 2022.
Hemant A. Patil, Rajul Acharya, Ankur T. Patil and Priyanka Gupta, "Non-Cepstral Uncertainty Vector for Replay Spoofed Speech Detection," In European Signal Processing Conference (EUSIPCO 2022), Belgrade, Serbia, IEEE, 29 Aug. - 02 Sept., 2022. pp. 374-378
Priyanka Gupta, Piyushkumar K. Chodingala and Hemant A. Patil, "Significance of Quadrature and In-Phase Components for Synthetic Spoofed Speech Detection," In Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC 2022), Chiang Mai, Thailand, 7-10 Nov. 2022, pp. 1252-1258
Madhu R. Kamble, Anand Therattil, Hemant A. Patil,M. Ali Basha Shaik, and Vikram Vij "Smoothed Teager Energy Cepstral Feature for Replay Attack Detection on Voice Assistants," In Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC 2022), Chiang Mai, Thailand, 7-10 Nov. 2022, pp. 82-88
Aastha Kachhi, Anand Therattil, Ankur T. Patil, Hardik B. Sailor and Hemant A. Patil, "Teager Energy Cepstral Coefficients for Classification of Dysarthric Speech Severity-Level," In Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC 2022), Chiang Mai, Thailand, 7-10 Nov. 2022, pp. 1462-1468
Madhu R. Kamble, and Hemant A. Patil, "The Impact of Room Acoustics on Replay Speech Signal," In 13th International Symposium on Chinese Spoken Language Processing (ISCSLP 2022), Singapore, 11-14 Dec. 2022, pp. 105-109
Aastha Kachhi, Shreya Chaturvedi, Hemant A. Patil, and Dipesh Kumar Singh, "Data Augmentation for Infant Cry Classification," In 13th International Symposium on Chinese Spoken Language Processing (ISCSLP 2022), Singapore, 11-14 Dec. 2022, pp. 433-435.
Priyanka Gupta, and Hemant A. Patil, Effect of Speaker-Microphone Proximity on Pop Noise: Continuous Wavelet Transform-Based Approach," In 13th International Symposium on Chinese Spoken Language Processing (ISCSLP 2022), Singapore, 11-14 Dec. 2022, pp. 110-114,
Priyanka Gupta, Piyush Chodingala and Hemant A. Patil, "Morse Wavelet Features for Pop Noise Detection," In IEEE International Conference on Signal Processing and Communications (SPCOM 2022),IISc Bangalore, India, IEEE, 11-15 Jul. 2022, pp. 1-5
Shreya Chaturvedi, Hardik B. Sailor and Hemant A. Patil, "Noisy Student Teacher Training with Self Supervised Learning for Children ASR," In IEEE International Conference on Signal Processing and Communications (SPCOM 2022),IISc Bangalore, India, IEEE, 11-15 Jul. 2022, pp. 1-5
Piyush Chodingala, Shreya Chaturvedi, Ankur T. Patil and Hemant A. Patil, "Robustness of DAS Beamformer Over MVDR for Replay Attack Detection On Voice Assistants," In IEEE International Conference on Signal Processing and Communications (SPCOM 2022), IISc Bangalore, India, EEE, 11-15 Jul. 2022, pp. 1-5
Gauri P. Prajapati, Dipesh Kumar Singh and Hemant A. Patil, "Significance of Distance Measures for Speaker Anonymization," In IEEE International Conference on Signal Processing and Communications (SPCOM 2022),IISc Bangalore, India, IEEE, 11-15 Jul. 2022, pp. 1-5
Rajul Acharya, Harsh Kotta, Ankur T. Patil, and Hemant A. Patil, "Cross-Teager energy cepstral coefficients for replay spoof detection on voice assistants," IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Toronto, ON, Canada, 6-11 Jun. 2021, pp. 6364- 6368.
Gauri Prajapati, Dipesh Kumar Singh, Preet Amin, and Hemant A. Patil, "Voice privacy through x-vector and CycleGAN-based anonymization," Proc. INTERSPEECH 2021, Brno, Czechia, 30 Aug.-3 Sep. 2021, pp. 1684-1688.
Dipesh Kumar Singh, Preet Amin, Hardik Sailor, and Hemant A. Patil, " Data Augmentation Using CycleGAN for End-to-End Children ASR," in European Signal Processing Conference (EUSIPO 2021), Dublin, Ireland, 23- 27 Aug. 2021.
Nirmesh J. Shah, M. Ali Basha Shaik, Periyasamy P., Vikram Vij, and Hemant A. Patil, " Exploring Phase-Based Features for Whisper vs. Speech Classification," in European Signal Processing Conference (EUSIPO 2021), Dublin, Ireland, 23- 27 Aug. 2021.
Shrishti Singh, Kuldeep Khoria, and Hemant A. Patil, " Modified Group Delay Cepstral Coefficients (MGDCC) for Voice Liveness Detection," in European Signal Processing Conference (EUSIPO 2021), Dublin, Ireland, 23- 27 Aug. 2021.
Kuldeep Khoria, Ankur T. Patil, and Hemant A. Patil, " Significance of Constant-Q Transform for Voice Liveness Detection," in European Signal Processing Conference (EUSIPO 2021), Dublin, Ireland, 23- 27 Aug. 2021.
Madhu R. Kamble, Shekhar Nayak, M. Ali Basha Shaik, Vikram Vij, and Hemant A. Patil, " Teager Energy Subband Filtered Features for Near and Far-Field Automatic Speech Recognition," in13th Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA-ASC 2021),, Tokyo, Japan, 14-17 Dec. 2021.
Siddhant Gupta, Kuldeep Khoria, Ankur T. Patil, and Hemant A. Patil, "Deep Convolutional Neural Network for Voice Liveness Detection," in13th Asia-Pacific Signal and Information Processing Association Annual Summit and Conerence (APSIPA-ASC 2021), Tokyo, Japan, 14-17 Dec. 2021.
Ankur T. Patil, and Hemant A. Patil, "Significance of CMVN for Replay Spoof Detection," in Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA-ASC), New Zealand, Dec. 7-10, 2020.
Priyanka Gupta, Gauri Prajapati, Shrishti Singh, Madhu R. Kamble, and Hemant A. Patil, "Design of Voice Privacy System using Linear Prediction," in Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA-ASC), New Zealand, Dec. 7-10, 2020.
Harsh Kotta, Ankur T. Patil, Rajul Acharya, and Hemant A. Patil, 'Subband Chaneel Selection, Using TEO for Replay Spoof Dection in Voice Assistants,' in Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA-ASC), New Zealand, Dec. 7-10, 2020.
Kirtana S. Phatnani and Hemant A. Patil, “The Symmetry in the Structure of Musical Nodes,” in Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA-ASC), New Zealand, Dec. 7-10, 2020.
Neil Shah, Sreeraj R., Maulik C. Madhavi, Nirmesh J. Shah, Hemant A. Patil, “Query-by-example Spoke Term Detection Using Generative Adversarial Network,” in Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA-ASC), New Zealand, Dec. 7-10, 2020.
Madhu R. Kamble and Hemant A. Patil, "Novel Variable Length Teager Energy Profiles for Replay Spoof Detection", in Speaker Odyssey, Tokyo, Japan, May 18-21, 2020.
Madhu R. Kamble, Aditya Krishna Sai Pulikonda, Maddala Venkata Siva Krishna, and Hemant A. Patil, "Analysis of Teager Energy Profiles for Spoof Speech Detection", in Speaker Odyssey, Tokyo, Japan, May 18-21, 2020.
Harshit Malaviya, Jui Shah, Maitreya Patel, Jalansh Munshi, and Hemant A. Patil, “MSPEC-NET: MULTI-DOMAIN SPEECH CONVERSION NETWORK,” IEEE Int. Conf. Acoust. Speech and Signal Process. (ICASSP), Barcelona, Spain, pp. 7764- 7768, May 4-8, 2020.
Mirali Purohit, Maitreya Patel, Harshit Malaviya, Ankur Patil, Mihir Parmar, Nirmesh J. Shah, Savan Doshi, Hemant A. Patil, “Intelligibility Improvement of Dysarthric Speech using MMSE DiscoGAN,” in Int. Conf. on Signal Processing and Communications (SPCOM), IISc Bengaluru, July 20-23, 2020.
Divyesh Rajpura, Jui Shah, Maitreya Patel, Harshit Malaviya, Kirtana Phatnani, Hemant A. Patil, “Effectiveness of Transfer Learning on Singing Voice Conversion in the Presence of Background Music,” in Int. Conf. on Signal Processing and Communications (SPCOM), IISc Bengaluru, July 20-23, 2020.
Kuldeep Khoria, Madhu Kamble and Hemant A. Patil, "Teager Energy Cepstral Coefficients for Classification of Normal vs. Whisper Speech," in European Signal Processing Conference (EUSIPO), Amsterdam, The Netherlands, 24-28 August 2020.
Mirali Purohit, Mihir Parmar, Maitreya Patel, Harshit Malaviya and Hemant A. Patil, "'Weak Speech Supervision: A Case Study of Dysarthria Severity Classification," in European Signal Processing Conference (EUSIPO), Amsterdam, The Netherlands, 24-28 August 2020.
Gauri Prajapati, Madhu Kamble and Hemant A. Patil, "Energy Separation Based Features for Replay Spoof Detection for Voice Assistant," in European Signal Processing Conference (EUSIPO), Amsterdam, The Netherlands, 24-28 August 2020.
Maitreya Patel, Mirali Purohit, Jui Shah and Hemant A. Patil, "CINC-GAN for Effective Fo Prediction for Whisper-to-Normal Speech Conversion," in European Signal Processing Conference (EUSIPO), Amsterdam, The Netherlands, 24-28 August 2020.
Rajul Acharya, Hemant A. Patil, Harsh Kotta, "Novel Enhanced Teager Energy Based Cepstral Coefficients for Replay Spoof Detection, The 2019 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU 2019), Sentosa, Singapore, 14-18 December 2019.
Nirmesh Shah, Hardik Sailor, and Hemant Patil, "Whether To Pretrain DNN or Not?: An Empirical Analysis for Voice Conversion", in INTERSPEECH, Graz, Austria, 2019.
Nirmesh Shah and Hemant Patil, "Phone Aware Nearest Neighbor Technique using Spectral Transition Measure for Non-Parallel Voice Conversion", in INTERSPEECH, Graz, Austria, 2019.
Ankur T. Patil, Rajul Acharya, Pulikonda Aditya Sai, Hemant A. Patil, "Combining evidences of auditory transform and ESA algorithm for replay spoof detection", in INTERSPEECH, Graz, Austria, 2019.
Hemant A. Patil, "Combining Evidences from Variable Teager Energy Source and Mel Cepstral Features for Classification of Normal vs. Pathological Voices," in European Signal Processing Conference (EUSIPCO), A Coruña, Spain, September 2-6, 2019.
Mihir Parmar, Savan Doshi, Nirmesh J. Shah, Maitreya Patel and Hemant A. Patil, "Effectiveness of Cross-Domain Architectures for Whisper-to-Normal Speech Conversion," in European Signal Processing Conference (EUSIPCO), A Coruña, Spain, September 2-6, 2019.
Hemant A. Patil, Srikant Viswanath, "Energy Separation Algorithm Based Spectrum Estimation for Very Short Duration of Speech," in European Signal Processing Conference (EUSIPCO), A Coruña, Spain, September 2-6, 2019.
Maitreya Patel, Mihir Parmar, Savan Doshi, Nirmesh J. Shah and Hemant A. Patil, "Novel Inception-GAN for Whispered-to-Normal Speech Conversion," in The 10th ISCA Speech Synthesis Workshop (SSW), Viennna, Austria, 20-22 Sep. 2019.
Madhu R. Kamble, Aditya Krishna Sai Pulikonda, Maddala Venkata Siva Krishna, Ankur Patil, Rajul Acharya and Hemant A. Patil, "Speech Demodulation-based Techniques for Replay and Presentation Attack Detection", in Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA-ASC), Lanzhou, China, November 18-21, 2019.
Maitreya Patel, Mihir Parmar, Savan Doshi, Nirmesh J. Shah and Hemant A. Patil, "Novel Adaptive Generative Adversarial Network for Voice Conversion", in Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA-ASC), Lanzhou, China, November 18-21, 2019.
Nirmesh J. Shah and Hemant A. Patil, "Novel metric learning for alignment task in voice conversion," in IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Brighton, UK, May 12-17, 2019.
Madhu Kamble and Hemant A. Patil, "Analysis of reverberation via Teager energy features via replay speech detection," in IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Brighton, UK, May 12-17, 2019. [Poster]
Hemant A. Patil, Madhu R. Kamble, "A Survey on Replay Attack Detection for Automatic Speaker Verification (ASV) System," in Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA), Honolulu, Hawaii, USA, 12-15 November 2018.
Nirmesh J. Shah, Sreeraj R., Neil Shah, Hemant A. Patil, "Novel Unsupervised Sorted GMM Posteriorgram for DNN and GAN-based Voice Conversion Framework," in Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA), Honolulu, Hawaii, USA, 12-15 November 2018.
Neil Shah, Hemant A. Patil, Meet H. Soni, "Time-Frequency Mask-based Speech Enhancement using Convolutional Generative Adversarial Network," in Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA), Honolulu, Hawaii, USA, 12-15 November 2018.
Prasad Tapkir, Madhu R. Kamble, Hemant A. Patil, "Replay Spoof Detection using Power Function Based Features," in Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA), Honolulu, Hawaii, USA, 12-15 November 2018.
Prasad Tapkir, Hemant A. Patil, "Significance of Teager Energy Operator Phase for Replay Spoof Detection," in Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA), Honolulu, Hawaii, USA, 12-15 November 2018.
Prasad Tapkir, Ankur T. Patil, Neil Shah, Hemant A. Patil, "Novel Spectral Root Cepstral Features for Replay Spoof Detection," in Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA) 2018, Honolulu, Hawaii, USA, 12-15 November 2018.
Nirmesh J. Shah, Mihir Parmar, Neil Shah, and Hemant A. Patil, "Novel MMSE DiscoGAN for Cross-Domain Whisper-to-Speech Conversion," in Machine Learning in Speech and Language Processing Workshop, Google, Hyderabad, India, Sept. 07, 2018.
Ankur Patil, Siva Krishna Maddala, Mehak Piplani, Aditya Sai Pulikonda, Hardik B. Sailor and Hemant Patil, "DA-IICT/IIITV System for the 5th CHiME 2018 Challenge", in 5th CHiME 2018 Challenge, Hyderabad, Sept. 07, 2018.
Hardik B. Sailor and Hemant A. Patil, "Neural Networks-based Automatic Speech Recognition for Agricultural Commodity in Gujarati Language," in 6th International Workshop on Spoken Language Technologies for Under-resourced Languages (SLTU'18), Gurugram, India on 29-31 Aug. 2018.
Hardik B. Sailor, Ankur T. Patil and Hemant A. Patil, "Advances in Low Resource ASR: A Deep Learning Perspective," in 6th International Workshop on Spoken Language Technologies for Under-resourced Languages (SLTU'18), Gurugram, India on 29-31 Aug. 2018.
Srinivas Kantheti and Hemant A. Patil, "Relative Phase Shift Features for Replay Spoof Detection System," in 6th International Workshop on Spoken Language Technologies for Under-resourced Languages(SLTU'18), Gurugram, India on 29-31 Aug. 2018.
Ami Gandhi, and Hemant A. Patil, "Feature Extraction from Temporal Phase for Speaker Recognition," in International Conference on Signal Processing and Communications, IISc Bangalore, 16-19 Jul. 2018.
Hardik B. Sailor, and Hemant A. Patil, "Auditory Filterbank Learning Using ConvRBM for Infant Cry Classification," in INTERSPEECH 2018, Hyderabad, India, 2-6 Sept. 2018.
Hardik B. Sailor, Maddala Venkata Siva Krishna, Diksha Chhabra, Ankur Patil, Madhu Kamble, and Hemant A. Patil, "DA-IICT/IIITV System for Low Resource Speech Recognition Challenge 2018," in INTERSPEECH 2018, Hyderabad, India, 2-6 Sept. 2018.
Hardik B. Sailor, Madhu Kamble, and Hemant A. Patil, "Auditory Filterbank Learning for Temporal Modulation Features in Replay Spoof Speech Detection," in INTERSPEECH 2018, Hyderabad, India, 2-6 Sept. 2018.
Hemlata Tak, and Hemant A. Patil, "Novel Linear Frequency Residual Cepstral Features For Replay Attack Detection," in INTERSPEECH 2018, Hyderabad, India, 2-6 Sept. 2018.
Madhu Kamble, Hemlata Tak, and Hemant A. Patil, "Effectiveness of Speech Demodulation-Based Features for Replay Detection," in INTERSPEECH 2018, Hyderabad, India, 2-6 Sept. 2018. [Poster]
Madhu Kamble, and Hemant A. Patil, "Novel Variable Length Energy Separation Algorithm using Instantaneous Amplitude Features For Replay Detection," in INTERSPEECH 2018, Hyderabad, India, 2-6 Sept. 2018. [Poster]
Neil Shah, Nirmesh J. Shah, and Hemant A. Patil, "Effectiveness of Generative Adversarial Network for Non-Audible Murmur-to-Whisper Speech Conversion," in INTERSPEECH 2018, Hyderabad, India, 2-6 Sept. 2018.
Nirmesh J. Shah, Maulik C. Madhavi, and Hemant A. Patil, "Unsupervised Vocal Tract Length Warped Posterior Features for Non-Parallel Voice Conversion," in INTERSPEECH 2018, Hyderabad, India, 2-6 Sept. 2018.
Nirmesh J. Shah, and Hemant A. Patil, "Effectiveness of Dynamic Features in INCA and Temporal Context-INCA," in INTERSPEECH 2018, Hyderabad, India, 2-6 Sept. 2018.
Prasad Tapkir, and Hemant A. Patil, "Novel Empirical Mode Decomposition Cepstral Features for Replay Spoof Detection," in INTERSPEECH 2018, Hyderabad, India, 2-6 Sept. 2018.
Madhu Kamble, and Hemant A. Patil, "Novel Amplitude Weighted Frequency Modulation Features for Replay Spoof Detection," to appear in 11th International Symposium on Chinese Spoken Language Processing (ISCSLP), Taipei, Taiwan, November 26-29, 2018.
Madhu Kamble, Hemlata. Tak, V. S. K. Maddala, and Hemant A. Patil, "Novel Demodulation-Based Features using Classifier-level Fusion of GMM and CNN for Replay Detection," to appear in 11th International Symposium on Chinese Spoken Language Processing (ISCSLP), Taipei, Taiwan, November 26-29, 2018. [Poster]
Kantheti Srinivas and Hemant A. Patil, "Combining Phase-based Features for Replay Spoof Detection System," to appear 11th International Symposium on Chinese Spoken Language Processing (ISCSLP), Taipei, Taiwan, November 26-29, 2018.
Meet H. Soni, Neil Shah, and H. A. Patil, “Effectiveness of speech enhancement using generative adversarial network," in IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Alberta, Calgary, Canada, 15-20 April 2018.
Hardik B. Sailor and H. A. Patil, “Representation learning for speech recognition system in agriculture commodity for Gujarati” in Global Conference on Cyberspace (GCCS), Organized by MeitY, Govt. of India under National e-Governance Division (NeGD), New Delhi, India, 23-24 November 2017. [Poster]
Maulik C. Madhavi and Hemant A. Patil, "Combining evidences from detection sources for query-by-example spoken term detection," in IEEE Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA-ASC), Kuala Lumpur, Malaysia, 12-15 December, 2017.
Nirmesh J. Shah, Pramod B. Bachhav and Hemant A. Patil, "A novel filtering-based Fo estimation algorithm with an application to Voice Conversion," in IEEE Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA-ASC), Kuala Lumpur, Malaysia,12-15 December, 2017.
Nirmesh J. Shah and Hemant A. Patil, "On the convergence of INCA algorithm," in IEEE Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA-ASC), Kuala Lumpur, Malaysia, 12-15 December, 2017.
M. R. Kamble and H. A. Patil, “Novel energy separation based instantaneous frequency features for spoof speech detection," in European Signal Processing Conference (EUSIPCO), (Kos Island, Greece), pp. 116-120, 2017.
Maulik C. Madhavi and Hemant A. Patil, "VTLN-warped Gaussian posteriorgram for QbE-STD," in 25th European Signal Processing Conference (EUSIPCO), Kos island, Greece, 28 August - 2 September, 2017.
Meet Soni and Hemant A. Patil, "Effectiveness of ideal ratio mask for non-intrusive quality assessment of noise suppressed speech," in 25th European Signal Processing Conference (EUSIPCO), Kos island, Greece, 28 August - 2 September, 2017.
Pramod Bachhav and Hemant A. Patil, "A novel filterbank for epoch estimation," in 25th European Signal Processing Conference (EUSIPCO), Kos island, Greece, 28 August - 2 September, 2017.
Dharmesh kumar Agrawal, Hardik B. Sailor, Meet Soni and Hemant A. Patil, "Novel TEO-based gammatone features for environmental sound classification," in 25th European Signal Processing Conference (EUSIPCO), Kos island, Greece, 28 August - 2 September, 2017.
H. A. Patil, M. R. Kamble, T. B. Patel, and M. Soni, “Novel variable length Teager energy separation based instantaneous frequency features for replay detection," in INTERSPEECH, Stockholm, Sweden, pp. 12-16, 2017.
H. B. Sailor, M. R. Kamble, and H. A. Patil, “Unsupervised representation learning using convolutional restricted Boltzmann machine for spoof speech detection," in INTERSPEECH, Stockholm, Sweden, pp. 2601-2605, 2017.
Hardik B. Sailor, Dharmesh Agrawal and Hemant A. Patil, "Unsupervised filterbank learning using convolutional restricted Boltzmann machine for environmental sound classification," in INTERSPEECH 2017, Stockholm, Sweden, 20-24 August 2017.
Meet Soni, Rishabh Tak and Hemant A. Patil, "Novel shifted real spectrum for exact signal reconstruction," in INTERSPEECH 2017, Stockholm, Sweden, 20-24 August 2017.
Madhu R. Kambleand Hemant A. Patil, Novel energy separation based frequency modulation features for spoofed speech classification," in 9th International Conference on Advances in Pattern Recognition (ICAPR) Bangalore, India, 2017.
Maulik C. Madhavi and Hemant A. Patil, Two stage zero resource approaches for QbE-STD," in 9th International Conference on Advances in Pattern Recognition (ICAPR) Bangalore, India, 2017.
Hardik B. Sailor, Hemant A. Patil and Avni Rajpal, “Unsupervised Filterbank Learning for Speech-based Access System for Agricultural Commodity,” 9th International Conference on Advances in Pattern Recognition (ICAPR) Bangalore, India, 2017.
Meet Soni, Manisha Sharma, Hardik B. Sailor and Hemant A. Patil, “Subband autoencoder for feature for Automatic Speech Recognition", in 9th International Conference on Advances in Pattern Recognition (ICAPR) Bangalore, India, 2017.
Nirmesh J. Shah and Hemant A. Patil, "Novel amplitude scaling method for bilinear frequency warping-based voice conversion," in IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), New Orleans USA, 5-9 March 2017.
Avni Rajpal, Nirmesh J. Shah, Mohammadi Zaki, Hemant A. Patil, "Quality assessment of voice converted speech using articulatory features," in IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), New Orleans USA, March 5-9, 2017
Meet H. Soni, Tanvina B. Patel and Hemant A. Patil, "Novel subband autoencoder features for detection of spoofed speech," in INTERSPEECH 2016, San Francisco, USA, Sept. 08-12, 2016.
Meet H. Soni and Hemant A. Patil, "Novel novel subband autoencoder features for non-intrusive quality assessment of noise suppressed speech," in INTERSPEECH 2016, San Francisco, USA, Sept. 08-12, 2016.
Hardik B. Sailor and Hemant A. Patil, "Unsupervised deep auditory model using stack of convolutional RBMs for speech recognition," in INTERSPEECH 2016, San Francisco, USA, Sept. 08-12, 2016.
Himanshu Bhavsar, Tanvina B. Patel and Hemant A. Patil, "Novel nonlinear prediction based features for spoofed speech detection," in INTERSPEECH 2016, San Francisco, USA, Sept.08-12, 2016.
Avni Rajpal, Tanvina B. Patel, Hardik B. Sailor, Maulik C. Madhavi, Hemant A. Patil and Hiroya Fujisaki, "Native language identification using spectral and source-based features," in INTERSPEECH 2016, San Francisco, USA, Sept.08-12, 2016.
Hardik B. Sailor and Hemant A. Patil, "Unsupervised learning of temporal receptive fields using convolutional RBM for ASR task," in 24th European Signal Processing Conference (EUSIPCO), Hilton Budapest, Hungary, 29 August- 02 September, 2016.
Meet H. Soni and Hemant A. Patil, " Novel deep autoencoder features for non-intrusive speech quality assessment," in 24th European Signal Processing Conference (EUSIPCO), Hilton Budapest, Hungary, 29 August- 02 September, 2016.
Sushant V. Rao, Nirmesh J. Shah, Hemant A. Patil, "Novel pre-processing using outlier removal in voice conversion," in 9th ISCA Speech Synthesis Workshop (SSW 09), Sunnyvale, California, USA, Sept. 13-15, 2016.
Avni Rajpal and Hemant A. Patil, "Jerk minimization for acoustic-to-articulatory inversion," in 9th ISCA Speech Synthesis Workshop (SSW 09), Sunnyvale, California, USA, Sept. 13-15, 2016.
Meet H. Soni, Hemant A. Patil, "Non-intrusive quality assessment of synthesized speech using spectral features and support vector regression," 9th ISCA Speech Synthesis Workshop (SSW 09), Sunnyvale, California, USA, Sept. 13-15, 2016.
Mohammadi Zaki, Hardik B. Sailor and Hemant A. Patil, "Analysis of hierarchical bottleneck framework for improved phoneme recognition," in International Conference on Signal Processing and Communications (SPCOM), IISc Bangalore, India, 12-15 June, 2016.
Deep Gandhi, Tanvina B. Patel and Hemant A. Patil, "A novel lowpass filtering-based approach for estimating strength of excitation from speech signal," in International Conference on Signal Processing and Communications (SPCOM), IISc Bangalore, India, 12-15 June, 2016.
Maulik C. Madhavi and Hemant A. Patil, "Modification in sequential dynamic time warping for fast computation of query-by-example spoken term detection task," in International Conference on Signal Processing and Communications (SPCOM), IISc Bangalore, India, 12-15 June, 2016.
Tanvina B. Patel and Hemant A. Patil, “Effectiveness of fundamental frequency (Fo) and strength of excitation (SOE) for spoofed speech detection," in Proc. Int. Conf. Acoust., Speech and Signal Process., ICASSP’16, Shanghai, China, March 20-25, 2016.
Tanvina B. Patel and Hemant A. Patil, “Analysis of natural vs. synthetic speech using Fujisaki model for low resourced language,” in Proc. Int. Conf. Acoust., Speech and Signal Process., ICASSP’16, Shanghai, China, March 20-25, 2016.
Hardik B. Sailor and Hemant A. Patil, "Filterbank learning using convolutional restricted Boltzman machine for speech recognition," in Proc. Int. Conf. Acoust., Speech and Signal Process., ICASSP’16, Shanghai, China, March 20-25, 2016.
Tanvina B. Patel and Hemant A. Patil, “Combining evidences from Mel cepstral, cochlear filter cepstral and instantaneous frequency features for detection of natural vs. spoofed speech,” in the 16th Annual Conference of International Speech Communication Association (ISCA), INTERSPEECH'15, Dresden, Germany, September 6-10, 2015.
Maulik C. Madhavi, Hemant A. Patil and Bhavik B. Vachhani, “Spectral transition measure for detection of obstruents,“ in the 23rd European Signal Processing Conference (EUSIPCO 2015), Nice, France, 31st August - 4th September, 2015. [Poster]
Mohammadi Zaki, Nirmesh J. Shah and Hemant A. Patil, “Novel fractal dimension-based features for automatic speech recognition,” in the 23rd European Signal Processing Conference (EUSIPCO 2015), Nice, France, 31st August - 4th September, 2015.
Anshu Chittora and Hemant A. Patil, “Classification of normal and pathological infant cries using bispectrum features," in the 23rd European Signal Processing Conference (EUSIPCO 2015), Nice, France, 31st August - 4th September, 2015.
Pramod B. Bachhav, Hemant A. Patil and Tanvina B. Patel, “A novel filtering based approach for epoch extraction,” in Proc. IEEE Int. Conf. Acoust., Speech and Signal Process., ICASSP’15, Brisbane, Australia, April 19-24, 2015.
Mohammadi Zaki, Hemant A. Patil and Chinmay Maheshwari, “Effectiveness of empirical mode decomposition in financial time series prediction,” accepted in 4th Int. Conf. on Advanced Data Analysis, Business Analytics and Intelligence (ICADABAI), IIM Ahmedabad, India, April 11-12, 2015.
Anshu Chittora, Hemant A. Patil and Hardik B. Sailor, “Spectro-temporal analysis of HIE and asthma infant cries using auditory spectrogram,” in International Conference on BioSignal Analysis, Processing and System (ICBAPS 2015) Kuala Lumpur Malaysia, on 26-28 May 2015.
Anshu Chittora and Hemant A. Patil, “Analysis of normal and pathological infant cries using bispectrum features derived using HOSVD,” International Conference on BioSignal Analysis, Processing and System (ICBAPS 2015) Kuala Lumpur Malaysia, on 26-28 May 2015. (Best Paper Award)
Hardik B. Sailor, Maulik C. Madhavi and Hemant A. Patil, “Significance of phase-based features for person recognition using humming,” in 2nd Int. Conf. on Perception and Machine Intelligence (PerMin), C-DAC, Kolkata, Feb. 26-27, 2015.
Purvi Agrawal and Hemant A. Patil, “Fusion of TEO Phase with MFCC features for speaker verification,” in 2nd Int. Conf. on Perception and Machine Intelligence (PerMin), C-DAC, Kolkata, Feb. 26-27, 2015.
Shubham Sharma and Hemant A. Patil, “Combining Evidences from Bark scale and Mel scale warped features for VTLN,” in 2nd Int. Conf. on Perception and Machine Intelligence (PerMin), C-DAC, Kolkata, Feb. 26-27, 2015.
Anshu Chittora, Hemant A. Patil and Kewal D. Malde, “Classification of stop consonants using modulation spectrogram-based features,” in 2nd Int. Conf. on Perception and Machine Intelligence (PerMin), C-DAC, Kolkata, Feb. 26-27, 2015.
Hemant A. Patil, Shubham Sharma and Maulik Madhavi, "Development of vocal tract length normalized phonetic engine for Gujarati and Marathi languages," in 17th Oriental COCOSDA Conference, Phuket, Thailand, 10-12 September 2014.
Anshu Chittora, Kewal D. Malde and Hemant A. Patil, “Obstruent classification using modulation spectrogram based features,” in 17th Oriental COCOSDA Conference, Phuket, Thailand, 10-12 September 2014.
Anshu Chittora and Hemant A. Patil, "Use of glottal inverse filtering for asthma and HIE infant cries classification,” in Int. Conf. Asian Lang. Process. (IALP), Kuching, Sarawak, Oct. 20-22, 2014.
Anshu Chittora and Hemant A. Patil, "Classification of phonemes using modulation spectrogram based features for Gujarati languages,” in Int. Conf. Asian Lang. Process. (IALP), Kuching, Sarawak, Oct. 20-22, 2014.
Mohammadi Zaki, Nirmesh Shah and Hemant A. Patil, "Effectiveness of multiscale fractal dimension-based phonetic segmentation in speech synthesis for low resource language,” in Int. Conf. Asian Lang. Process. (IALP), Kuching, Sarawak, Oct. 20-22, 2014.
Nirmesh Shah, Mohammadi Zaki and Hemant A. Patil, "Influence of various asymmetrical contextual factors for TTS in a low resource language,” in Int. Conf. Asian Lang. Process. (IALP), Kuching, Sarawak, Oct. 20-22, 2014.
Purushottam Radadia and Hemant A. Patil, “Cepstral mean subtraction based features for singer identification,” in Int. Conf. Asian Lang. Process. (IALP), Kuching, Sarawak, Oct. 20-22, 2014.
Bhavik Vachhani, Kewal D. Malde, Maulik C. Madhavi and Hemant A. Patil, “A spectral transition measure based Mel cepstral features for obstruent detection,” in Int. Conf. Asian Lang. Process. (IALP), Kuching, Sarawak, Oct. 20-22, 2014.
Shubham Sharma, Maulik C. Madhavi, and Hemant A. Patil, “Vocal tract length normalization for vowel recognition in low resource languages,” in Int. Conf. Asian Lang. Process. (IALP), Kuching, Sarawak, Oct. 20-22, 2014.
Shubham Sharma, Maulik C. Madhavi and Hemant A. Patil, “Development of language resources for speech application in Gujarati and Marathi,” in Int. Conf. Asian Lang. Process. (IALP), Kuching, Sarawak, Oct. 20-22, 2014.
S. Adarsa and Hemant A. Patil, “Nonlinear analysis of speech signals for classification of natural vs. HMM-based synthetic speech,” in Int. Conf. Asian Lang. Process. (IALP), Kuching, Sarawak, Oct. 20-22, 2014.
Ankur Undhad, Hemant A. Patil and Maulik C. Madhavi, “Exploiting speech source information for vowel landmark detection for low resource language,” in 9th Int. Symp. Chinese Spoken Lang. Proc., ISCSLP’14, Singapore, 12-14 September 2014.
Mohammadi Zaki, Nirmesh Shah and Hemant A. Patil, "Effectiveness of fractal dimension for ASR in low resource language," in 9th Int. Symp. Chinese Spoken Lang. Proc., ISCSLP’14, Singapore, 12-14 September 2014.
Anshu Chittora and Hemant A. Patil, "Classification of pathological infant cries using modulation spectrogram features," in 9th Int. Symp. Chinese Spoken Lang. Proc., ISCSLP’14, Singapore, 12-14 September 2014.
Tanvina Patel and Hemant A. Patil, "Novel approach for estimating length of the vocal folds using Fujisaki model," in 9th Int. Symp. Chinese Spoken Lang. Proc., ISCSLP’14, Singapore, 12-14 September 2014.
Maulik C. Madhavi and Hemant A. Patil, “A novel Mel spectral based magnitude and phase information for humming based biometrics,” in 9th Int. Symp. Chinese Spoken Lang. Proc., ISCSLP’14, Singapore, 12-14 September 2014.
Hardik B. Sailor and Hemant A. Patil, “Fusion of magnitude and phase-based features for objective evaluation of TTS voice,” in 9th Int. Symp. Chinese Spoken Lang. Proc., ISCSLP’14, Singapore, 12-14 September 2014.
Nirmesh J. Shah, Hemant A. Patil, Maulik C. Madhavi, Hardik B. Sailor and Tanvina B. Patel “Deterministic annealing EM algorithm for developing Gujarati TTS system, “ in 9th Int. Symp. Chinese Spoken Lang. Proc., ISCSLP’14, Singapore, 12-14 September 2014.
Hemant A. Patil and Tanvina B. Patel, “Chaotic mixed excitation source for synthesis of speech signal,” in INTERSPEECH'14, Singapore, September 14-18 ,2014.
Nirmesh J. Shah, Bhavik B. Vachhani, Hardik B. Sailor and Hemant A. Patil, “Effectiveness of PLP-Based Phonetic Segmentation for TTS in Gujarati,” in Proc. Int. Conf. Acoust., Speech and Signal Process., ICASSP’14, Florence, Italy, May 4-9, 2014.
Hemant Patil, Tanvina Patel, Swati Talesara, Nirmesh Shah, Hardik Sailor, Bhavik Vachhani, Janki Akhani, Bhargav Kanakiya, Yashesh Gaur and Vibha Prajapati, "Algorithms for Speech Segmentation at Syllable-Level for Text-to-Speech Synthesis System in Gujarati," in 16th International Oriental COCOSDA Conference, November 25 - 27, 2013. Gurgaon, INDIA.
Hemant A Patil, Tanvina B Patel, Nirmesh J Shah, Hardik B Sailor, Raghava Krishnan, G R Kasthuri, T Nagarajan, Lilly Christina, Naresh Kumar, Veera Raghavendra, S P Kishore, S R M Prasanna, Nagaraj Adiga, Sanasam Ranbir Singh, Konjengbam Anand, Pranaw Kumar, Bira Chandra Singh, S L Binil Kumar, T G Bhadran, T Sajini, Arup Saha, Tulika Basu, K Sreenivasa Rao, N P Narendra, Anil Kumar Sao, Rakesh Kumar, Pranhari Talukdar, Purnendu Acharyaa, Somnath Chandra, Swaran Lata, Hema A Murthy, “ A Syllable-Based Framework for Unit Selection Synthesis in 13 Indian Languages,” in 16th International Oriental COCOSDA Conference, November 25 - 27, 2013. Gurgaon, INDIA.
Kewal D. Malde, Bhavik B. Vachhani, Maulik C. Madhavi, Nirav H. Chhayani and Hemant A. Patil,” Development of speech corpora in Gujarati and Marathi for phonetic transcription,” in 6th International Oriental COCOSDA Conference, November 25 - 27, 2013. Gurgaon, INDIA. [Poster]
Anshu Chitrora and Hemant A. Patil, “Corpus design for infant cry analysis,” in 16th International Oriental COCOSDA Conference, November 25 - 27, 2013. Gurgaon, INDIA.
Nirav H. Chhayani and Hemant A. Patil,” Development of corpora for person recognition using humming, singing and speech,” in 16th International Oriental COCOSDA Conference, November 25 - 27, 2013. Gurgaon, INDIA.
Hemant A. Patil, Anshu Chitrora and Kewal D. Malde, “Novel modulation spectrogram based features for obstruent classification,” in International Conference on Acoustics 2013, New Delhi, Nov. 10-15, 2013.
Bhavik B. Vachhani and Hemant A. Patil, “Use of PLP cepstral features for phonetic segmentation," in Int. Conf. Asian Language Processing (IALP), Urumki, China, August 17-19, 2013.
Swati Talesara, Hemant A. Patil, Tanvina Patel, Hardik Sailor and Nirmesh Shah, " A novel Gaussian filter-based automatic labeling of speech data for TTS system in Gujarati language," in Int. Conf. Asian Language Processing (IALP), Urumki, China, August 17-19, 2013.
Hemant A. Patil and Tanvina B. Patel, “Nonlinear prediction of speech using Volterra-Wiener Series,” in INTERSPEECH’13, Lyon, France, August 25-29, 2013.
Hemant A. Patil, Maulik C. Madhavi, Kewal D. Malde and Bhavikkumar Vachhani, “Phonetic transcription of fricatives and plosives for Gujarati and Marathi languages,” in Int. Conf. Asian Language Processing (IALP), Hannoi, Vietnam, Nov. 13-15, 2012.
Hemant A. Patil, Maulik C. Madhavi, and Nirav Chayani, “Person recognition using humming, singing and speech,” in Int. Conf. Asian Language Processing (IALP), Hannoi, Vietnam, Nov. 13-15, 2012.
Hemant A. Patil and Purushottam Radadia, “Combining evidences from Mel cepstral features and cepstral mean subtracted features for singer identification,” in Int. Conf. Asian Language Processing (IALP), Hannoi, Vietnam, Nov. 13-15, 2012.
Hemant A. Patil and Shrishail S. Gajbhar, “Acoustical analysis of musical pillar of great stage of Vitthala temple at Hampi, India” in Int. Conf. on Signal Processing and Communications, SPCOM’12, IISc, Bangalore, India, 22-25 July, 2012.
Hemant A. Patil and Pallavi N. Baljekar, “Classification of normal and pathological voices using TEO phase and Mel cepstral features,” in Int. Conf. on Signal Processing and Communications, SPCOM’12, IISc, Bangalore, India, 22-25 July, 2012.
Hemant A. Patil and Tanvina B. Patel, “Novel chaotic titration method for analysis of normal and pathological voices,” in Int. Conf. on Signal Processing and Communications, SPCOM’12, IISc, Bangalore, India, 22-25 July, 2012.
Pallavi N. Baljekar and Hemant A. Patil, “A comparison of waveform fractal dimension techniques for voice pathology classification,” in Proc. Int. Conf. Acoust., Speech and Signal Proc., ICASSP’12, Kyoto, Japan, March 25-30, 2012.
Tanvina B. Patel, Hemant A. Patil and Kunal P. Acharya, "Analysis of normal and pathological voices based on nonlinear dynamics," ICEEE’12, Ahmedabad., Feb. 12, 2012.
Hemant A. Patil and Maulik C. Madhavi, “Significance of magnitude and phase information via VTEO for humming based biometrics,” in International Conference on Biometrics( ICB), Delhi, India, March 30-April 1, 2012.
Hemant A. Patil, Maulik C. Madhavi and Keshab K. Parhi, "Combining evidence from spectral and source-like features for person recognition from humming," in INTERSPEECH’11, Florence, Italy, pp. 369-372, 28-31 August, 2011.
Hemant A. Patil, Pallavi N. Baljekar, "Novel VTEO based Mel cepstral features for classification of normal and pathological voices," in INTERSPEECH’11, Florence, Italy, pp. 509-512, 28-31 August, 2011.
Prakhar Kant Jain, Robin Jain, Hemant A. Patil and T. K. Basu, “Design of query-by-humming system using DDTW,” in Int. Conf. on Asian Lang. Process., IALP’11, Penang, Malaysia, pp. 240,-243, Nov 15-17, 2011.
Nirmalya Sen, T. K. Basu and Hemant A. Patil and, “New features extracted from Nyquist filter bank for text-independent speaker identification,” in IEEE INDICON’10, Kolkata, India, Dec. 17-19, 2010.
Hemant A. Patil and Keshab K. Parhi, “Novel variable length Teager energy based features for person recognition from their hum,” in Proc. Int. Conf. Acoust., Speech and Signal Proc., ICASSP’10, Texas, Dallas, USA, pp. 4526-4529, March 2010.
Hemant A. Patil and Keshab K. Parhi, “Development of TEO phase for speaker recognition,” in Int. Conf. on Signal Processing and Communications, SPCOM’10, IISc, Bangalore, India, pp.1-5, 18-21 July, 2010.
Nirmalya Sen, T. K. Basu and Hemant A. Patil, “Significant improvement in the closed set text-independent speaker identification using features extracted from Nyquist filter bank,” Int. Conf. Industrial and Information Systems (ICIIS), pp. 303-308, July 29-Aug. 01, 2010.
Hemant A. Patil, Robin Jain and Prakhar Jain, "A novel approach to identification of speakers from their hum,” in 7th Int. Conf. Advances in Pattern Recognition, ICAPR, ISI Kolkata, IEEE Computer Society, pp. 167-170, Feb. 4-6, 2009.
Hemant A. Patil and T. K. Basu, “A novel modified polynomial networks design for dialect recognition,” 7th Int. Conf. Advances in Pattern Recognition, ICAPR, ISI Kolkata, IEEE Computer Society, pp. 175-178, Feb. 4-6, 2009.
Hemant A. Patil, Sunayana Sitaram and Esha Sharma, “DA-IICT cross-lingual and multilingual corpora for speaker recognition,” 7th Int. Conf. Advances in Pattern Recognition, ICAPR, ISIKolkata, IEEE Computer Society, pp. 187-190, Feb. 4-6, 2009.
Hemant A. Patil, “Infant identification from their cry,” 7th Int. Conf. Advances in Pattern Recognition, ICAPR, ISIKolkata, IEEE Computer Society, pp. 107-109,Feb. 4-6, 2009.
Nirmalya Sen, Hemant A. Patil and T. K. Basu, “A new transform for robust text-independent speaker identification,” in IEEE INDICON’09, Ahmedabad, India, Dec. 18-20, 2009.
Siddarth Rai Mahendra, Hemant A. Patil, Narendra Kumar Shukla, Pitch estimation of musical notes for Indian classical music,” in IEEE INDICON’09, Ahmedabad, India, Dec. 18-20, 2009.
Mayank Mishra and Hemant A. Patil, “Design and implementation of HMM-VQ based isolated digit recognition system,” in Special Session on Speech, Audio, Image and Video Processing using AI, IICAI’09, India pp. 1754-1763, 16-18 Dec., 2009.
Vikrant Tomar and Hemant A. Patil, “On the development of variable length Teager energy operator (VTEO),” in INTERSPEECH’08, Brisbane, Australia, 22-26 September, pp. 1056-1059, 2008.
Hemant A. Patil and T.K. Basu, “Identifying phonetically similar languages using Teager energy based cepstrum,” in special session on “Frontiers of Language Processing and Information Retrieval for Asian Languages”, in Int. Conf. on Artificial Intelligence and Pattern Recognition, AIPR’07, Florida, USA, July 9-12, pp.1-8, 2007.
Hemant A. Patil and T. K. Basu, “Advances in Speaker Recognition: A Feature Based Approach,” Int. Conf. Artificial Intelligence and Pattern Recognition, AIPR’07, Orlando, Florida, USA, July 9-12, pp. 528-537, 2007 (Invited Paper).
Neeharika Buddha and Hemant A. Patil, “Corpora for analysis of infant cry,” in Int. Conf. on Speech Databases and Assessments, Oriental COCOSDA’07, Hanoi, Vietnam, Dec. 4-6, 2007.
Nimish Singh and Hemant A. Patil, “Speech corpus for speaker recognition research and evaluation in Urdu,” in Int. Conf. on Speech Databases and Assessments, Oriental COCOSDA’07, Hanoi, Vietnam, Dec. 4-6, 2007.
Hemant A. Patil and T. K. Basu, “Designing neural network using polynomial RBF for language identification” in Int. Conf. Neural Information Processing, ICONIP’07, pp. 107, Japan (Abstract only).
Hemant A. Patil and T.K. Basu, “Designing quadratic spline wavelet for subband based speaker classification,” in Workshop on Image and Signal Processing, WISP’07, IIT Guwahati, Dec. 28-29, 2007.
Hemant A. Patil, P. K. Dutta and T. K. Basu, “Effectiveness of LP based features for identification of professional, mimics in Indian languages”, in Int. Workshop on Multimodal User Authentication, MMUA’06, Toulouse, France, May 11-12, 2006.
Hemant A. Patil and T.K. Basu, “A new data fusion technique and performance measure for identification of twins in Marathi,” in Int. Symp. Chinese Spoken Lang. Proc., ISCSLP’06, Singapore, Special Session on Speaker Recognition, Companion volume, Dec. 2006.
Hemant A. Patil, S. Ghosh, A. Si and T. K. Basu, “Design of cross-lingual and multilingual corpora for speaker recognition research and evaluation in Indian languages,” in Int. Symp. Chinese Spoken Lang. Proc., ISCSLP’06, Singapore, Special Session on Multilingual Corpora Development, Companion volume, Dec. 2006.
Hemant A. Patil, Debee Prakash, Bikas Kar, Bishnu Bhatta, Biswajit Kar and T. K. Basu, “Corpora for speaker recognition research and evaluation in Oriya,” in IEEE Int. Conf. on Industrial Tech., IEEE ICIT’06, Dec. 15-17, 2006, Mumbai, INDIA.
Hemant A. Patil, P. K. Dutta and T. K. Basu, “On the investigation of spectral resolution problem for identification of female speakers in Bengali, in Special Session on Person Authentication: Voice and other biometrics, IEEE Int. Conf. on Industrial Tech., IEEE ICIT’06, Dec. 15-17, 2006, Mumbai, INDIA (regarded as excellent paper by the esteemed reviewers)
Hemant A. Patil, P. K. Dutta and T. K. Basu, “Speaker classification using wavelet packet based features,” Presented in EU-India Culture Tech Workshop, IIT Kharagpur, Nov. 7-8, 2005.
Hemant A. Patil, P. K. Dutta and T. K. Basu, “The wavelet packet based cepstral features for open set speaker classification in Marathi,” Presented in 29th Annual Conference of the German Classification Society Otto-von-Guericke-University Magdeburg (GfKl 2005) "From Data and Information Analysis to Knowledge Engineering", pp. 199, March 9-11, 2005. (Abstract only)
Gagan Porwal, Hemant A. Patil and T. K. Basu, “Effect of speech coding on text-independent speaker identification”, in Int. Conf. on Intelligent Sensing and Information Processing, ICISIP’04, Chennai, pp. 415-420, Jan. 4-7, 2005.
Hemant A. Patil and T. K. Basu, “Detection of bilingual twins by Teager energy based features,” in Int. Conf. on Signal Processing and Communications, SPCOM’04, IISc, Bangalore, pp. 32-36, Dec. 11-14, 2004.
Hemant A. Patil and T. K. Basu, “Text-independent identification of identical twins for Marathi language in noisy environments,” in Proc. Int. Conf. on Artificial Intelligence in Engineering and Technology, ICAIET’04, Malaysia, pp. 190-196, Aug. 3-5, 2004.
Hemant A. Patil and T. K. Basu, “Designing speech corpus for twin identification experiments in Indian languages”, in Int. Conf. on Natural Language Processing, ICON’04, IIIT Hyderabad, Dec. 19-22, 2004.
Gagan Porwal, Hemant A. Patil and T. K. Basu, “Effect of GSM-FR coding standard on performance of text-independent speaker identification”, in Int. Conf. on Advanced Computing and Communications, ADCOM’04, Ahmedabad, Dec. 13-15, 2004.
Hemant A. Patil and T. K. Basu, “Identification of twins in multilingual environment using Teager energy operator,” in Int. Conf. on Speech and Language Technology, ICSLT’04, Noida, India, Nov. 17-19, 2004.
Hemant A. Patil and T. K. Basu, “Design of speech corpus for ASR in multilingual environment,” in Int. Workshop on Standardization of Speech Database, Oriental COCOSDA, India, Delhi, Nov. 17-19, 2004.
National Conferences
Hemant A. Patil, Arindam Kesh, D. Krishna Bhaskar, Kumara Ganesh, S. Barathi, S. Yamini, Shauryadipta Sarkar and T. K. Basu, “Comparison of different features for identification of females in multilingual environment,” in National Conference on Communications, NCC’05, IIT Kharagpur, India, pp. 300-304, Jan. 28-30, 2005.
Gagan Porwal, Hemant A. Patil and T. K. Basu, “Speech compression strategy for text-independent speaker identification,” in SPCCN01, H. K. Abhyankar et. al. (Eds.), Tata McGraw-Hill, pp. 140-145, 2005.
Hemant A. Patil and T. K. Basu, “Text-independent identification of identical twins for Hindi language in noisy environments,” in Proc. of National Conf. on Emerging Techniques in Electrical Engineering, Etee’04, Chennai, India, Jan. 23-24, 2004.
Hemant A. Patil and T. K. Basu, “Comparison and evaluation of LP-based features for text-independent identification for female speakers in Hindi language,” in Proc. of National Conf. on Emerging Techniques in Electrical Engineering, Etee’04, Chennai, India, Jan. 23-24, 2004.
Hemant A. Patil and T. K. Basu, “Comparison and evaluation of LP based features for text-independent identification for female speakers,” in Proc. of National Conf. on Control, Communication and Information Systems, CCIS’04, Goa, India, pp. 41-46, Jan. 23-24, 2004.
Hemant A. Patil and T. K. Basu, “Comparison of subband cepstrum and Mel cepstrum for open set speaker classification”, in IEEE INDICON, IIT Kharagpur, pp. 35-40, Dec. 20-22, 2004.
Hemant A. Patil and T. K. Basu, “Teager energy Mel cepstrum for identification of twins in Marathi”, in IEEE INDICON, IIT Kharagpur, pp. 58-61, Dec. 20-22, 2004.
Hemant A. Patil, Tauseef Ahmad, Snehesh Mitra and T. K. Basu, “Comparison of performance of different speech features for text-independent speaker identification of female speakers in Urdu language,” in National System Conference, NSC’04, VIT, Vellore, India, pp. 254-258, Dec. 16-18, 2004.
Hemant A. Patil and T. K. Basu, “Speech corpus for text/language independent speaker recognition in Indian languages,” Addendum to the lecture compendium, in Proc. of National Symposium on Morphology, Phonology and Language Engineering, SIMPLE’04, IIT Kharagpur, pp. A1-A4, March 19-21, 2004.
Hemant A. Patil, Tauseef Ahmad and T. K. Basu, “LP based features for multilingual speaker identification of identical twins in Indian languages,” in Proc. of Conf. on Distributed Processing and Networking, DPN’04, IIT Kharagpur, pp. 221-227, June 11-13, 2004.
Hemant A. Patil, P. K. Dutta and T. K. Basu, “Comparison of performance of different speech features for text-independent identification of professional mimic in Hindi and Urdu languages,” in National Symposium on Acoustics, NSA’04, Mysore, India, Nov. 25-27, 2004.
Hemant A. Patil, P. K. Dutta and T. K. Basu, “The Teager energy Mel Cepstrum for speaker identification in multilingual environment,” in National Symposium on Acoustics, NSA’04, Mysore, India, Nov. 25-27, 2004.
Snehesh Mitra, Hemant A. Patil and T. K. Basu, “Polynomial classifier techniques for speaker identification in Indian languages,” in Proc. of National System Conference, NSC’03, IIT Kharagpur, India, pp. 304-308, Dec. 17-19, 2003.