Publications
Citation Analytics: Google Scholar; ResearchGate; Scopus; ResearcherID; ORCiD; Semantic Scholar; DBLP
Book
Tom Bäckström, Okko Räsänen, Abraham Zewoudie, Pablo Pérez Zarazaga, Liisa Koivusalo, Sneha Das, Esteban Gómez Mellado, Mariem Bouafif Mansali, Daniel Ramos, Sudarsana Reddy Kadiri and Paavo Alku, “Introduction to Speech Processing”, 2nd Edition, 2022.
Book Chapters
Vinoo Alluri and Sudarsana Reddy Kadiri, “Neural Correlates of Timbre Processing”, Timbre: Acoustics, Perception, and Cognition, Springer Handbook in Auditory Research (SHAR) series, pp. 151 - 172, May 2019.
P. Gangamohan, Sudarsana Reddy Kadiri and B. Yegnanarayana, “Analysis of Emotional Speech - A Review”, Toward Robotic Socially Believable Behaving Systems - Volume I : Modeling Emotions, Springer International Publishing, pp. 205 - 238, March 2016.
Journals
Farhad Javanmardi, Sudarsana Reddy Kadiri and Paavo Alku, “Exploring the Impact of Fine-Tuning the Wav2vec2 Model in Database-Independent Detection of Dysarthric Speech”, IEEE Journal of Biomedical and Health Informatics, 2024.
Farhad Javanmardi, Sudarsana Reddy Kadiri and Paavo Alku, “Pre-trained Models for Detection and Severity Level Classification of Dysarthria from Speech”, Speech Communication, Vol. 158, Article 103047, March 2024.
Paavo Alku, Manila Kodali, Laura Laaksonen, and Sudarsana Reddy Kadiri, “AVID: A Speech Database for Machine Learning Studies on Vocal Intensity”, Speech Communication, Vol. 157, Article 103039, February 2024.
Farhad Javanmardi, Sudarsana Reddy Kadiri and Paavo Alku, “A Comparison of Data Augmentation Methods in Voice Pathology Detection”, Computer Speech and Language, Vol. 83, Article 101552, January 2024.
Sudarsana Reddy Kadiri, Farhad Javanmardi and Paavo Alku, “Investigation of Self-supervised Pre-trained Models for Classification of Voice Quality from Speech and Neck Surface Accelerometer Signals”, Computer Speech and Language, Vol. 83, Article 101550, January 2024.
Manila Kodali, Sudarsana Reddy Kadiri and Paavo Alku, “Automatic classification of the severity level of Parkinson’s disease: A comparison of speaking tasks, features, and classifiers”, Computer Speech and Language, Vol. 83, Article 101548, January 2024.
Hemant Kathania, Virender Kadyan, Sudarsana Reddy Kadiri and Mikko Kurimo, “Spectral Warping based Data Augmentation for Low Resource Children’s Speaker Verification”, Multimedia Tools and Applications, 2023.
Manuel Brandner, Paul Bereuter, Sudarsana Reddy Kadiri, and Alois Sontacchi, “Classification of Phonation Modes in Classical Singing using Modulation Power Spectral Features”, IEEE Access, Vol. 11, pp. 29149-29161, 2023.
Paavo Alku, Sudarsana Reddy Kadiri, and Dhananjaya Gowda, “Refining a Deep Learning-based Formant Tracker using Linear Prediction Methods”, Computer Speech and Language, Vol. 81, Article 101515, June 2023.
Saska Tirronen, Sudarsana Reddy Kadiri and Paavo Alku “Hierarchical Multi-class Classification of Voice Disorders Using Self-supervised Models and Glottal Features”, IEEE Open Journal of Signal Processing, Vol. 4, pp. 80–88, 2023.
Sudarsana Reddy Kadiri, Paavo Alku and B. Yegnanarayana, “Analysis of Instantaneous Frequency Components of Speech Signals for Epoch Extraction”, Computer, Speech and Language, Vol. 78, Article 101443, March 2023.
Hemant Kathania, Virender Kadyan, Sudarsana Reddy Kadiri and Mikko Kurimo, “Data augmentation using spectral warping for low resource children ASR'', Journal of Signal Processing Systems, Vol. 94, pp. 1507–1513, 2022.
Sudarsana Reddy Kadiri and Paavo Alku “Subjective Evaluation of Basic Emotions from Audio-Visual Data”, Sensors, Vol. 22, Article 4931, 2022. (Invited article)
Saska Tirronen, Sudarsana Reddy Kadiri and Paavo Alku “The Effect of the MFCC Frame Length in the Detection of Voice Pathologies”, Journal of Voice, 2022.
Rashmi Kethireddy, Sudarsana Reddy Kadiri, and Suryakanth V. Gangashetty, “Deep neural architectures for dialect classification with single frequency filtering and zero-time windowing feature representations”, The Journal of the Acoustical Society of America, Vol. 151, pp. 1077-1092, February 2022.
Rashmi Kethireddy, Sudarsana Reddy Kadiri, and Suryakanth V. Gangashetty, “Exploration of temporal dynamics of frequency domain linear prediction cepstral coefficients for dialect classification”, Applied Acoustics, Vol. 188, Article 108553, January 2022.
Hemant Kathania, Sudarsana Reddy Kadiri, Paavo Alku and Mikko Kurimo, “A Formant Modification Method for Improved ASR of Children’s Speech”, Speech Communication, Vol. 136, pp. 98-106, January 2022.
Sudarsana Reddy Kadiri, Paavo Alku and B. Yegnanarayana, “Extraction and Utilization of Excitation Information of Speech: A Review”, Proceedings of the IEEE, Vol. 109, No. 12, pp. 1920-1941, December 2021.
Dhananjaya Gowda*, Bajibabu Bollepalli*, Sudarsana Reddy Kadiri* and Paavo Alku, “Formant Tracking using Quasi-Closed Phase Forward-Backward Linear Prediction Analysis and Deep Neural Networks”, IEEE Access, Vol. 9, pp. 151631-151640, 2021. (*Equal contribution)
Hemant Kathania, Sudarsana Reddy Kadiri, Paavo Alku and Mikko Kurimo “Using Data Augmentation and Time-Scale Modification to Improve ASR of Children’s Speech in Noisy Environments'', Applied Sciences, Vol.11, No. 18, Article 8420, 2021.
Sudarsana Reddy Kadiri and Paavo Alku “Glottal Features for Classification of Phonation Type from Speech and Neck Surface Accelerometer Signals”, Computer, Speech and Language, Vol. 70, Article 101232, November 2021.
Rashmi Kethireddy, Sudarsana Reddy Kadiri, Paavo Alku and Suryakanth V. Gangashetty, “Mel-Weighted Single Frequency Filtering Spectrogram for Dialect Identification”, IEEE Access, Vol. 8, pp. 174871-174879, 2020.
Dhananjaya Gowda, Sudarsana Reddy Kadiri, Brad Story and Paavo Alku, “Time-Varying Quasi-Closed-Phase Analysis for Accurate Formant Tracking in Speech Signals”, IEEE/ACM Transactions on Audio, Speech, and Language Processing, Vol. 28, pp. 1901-1914, 2020.
Sudarsana Reddy Kadiri and B. Yegnanarayana, “Determination of Glottal Closure Instants from Clean and Telephone Quality Speech Signals using Single Frequency Filtering”, Computer Speech and Language, Vol. 64, Article 101097, November 2020.
Sudarsana Reddy Kadiri and Paavo Alku “Excitation Features of Speech for Speaker-Specific Emotion Detection”, IEEE Access, Vol. 8, pp. 60382-60391, 2020.
Sudarsana Reddy Kadiri, Paavo Alku and B. Yegnanarayana, “Analysis and Classification of Phonation Types in Speech and Singing Voice”, Speech Communication, Vol. 118, pp. 33-47, April 2020.
Sudarsana Reddy Kadiri, P. Gangamohan, Suryakanth V Gangashetty, Paavo Alku, and B. Yegnanarayana, “Excitation Source Features of Speech for Emotion Recognition using Neutral Speech as Reference”, Circuits, Systems and Signal Processing, Vol. 39(9), pp. 4459-4481, September 2020.
BHVS Narayana Murthy, B. Yegnanarayana and Sudarsana Reddy Kadiri, “Time Delay Estimation from Mixed Multispeaker Speech Signals using Single Frequency Filtering”, Circuits, Systems, and Signal Processing, Vol. 39(4), pp. 1988-2005, April 2020.
Sudarsana Reddy Kadiri and Paavo Alku, “Analysis and Detection of Pathological Voice using Glottal Source Features”, IEEE Journal of Selected Topics in Signal Processing, Vol. 14, No. 2, pp. 367-379, February 2020.
Sudarsana Reddy Kadiri, RaviShankar Prasad and B. Yegnanarayana, “Detection of Glottal Closure Instant and Glottal Open Region from Speech Signal using Spectral Flatness”, Speech Communication, Vol. 116, pp. 30-43, January 2020.
Sudarsana Reddy Kadiri and B. Yegnanarayana, “Analysis of aperiodicity in artistic Noh singing voice using an impulse sequence representation of excitation source”, The Journal of the Acoustical Society of America, Vol. 146, No. 6, pp. 4446–4457, December 2019.
Sudarsana Reddy Kadiri and Paavo Alku, “Mel-frequency cepstral coefficients derived using the zero-time windowing spectrum for classification of phonation types in singing”, The Journal of the Acoustical Society of America, Vol. 146, No. 5, pp. EL418-EL423, November 2019.
Nivedita Chennupati, Sudarsana Reddy Kadiri and B. Yegnanarayana, “Spectral and temporal manipulations of SFF envelopes for enhancement of speech intelligibility in noise”, Computer, Speech and Language, Vol. 54, pp. 86-105, March 2019.
Nivedita Chennupati, Sudarsana Reddy Kadiri and B. Yegnanarayana, “Significance of Phase in Single Frequency Filtering Outputs of Speech Signals”, Speech Communication, Vol. 97, pp. 66-72, March 2018.
Sudarsana Reddy Kadiri and B. Yegnanarayana, “Epoch Extraction from Emotional Speech using Single Frequency Filtering Approach”, Speech Communication ,Vol. 86, pp. 52-63, February 2017.
Hari Krishna Vydana, Sudarsana Reddy Kadiri and Anil Kumar Vuppala “Vowel Based Non-Uniform Prosody Modification for Emotion Conversion”, Circuits, Systems & Signal Processing, Vol. 35(5), pp. 1643-1663, 2016.
Conferences
Jihwan Lee, Aditya Kommineni, Tiantian Feng, Kleanthis Avramidis, Xuan Shi, Sudarsana Reddy Kadiri, Shrikanth Narayanan, “Toward Fully-End-to-End Listened Speech Decoding from EEG Signals”, in Proc. INTERSPEECH, 2024.
Liangyu Nie, Sudarsana Reddy Kadiri and Ruchit Agrawal, “MMSD-Net: Towards Multi-modal Stuttering Detection”, in Proc. INTERSPEECH, 2024.
Manila Kodali, Sudarsana Reddy Kadiri, and Paavo Alku, “Fine-tuning of Pre-trained Models for Classification of Vocal Intensity Category from Speech Signals”, in Proc. INTERSPEECH, 2024.
Abhijit Sinha, Mittul Singh, Sudarsana Reddy Kadiri, Mikko Kurimo, and Hemant Kathania,“Effect of speech modification on Wav2Vec2 models for Children Speech Recognition”, in Proc. International Conference on Signal Processing and Communications (SPCOM), 2024.
Sudarsana Reddy Kadiri, Manila Kodali, and Paavo Alku, “Severity Classification of Parkinson’s Disease from Speech using Single Frequency Filtering-based Features”, in Proc. INTERSPEECH, pp. 2393-2397, Dublin, Ireland, August 20-24, 2023.
Manila Kodali, Sudarsana Reddy Kadiri, and Paavo Alku, “Classification of Vocal Intensity Category from Speech using the Wav2vec2 and Whisper Embeddings”, in Proc. INTERSPEECH, pp. 4134-4138, Dublin, Ireland, August 20-24, 2023.
Manila Kodali, Sudarsana Reddy Kadiri, Laura Laaksonen, and Paavo Alku, “Automatic Classification of Vocal Intensity Category from Speech”, in Proc. ICASSP, Rhodes Island, Greece, June 4-10, 2023.
Farhad Javanmardi, Saska Tirronen, Manila Kodali, Sudarsana Reddy Kadiri, and Paavo Alku, “Wav2vec-based Detection and Severity Level Classification of Dysarthria from Speech”, in Proc. ICASSP, Rhodes Island, Greece, June 4-10, 2023.
Saska Tirronen, Farhad Javanmardi, Manila Kodali, Sudarsana Reddy Kadiri, and Paavo Alku, “Utilizing Wav2vec in Database-independent Voice Disorder Detection”, in Proc. ICASSP, Rhodes Island, Greece, June 4-10, 2023.
Tamás Grósz, Dejan Porjazovski, Yaroslav Getman, Sudarsana Reddy Kadiri, and Mikko Kurimo, “Wav2vec2-based Paralinguistic Systems to Recognise Vocalised Emotions and Stuttering”, in Proc. ACM Multimedia, pp. 7026-7029, Lisbon, Portugal, October 10-14, 2022.
Sudarsana Reddy Kadiri, Farhad Javanmardi and Paavo Alku, “Convolutional Neural Networks for Classification of Voice Qualities from Speech and Neck Surface Accelerometer Signals”, in Proc. INTERSPEECH, pp. 5253-5257, Incheon, Korea, September 18-22, 2022.
Farhad Javanmardi, Sudarsana Reddy Kadiri, Manila Kodali and Paavo Alku, “Comparing 1-Dimensional and 2-Dimensional Spectral Feature Representations in Voice Pathology Detection using Machine Learning and Deep Learning Classifiers”, in Proc. INTERSPEECH, pp. 2173-2177, Incheon, Korea, September 18-22, 2022.
Hemant Kathania, Sudarsana Reddy Kadiri, Paavo Alku and Mikko Kurimo “Spectral modification for recognition of children’s speech under mismatched conditions”, in Nordic Conference on Computational Linguistics (NoDaLiDa), pp. 94-100, Iceland, May-June 2021.
Sudarsana Reddy Kadiri, Rashmi Kethireddy, and Paavo Alku “Parkinson's Disease Detection from Speech using Single Frequency Filtering Cepstral Coefficients”, in Proc. INTERSPEECH, pp. 4971-4975, Shanghai, China, October 25-29, 2020.
Rashmi Kethireddy, Sudarsana Reddy Kadiri, Santosh Kesiraju and Suryakanth V Gangashetty “Zero-Time Windowing Cepstral Coefficients for Dialect Classification”, in Proc. ODYSSEY 2020: The Speaker and Language Recognition Workshop, pp. 32-38, Tokyo, Japan, November 01-05, 2020.
Rashmi Kethireddy, Sudarsana Reddy Kadiri, and Suryakanth V Gangashetty “Learning Filter-banks from Raw waveform for Accent Classication”, in Proc. IJCNN, pp. 1-6, Glasgow, UK, July 19-24, 2020.
Sudarsana Reddy Kadiri, Paavo Alku and B. Yegnanarayana “Comparison of Glottal Closure Instant Detection Algorithms for Emotional Speech”, in Proc. ICASSP, pp. 7379-7383, Barcelona, Spain, May 4 -8, 2020.
Hemant Kathania, Sudarsana Reddy Kadiri, Paavo Alku and Mikko Kurimo “Study of formant modification for Children ASR”, in Proc. ICASSP, pp. 7429-7433, Barcelona, Spain, May 4 -8, 2020.
Sushmita Thakallapalli, Sudarsana Reddy Kadiri, and Suryakanth V Gangashetty “Spectral Features derived from Single Frequency Filter for Multispeaker Localization”, in Proc. National Conference on Communications (NCC), pp. 1-6, IIT Kharagpur, India, Feb 21-23, 2020.
Sudarsana Reddy Kadiri and Paavo Alku: “Mel-cepstral coefficients of voice source waveforms for classification of phonation types in speech”, in Proc. INTERSPEECH, pp. 2508-2512, Graz, Austria, September 15-19, 2019.
Sudarsana Reddy Kadiri, “A Quantitative Comparison of Epoch Extraction Algorithms for Telephone Speech”, in Proc. IEEE Int. Conference on Acoustics, Speech, and Signal Processing (ICASSP), pp. 6500-6504, Brighton, UK, May 12-17, 2019.
Sudarsana Reddy Kadiri and B. Yegnanarayana, “Estimation of Fundamental Frequency From Singing Voice using Harmonics of Impulse-like Excitation Source”, in INTERSPEECH, pp. 2319-2323, September 2-6, 2018.
Sudarsana Reddy Kadiri, and B. Yegnanarayana, “Breathy to Tense Voice Discrimination using Zero-Time Windowing Cepstral Coefficients”, in INTERSPEECH, pp. 232-236, September 2-6, 2018.
Sudarsana Reddy Kadiri, and B. Yegnanarayana, “Analysis and Detection of Phonation Mode in Singing Voice using Excitation Source Features and single frequency filtering cepstral coefficients (SFFCC)”, in INTERSPEECH, pp. 441-445, September 2-6, 2018.
G. Aneeja, Sudarsana Reddy Kadiri, and B. Yegnanarayana, “Detection of glottal closure instants in degraded speech using single frequency filtering analysis”, in INTERSPEECH, pp. 2300-2304, September 2-6, 2018.
RaviShankar Prasad, Sudarsana Reddy Kadiri, Suryakanth V. Gangashetty and B. Yegnanarayana, “Discriminating nasals and approximants in English language using zero time windowing”, in INTERSPEECH, pp. 177-181, September 2-6, 2018.
K N R K Raju Alluri, Sivanand Achanta, Sudarsana Reddy Kadiri, Suryakanth V Gangashetty and Anil Kumar Vuppala, “Detection of Replay Attacks using Single Frequency Filter Cepstral Coefficients”, in INTERSPEECH, Stockholm, Sweden, pp. 2596-2600, August, 2017.
K N R K Raju Alluri, Sivanand Achanta, Sudarsana Reddy Kadiri, Suryakanth V Gangashetty and Anil Kumar Vuppala, “SFF Anti-Spoofer: IIIT-H Submission for Automatic Speaker Verification Spoofing and Countermeasures Challenge 2017”, in INTERSPEECH, Stockholm, Sweden, pp. 107-111, August, 2017.
Bhanu Teja Nellore, RaviShankar Prasad, Sudarsana Reddy Kadiri, Suryakanth V. Gangashetty, B. Yegnanarayana, “Locating burst onsets using SFF envelope and phase information”, in INTERSPEECH, Stockholm, Sweden, pp. 3023-3027, August, 2017.
Sudarsana Reddy Kadiri and B. Yegnanarayana, “Speech Polarity Detection using Strength of Impulse-like Excitation Extracted from Speech Epochs”, in IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), New Orleans, USA, pp. 5610-5614,March, 2017.
Vishala Pannala, G. Aneeja, Sudarsana Reddy Kadiri and B. Yegnanarayana, “Robust Estimation of Fundamental Frequency using Single Frequency Filtering Approach”, in INTERSPEECH, San-francisco, USA, pp. 2155-2159, September, 2016.
Sudarsana Reddy Kadiri, P. Gangamohan, Suryakanth V Gangashetty and B. Yegnanarayana, “Analysis of Excitation Source Features of Speech for Emotion Recognition”, in INTERSPEECH, Dresden, Germany, pp. 1324-1328, September, 2015.
Sudarsana Reddy Kadiri and B. Yegnanarayana, “Analysis of singing voice for epoch extraction using zero frequency filtering method”, in ICASSP, Brisbane, Australia, pp. 4260-4264, April, 2015
Sudarsana Reddy Kadiri, P. Gangamohan, VK Mittal and B. Yegnanarayana, “Naturalistic Audio-Visual Emotion Database”, ICON, Goa, India, pp. 119-126, December, 2014.
Sudarsana Reddy Kadiri, P. Gangamohan and B. Yegnanarayana, “Discriminating Neutral and Emotional Speech using Neural Networks”, Published in ICON, Goa, India, pp. 127-134, December, 2014.
Sudarsana Reddy Kadiri, P. Gangamohan, Suryakanth V Gangashetty and B. Yegnanarayana, “Analysis of Cognitive Loaded Speech using Excitation Source Features”, ICON , Goa, India, pp. 425-431, December, 2014.
P. Gangamohan, Sudarsana Reddy Kadiri, Suryakanth V Gangashetty and B. Yegnanarayana, “Discrimination of Anger and Happy Emotions using Features related to Excitation Source”, INTERSPEECH, Singapore, pp. 1253-1257, September, 2014.
Anil Kumar Vuppala and Sudarsana Reddy Kadiri “Neutral to Anger Speech Conversion Using Non-Uniform Duration Modification”, ICIIS, Gwalior, India, pp. 1-4, December, 2014.
P.Gangamohan, Sudarsana Reddy Kadiri and B. Yegnanarayana, “Analysis of emotional speech at subsegmental level”, INTERSPEECH, Lyon, France, pp. 1916-1920, August, 2013.
Patents
“Method and Apparatus for Recognizing Human Emotion Expressions based on Speech Signal”, B. Yegnanarayana, V.K. Mittal, Pratibha Moogi, P. Gangamohan and Sudarsana Reddy Kadiri, Indian Patent: IN2013CH04854A, 2015.
PhD Thesis
Sudarsana Reddy Kadiri, “Analysis of Excitation Information in Expressive Speech”, Doctoral dissertation, International Institute of Information Technology, Hyderabad, 2018.
ArXiv preprints
Tamás Grósz, Mittul Singh, Sudarsana Reddy Kadiri, Hemant Kathania and Mikko Kurimo, “End-to-end Ensemble-based Feature Selection for Paralinguistics Tasks”, October, 2022.
Tamás Grósz, Mittul Singh, Sudarsana Reddy Kadiri, Hemant Kathania and Mikko Kurimo, “Aalto's End-to-End DNN systems for the INTERSPEECH 2020 Computational Paralinguistics Challenge”, August, 2020.
Abstracts
Sudarsana Reddy Kadiri and Paavo Alku, “Effectiveness of Glottal Source Features in the Discrimination of Speech of Healthy Talkers from Speech of Parkinsonian Talkers With and Without Perceptual Dysarthia”, in 51st Annual Symposium: Care of the Professional Voice, June 1-5, 2022.
Sudarsana Reddy Kadiri, Johan Sundberg, and Paavo Alku, “Systematic Analysis of Phonation Modes in Singing Voice using Glottal Source Features”, in 50th Annual Symposium: Care of the Professional Voice, June 2-6, 2021.
Sudarsana Reddy Kadiri and Paavo Alku, “Analysis and Detection of Pathological Voice using Glottal Source Features”, ICASSP, 2021 (IEEE Signal Processing Society Journal Paper for Presentation).
Sushmita T, Sudarsana Reddy Kadiri, and Suryakanth V Gangashetty, “Why can’t we perceive whisper speech from a distance compared to normal speech?”, published in 1 st Conference of the Timing Research Forum, Strasbourg, France, October 23-25, 2017.
Sudarsana Reddy Kadiri and B. Yegnanarayana, “Extraction of Excitation Information from Speech and its Applications for Expressive Speech Processing”, in IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP) (M.Sc./Ph.D. papers), New Orleans,USA, March, 2017.