AUDIO, SPEECH and LANGUAGE Lab
Research AREAS
ASL Lab at College of Engineering Trivandrum focuses on the following areas of research.
Speech Signal Processing
Automatic Speech Recognition
Bio Acoustics and Allometry
Natural Language Processing
Music Information Retrieval
Expressive Speech Synthesis
Source Separation
People
Faculty
COLLABORATORS
Research Scholars
Amlu Anna Joshy (2019-2023)
Lekshmi C. R. (2019-2023)
Kavya Manohar (2019-2023)
Noumida A. (2021-)
Mala J B (2022-)
Bhasi K C (2022-)
M.Tech STUDENTS
Sujeesha A S (2020-2022)
Aiswarya M A (2020-2022)
Hridya Raj T V (2020-2022)
Godwin George (2020-2022)
B.Tech STUDENTS
Ananya Ayasi (2018-2022)
Jacob Joshy (2018-2022)
Sidharth (2019-2023)
Thomas Ajai (2019-2023)
Gokul G Menon (2020-2024)
Ashish Abraham (2020-2024)
PUBLICATIONS
JOURNALS
A. A. Joshy and R. Rajan, ”Severity Assessment of Dysarthria using Squeeze-and-Excitation Networks”, Biomedical Signal Processing and Control 82 (2023):104606 https://doi.org/10.1016/j.bspc.2023.104606 .
A. A. Joshy and R. Rajan, ”Dysarthria severity classification using multi-head attention and multi-task learning,” Speech Communication 147 (2023): 1-11. https://doi.org/10.1016/j.specom.2022.12.004 .
K. Manohar, A. R. Jayan and R. Rajan, "Mlphon: A Multifunctional Grapheme-Phoneme Conversion Tool Using Finite State Transducers," in IEEE Access, vol. 10, pp. 97555-97575, 2022, https://doi.org/10.1109/ACCESS.2022.3204403 .
Joseph, S., Rajan, R. Cycle GAN-Based Audio Source Separation Using Time–Frequency Masking. Circuits Syst Signal Process (2022). https://doi.org/10.1007/s00034-022-02178-1
Rajan, R., Chandrika Reghunath, L. & Varghese, L.T. POMET: a corpus for poetic meter classification. Lang Resources & Evaluation (2022). https://doi.org/10.1007/s10579-022-09604-5
Resna, S., Rajan, R. Multi-Voice Singing Synthesis From Lyrics. Circuits Syst Signal Process (2022). https://doi.org/10.1007/s00034-022-02122-3
Noumida, A., and Rajeev Rajan. "Multi-label bird species classification from audio recordings using attention framework." Applied Acoustics 197 (2022): 108901. https://doi.org/10.1016/j.apacoust.2022.108901
Reghunath, L.C., Rajan, R. Transformer-based ensemble method for multiple predominant instruments recognition in polyphonic music. Eurasip Journal of Audio Speech and Music Processing, 11 (2022). https://doi.org/10.1186/s13636-022-00245-8
Lekshmi C. R., Rajeev, R. ''Multiple Predominant Instruments Recognition in Polyphonic Music using Spectro/Modgd-gram Fusion'', Circuits Syst Signal Process (2023).(CSSP).https://doi.org/10.1007/s00034-022-02278-y
A. A. Joshy and R. Rajan, "Automated Dysarthria Severity Classification: A Study on Acoustic Features and Deep Learning Techniques," in IEEE Transactions on Neural Systems and Rehabilitation Engineering, doi: 10.1109/TNSRE.2022.3169814.
Rajan, R., Johnson, J. & Abdul Kareem, N. Bird Call Classification Using DNN-Based Acoustic Modelling. Circuits Syst Signal Process 41, 2669–2680 (2022). https://doi.org/10.1007/s00034-021-01896-2
Rajan, Rajeev, and BS Shajee Mohan. "Distance Metric Learnt Kernel-Based Music Classification Using Timbral Descriptors." International Journal of Pattern Recognition and Artificial Intelligence 35.13 (2021): 2151014.
CONFERENCES
A. A. Joshy, P. N. Parameswaran, S. R. Nair, and R. Rajan. "Statistical Analysis of Speech Disorder Specific Features to Characterise Dysarthria Severity Level." In ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp. 1-5. IEEE, 2023
K. Manohar, G. G. Menon, A. Abraham, R. Rajan and A. R. Jayan, "Automatic Recognition of Continuous Malayalam Speech using Pretrained Multilingual Transformers," 2023 International Conference on Intelligent Systems for Communication, IoT and Security (ICISCoIS), Coimbatore, India, 2023, pp. 671-675, doi: 10.1109/ICISCoIS56541.2023.10100598.
M. A. Aiswarya, M. S. Sinith and R. Rajan, "Automatic Tonic Pitch Estimation in South Indian Classical Music using Frequency- ratio Method," 2023 International Conference on Intelligent Systems for Communication, IoT and Security (ICISCoIS), Coimbatore, India, 2023, pp. 527-532, doi: 10.1109/ICISCoIS56541.2023.10100503.
Mala J B, Anisha Angel S J, Rajeev Rajan, and Alex Raj S M, "Efficacy of ELECTRA based Language Model in Sentiment Analysis", in 2023 International Conference on Intelligent Systems for Communication, IoT and Security (ICISCoIS), Coimbatore, India
Rajan, R., Ayasi, A. (2022) Oktoechos Classification in Liturgical Music Using SBU-LSTM/GRU. Proc. Interspeech 2022, 2403-2407, doi: 10.21437/Interspeech.2022-136
R. Rajan and N. A, "Multi-label Bird Species Classification Using Transfer Learning," 2021 International Conference on Communication, Control and Information Sciences (ICCISc), 2021, pp. 1-5, doi: 10.1109/ICCISc52257.2021.9484858.
Rajeev Rajan, Amlu Anna Joshy and Varsha Shiburaj, "Oktoechos Classification in Liturgical Music Using Musical Texture Features", CMMR2021, https://cmmr2021.github.io/proceedings/pdffiles/cmmr2021_07.pdf
A. Krishnan, A. Vincent, G. Jos and R. Rajan, "Multimodal Fusion for Segment Classification in Folk Music," 2021 IEEE 18th India Council International Conference (INDICON), 2021, pp. 1-7, doi: 10.1109/INDICON52576.2021.9691751.
N. A. and R. Rajan, "Deep Learning-based Automatic Bird Species Identification from Isolated Recordings," 2021 8th International Conference on Smart Computing and Communications (ICSCC), 2021, pp. 252-256, doi: 10.1109/ICSCC51209.2021.9528234.
A. A. Joshy and R. Rajan, "Automated Dysarthria Severity Classification Using Deep Learning Frameworks," 2020 28th European Signal Processing Conference (EUSIPCO), 2021, pp. 116-120, doi: 10.23919/Eusipco47968.2020.9287741.
Lekshmi Reghunath, & Rajeev Rajan. (2021, June 29). Attention-Based Predominant Instruments Recognition in Polyphonic Music. 18th Sound and Music Computing Conference (SMC 2021), Virtual. https://doi.org/10.5281/zenodo.5043841
A. M. Moncy, A. M., H. Jasmin and R. Rajan, "Automatic Speech Recognition in Malayalam Using DNN-based Acoustic Modelling," 2020 IEEE Recent Advances in Intelligent Computational Systems (RAICS), 2020, pp. 170-174, doi: 10.1109/RAICS51191.2020.9332493.
R. Rajan, A. J. Joseph, E. K. Robin and N. T. K. Fathima, "Part-Of-Speech Tagger in Malayalam Using Bi-directional LSTM," 2020 23rd Conference of the Oriental COCOSDA International Committee for the Co-ordination and Standardisation of Speech Databases and Assessment Techniques (O-COCOSDA), 2020, pp. 22-27, doi: 10.1109/O-COCOSDA50338.2020.9295018.
Manohar, K., Jayan, A.R., Rajan, R. (2020). Quantitative Analysis of the Morphological Complexity of Malayalam Language. In: Sojka, P., Kopeček, I., Pala, K., Horák, A. (eds) Text, Speech, and Dialogue. TSD 2020. Lecture Notes in Computer Science(), vol 12284. Springer, Cham. https://doi.org/10.1007/978-3-030-58323-1_7