Can Cui, Imran Sheikh, Mostafa Sadeghi, Emmanuel Vincent, "Joint Beamforming and Speaker-Attributed ASR for Real Distant-Microphone Meeting Transcription", working paper 2024. [arXiv]
Can Cui, Imran Sheikh, Mostafa Sadeghi, Emmanuel Vincent, "End-to-end Joint Punctuated and Normalized ASR with a Limited Amount of Punctuated Training Data", working paper 2024. [arXiv]
Can Cui, Imran Sheikh, Mostafa Sadeghi, Emmanuel Vincent, "Improving Speaker Assignment in Speaker-Attributed ASR for Real Meeting Applications", Odyssey 2024. [link]
Can Cui, Imran Sheikh, Mostafa Sadeghi, Emmanuel Vincent, "End-to-end Multichannel Speaker-Attributed ASR: Speaker Guided Decoder and Input Feature Analysis", ASRU 2023. [link] [HAL]
Imran Sheikh, Emmanuel Vincent, Irina Illina, "Transformer versus LSTM Language Models trained on Uncertain ASR Hypotheses in Limited Data Scenarios", LREC 2022. [link]
Imran Sheikh, Emmanuel Vincent, Irina Illina, "Training RNN Language Models on Uncertain ASR Hypotheses in Limited Data Scenarios". (to appear in) Computer Speech & Language. [HAL][latest]
Imran Sheikh, Emmanuel Vincent, Irina Illina, "On Semi-Supervised LF-MMI Training of Acoustic Models with Limited Data", Interspeech 2020. [link]
Swapnil Bhosale, Imran Sheikh, Sri Harsha Dunpala and Sunil Kumar Kopparapu, "End-to-End Spoken Language Understanding: Bootstrapping in Low Resource Scenarios", Interspeech 2019. [link]
Meet Soni, Imran Sheikh, Sunil Kumar Kopparapu, "Label-Driven T-F Masking For Robust Speech Command Recognition", TSD 2019. [link]
Sri Harsha Dumpala, Imran Sheikh, Rupayan Chakraborty, Sunil Kumar Kopparapu, "Improving ASR Robustness to Perturbed Speech Using Cycle-consistent Generative Adversarial Networks", IEEE ICASSP 2019. [link]
Imran Sheikh, Balamallikarjuna Garlapati, Srinivas Chalamala and Sunil Kumar Kopparapu, "A Fuzzy Approach to Mute Sensitive Information in Noisy Audio Conversations", CICLing 2019. [link] [link]
Sri Harsha Dumpala, Imran Sheikh, Rupayan Chakraborty, Sunil Kumar Kopparapu, "Sentiment Classification on Erroneous ASR Transcripts: A Multi View Learning Approach", 2018 IEEE Spoken Language Technology Workshop (SLT). [link]
Sri Harsha Dumpala, Imran Sheikh, Rupayan Chakraborty, Sunil Kumar Kopparapu, "Cycle-Consistent GAN Front-end to Improve ASR Robustness to Perturbed Speech", NeurIPS 2018 Interpretability and Robustness for Audio, Speech and Language (IRASL) Workshop. [link]
Sri Harsha Dumpala, Imran Sheikh, Rupayan Chakraborty, Sunil Kumar Kopparapu, "Audio-Visual Fusion for Sentiment Classification using Cross-Modal Autoencoder", NeurIPS 2018 Visually-Grounded Interaction and Language (ViGIL) Workshop. [link]
Imran Sheikh, Sri Harsha Dumpala, Rupayan Chakraborty and Sunil Kumar Kopparapu, "Sentiment Analysis using Imperfect Views from Spoken Language and Acoustic Modalities", ACL 2018 Workshop and Grand Challenge on Computational Modeling of Human Multimodal Language (HML). [link]
Imran Sheikh, Dominique Fohr, Irina Illina , "Topic segmentation in ASR transcripts using Bidirectional RNNs for Change Detection", IEEE Automatic Speech Recognition and Understanding Workshop, 2017. [link]
Imran Sheikh, Irina Illina, Dominique Fohr, Georges Linarès, "Modelling Semantic Context of OOV Words in Large Vocabulary Continuous Speech Recognition", IEEE/ACM Transactions on Audio, Speech and Language Processing, 25(3), 598-610, March 2017. [link]
Imran Sheikh, Irina Illina, Dominique Fohr, Georges Linarès, "Learning Word Importance with the Neural Bag-of-Words Model", In Proc. Workshop on Representation Learning for NLP (RepL4NLP) in the 54rd Annual Meeting of the Association for Computational Linguistics (ACL), 2016. [link]
Imran Sheikh, Irina Illina, Dominique Fohr, Georges Linarès, "Improved Neural Bag-of-Words Model to Retrieve Out-of-Vocabulary Words in Speech Recognition", In Proc. Interspeech 2016. [link]
Imran Sheikh, Irina Illina, Dominique Fohr, Georges Linarès "Document Level Semantic Context for Retrieving OOV Proper Names", In Proc. IEEE ICASSP, pages 6050-6054, Shanghai, China, 20-25 March 2016. [link]
Imran Sheikh, Irina Illina, Dominique Fohr, "How Diachronic Text Corpora Affect Context based Retrieval of OOV Proper Names for Audio News", In Proc. 10th Language Resources and Evaluation Conference (LREC) 2016. [link]
Imran Sheikh, Irina Illina, Dominique Fohr, Georges Linarès "Learning to Retrieve Out-of-Vocabulary Words in Speech Recognition", arXiv:1511.05389v4 [cs.CL].
Imran Sheikh, Irina Illina, Dominique Fohr, "Study of Entity-Topic Models for OOV Proper Name Retrieval", In Proc. Interspeech 2015, pages 1344-1348, Dresden, Germany, 6-10 September 2015. [link]
Imran Sheikh, Irina Illina, Dominique Fohr, Georges Linarès, "OOV Proper Name Retrieval Using Topic and Lexical Context Models", In Proc. IEEE ICASSP, pages 5291-5295, South Brisbane, QLD, 19-24 April 2015. [link]
Imran Sheikh, Irina Illina, Dominique Fohr, "Recognition of OOV Proper Names in Diachronic Audio News", In Proc. IEEE SIIE 2015. [link]
Sapna Soni, Imran Ahmed and Sunil Kopparapu, "Automatic Segmentation of Broadcast News Audio using Self Similarity Matrix", [link].
Imran Ahmed and Sunil Kopparapu, "What you should know about converting speech to text?", In Proc. 10th TCS Technical Architects' Global Conference 2014.
Imran Ahmed and Sunil Kopparapu, "Improved Method for Keyword Spotting in Audio". In Proc. Acoustics2013, pages 1028-1033, New Delhi, India, 10-15 November, 2013.
Imran Ahmed, Sunil Kopparapu and Meghna P., "A Suite of Mobile Applications to Assist Speaking at Right Speed". In Proc. SLaTE 2013 (Interspeech 2013 workshop on Speech and Language Technology in Education), pages 106-108, Grenoble, France, 30-31 August & September 1st, 2013. [link]
Chitralekha Bhat, Imran Ahmed, Vikram Saxena and Sunil Kopparapu, "Visual Subtitles for Internet Videos". In Proc. SLPAT 2013 (Interspeech 2013 workshop on Speech and Language Processing for Assistive Technologies), pages 17-20, Grenoble, France, 21-22 August, 2013. [link]
Imran Ahmed and Sunil Kopparapu, "Technique for Automatic Sentence Level Alignment of Long Speech and Transcripts". In Proc. Interspeech 2013, pages 1516-1519, Lyon, France, 25-29 August 2013. [link]
Imran Ahmed and Sunil Kopparapu, "Interactive Voice Response Mashup System for Service Enhancement". Recent Patents on Telecommunication, 1(2), pages 100 - 108, December 2012. [link]
Imran Ahmed, Sunil Kopparapu and Meghna Pandharipande, "SpeakRite: Monitoring Speaking Rate on Mobile Phone in Real Time", International Journal of Mobile Human Computer Interaction, 5(1), 62-69, January-March 2013). [link]
Imran Ahmed and Sunil Kopparapu, "Speech Recognition for Resource Deficient Languages using Frugal Speech Corpus", In Proc. IEEE International Conference on Signal Processing Communication and Computing, pages 750–755, Hong Kong, August 12-15, 2012. [link]
Charudatta Jadhav, Imran Ahmed, Meghna Pandharipande Venkatakrishna T, Mithun BS, Vrushali Kulkarni, Chitralekha Bhat, Arun Pande and Sunil Kumar Kopparapu, "Challenges in Enabling Speech as a Service Channel for Indian Scenario", Regional International Telecommunications Society India Conference, New Delhi, February 22-24, 2012.
Imran Ahmed and Sunil Kopparapu, "Natural Language Mobile Speech Interface for Market Prices Access and More", 8th TCS Technical Architects' Global Conference 2012.
Imran Ahmed and Sunil Kopparapu, "Building a Natural Language Hindi Speech Interface to Access Market Information", In Proc. National Conference on Computer Vision Pattern Recognition Image Processing and Graphics, pages 58-61, Hubli, Karnataka, 15-17 December 2011. [link]
Imran Ahmed and Sunil Kopparapu, "Specifications for Mixed Language Speech Corpora: A Proposal", 14th Oriental COCOSDA (International Committee for the Co-ordination and Standardization of Speech Databases and Assessment Techniques) Conference, Hsinchu, Taiwan, October 26-28, 2011.
Sunil Kopparapu and Imran Ahmed, "Enabling Rapid Prototyping of an Existing Speech Solution into another Language", 14th Oriental COCOSDA (International Committee for the Co-ordination and Standardization of Speech Databases and Assessment Techniques) Conference, Hsinchu, Taiwan, October 26-28, 2011.
Imran Ahmed and Sunil Kopparapu, "Enhanced Quality of Experience through IVR Mashup to Access Same Service Multiple Operator Services", In Proc. International Conference on Advances in Computing & Communication, pages 317-326, Kochi, India, July 22-24, 2011.[link]
Sunil Kopparapu, Imran Ahmed and G. Sita, "A Two Pass Algorithm for Speaker Change Detection", In Proc. IEEE Region 10 Conference TENCON, pages 755-758, Fukuoka, Japan, November 21-24, 2010. [link]
Imran Ahmed and Sunil Kopparapu, "Speaker Change Detection in Telephone Speech", International Conference on Signals, Systems and Automation, Vallabh Vidyanagar, India, 28-29 December 2009.
Imran Ahmed and Sunil Kopparapu, "Implementing a low-cost Speak-to-Dial service over VoIP", 5th TCS Technical Architects' Global Conference 2009.
[EP3598444] [US10930286B2] (Granted) Method and System for Muting Classified Information from an Audio; Imran Sheikh, Sunil Kopparapu, Bhavik Vachhani, Bala Mallikarjuna G, Srinivas Rao Chalmala.
[US20140287676] (Granted) System And Method For Visual Message Communication; Imran Sheikh, Sunil Kopparapu.
[US20140056418] (Granted) System and Method Providing Multi-Modality Interaction Over Voice Channel Between Communication Devices; Imran Sheikh, Sunil Kopparapu.
[US20130030810] (Granted) A Frugal Method and System for Creating Speech Corpus; Sunil Kopparapu and Imran Sheikh.
[US20120051532] (Granted) A System and Method to Enable Access Of Multiple Service Providers In A Single Call; Arun Pande, Sunil Kopparapu, Imran Sheikh.
[US20100299133] (Granted) System for Rapid Prototyping of Speech Recognition Application in Different Language; Sunil Kopparapu, Imran Sheikh, Amol P.