Recent News:
2025:
I delivered an invited lecture on “Knowledge Extraction from Text” in the ANRF-SSR Workshop on Natural Language Processing, held at the Indian Association for the Cultivation of Science (IACS), Kolkata, during 13–14 August 2025.
We have got a short paper accepted in the 34th ACM International Conference on Information and Knowledge Management (CIKM 2025), which will be held in Seoul, Korea, during November 10–14, 2025. The paper is:
HF-RAG: Hierarchical Fusion-based RAG with Multiple Sources and Rankers. Payel Santra, Madhusudan Ghosh, Debasis Ganguly, Partha Basuchowdhuri and Sudip Kumar Naskar.
I delivered an invited lecture on “Introduction to Natural Language Processing” in the Two-Weeks Online FDP on “Natural Language Processing with Deep Learning (NLPDL-2025)”, organized by the Dept of CSE, NIT Jamshedpur, in collaboration with E&ICT Academy, NIT Patna, , 24th June - 1st July 2025.
My PhD student, Sohom Ghosh, recently submitted his PhD thesis titled “Using Computational Linguistics to Demystify Financial Texts” to Jadavpur university.
We've just had a journal paper accepted in WIRES (Wiley Interdisciplinary Reviews) - Data Mining and Knowledge Discovery. The paper is:
The Curious Case of Contexts in Retrieval-Augmented Generation with a Combination of Labelled and Unlabelled Data. Payel Santra, Madhusudan Ghosh, Debasis Ganguly, Partha Basuchowdhuri and Sudip Kumar Naskar.
I gave a keynote talk on "Challenges, Nuances and Opportunities in NLP" in the 3-Day Workshop on “Translation and Annotation of Scientific, Technical and Knowledge Texts”, organized by Linguistic Research Unit, Indian Statistical Institute, Kolkata, 5-7 March 2025.
2024:
I gave a keynote talk in the Third International Conference on Speech and Language Technologies for Low-resource Languages (SPELLL), held at Vellore Institute of Technology (VIT), Chennai, INDIA during December 4-6, 2024.
We have got a paper accepted in the 2024 ACM/IEEE Joint Conference on Digital Libraries (JCDL 2024), which will be held in Hong Kong, during 16-20 December, 2024. The paper is:
Chronological Evaluation of Emerging Methodology Extraction from AI Literature. Madhusudan Ghosh, Debasis Ganguly, Partha Basuchowdhuri and Sudip Kumar Naskar.
Sohom's PhD proposal on "Demystifying Financial Texts using Natural Language Processing" has been accepted for the PhD Symposium at the 33rd ACM International Conference on Information and Knowledge Management (CIKM 2024, CORE rank A), to be held during October 21 – 25 2024, in Idaho, USA.
Atanu's PhD proposal on "Multilingual Speech Translation for code-mixed Indian Languages" has been accepted for presentation at the 10th ISCA-SAC Doctoral Consortium at INTERSPEECH 2024, to be held during 1-5 September 2024, in Kos Island, Greece. Interspeech is the world's premier conference in Speech Technology with CORE rank A.
We've just had a journal paper accepted in Methods, Elsevier (Impact Factor: 4.2). The paper is:
AlpaPICO: Extraction of PICO frames from clinical trial documents using LLMs. Madhusudan Ghosh, Shrimon Mukherjee, Asmit Ganguly, Partha Basuchowdhuri, Sudip Kumar Naskar and Debasis Ganguly.
We participated in the Multi-Lingual ESG Impact Duration Inference (ML-ESG-3) Shared task held in the FinNLP-KDF-ECONLP Workshop in conjunction with LREC-COLING-2024. Our team ranked 1st in the Impact Length sub-task in French and 3rd in both the sub-tasks (Impact Length and Impact Level) in English.
I delivered an invited lecture on “Natural Language Processing” in the Entrepreneurship and Skill Development Program on “Artificial Intelligence and Machine Learning”, organized by the Indian Institute of Information Technology, Manipur, India, funded by the Ministry of MSME, March 4–18, 2024.
We have got a paper accepted in the Short Papers Track of The 2024 ACM Web Conference (theWebConf 2024, formerly known as International World Wide Web Conference, abbreviated as WWW), which will be held in Singapore, during 13-17 May, 2024. Congratulations to Sohom on his first A* publication. The paper is:
Generator-Guided Crowd Reaction Assessment. Sohom Ghosh, Chung-Chi Chen and Sudip Kumar Naskar.
I delivered an invited talk on “Knowledge Extraction from Text” in the Xavier International Conference on Artificial Intelligence (XICAI 2024), International Center, XIM University, Bhubaneswar, Odisha, India, February 29 – March 2, 2024.
I served as a panelist along with co-panelist Prof. Anupam Basu in a Panel-discussion on “Machine Translation: Process, Potential, Prospect” held on 26 February, 2024, at School of International Languages, Sister Nivedita University.
We have got a paper accepted in the 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING 2024) which will be held in Torino, Italy, during 20-25 May, 2024. The paper is:
IndicFinNLP: Financial Natural Language Processing for Indian Languages. Sohom Ghosh, Arnab Maji, Aswartha Narayana and Sudip Kumar Naskar.
We hosted Panini Linguistics Olympiad 2024, Round 1, in the CSE Department, Jadavpur University.
I delivered an invited lecture on “Natural Language Processing” in the Faculty Development Program on “Cutting Edge: Horizons of Data & Analytics”, organized by the Department of CSE, Meghnad Saha Institute of Technology, Kolkata, India, in association with Computer Chapter, IEEE Kolkata Section, and IEEE Young Professionals, January 22–31, 2024.
2023:
We have got 2 papers accepted in the 20th International Conference on Natural Language Processing (ICON 2023) which will be held in Goa University, Goa, India, Dec 14-17, 2023. The papers are:
Attentive Fusion: A Transformer-based Approach to Multimodal Hate Speech Detection. Atanu Mandal, Gargi Roy, Amit Barman, Indranil Dutta and Sudip Naskar.
Convolutional Neural Networks can achieve binary bail judgement classification. Amit Barman, Devangan Roy, Debapriya Paul, Indranil Dutta, Shouvik Kumar Guha, Samir Karmakar and Sudip Naskar.
We have got 2 papers accepted in the 15th meeting of Forum for Information Retrieval Evaluation 2023 (FIRE 2023) which will be held at Goa Business School, Goa University, Panjim, India, during December 15-18, 2023. The papers are:
Financial Argument Analysis in Bengali. Rima Roy, Sohom Ghosh and Sudip Kumar Naskar.
The Mask One At a Time Framework for Detecting the Relationship between Financial Entities. Sohom Ghosh, Sachin Umrao, Chung-Chi Chen and Sudip Kumar Naskar.
We will present a tutorial in the 15th meeting of Forum for Information Retrieval Evaluation 2023 (FIRE 2023) which will be held at Goa Business School, Goa University, Panjim, India, during December 15-18, 2023.
Unleashing the Power of Large Language Models: A Hands-On Tutorial. Payel Santra, Madhusudan Ghosh, Shrimon Mukherjee, Debasis Ganguly, Partha Basuchowdhuri, and Sudip Kumar Naskar.
We've just had a journal paper accepted in Natural Language Engineering, Cambridge University Press. The paper is:
Is Attention always needed? A Case Study on Language Identification from Speech. Atanu Mandal, Santanu Pal, Indranil Dutta, Mahidas Bhattacharya, and Sudip Kumar Naskar.
We have got a paper accepted in the CIKM 2023 Workshop on Large Language Models’ Interpretation and Trustworthiness (LLMIT), which will be held in University of Birmingham and Eastside Rooms, UK, during October 22, 2023. The paper is:
Is LLM Generated Synthetic Data Augmentation Beneficial for Fact Verification?. Payel Santra, Madhusudan Ghosh, Debasis Ganguly, Partha Basuchowdhuri, Sudip Kumar Naskar.
I'm in the Program Committee of The 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation in 2024 (LREC-COLING 2024) which will take place in Turin, Italy, during 20-25 May 2024.
We've got a journal paper accepted in Science Talks, Elsevier. The paper is:
Recent trends in financial natural language processing research. Sohom Ghosh, Sudip Kumar Naskar.
Our MT system (IACS-LRILT) has been adjudged as the best system in Manipuri-to-English (PRIMARY) (Subtask-4) in the Low-Resource Indic Language Translation shared task in WMT'2023.
We have got a short paper accepted in the 32nd ACM International Conference on Information and Knowledge Management (CIKM 2023, CORE rank A) which will be held in University of Birmingham and Eastside Rooms, UK, during October 21-25, 2023. The paper is:
Extracting Methodology Components from AI Research Papers: A Data-driven Factored Sequence Labeling Approach. Madhusudan Ghosh, Debasis Ganguly, Partha Basuchowdhuri, Sudip Kumar Naskar.
We've just had a journal paper accepted in SN Computer Science, Springer Nature. The paper is:
Learning Semantic Text Similarity to rank Hypernyms of Financial Terms. Sohom Ghosh, Ankush Chopra, Sudip Kumar Naskar.
I'm Serving as a Workshop/Tutorial Chair for the 20th International Conference on Natural Language Processing (ICON 2023) which will be held in Goa University, Goa, India, Dec 14-17, 2023.
I'm serving as a Senior Area Chair for the Information Extraction track in The 2023 Conference on Empirical Methods in Natural Language Processing (EMNLP 2023, CORE Rank: A*) which will take place in Singapore during Dec 6-10, 2023.
We are recruiting 2 JRFs [link].
We have got a project, “Automated Bail Judgment Classification and Prediction using Semi-Supervised Techniques”, funded by The West Bengal National University of Jurisdical Sciences.
Sohom's paper "Using Natural Language Processing to Enhance Understandability of Financial Texts" received an Honourable Mention in the YRS track of CODS-COMAD 2023.
I'm in the Program Committee of The 61st Annual Meeting of the Association for Computational Linguistics (ACL 2023) which will take place in Toronto, Canada, during July 9-14, 2023.
Older :
2022:
We have got 2 papers accepted in the 19th International Conference on Natural Language Processing (ICON 2022) which will be held in IIIT Delhi, India, during December 15-18, 2022. The papers are:
Fine Grained Sentiment Analysis in Bengali using Deep Learning based Models. Piyal Roy, Rajat Pandit, Sudip Kumar Naskar.
A Novel Approach towards Cross Lingual Sentiment Analysis using Transliteration and Character Embedding. Rajarshi Roychoudhury, Subhrajit Dey, Md Shad Akhtar, Amitava Das and Sudip Kumar Naskar.
Sohom has got a paper accepted in the YRS track and another in the Demo track in CODS-COMAD 2023: 6th Joint International Conference on Data Science & Management of Data (10th ACM IKDD CODS and 28th COMAD), which will be held in IIT Bombay, Mumbai, India, during January 4-7, 2023, in cooperation with ACM, ACM SIGKDD & ACM SIGMOD. The papers are:
Using Natural Language Processing to Enhance Understandability of Financial Texts. Sohom Ghosh and Sudip Kumar Naskar. (YRS Track)
FLUEnT: Financial Language Understandability Enhancement Toolkit. Sohom Ghosh and Sudip Kumar Naskar. (Demo Track)
Anubhav, Swagata and Sohom have got a paper accepted in the 14th meeting of the Forum for Information Retrieval Evaluation (FIRE 2022), which will be held at ISI, Kolkata, India, during 9-13 December, 2022. The paper is based on the work carried out by Anubhav and Swagata (with the help of Sohom) during their internship at Jadavpur University. The paper is:
Evaluating Impact of Social Media Posts by Executives on Stock Prices. Anubhav Sarkar, Swagata Chakraborty, Sohom Ghosh and Sudip Kumar Naskar.
I am serving as the Area Chair for the Machine Learning in NLP track at ICON-2022 to be held at IIIT Delhi, India, during December 15th - 18th, 2022.
I am recruiting a Senior Research Assistant (Technical). [link]
I'm in the Program Committee of The 2022 Conference on Empirical Methods in Natural Language Processing (EMNLP 2022) which will take place in Abu Dhabi, during December 7–11, 2022.
We are part of the consortium project “VIDYAAPATI: Bidirectional Machine Translation Involving Bengali, Konkani, Maithili, Marathi, and Hindi” (under the National Language Translation Mission), funded by Ministry of Electronics & Information Technology, Government of India. The project is a collaboration between IIT Bombay (Consortium Leader), IIT Patna, Jadavpur University, ISI Kolkata, CDAC Pune, CDAC Kolkata, Jawaharlal Nehru University and Goa University.
I'm in the Program Committee of The 29th International Conference on Computational Linguistics (COLING 2022) which will take place in Gyeongju, Republic of Korea, during October 12-17, 2022.
We have got a paper accepted in the 4th Financial Narrative Processing Workshop (FNP 2022) which will be held in Marseille, France, on 24 June, 2022, in conjunction with LREC 2022. The paper is:
FinRAD: Financial Readability Assessment Dataset - 13,000+ Definitions of Financial Terms for Measuring Readability. Sohom Ghosh, Shovon Sengupta, Sudip Kumar Naskar and Sunny Kumar Singh.
I delivered an invited lecture on “NLP: A Broad Perspective” in the in the One Week Online Workshop on “Natural Language Processing (NLP) : Tools and Techniques”, organized by the Department of Computer Science and Engineering, Triguna Sen School of Technology, Assam University, Silchar, India, in association with IEEE Student Branch, Assam University, Silchar, February 14–18, 2022.
I delivered an invited lecture on “Computational Semantics” in the Online Faculty Development Programme on “Natural Language Processing”, organized by the CSE Department, NIT Patna, Bihar, India, February 7–18, 2022.
Our paper got the best paper award in the International Workshop on Networking Women in Distributed Computing and Networks (NWDCN 2022), an ICDCN 2022 event, IIIT Delhi, India, January 4–7, 2022. The paper is:
Understanding the Robustness in Phoneme Production Mechanism in English and Bengali. Suparnakanti Das, Trishita Dhara, Sirshapan Mitra, Sudip Kumar Naskar.
2021:
I delivered an invited lecture on “Knowledge Graphs” in the Faculty Development Program on “Application of Data Science and Analytics” (Online), organized by the School of Computer Science and Engineering, XIM University (aforetime Xavier University), Bhubaneswar, December 13-17, 2021.
We've just had a paper accepted in The International Conference on Asian Language Processing (IALP 2021), which will be held in Yantai, China, 11-13 December 2021, organized by Ludong University, China and Chinese and Oriental Languages Information Processing Society (COLIPS), Singapore. The paper is:
Profiling Profession of Celebrities from Twitter Data. Kumar Gourav Das, Braja Gopal Patra and Sudip Kumar Naskar.
Dr. Amitava Das and myself are serving as the Workshop/Tutorial Chairs in the 18th International Conference on Natural Language Processing (ICON 2021) which will be held in NIT Silchar, December 16-19, 2021.
I'm in the Program Committee of The 13th meeting of the Forum for Information Retrieval Evaluation (FIRE 2021), 13–17 December, 2021.
I delivered an invited lecture on “Natural Language Processing” in the One Week FDP (online mode) on “Advancement of Intelligence System and Computation”, organized by the Department of Computer Science and Engineering, JIS College of Engineering, Kalyani, West Bengal, in association with CSI Kolkata Chapter, April 20–24, 2021.
I'm in the Program Committee of The 2021 Conference on Empirical Methods in Natural Language Processing (EMNLP 2021), 7–11 November, 2021.
Amit Majumder, co-supervised by myself and Dr. Asif Ekbal, successfully defended his PhD thesis on "Evolutionary Approach for Biomedical Event Extraction and Word Sense Disambiguation". Congratulations Dr. Majumder!
I'm in the Program Committee of The 13th biennial conference on Recent Advances in Natural Language Processing (RANLP), September 1-3, 2021.
We've just had a paper accepted in IEEE Transactions on Learning Technologies. The paper is:
Classifying and Solving Arithmetic Math Word Problems – an Intelligent Math Solver. Sourav Mandal and Sudip Kumar Naskar.
I'm in the Program Committee of The Joint Conference of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing (ACL-IJCNLP 2021), Bangkok, Thailand, August 1-6, 2021.
2020:
We've just had 3 papers accepted in The 17th International Conference on Natural Language Processing (ICON 2020), which will be held in IIT Patna, India, December 18-21, 2020. The papers are:
Deep Neural Model for Manipuri Multiword Named Entity Recognition with Unsupervised Cluster Feature. Jimmy Laishram, Kishorjit Nongmeikapam and Sudip Kumar Naskar.
A New Approach to Claim Check-Worthiness Prediction and Claim Verification. Sukriti Si, Anisha Datta and Sudip Kumar Naskar.
A Rule Based Lightweight Bengali Stemmer. Souvick Das, Rajat Pandit and Sudip Kumar Naskar.
Sourav Mandal (Assistant Professor, Xavier School of Computer Science and Engineering, Xavier University Bhubaneswar, Odisha ), supervised by myself, successfully defended his PhD thesis on “Understanding and Learning to Solve Arithmetic Word Problems” . Congratulations Dr. Mandal!
I'm in the Program Committee of The 2021 Annual Conference of the North American Chapter of the Association for Computational Linguistics (NAACL-HLT 2021) , Mexico City, Mexico, June 6–11, 2021.
I delivered an invited lecture on “Knowledge Extraction From Text” in the IEEE Computational Intelligence Society (Kolkata Chapter) sponsored One day International webinar on “State of the art and future of Text Mining”, organized by the Department of Computer and System Sciences, Visva-Bharati, on 2nd November, 2020.
I delivered an invited lecture on “Machine Translation” in the AICTE sponsored ATAL FDP on “Artificial Intelligence in Natural Language Processing”, organized by KIIT Deemed to be University, Bhubaneswar, Odisha, during 15th to 19th October, 2020.
I'm in the Program Committee of The 30th International Joint Conference on Artificial Intelligence (IJCAI-21), Montreal, Canada, 21–26 August, 2021.
We are organizing three shared tasks in the 17th International Conference on Natural Language Processing (ICON 2020). Do consider participating in these shared tasks. There will be prizes for the winners in these shared tasks. The Shared Tasks are:
We've just had a paper accepted in The International Workshop on Predictive and Learning Approaches based on Distributed-to-Centralized Machine Learning/Artificial Intelligence Techniques in Management of Large-Scale Internet of Things Networks in Smart Cities (D2C-ML&AI) (in conjunction with ICDCN 2021), which will be held in Nara, Japan, January 08, 2021. The paper is:
Deep Learning based Visual Data Analysis in Integrated Edge to Cloud Computing Environment. Atanu Mandal, Amir Sinaeepourfard and Sudip Kumar Naskar.
Dr. Sriparna Saha and myself are serving as the Workshop/Tutorial Chairs in the 17th International Conference on Natural Language Processing (ICON 2020).
We've just had a paper accepted in COLING’2020, The 28th International Conference on Computational Linguistics, which will be held in Barcelona, Spain, from 8 to 13 December 2020. The paper is:
The Transference Architecture for Automatic Post-Editing. Santanu Pal, Hongfei Xu, Nico Herbig, Sudip Kumar Naskar, Antonio Krueger, Josef van Genabith.
I delivered an invited lecture on “NLP - What, Why and How?” in “Two Days’ International Webinar on Linguistics, Machine Learning & Artificial Intelligence” August 10–11, 2020, organized by Haldia Institute of Technology, Haldia, West Bengal, India.
Rohini Basak (Assistant Professor, Department of Information Technology, Jadavpur University), co-supervised by myself and Prof. Alexander Gelbukh (CIC, IPN, Mexico), successfully defended her PhD thesis on Textual Entailment. Congratulations Dr. Basak!
I'm in the Pre-Standardization Committee on Indian Language Resources of the IEEE Standards Association looking after standards in Machine Translation Evaluation.
I'm in the Program Committee of The 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP 2020), 16th – 20th November 2020.
I'm in the Program Committee of The 1st Conference of the Asia-Pacific Chapter of the Association for Computational Linguistics and the 9th International Joint Conference on Natural Language Processing (AACL-IJCNLP 2020) which will take place in Suzhou, China, during December 4–7, 2020.
Rajat Pandit (Assistant Professor, Department of Computer Science, West Bengal State University), co-supervised by myself and Prof. Mohini Mohan Sardar, successfully defended his PhD thesis entitled "A Comprehensive Study on Computational Semantics in Bengali". Congratulations Dr. Pandit!
I'm in the Program Committee of The 28th International Conference on Computational Linguistics (COLING 2020) which will take place in Barcelona, Spain, during 9–13 December, 2020.
I delivered an invited lecture on “Natural Language Processing for Data Science” in the “Five-Day Workshop on Application of Python in Big Data, Natural Language Processing and Social Network Analysis”, sponsored by IEEE Computational Intelligence Society, Kolkata Chapter, February 12–16, 2020, Department of Computer and System Sciences, Institute of Science, Visva-Bharati University, Santiniketan, West Bengal, India.
I delivered an invited lecture on "Natural Language Processing" in the TEQIP sponsored Short Term Course on "AI and Machine Learning" held during 12-16 February, 2020, at CSE Department, NIT Patna, Bihar, India.
I delivered two lectures in the TEQIP sponsored 1-week Short Term Course on Applications of Computational Intelligence in Text, Image and Sensor Data Processing, during 3-8 February, 2020, at Jadavpur University, Kolkata, India.
"Intelligent Information Retrieval" on 5th February, 2020
"Statistical Machine Translation" on 6th February, 2020
We've just had a book chapter accepted in Innovations, Algorithms, and Applications in Cognitive Informatics and Natural Intelligence, IGI Global. The paper is:
A Lexico-Syntactic-Semantic Approach to Recognizing Textual Entailment. Rohini Basak, Sudip Kumar Naskar and Alexander Gelbukh.
We organized a 1-week Short Term Course on Applications of Computational Intelligence in Text, Image and Sensor Data Processing, during 3-8 February, 2020, at Jadavpur University.
We've just had a journal paper accepted in SN Computer Science, Springer Nature. The paper is:
MSIR@FIRE: A Comprehensive Report from 2013 to 2016. Somnath Banerjee, Monojit Choudhury, Kunal Chakma, Sudip Kumar Naskar, Amitava Das, Sivaji Bandyopadhyay, Paolo Rosso.
Somnath Banerjee, co-supervised by myself, Prof. Sivaji Bandyopadhyay and Prof. Paolo Rosso, successfully defended his PhD thesis entitled "Answering Monolingual and Cross-Script Restricted Domain Questions in Bengali". Congratulations Dr. Banerjee!
2019:
I joined as an Associate Professor in the Computer Science & Engineering Department, Jadavpur University.
We've just had a demonstration paper accepted in ICDCN 2020, 21st International Conference on Distributed Computing and Networking (ICDCN), which will be held during January 4th-7th, 2020, in Kolkata, India. The paper is:
Ethnicity Identification from Speech Signal in Bilingual Code Mixed Scenario. Atanu Mandal, Suparnakanti Das and Sudip Kumar Naskar.
I delivered two invited lectures in the AICTE sponsored 2 Weeks' Faculty Development Programme on “Natural Language Processing for Digital Humanities”, 3–13 December, 2019, at National Institute of Science & Technology (NIST), Institute Park, Pallur Hills, Berhampur, Odisha 761008, India.
"Natural Language Processing: A Broad Overview" on 12th December, 2019
"Machine Translation: The Fundamentals and the State of the Art" on 13th December, 2019
Rajat Pandit, co-supervised by myself and Prof. Mohini Mohan Sardar (West Bengal State University) , submitted his PhD thesis on "A Comprehensive Study on Computational Semantics in Bengali".
My PhD student, Sourav Mandal, submitted his PhD thesis on "Understanding and Learning to Solve Arithmetic Word Problems ".
We've just had a paper accepted in Neural Computing and Applications, (SCIE indexed), Springer. The paper is:
Online Bangla Handwritten Word Recognition Using HMM and Language Model. Shibaprasad Sen, Ankan Bhattacharyya, Mridul Mitra, Kaushik Roy, Sudip Kumar Naskar, Ram Sarkar.
We've just had a paper accepted in IALP 2019, International Conference on Asian Language Processing , which will be held during November 15-17, 2019, in Shanghai, China. The paper is:
Celebrity Profiling from Twitter Data. Kumar Gourav Das, Braja Gopal Patra and Sudip Kumar Naskar.
We've just had an Oral Paper accepted in LKE 2019, the 7th International Symposium on Language & Knowledge Engineering, which will be held during October 29-31, 2019, in Dublin, Ireland. The paper will be published in Journal of Intelligent & Fuzzy Systems (JIFS, SCIE indexed). The paper is:
Solving Arithmetic Word Problems: a Deep Learning Based Approach. Sourav Mandal, Sk Arif Ahmed and Sudip Kumar Naskar.
We've just had a paper accepted in Sādhanā (SCIE indexed), Springer. The paper is:
Classifier Combination Approach for Question Classification for Bengali Question Answering System . Somnath Banerjee, Sudip Kumar Naskar, Paolo Rosso and Sivaji Bandyopadhyay.
We've just had a paper accepted in TENCON 2019, the IEEE Region 10 flagship Conference, which will be held during 17-20th October 2019, in Kochi, India. The paper is the outcome of work done by 2 second year undergraduate (BCSE-II) students (well done Arpan and Avishek!). The paper is:
Word Difficulty Prediction Using Convolutional Neural Networks. Arpan Basu, Avishek Garain, Sudip Kumar Naskar.
Poulami Das (co-supervised by myself and Dr Sankar Narayan Patra) successfully defender her PhD thesis entitled "A Soft Computing based Approach for Signal Processing". Congratulations Dr. Das!
I was on a (Erasmus+) research visit to the Natural Language Engineering Lab, Pattern Recognition and Human Language Technologies (PRHLT) Research Center, Universitat Politècnica de València (UPV), Valencia, Spain, in June, 2019.
We've just had a paper accepted in Sādhanā (SCIE indexed), Springer. The paper is:
A Novel Approach to Word Sense Disambiguation in Bengali Language using Supervised Methodology. Alok Kumar Pal, Diganta Saha, Niladri Sekhar Dash, Sudip Kumar Naskar and Antara pal.
We've just had a paper accepted in International Journal on Artificial Intelligence Tools (SCIE indexed), World Scientific Press. The paper is:
Solving Arithmetic Word Problems by Object Oriented Modeling and Query-based Information Processing. Sourav Mandal and Sudip Kumar Naskar.
I am now (since May 2019) an Associate Editor on the Editorial Board of SADHANA – Academy Proceedings in Engineering Sciences, Indian Academy of Sciences, Publisher: Springer. (SCIE journal)
We've just had a paper accepted in MT Summit 2019, the 17th Machine Translation Summit, which will be held during 19-23 August, 2019, Dublin, Ireland. The paper is:
Improving CAT Tools in the Translation Workflow: New Approaches and Evaluation. Mihaela Vela, Santanu Pal, Marcos Zampieri, Sudip Kumar Naskar and Josef van Genabith
Rohini Basak, co-supervised by myself and Prof. Alexander Gelbukh, submitted her PhD thesis on "A Study on Recognizing Textual Entailment using Different Approaches".
We've just had a paper accepted in Informatics. The paper is:
Improving Semantic Similarity with Cross-Lingual Resources: A Study in Bangla – A Low Resourced Language. Rajat Pandit, Saptarshi Sengupta, Sudip Kumar Naskar, Niladri Sekhar Dash, Mohini Mohan Sardar.
Amit Majumder, co-supervised by myself and Dr. Asif Ekbal, submitted his PhD thesis on "Evolutionary Approach for Biomedical Event Extraction and Word Sense Disambiguation".
We organized a 3-day Workshop on Machine Learning and Data Analytics during 26th -28th March, 2019. Eminent academicians and researchers from IITs and ISI served as resource persons. The workshop was attended by more than 150 participants, mostly faculty members and research scholars.
Somnath Banerjee, co-supervised by myself, Prof. Sivaji Bandyopadhyay and Prof. Paolo Rosso, submitted his PhD thesis on "Answering Monolingual and Cross-Script Restricted Domain Questions in Bengali". He joined LIMSI, CNRS, France, as a post-doctoral researcher.
I am in the Program Committee of 2019 Annual Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (NAACL-HLT 2019) which will take place in Minneapolis, USA, during 2–7 June, 2019.
2018:
I delivered an invited lecture on "Cross-Lingual Information Retrieval on Digital Libraries" in the International Training Programme on Library Automation for Professional Enrichment (ITPLAPE) held during 10-16 December, 2018, at Satyajit Ray Film & Television Institute (SRFTI), Kolkata, India.
Poulami Das, co-supervised by myself and Dr Sankar Narayan Patra, submitted her PhD thesis on "A Soft Computing based Approach for Signal Processing".
We have proposed a new MT evaluation metric, ITER (Improved TER), in WMT 2018. The metric provides significant improvements over Translation Error Rate (TER), one of the most commonly used MT evaluation metrics which mimics human post-editing effort and is often used as a baseline evaluation metric by MT researchers due to its simplicity and interpretability. ITER improves over TER by inclusion of stem matching, better normalization technique and optimal edit operation costs so as to improve the correlation of the metric scores with human judgement scores. ITER provided state-of-the-art results for the fi-en, zh-en, en-et, en-fi language pairs in terms of absolute Pearson correlation of system-level metrics with DA human assessment for 10K hybrid super-sampled systems in newstest2018 in WMT 2018 according to the Results of the WMT18 Metrics Shared Task reported by the Organizers.
We've just had two Oral Papers accepted in LKE 2018, the 6th International Symposium on Language & Knowledge Engineering, which will be held during October 29-31, 2018, in Puebla, Mexico. The papers will be published in Journal of Intelligent & Fuzzy Systems (JIFS, SCIE indexed). The papers are:
Automatic Short Answer Grading Using Textual Entailment. Rohini Basak, Sudip Kumar Naskar, and Alexander Gelbukh.
Word Sense Induction in Bengali Using Parallel Corpora and Distributional Semantics. Saptarshi Sengupta, Parag Mitra, Rajat Pandit, Sudip Kumar Naskar, Mohini Mohan Sardar.
We've just had a paper accepted in Sādhanā (SCIE indexed), Springer. The paper is:
Bio-molecular Event Extraction by integrating multiple Event Extraction Systems. Amit Majumder, Asif Ekbal and Sudip Kumar Naskar.
We've just had a paper accepted in Applied Soft Computing (SCIE indexed). The paper is:
Hardware Efficient FIR Filter Design using Global Best Steered Quantum Inspired Cuckoo Search Algorithm. Poulami Das, Sudip Kumar Naskar, and Sankar Narayan Patra.
I delivered an invited talk in the National Students Computing Conference held during 21-22 April, 2018, at Veer Surendra Sai University of Technology (VSSUT), Burla, Sambalpur, Odisha, India.
I delivered an invited talk in the Workshop on “Emerging Trends in Information Technology” held during 21-23 March, 2018, at Asutosh College Second Campus, Vasa, West Bengal, India.
We've just had a paper accepted in CICLing 2018, the 19th International Conference on Intelligent Text Processing and Computational Linguistics, which will be held during 18-24th March, 2018, in Hanoi, Vietnam. The paper is:
Recognizing Textual Entailment Using Weighted Dependency Relations. Tanik Saikh, Sudip Kumar Naskar and Asif Ekbal.
I'm in the Program Committee of The 27th International Conference on Computational Linguistics (COLING 2018) which will take place in Santa Fe, New-Mexico, USA, during 20–25 August, 2018.
We've just had a a Poster paper accepted in the 2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2018), an annual flagship conference of the IEEE Signal Processing Society and the world's largest (it received received 2830 paper submissions this year!) and most comprehensive technical conference on signal processing and its applications, which will be held during 15–20 April 2018, Calgary, Alberta, Canada. This is my first ever publication on speech processing. The paper is:
Says Who? Deep Learning Models for Joint Speech Recognition, Segmentation and Diarization. Amitrajit Sarkar, Surajit Dasgupta, Sudip Kumar Naskar, Sivaji Bandyopadhyay.
I'm in the Program Committee of the Workshop on Technologies for MT of Low Resource Languages (LoResMT) which will be held in Boston, Massachusetts, USA, on March 21, 2018. The Workshop is associated and collocated with the 13th biennial conference of the Association for Machine Translation in the Americas (AMTA 2018).
2017:
We organized ICON 2017 at Jadavpur University. Myself, along with Dr. Dipankar Das were the Organizing Chairs.
My first PhD student, Santanu Pal, co-advised by Prof. Josef van Genabith, defended his thesis titled "A Hybrid Machine Translation Framework for an Improved Translation Workflow" with "Magna Cum Laude" grade at Saarland University, Saarbrücken.
We've had an Oral Paper and two Posters accepted in ICON-2017, the 14th International Conference on Natural Language Processing, which will be held during December 18th to 21st, 2017, in Jadavpur, India. The papers are:
[Oral] Natural Language Programing with Automatic Code Generation towards Solving Addition-Subtraction Word Problems. Sourav Mandal and Sudip Kumar Naskar.
[Poster] Normalization of Social Media Text using Deep Neural Networks. Ajay Shankar Tiwari and Sudip Kumar Naskar.
[Poster] Joy Mahapatra and Sudip Kumar Naskar. Unsupervised Morpheme Segmentation Through Numerical Weighting and Thresholding.
We've just had two Oral Papers accepted in LKE 2017 , the 5th International Symposium on Language & Knowledge Engineering, which will be held during November 22nd to 24th, 2017, in Puebla, Mexico. The papers will be published in Journal of Intelligent & Fuzzy Systems (JIFS). The papers are:
A Simple Hybrid Approach to Recognizing Textual Entailment. Rohini Basak, Sudip Kumar Naskar, and Alexander Gelbukh.
Code Mixed Cross Script Factoid Question Classification - A Deep Learning Approach. Somnath Banerjee, Sudip Kumar Naskar, Paolo Rosso and Sivaji Bandyopadhyay.
We've just had a Short Paper and a Poster/Demo paper accepted in NLDB 2017, the 22nd International Conference on Natural Language & Information Systems, which will be held during 21-23 June 2017, 2017, in Liège, Belgium. The papers are:
[Short Paper] Feature Selection and Class-Weight tuning using Genetic Algorithm for Bio-molecular Event Extraction. Amit Majumder, Asif Ekbal and Sudip Kumar Naskar.
[Poster/Demo] Towards Generating Object-Oriented Programs Automatically from Natural Language Texts for Solving Mathematical Word Problems. Sourav Mandal and Sudip Kumar Naskar.
We've just had 2 papers accepted in CICLing 2017, the 18th International Conference on Intelligent Text Processing and Computational Linguistics, which will be held during 17-23th April, 2017, in Budapest, Hungary. The papers are:
Named Entity Recognition on Code-Mixed Cross-Script Social Media Content. Somnath Banerjee, Sudip Kumar Naskar, Paolo Rosso and Sivaji Bandyopadhyay.
Textual Entailment Using Machine Translation Evaluation Metrics. Tanik Saikh, Sudip Kumar Naskar, Asif Ekbal and Sivaji Bandyopadhyay.
We've just had a short paper accepted in EACL 2017, the 15th Conference of the European Chapter of the Association for Computational Linguistics, which will be held during 3-7th April, 2017, in Valencia, Spain. The paper is:
Neural Automatic Post-Editing Using Prior Alignment and Reranking. Santanu Pal, Sudip Kumar Naskar, Mihaela Vela, Qun Liu and Josef van Genabith.
2016:
Acted as a resource person in a 3-day national workshop on "Current Trends in Computational Linguistics and Its Applications in Odiya Language" held during 24-26th October, 2016 at Utkal University, Bhubaneswar, Odisha.
We've just had a research paper and a system demonstration paper accepted in COLING 2016, the 26th biennial International Conference on Computational Linguistics, which will be held during 11-16 December, 2016, in Osaka, Japan. The papers are:
[Research Paper] Multi-Engine and Multi-Alignment Automatic Post-Editing and its Impact on Translation Productivity. Santanu Pal, Sudip Kumar Naskar and Josef van Genabith.
[System Demonstration Paper] CATaLog Online: A Web-based CAT Tool for Distributed Translation with Data Capture for APE and Translation Process Research. Santanu Pal, Sudip Kumar Naskar, Marcos Zampieri, Tapas Nayak and Josef van Genabith.
We've just had a full paper accepted in INLG 2016, the biennial conference of the Special Interest Group on Natural Language Generation (SIGGEN), which will be held during September 5-8, 2016, in Edinburgh, Scotland. The paper is:
Statistical Natural Language Generation from Tabular Non-textual Dataset. Joy Mahapatra, Sudip Kumar Naskar and Sivaji Bandyopadhyay.
I have been awarded "Young Faculty Research Fellowship" under "Visvesvaraya PhD Scheme for Electronics & IT", by Media Lab Asia, Ministry of Electronics and Information Technology, Govt. of India.
We've just had a short paper accepted in ACL 2016, The 54th annual meeting of the Association for Computational Linguistics,which will be held during August 7-12, 2016, in Berlin, Germany. The paper is:
A Neural Network based Approach to Automatic Post-Editing. Santanu Pal, Sudip Kumar Naskar, Mihaela Vela and Josef van Genabith.
We are organizing a sub-task on "Code-Mixed Cross-Script Question Classification" in the Shared Task on Mixed Script Information Retrieval (MSIR) in the 8th meeting of Forum for Information Retrieval Evaluation (FIRE 2016) which will be held in Indian Statistical Institute, Kolkata, India, during 7-10th December, 2016.
We have just launched a new web-based Computer Aided Translation (CAT) / Post-Editing tool, ONLINE CATaLOG, which offers a number of enhanced CAT functionalities and improved interface over existing CAT tools. The desktop version of CATaLog is open-source and freely available (here).
We've just had a book chapter accepted in the book "Hybrid Approaches to Machine Translation". The paper is
Hybrid Word Alignment. Santanu Pal and Sudip Kumar Naskar. In M.R. Costa-jussà et al. (eds.), "Hybrid Approaches to Machine Translation", Theory and Applications of Natural Language Processing, Springer International Publishing Switzerland. DOI 10.1007/978-3-319-21311-8_3.
I delivered an invited talk on “Machine Translation” in “Recent Trends in Computing” held during 16th-17th March, 2016 at West Bengal State University, Barasat, India.
I am on the Program Committee of IEEE International FRUCT conference on Intelligence, Social Media and Web 2016, Saint-Petersburg, Russia.
We've just had a paper accepted in CICLing 2016 which will be held during April 3–9, 2016 in Konya, Turkey. The paper is:
Forest to String Based Statistical Machine Translation with Hybrid Word Alignments. Santanu Pal, Sudip Kumar Naskar and Josef van Genabith. 2016.
We've just had a paper accepted in MultiLingMine 2016 which will be held on 20th March, 2016 in Padua, Italy. The paper is:
The First Cross-Script Code-Mixed Question Answering Corpus. Somnath Banerjee, Sudip Kumar Naskar, Paolo Rosso and Sivaji Bandyopadhyay.
We've just had a paper accepted in NLP4TM 2016 which will be held during 28th May 2016 in Portorož, Slovenia. The paper is:
Beyond Translation Memories: Generating Translation Suggestions based on Parsing and POS Tagging. Tapas Nayek, Santanu Pal, Sudip Kumar Naskar, Sivaji Bandyopadhyay and Josef van Genabith. 2016.
I delivered an invited talk on “NLP, MT and Humans in the Loop” in the “Workshop on Language Technology and Cognitive Science (LTCS-2016)” held during 10th-12th February, 2016 at Indian Statistical Institute, Kolkata, India.
I presented a 4-hour tutorial on “Machine Translation” in the “Workshop on Natural Language Processing and Its Applications” held during 1st-5th February, 2016 at Manipur Institute of Technology, Imphal, Manipur, India.
We've just had a paper accepted in LREC 2016 which will be held during 23-28 May 2016 in Portorož, Slovenia. The paper is:
CATaLog Online: Porting a Post-editing Tool to the Web. Santanu Pal, Tapas Nayek, Sudip Kumar Naskar, Marcos Zampieri, Mihaela Vela and Josef van Genabith.
2015:
I was the Student Paper Competition Chair in the Twelfth International Conference on Natural Language Processing (ICON-2015), IIITM-Kerala, Trivandrum, India, December 13-16, 2015.
I was on the Program Committee for The 7th meeting of Forum for Information Retrieval Evaluation (FIRE 2015), 4th-6th 2015, DAIICT, Gandhinagar, Gujarat, India.
We organized a subtask on “Mixed-script Question Answering” in The 7th meeting of Forum for Information Retrieval Evaluation (FIRE 2015), held during 4th-6th 2015, at DAIICT, Gandhinagar, Gujarat, India. Co-Organinzers: Somnath Banerjee, Sivaji Bandyopadhyay and Paolo Rosso.
I was the Organizing Chair (along with Dr. Nibaran Das) in the 2nd IEEE International Conference on Recent Trends in Information Systems (ReTIS-15) which was held during July 9-11, 2015, at Jadavpur University, Kolkata, India.