Information Retrieval, Natural Language Processing, Event Analysis, Machine Learning and Data Mining.
Keyphrase Extraction and Generation, Conversation Summarization, Scholarly Document Processing, Document Intelligence.
Meng, Rui, Debanjan Mahata, and Florian Boudin. "From Fundamentals to Recent Advances: A Tutorial on Keyphrasification." In European Conference on Information Retrieval, pp. 582-588. Springer, Cham, 2022 (ECIR 2022). Website, Videos
Mayank Kulkarni, Debanjan Mahata, Ravneet Arorar, Rajarshi Bhowmik. Learning Rich Representation of Keyphrases from Text. Accepted at In the Findings of 2022 Annual Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (NAACL-HLT 2022). Models.
Laiba Mehnaz, Debanjan Mahata, Uma Sushmitha Gunturi, Amardeep Kumar, Riya Jain, Gauri Gupta, Isabelle Lee, Anish Acharya. Rajiv Ratn Shah. GupShup: Summarizing Open-Domain Code-Switched Conversations. Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing (EMNLP 2021). Data and Models.
Rakesh Gosangi, Ravneet Arora, Mohsen Gheisarieha, Debanjan Mahata and Haimin Zhang. On the Use of Context for Predicting Citation Worthiness of Sentences in Scholarly Articles. Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics - Human Language Technologies (NAACL-HLT 2021). Data.
Sarthak Anand, Pradyumna Gupta, Hemant Yadav, Debanjan Mahata, Rakesh Gosangi, Haimin Zhang, Rajiv Ratn Shah. MIDAS at SemEval-2020 task 10: Emphasis selection using label distribution learning and contextual embeddings. Proceedings of the Fourteenth Workshop on Semantic Evaluation (SemEval@COLING 2020). (Best Paper Award for Result Interpretation)
Avinash Swaminathan, Haimin Zhang, Debanjan Mahata, Rakesh Gosangi, Rajiv Ratn Shah, Amanda Stent. A Preliminary Exploration of GANs for Keyphrase Generation. Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP 2020). Code.
Akash Kumar Gautam, Debanjan Mahata, Rakesh Gosangi, Rajiv Ratn Shah. Semi-Supervised Iterative Approach for Domain-Specific Complaint Detection in Social Media. Proceedings of The 3rd Workshop on e-Commerce and NLP. (ECNLP@ACL 2020). Data.
Swapnil Dhanwal, Hritwik Dutta, Hitesh Nankani, Nilay Shrivastava, Yaman Kumar, Junyi Jessy Li, Debanjan Mahata, Rakesh Gosangi, Haimin Zhang, Rajiv Ratn Shah, Amanda Stent. An Annotated Dataset of Discourse Modes in Hindi Stories. 12th Language Resource and Evaluation Conference, Marseille, France (LREC 2020). Data.
Dhruva Sahrawat, Debanjan Mahata, Mayank Kulkarni, Haimin Zhang, Rakesh Gosangi, Amanda Stent, Agniv Sharma, Yaman Kumar, Rajiv Ratn Shah, Roger Zimmermann. Keyphrase Extraction from Scholarly Articles as Sequence Labeling using Contextualized Embeddings. 42nd European Conference on Information Retrieval, Lisbon, Portugal (ECIR 2020). Data. Arxiv Preprint.
Akash Gautam, Puneet Mathur, Rakesh Gosangi, Debanjan Mahata, Ramit Sawhney, Rajiv Ratn Shah. #MeTooMA: Multi-Aspect Annotations of Tweets Related to the MeToo movement. 14th International Conference on Web and Social Media. Atlanta, Georgia, USA (ICWSM 2020). Data. Arxiv Preprint.
Dhruva Sahrawat, Yaman Kumar, Shubham Maheshwari, Debanjan Mahata, Amanda Stent, Yifang Yin, Rajiv Ratn Shah, Roger Zimmermann. Harnessing GANs for Zero-shot Learning of New Classes in Visual Speech Recognition. Accepted at The Thirty-Fourth AAAI Conference on Artificial Intelligence. New York City, NY, USA (AAAI 2020).
Gyanesh Anand, Akash Gautam, Puneet Mathur, Debanjan Mahata , Rajiv Ratn Shah, Ramit Sawhney. An Iterative Approach for Identifying Complaint Based Tweets in Social Media Platforms. Accepted at The Thirty-Fourth AAAI Conference on Artificial Intelligence, Student Paper Track. New York City, NY, USA (AAAI 2020).
Avinash Swaminathan, Raj Kuwar, Haimin Zhang, Debanjan Mahata, Rakesh Gosangi, Rajiv Ratn Shah. Keyphrase Generation for Scientific Articles using GANs. Accepted at The Thirty-Fourth AAAI Conference on Artificial Intelligence, Student Paper Track. New York City, NY, USA (AAAI 2020). Arxiv Preprint. Code.
Pradyumna Prakhar Sinha, Rohan Mishra, Ramit Sawhney, Debanjan Mahata, Rajiv Ratn Shah and Huan Liu. #suicidal - A Multipronged Approach to Identify and Explore Suicidal Ideation in Twitter. The 28th ACM International Conference on Information and Knowledge Management. Beijing, China (CIKM 2019).
Nilay Shrivastava, Astitwa Saxena, Yaman Kumar, Rajiv Ratn Shah, Amanda Stent, Debanjan Mahata, Preeti Kaur, Roger Zimmermann. MobiVSR : Efficient and Light-weight Neural Network for Visual Speech Recognition on Mobile Devices. Accepted at The 20th Annual Conference of the International Speech Communication Association INTERSPEECH 2019 | Graz, Austria, Sep. 15-19, 2019. Code.
Shashwat Uttam, Yaman Kumar, Dhruva Sahrawat, Mansi Aggarwal, Rajiv Ratn Shah, Debanjan Mahata, Amanda Stent. Hush-Hush Speak: Speech Reconstruction Using Silent Videos. Accepted at The 20th Annual Conference of the International Speech Communication Association INTERSPEECH 2019 | Graz, Austria, Sep. 15-19, 2019. Code.
Sarthak Anand, Debanjan Mahata, Haimin Zhang, Simra Shahid, Laiba Mehnaz, Yaman Kumar, Rajiv Ratn Shah. MIDAS@SMM4H-2019: Identifying Adverse Drug Reactions and Personal Health Experience Mentions from Twitter. Social Media Mining for Health Workshop, co-located with the 57th Annual Meeting of the Association for Computational Linguistics (ACL), Florence, Italy (SMM4H@ACL 2019).
Arijit Ghosh Chowdhury, Ramit Sawhney, Rajiv Ratn Shah and Debanjan Mahata. #YouToo? Detection of Personal Recollections of Sexual Harassment on Social Media. The 57th Annual Meeting of the Association for Computational Linguistics (ACL), Florence, Italy (ACL 2019).
Sarthak Anand, Debanjan Mahata, Kartik Aggarwal, Laiba Mehnaz, Simra Shahid, Haimin Zhang, Yaman Kumar, Rajiv Shah, Karan Uppal. MIDAS at SemEval-2019 Task 9: Suggestion Mining from Online Reviews using ULMFiT. International Workshop on Semantic Evaluation 2019, co-located with 2019 Annual Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (SemEval 2019@NAACL-HLT 2019). (10 th rank out of 34 teams in SubTask A). Code.
Haimin Zhang, Debanjan Mahata, Simra Shahid, Laiba Mehnaz, Sarthak Anand, Yaman Singla, Rajiv Ratn Shah,and Karan Uppal. MIDAS at SemEval-2019 Task 6: Identifying Offensive Posts and Targeted Offense from Twitter. International Workshop on Semantic Evaluation 2019, co-located with 2019 Annual Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (SemEval 2019@NAACL-HLT 2019). (5 th rank out of 103 teams in SubTask A and 8th rank out of 75 teams in SubTask B).
Arijit Ghosh Chowdhury, Ramit Sawhney, Puneet Mathur, Debanjan Mahata and Rajiv Ratn Shah. Speak Up, Fight Back! Detection of Social Media Disclosures of Sexual Harassment. 2019 Annual Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Student Research Workshop (NAACL-HLT, 2019).
Rohan Mishra, Pradyumn Prakhar Sinha, Ramit Sawhney, Debanjan Mahata, Puneet Mathur and Rajiv Ratn Shah. SNAP-BATNET: Cascading Author Profiling and Social Network Graphs for Suicide Ideation Detection on Social Media. In the proceedings of 2019 Annual Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Student Research Workshop (NAACL-HLT, 2019).
Yaman Kumar, Swati Aggarwal, Debanjan Mahata, Rajiv Ratn Shah, Ponnurangan Kumaraguru, Roger Zimmermann. Get IT Scored using AutoSAS-An Automated System for Scoring Short Answers. In the proceedings of The Ninth Symposium on Educational Advances in Artificial Intelligence (EAAI-19) in conjunction with Thirty-Third AAAI Conference on Artificial Intelligence (AAAI, 2019).
Puneet Mathur, Rajiv Shah, Ramit Sawhney, Debanjan Mahata. Detecting offensive tweets in hindi-english code-switched language . Proceedings of the Sixth International Workshop on Natural Language Processing for Social Media in conjunction with 56th Annual Meeting of the Association for Computational Linguistics., Melbourne, Australia (ACL, 2018).
Nupur Baghel, Yaman Kumar, Paavini Nanda, Rajiv Ratn Shah, Debanjan Mahata and Roger Zimmermann. Kiki Kills: Identifying Dangerous Challenge Videos from Social Media. Arxiv preprint arXiv:1812.00399 . Data.
Debanjan Mahata, John Kuriakose, Ratn Rajiv Shah, Roger Zimmermann. Key2Vec: Automatic Ranked Keyphrase Extraction from Scientific Articles Using Phrase Embeddings. In the proceedings of Human Language Technologies: The 16th Annual Conference of the North American Chapter of the Association for Computational Linguistics, New Orleans, Louisiana, U.S.A, 2018 (NAACL-HLT, 2018). Download Paper.
Debanjan Mahata, John Kuriakose, Ratn Rajiv Shah, Roger Zimmermann, John Talburt. Theme-weighted Ranking of Keywords from Text Documents using Phrase Embeddings. In the proceedings of 1st IEEE International Conference on Multimedia Information Processing and Retrieval; Miami, Florida, U.S.A, 2018 (IEEE MIPR, 2018). Download Paper.
Mayank Meghawat, Satyendra Yadav, Debanjan Mahata, Yifang Yin, Rajiv Ratn Shah, Roger Zimmermann. A Multimodal Approach to Predict Social Media Popularity. To appear in the proceedings of 1st IEEE International Conference on Multimedia Information Processing and Retrieval; Miami, Florida, U.S.A, 2018 (IEEE MIPR, 2018). Download Paper.
Jasper Friedrichs, Debanjan Mahata, Shubham Gupta. InfyNLP at SMM4H Task 2: Stacked Ensemble of Shallow Convolutional Neural Networks for Identifying Personal Medication Intake from Twitter. Proceedings of the Second Workshop on Social Media Mining for Health Applications (SMM4H, 2017). Health Language Processing Laboratory; 2017. Download Paper. (Top performing system in the Shared Task of Identifying Personal Medication Intake from Twitter).
D Mahata, A Kaduskar, J Kuriakose. Search Powered by Deep Learning. Smart Data Conference, San Francisco, 2017. Presentation.
Debanjan Mahata, John R. Talburt and Vivek Kumar Singh. From Chirps to Whistles : Discovering Event-specific Informative Content from Twitter. Proceedings of the 7th Annual ACM Web Science Conference. June 28th - July 1st, Oxford, UK (ACM Web Sci, 2015). Download Paper. Presentation.
Debanjan Mahata, John R. Talburt and Vivek Kumar Singh. A Framework for Collecting, Extracting and Managing Event Identity Information from Twitter. In Proceedings of 20th MIT- International Conference on Information Quality. Jul 24, 2015, Boston, USA (MIT ICIQ, 2015). Download Paper. Presentation.
Debanjan Mahata, John R. Talburt and Vivek Kumar Singh. Identifying and Ranking of Event-specific Entity-centric Informative Content from Twitter. 20th International Conference On Applications Of Natural Language To Information Systems (NLDB, 2015), June 17th - 19th, 2015, Passau, Germany. Download.
Debanjan Mahata and John R. Talburt. A Framework for Collecting and Managing Entity Identity Information from Social Media. In Proceedings of 19th MIT-International Conference on Information Quality. August 1-3, 2014, Xi'An, China (MIT ICIQ, 2014). Download.
Debanjan Mahata and Nitin Agarwal. Learning From The Crowd: An Evolutionary Mutual Reinforcement Model for Analyzing Events. In Proceedings of ACM/IEEE International Conference on Advances in Social Networks Analysis and Mining (ASONAM, 2013). August 25-28. Niagara Falls, Canada. Download.
Debanjan Mahata and Nitin Agarwal. What does Everybody Know? Identifying Event-Specific Sources from Social Media. In Proceedings of the fourth International Conference on Computational Aspects of Social Networks (CASoN, 2012). November 21-23, 2012. Sao Carlos, Brazil. Download.
Fatih Sen, Rolf T. Wigand, Nitin Agarwal, Debanjan Mahata, and Halil Bisgin. Identifying Focal Patterns in Social Networks. In Proceedings of the fourth International Conference on Computational Aspects of Social Networks (CASoN, 2012). November 21-23, 2012. Sao Carlos, Brazil. Download.
Debanjan Mahata and Nitin Agarwal. Analyzing Event-specific Socio-Technical Behaviors Through the Lens of Social Media. The International Sunbelt Social Network Conference (Sunbelt XXXII) organized by the International Network for Social Network Analysis (INSNA), March 12-18, 2012, Redondo Beach, California.
Singh, V.K, Mahata, D.,Adhikari, R: A Clustering and Opinion Mining Approach to Sociopolitical Analysis of the Blogosphere , in proceedings of 2010 IEEE International Conference on Computational Intelligence and Computing Research., December. 2010, Coimbatore-India, IEEE Xplore, pp. 1-4, DOI: 10.1109/ICCIC.2010.5705807 (ISBN: 978-1-4244-5965-0). Download.
Singh, V.K., Mahata, D., Adhikari R.: Mining the Blogosphere from a Socio-political Perspective, appeared in Proceedings of 6th International Conference on Next Generation Web Services Practices, Nov. 2010, Gwalior-India, IEEE Xplore, pp. 365-370, DOI:10.1109/CISIM.2010.5643634 (ISBN: 978-1-4244-7817-0). Download.
Method and System for Key Phrase Extraction and Generation from Text (IN201741042053).
Method and System for Key Phrase Extraction and Generation from Text (US20190155944A1).
Method and System for Key Phrase Extraction and Generation from Text (EP3489837A1).
Method and System for Key Phrase Extraction and Generation from Text (AU2018267618A1).
Rajiv Ratn Shah, Debanjan Mahata, Vishal Choudhary, Rajiv Bajpai. "Multimodal Semantics and Affective Computing". Intelligent Multidimensional Data and Image Processing, IGI Global, 2018.
Debanjan Mahata and Nitin Agarwal. “Grouping the Similar among the Disconnected Bloggers”,Social Network Analysis and Social Media Mining: Emerging Research. Guandong Xu and Lin Li (Eds.). IGI Global, 2012.
Debanjan Mahata and Nitin Agarwal. “Identifying Event-Specific Sources from Social Media”, Online Social Media Analytics and Visualization. Jalal Kawash (Ed.). Springer, 2014.
Nitin Agarwal, Debanjan Mahata, and Huan Liu. "Time and Event Driven Modeling of Blogger Influence", Encyclopedia of Social Network Analysis and Mining (ESNAM). Alhajj, Reda; Rokne, Jon (Eds.). Springer, 2014.
Abeed Sarker, Maksim Belousov, Jasper Friedrichs, Kai Hakala, Svetlana Kiritchenko, Farrokh Mehryary, Sifei Han, Tung Tran, Anthony Rios, Ramakanth Kavuluru, Berry de Bruijn, Filip Ginter, Debanjan Mahata, Saif M Mohammad, Goran Nenadic, and Graciela Gonzalez-Hernandez. Data and systems for medication-related text classification and concept normalization from Twitter: insights from the Social Media Mining for Health (SMM4H)-2017 shared task. Journal of the American Medical Informatics Association, 25(10), pp.1274-1283. (JAMIA) Download Paper.
Rajiv R S, Debanjan M, Mayank M, Roger Z. Leveraging Multimodal Semantics and Sentiments Information in Event Understanding and Summarization. Psychol Behav Sci Int J. 2017; 6(5): 555699. DOI: 10.19080/PBSIJ.2017.06.555699. Download Paper.
Debanjan Mahata, Jasper Friedrichs, Rajiv Ratn Shah, Jing Jiang. Detecting Personal Intake of Medicine from Twitter. IEEE Intelligent Systems ( Volume: 33 , Issue: 4 , Jul./Aug. 2018 ) , Page(s): 87 - 95 DOI: 10.1109/MIS.2018.043741326 (IEEE Intelligent Systems).
Chatter that Matter : A Framework for Collecting, Extracting and Managing Event Identity Information from Short Social Media Text. Student Research and Creative Works Expo, Graduate Competition, University of Arkansas at Little Rock, 2015. (Awarded First Place)
A framework for collecting, extracting and managing event identity information from textual content in social media. Download. Supervisor: Dr. John R. Talburt.
Collecting and Managing Real-life Event Information from Social Media - Challenges and Methodologies. LAP Lambert Academic Publishing AG & CO. KG. ISBN: 978-6202081979. [Amazon]