Publications
 
2013
  • "Sustainable Employment in India by Crowdsourcing Enterprise Tasks", 3rd annual Symposium on Computing for Development(DEV),2013, with Chithralekha Balamurugan and Sujit Gujar
  • "Form Digitization in BPO: From Outsourcing to Crowdsourcing?", ACM SIGCHI Conference on Human Factors in Computing Systems, 2013, with Jacki O’Neill, Antonietta Grasso and David Martin
2012
  • "“Is BPO Work crowdsourcable?", International Conference on Human Computer Interaction (India HCI) 2012; with Jacki O’Neill and Antonietta Grasso
  • “Experiences in Resource Generation for Machine Translation through crowdsourcing”, 8th international conference on Language Resources and Evaluation (LREC), 2012; with Anoop Kunchukuttan, Pratik Patel, Kushal Ladha, Sowmya Gupta, Mitesh Khapra, Pushpak Bhattacharyya
  • "Enabling Rural BPOs Move Up the Value Chain :. Secure Distribution of Office Documents", International Conference on Advances in ICT for Emerging Regions(ICTer), 2012, with Lakshmi Vaidyanathan, Bala Sankar Pazhani, and Meera Sampath
        2011
  • ""How to Assure the Quality of Human Computation Tasks When Majority Voting Fails?", 2nd NIPS workshop on Computational Social Science and the Wisdom of Crowds 2011 with Yu-An Sun, Chris Dance and Greg Little
  • "Experiences in Resource Generation for Machine Translation through Crowdsourcing", CrowdConf 2011 with Anoop Kunchukuttan, Pratik Patel, Kushal Ladha, Somya Gupta, Mitesh M. Khapra, and Pushpak Bhattacharyya
  • “One Step beyond Independent Agreement: A Tournament Selection Approach for Quality Assurance of Human Computation Tasks”, 3rd Human Computation Workshop (HCOMP) with AAAI 2011, with  Yu-An Sun, Shourya Roy, and Greg D. Little
2009
  • “A Survey of Types of Text Noise and Techniques to Handle Noisy Text”, Third Workshop on Analytics for Noisy Unstructured Text Data (AND) 2009, with L. V. Subramaniam, Tanveer  A. Faruquie, Sumit Negi
  • “Language Independent Unsupervised Learning of Short Message Service Dialect”, International Journal on Document Analysis and Recognition (IJDAR): Special Issue on Noisy Text Analytics, Springer, accepted for publication in 2009, with Sreansu Acharyya, Sumit Negi, L. V. Subramaniam
  • “Getting Insights from Real Voice of Customers: Conversation Mining at a Contact Center”, Journal on Information Sciences“, with Hironori Takeuchi, L. Venkata Subramaniam, and Tetsuya Nasukawa
  • Unsupervised Segmentation of Conversational Transcripts", Journal of Statistical Analysis and Data Mining, with Krishna Kummamuru, Deepak P, and L Venkata Subramaniam
 2008
  • “An Integrated System for Automatic Customer Satisfaction Analysis in the Services Industry”, Demonstration paper, KDD 2008, with Shantanu Godbole
  • “Unsupervised Learning of Multilingual Short Message Service (SMS) Dialect From Noisy Examples”, 2nd Workshop on Analytics for Noisy Unstructured Text Data (AND 2008), with Sreangsu Acharyya, Sumit Negi, and L Venkata Subramaniam
  • “Integrating Text Classification, Business Intelligence, and Interactive Labeling for Services Industry Deployments”, Industry/Govt Track, KDD 2008, with Shantanu Godbole
  • “Text to Intelligence: Building and Deploying a Text Mining Solution in the Services Industry for Customer Satisfaction Analysis”, IEEE International Conference on Services Computing (SCC) 2008, with Shantanu Godbole
  • “Unsupervised Segmentation of Conversational Transcripts”, SIAM DM 2008, with Krishna Kummamuru, Deepak P, and L Venkata Subramaniam
  • “Adding Sentence Boundaries to Conversational Speech Transcriptions using Noisily Labeled Examples", International Journal of Document Analysis and Recognition(IJDAR), with Hironori Takeuchi, L. Venkata Subramaniam, Diwakar Punjani, and Tetsuya Nasukawa
2007
  • “Analytical Techniques for Noisy and Unstructured Text Data - I”, Encyclopedia of Artificial Intelligence, published by Springer, with L Venkata Subramaniam
  • “Analytical Techniques for Noisy and Unstructured Text Data - II”, Encyclopedia of Artificial Intelligence, published by Springer,, with L Venkata Subramaniam
  • "Economic Freedom and Economic Growth: An Analysis of BRIC Countries"; Business Horizon: A Journal of Commerce and Economics; 2007, with Simrit Kaur (This is my only paper in Economics, in which I don't claim to have expertise :). Thanks Simrit-mam!)
  • "How Much Noise in Text is too Much: A Study in Automatic Document Classification”, ICDM 2007, with Sumeet Agarwal, Shantanu Godbole, and Diwakar Punjani
  • "Automatic Identification of Valuable Segments and Expressions for Mining of Business-Oriented Conversations at Contact Centers", EMNLP 2007, with Hironori Takeuchi, Tetsuya Nasukawa, L V Subramaniam, and Sreeram Balakrishnan
  • “ProACT: A solution for Automatic Customer Satisfaction Analysis and Business Intelligence in Contact Centers”, 16th Annual Frontiers in Service Conference, with Sumeet Agarwal, Shantanu Godbole, Raghu Krishnapuram, and Diwakar Punjani
  • “A Conversation-Mining System for Gathering Insights to Improve Agent Productivity", 9 th IEEE Conference on E-Commerce Technology (CEC' 07) and the 4th IEEE Conference on Enterprise Computing, E-Commerce and E-Services (EEE ' 07), with Hironori Takeuchi, L Venkata Subramaniam, Tetsuya Nasukawa, and Sreeram Balakrishnan
  • “Adding Sentence Boundaries to Conversational Speech Transcriptions using Noisily Labelled Examples", IJCAI-2007 Workshop on Analytics for Noisy Unstructured Text Data, with Tetsuya Nasukawa, Diwakar Punjani, L V Subramaniam, and Hironori Takeuchi
  • “A Middleware for Storage, Federation, Security and Access Control of Policies ", Special Issue of Journal of Autonomic and Trusted Computing on Autonomic and Trusted Computing Systems and Applications, with Anuradha Bhamidipaty, Manish Bhide, Rajeev Gupta, and Mukesh Mohania
  • “Analysis of Agents from Call Transcriptions of a Car Rental Process", LAICS-NLP (Language, Artificial Intelligence and Computer Science for Natural Language Processing applications), with Swati Challa, L Venkata Subramaniam
2006
  • "Automatic Generation of Domain Models for Call-Centers from Noisy Transcriptions; the joint conference of the International Committee on Computational Linguistics and the Association for Computational Linguistics (ACL-COLING), 2006, Sydney, Australia, with L Venkata Subramaniam,
  • "Identity Delegation in Policy Based Systems; with Rajeev Gupta, and Manish Bhide; poster in the 3rd IEEE International Conference on Autonomic Computing, 2006, Dublin, Ireland
  • "OPTICS On Text Data: Experiments and Test Results; with Deepak P; Text Mining Workshop (TM-2006) held in conjunction with the SIAM International Conference on Data Mining (SIAM DM-2006), Maryland, USA
  • "Scaled Entropy and DF-SE: Different and Improved Unsupervised Feature Selection Techniques for Text Clustering; with Deepak P; International Workshop on Feature Selection for Data Mining (FSDM 2006) held in conjunction with the SIAM International Conference on Data Mining (SIAM DM-2006), Maryland, USA
2004
  • "Automatic categorization of web sites based on source types; with Sachindra Joshi, and Raghu Krishnapuram; August 2004; Proceedings of the fifteenth ACM conference on Hypertext and hypermedia HYPERTEXT '04
  • "A hierarchical monothetic document clustering algorithm for summarization and browsing search results; with Krishna Kummamuru, Rohit Lotlikar, Karan Singal, and Raghu Krishnapuram; May 2004 ; Proceedings of the 13th international conference on World Wide Web 
2003
  • "Fast and accurate text classification via multiple linear discriminant projections; Soumen Chakrabarti, Shourya Roy, Mahesh V. Soundalgekar; August 2003; The VLDB Journal - The International Journal on Very Large Data Bases, Volume 12 Issue 2 
2002
  • "Fast and accurate text classification via multiple linear discriminant projections; Soumen Chakrabarti, Shourya Roy, Mahesh Soundalgekar; 28th Intenational Conference on Very Large Databases(VLDB), Hong Kong, August 2002.