Programme

AND 2008 

Home

Call for Papers

Proceedings

Programme

Important Dates

People

Submission

Registration

Contact

AND 2007 (the first workshop)





Supported by Information Retrieval Facility







Endorsed by

International Association for Pattern Recognition

Session I0830-1000
0830Keynote Address by Donna Harman. Some thoughts on failure analysis for noisy data
0905Invited Lecture by John Tait. Noise and information
0945Boasters of Poster Papers

Tea Break

1000-1030

Session II: Errors and Effects

1030-1300
1030Laurianne Sitbon and Patrice Bellot. How to cope with questions typed by dyslexic users
1050Daniel Lopresti. Optical character recognition errors and their effects on natural language processing
1110Ulrich Reffle, Annette Gotscharek, Christoph Ringlstetter and Klaus Schulz. Successfully detecting and correcting false friends using channel profiles
1130Rachit Arora and Balaraman Ravindran. Latent Dirichlet allocation based multi-document summarization
1150Amaresh Pandey and Tanveer Siddiqui. An Unsupervised Hindi stemmer with heuristic improvements
1210Discussion

Lunch

1230-1330

Session III: Named Entities and Blogs

1330-1530
1330Valentin Jijkoun, Mahboob Khalid, Maarten Marx and Maarten de Rijke. Named entity normalization in user generated content
1350Rema Ananthanarayanan, Vijil Chenthamarakshan, Prasad Deshpande and Raghu Krishnapuram. Rule based synonyms for entity extraction from noisy text
1410Jiyin He, Wouter Weerkamp, Martha Larson and Maarten de Rijke. Blogger, Stick to your story: Modeling topical noise in blogs with coherence measures
1430Robert McArthur. Uncovering deep user context from blogs
1450Discussion

Session IV: Posters Over Tea

1530-1600

*Jinfeng Zhuang, Steven C.H. Hoi and Aixin Sun. On profiling blogs with representative entries
*Soumya Datta and Sudeshna Sarkar. Comparative study of statistical features of language in blog-vs-splog
*Anurag Bhardwaj, Faisal Farooq, Huaigu Cao and Venu Govindaraju. Topic based language models for OCR correction
*Eiman Alshammari and Jessica Lin. A novel Arabic lemmatization algorithm

Session V: Noisy Environments

1600-1800
1600Sreangsu Acharyya, Sumit Negi, L Venkata Subramaniam and Shourya Roy. Unsupervised learning of multilingual short message service (SMS) dialect from noisy examples
1620Antti Järvelin, Tuomas Talvensaari and Anni Järvelin. Data driven methods for improving mono- and cross-lingual IR performance in noisy environments
1640Lipika Dey and Mirajul Haque. Opinion mining from noisy text data
1700Discussion