Program

in conjunction with CIKM 2010

The workshop starts at 9 am sharp. See you at AND!

For the detailed program see the pdf attachment below.


Keynote Talk:

Randy Goebel. What is the nature of noise in linguistic corpora?


Panel Discussion:

Seamus Ross, Yuji Matsumoto, Gareth Jones. Why is it Impossible to Handle Noisy Text With Existing Techniques: The Way Forward



Accepted Papers:

Akhil Langer, Rohit Banga, Ankush Mittal and Venkata Subramaniam. Variant Search and Syntactic Tree Similarity Approach to Retrieve Matching Questions for SMS queries

Thomas Packer, Joshua Lutes, Aaron Stewart, David Embley, Eric Ringger, Kevin Seppi and Lee Jensen. Extracting Person Names from Diverse and Noisy OCR Text

Matt Michelson and Sofus Macskassy. Discovering Users’ Topics of Interest on Twitter: A First Look

Hervé Déjean and Jean-Luc Meunier. Document:  a useful level for facing noisy data

Yoko Futagi. The effects of learner errors on the development of a collocation detection tool

Gustavo Laboreiro, Luís Sarmento, Jorge Teixeira and Eugénio Oliveira. Tokenizing Micro-Blogging Messages using a Text Classification Approach

Julien Fayolle, Fabienne Moreau, Christian Raymond and Guillaume Gravier. Reshaping automatic speech transcripts for robust high-level spoken document analysis

Bart Lamiroy and Daniel Lopresti. A Platform for Storing, Visualizing, and Interpreting Collections of Noisy Documents

Koji Murakami, Eric Nichols, Junta Mizuno, Yotaro Watanabe, Shouko Masuda, Hayato Goto, Megumi Ohki, Chitose Sao, Suguru Matsuyoshi, Kentaro Inui and Yuji Matsumoto. Statement Map: Reducing Web Information Credibility Noise through Opinion Classification

Pooyan Asgari, Jon Patrick and Negin Motamedi. Improving Accuracy of Identifying Clinical Concepts in Noisy un-structured clinical corpus using existing internal redundancy

Rohit Babbar and Nidhi Singh. Clustering Based Approach to Learning Regular Expressions over Large Alphabet for Noisy Unstructured Text



Download PDF version here