Keynote Speakers

Truth Finding on the Deep Web

Xin (Luna) Dong

Google Mountain View

The Web has been changing our lives enormously and people rely more and more on the Web to fulfil their information needs. Compared with traditional media, information on the Web can be published fast, but with fewer guarantees on quality and credibility. Indeed, Web sources are of different qualities, sometimes providing conflicting, out-of-date and incomplete data. The sources can also easily copy, reformat and modify data from other sources, propagating erroneous data.

In this talk we present a recent study for truthfulness of Deep Web data in two domains where we believed data quality is important to people's lives: Stock and Flight. We then describe how we can resolve conflicts from different sources by leveraging accuracy of the sources and the copying relationships between the sources. We demo our SOLOMON system, which can effectively detect copying between data sources, leverage the results in truth discovery, and provide a user-friendly interface to facilitate users in understanding the results.

Making Effective Recommendations based on Multi-Source Data

Xue Li

DKE Division, School of Information Technology and Electrical Engineering

The University of Queensland

http://itee.uq.edu.au/~xueli/

A recent stunning story on a successful prediction of 2012 USA Presidential Election with 100% accuracy by Mr Nate Silver has shown that the challenges are not just to invent new algorithms to deal with large, noisy, and uncertain data, but to link the multiple data sources, structured or unstructured, together to make effective recommendations. Information is now available everywhere from the Web, sensor networks, social networks, or the proprietary databases. Consequently, making effective and efficient recommendations is becoming a significant and urgent challenge because of complex, fast changing relationships between data objects. Therefore, the question is: how can we make effective recommendations based on the relevant information collected from multiple data sources in the current changing and interconnected world? This talk will introduce current research activities in the data mining research group led by Dr Xue Li and provide insight into the issues in the current research.