Modeling the Heterogeneity of Heterogeneity: 

Algorithms, Theories and Applications

Summary

Multiple types of heterogeneity naturally co-exist in a variety of high-impact real applications, such as manufacturing processes, abnormal user detection, etc. State-of-the-art data mining techniques are rich in handling a single type of heterogeneity, which are designed for a specific type of heterogeneity, and often inadequate in addressing the co-existence of multiple types of heterogeneity, referred to as Heterogeneity of Heterogeneity (HoH). To be specific, the following two questions largely remain unanswered: (Q1) how to jointly model multiple types of heterogeneity; (Q2) how to theoretically understand the model generalization performance in the context of HoH? The overall goal of this project is to model and understand the Heterogeneity of Heterogeneity in data mining, by (1) creating a suite of effective and efficient algorithms for modeling HoH, (2) characterizing the model generalization performance in the context of HoH , and (3) evaluate the proposed techniques on various real applications, such as abnormal user detection and quality control in manufacturing processes.

Publications

Tutorials