The goal of this project is to develop a quantitative data analysis framework to facilitate effective visualization of large-scale scientific data sets. By considering the process of visualization as a communication channel, we can quantitatively model the information flow between the data input and the visualization output. With information theory as the theoretical foundation, we are developing a framework to evaluate and optimize the quality of visualization based on the information content of the input data, the visualization output, and the discrepancy between the two. The framework can systematically guide the visual analysis process by iteratively optimizing the visualization result so that the information gap between the two ends of the visual analysis pipeline be quickly narrowed.
The project is supported in part by National Science Foundation [NSF project page] and Department of Energy