Visual Computing

University of Konstanz
Computer Graphics Forum

Selecting Good Views of High-dimensional Data using Class Consistency

M. Sips, B. Neubert, J. Lewis, P. Hanrahan

Abstract

Many visualization techniques involve mapping high-dimensional data spaces to lower-dimensional views. Unfortunately, mapping a high-dimensional data space into a scatterplot involves a loss of information; or, even worse, it can give a misleading picture of valuable structure in higher dimensions. In this paper, we propose class consistency as a measure of the quality of the mapping. Class consistency enforces the constraint that classes of n–D data are shown clearly in 2–D scatterplots. We propose two quantitative measures of class consistency, one based on the distance to the class’s center of gravity, and another based on the entropies of the spatial distributions of classes. We performed an experiment where users choose good views, and show that class consistency has good precision and recall. We also evaluate both consistency measures over a range of data sets and show that these measures are efficient and robust.

BibTeX

@article{Sips2009SelectingGoodViews,
  author     = {M. Sips and B. Neubert and J. Lewis and P. Hanrahan},
  doi        = {10.1111/j.1467-8659.2009.01467.x},
  issn       = {1467-8659},
  journal    = {Computer Graphics Forum},
  publisher  = {The Eurographics Association and Blackwell Publishing Ltd.},
  title      = {Selecting Good Views of High-dimensional Data using Class Consistency},
  year       = {2009},
}

Supplemental Material

Paper (.pdf, 1.6 MB)