Latest News and Events

The SAMSI-FODAVA Workshop on Interactive Visualization and Analysis of Massive Data will be held on December 10-12, 2012.
Posted: October 02, 2012
The FODAVA Annual Meeting will immediately follow (Dec 12-13) the SAMSI/FODAVA joint workshop at the same location.
Posted: September 05, 2012
Many of the modern data sets such as text and image data can be represented in high-dimensional vector spaces and have benefited from computational methods that utilize advanced techniques from num
Posted: June 30, 2012

Parameterizing high-dimensional data sets with kernel map manifolds

Ross Whitaker

Many important data analysis problems come in in the form of a set of data points each of which contains a large number of measurements, which can be considered scattered data in a very high dimensional space. Visualizing and analyzing such data is challenging, because the dimensionality of the ambient space makes visualization and statistical analysis quite difficult. However, often such data sets do not fill the ambient space, but rather lie close to some lower dimensional manifold. If the manifold is linear, then principal component analysis and other linear models can extract the best fitting models. However, the nonlinear case demands a more sophisticated set of tools for learning the underlying structure of high-dimensional data. This talk examines the problem of manifold learning from a machine learning point of view and describes new tools that make the connection between manifold learning and the statistical generalization of PCA, called principal surfaces. We also present results on examples of visualization and analysis of high-dimensional data from graphics, perception, and medicine.