Latest News and Events

The SAMSI-FODAVA Workshop on Interactive Visualization and Analysis of Massive Data will be held on December 10-12, 2012.
Posted: October 02, 2012
The FODAVA Annual Meeting will immediately follow (Dec 12-13) the SAMSI/FODAVA joint workshop at the same location.
Posted: September 05, 2012
Many of the modern data sets such as text and image data can be represented in high-dimensional vector spaces and have benefited from computational methods that utilize advanced techniques from num
Posted: June 30, 2012

Effective Dimension Reduction with Prior Knowledge

Haesun Park

In this talk, I will give a brief overview of research that we propose on dimension reduction and data reduction for effective and efficient data and visual analytics. I will then give some detailed discussion regarding effective dimension reduction that utilizes prior knowledge such as data clusters or nonnegatavity in the data. Dimension reduction is imperative for efficient representation of high dimensional data. The optimization criteria and role of some matrix
decompositions are examined in many commonly used dimension reduction methods such as Linear Discriminant Analysis (LDA), Principal Component Analysis (PCA), and Latent Semantic Indexing (LSI), and Nonnegative Matrix Factorization (NMF). In particular, we discuss how the generalized LDA based on the generalized singular value decomposition (LDA/GSVD), which is applicable even when the data set is extremely high dimensional and undersampled,
can be utilized in visualization of clustered data. We also propose some new directions for improving its efficiency and effectiveness. Some experimental results are presented in text classification, facial recognition, and fingerprint classification, demonstrating the effectiveness of the proposed directions.