Seminars

An Eigenvector Variability Plot

214
reads

I-Ping Tu

2008-04-29
13:30:00 - 15:00:00

An Eigenvector Variability Plot

405 , Mathematics Research Center Building (ori. New Math. Bldg.)

Principal components analysis is perhaps the most widely used method for ex-ploring multivariate data. In this talk, we present a variability plot composed of measures on the stability of each eigenvectors over samples as a data exploration tool. We also show that this variability measure gives a good measure on the intersample variability of eigenvectors through asymptotic analysis. For distinct eigenvalues, the asymptotic behavior for this variability measure is comparable to the size of the asymptotic covariance of the eigenvector in Anderson (1963). A simulation for functional data analysis with dimension p greater than sample size n is provided. The proposed variability plot is successful to distinguish the signal components, noise components and 0 eigenvalue components. Applying this method on a gene expression data set for a gastric cancer study, many hills on the proposed variability plot are observed. When the intersample variability of eigenvectors is considered, the cuto point on informative eigenvectors should not be on the top of the hill as suggested by the proposed variability plot.