RV coefficient

In statistics, the RV coefficient[1] is a multivariate generalization of the squared Pearson correlation coefficient (because the RV coefficient takes values between 0 and 1).[2] It measures the closeness of two set of points that may each be represented in a matrix.

The major approaches within statistical multivariate data analysis can all be brought into a common framework in which the RV coefficient is maximised subject to relevant constraints. Specifically, these statistical methodologies include:[1]

One application of the RV coefficient is in functional neuroimaging where it can measure the similarity between two subjects' series of brain scans[3] or between different scans of a same subject.[4]

Definitions

The definition of the RV-coefficient makes use of ideas[5] concerning the definition of scalar-valued quantities which are called the "variance" and "covariance" of vector-valued random variables. Note that standard usage is to have matrices for the variances and covariances of vector random variables. Given these innovative definitions, the RV-coefficient is then just the correlation coefficient defined in the usual way.

Suppose that X and Y are matrices of centered random vectors (column vectors) with covariance matrix given by

then the scalar-valued covariance (denoted by COVV) is defined by[5]

The scalar-valued variance is defined correspondingly:

With these definitions, the variance and covariance have certain additive properties in relation to the formation of new vector quantities by extending an existing vector with the elements of another.[5]

Then the RV-coefficient is defined by[5]

gollark: `parallel` can do that quite simply.
gollark: That does seem... relevant, but also complex to implement and probably overkill for just detecting line vs parabola.
gollark: You can *approximate* just checking if it's going in a straight line...
gollark: Actually, if you want to detect just whether it's going straight up or down as opposed to other directions that's easier.
gollark: So what you want is to figure out if three points are (roughly) on a line?

See also

References

  1. Robert, P.; Escoufier, Y. (1976). "A Unifying Tool for Linear Multivariate Statistical Methods: The RV-Coefficient". Applied Statistics. 25 (3): 257–265. doi:10.2307/2347233. JSTOR 2347233.
  2. Abdi, Hervé (2007). Salkind, Neil J (ed.). RV coefficient and congruence coefficient. Thousand Oaks. ISBN 978-1-4129-1611-0.
  3. Ferath Kherif; Jean-Baptiste Poline; Sébastien Mériaux; Habib Banali; Guillaume Plandin; Matthew Brett (2003). "Group analysis in functional neuroimaging: selecting subjects using similarity measures" (PDF). NeuroImage. 20 (4): 2197–2208. doi:10.1016/j.neuroimage.2003.08.018. PMID 14683722.
  4. Herve Abdi; Joseph P. Dunlop; Lynne J. Williams (2009). "How to compute reliability estimates and display confidence and tolerance intervals for pattern classiffers using the Bootstrap and 3-way multidimensional scaling (DISTATIS)". NeuroImage. 45 (1): 89–95. doi:10.1016/j.neuroimage.2008.11.008. PMID 19084072.
  5. Escoufier, Y. (1973). "Le Traitement des Variables Vectorielles". Biometrics. International Biometric Society. 29 (4): 751–760. doi:10.2307/2529140. JSTOR 2529140.
This article is issued from Wikipedia. The text is licensed under Creative Commons - Attribution - Sharealike. Additional terms may apply for the media files.