Correlation coefficient

Normalized covariance giving a scale-free measure of linear association between two random variables
Correlation coefficient

A correlation coefficient is the normalized covariance ρ(X,Y)=Cov(X,Y)Var(X)Var(Y)\rho(X,Y)=\frac{\operatorname{Cov}(X,Y)}{\sqrt{\operatorname{Var}(X)\operatorname{Var}(Y)}} for XX and YY with 0<Var(X),Var(Y)<0<\operatorname{Var}(X),\operatorname{Var}(Y)<\infty.

It is a dimensionless rescaling of and satisfies 1ρ(X,Y)1-1\le \rho(X,Y)\le 1, with the sign indicating the direction of linear association. Correlation ρ(X,Y)=0\rho(X,Y)=0 means XX and YY are uncorrelated, which is implied by but is generally weaker.

Examples:

  • If Y=aX+bY=aX+b with a0a\neq 0 and Var(X)>0\operatorname{Var}(X)>0, then ρ(X,Y)=1\rho(X,Y)=1 when a>0a>0 and ρ(X,Y)=1\rho(X,Y)=-1 when a<0a<0.
  • If XN(0,1)X\sim N(0,1) and ZN(0,1)Z\sim N(0,1) are independent and Y=X+ZY=X+Z, then ρ(X,Y)=12\rho(X,Y)=\frac{1}{\sqrt{2}}.
  • If XX and YY are independent with finite, nonzero variances, then ρ(X,Y)=0\rho(X,Y)=0.