[Statistics] Comparison of Three Correlation Coefficient: Pearson, Kendall, Spearman

2020 年 3 月 4 日
笔记

There are three popular metrics to measure the correlation between two random variables: Pearson’s correlation coefficient, Kendall’s tau and Spearman’s rank correlation coefficient. In this article, I will make a detailed comparison among the three measures and discuss how to choose among them.

Definition

Pearson Correlation

Pearson’s correlation coefficient is the covariance of the two variables divided by the product of their standard deviations.

The formula for $rho$

the formula for $rho$

Kendall’s Tau

Let (x₁, y₁), (x₂, y₂), …, (x_n, y_n) be a set of observations of the joint random variables X and Y respectively, such that all the values of ( $x_{i}$

The Kendall τ coefficient is defined as:

Consequently,

Spearman’s Rank Correlation Coefficient

The Spearman correlation coefficient is defined as the Pearson correlation coefficient between the rank variables.

For a sample of size n, the n raw scores $X_{i},Y_{i}$

${displaystyle r_{s}=rho _{operatorname {rg} _{X},operatorname {rg} _{Y}}={frac {operatorname {cov} (operatorname {rg} _{X},operatorname {rg} _{Y})}{sigma _{operatorname {rg} _{X}}sigma _{operatorname {rg} _{Y}}}},}$

To compute Spearman’s correlation, we have to compute the rank of each value, which is its index in the sorted sample. Then we compute Pearson’s correlation for the ranks.

[Statistics] Comparison of Three Correlation Coefficient: Pearson, Kendall, Spearman

Definition

Pearson Correlation

Kendall’s Tau

Spearman’s Rank Correlation Coefficient

VirMach 便宜 VPS

QNews

[Statistics] Comparison of Three Correlation Coefficient: Pearson, Kendall, Spearman

Definition

Pearson Correlation

Kendall’s Tau

Spearman’s Rank Correlation Coefficient

分享此文：

Related Posts

如何通过行为设计实现持续改变

数据库 MySQL 练习

Nginx之常用基本配置（三）

[Statistics] Comparison of Three Correlation Coefficients: Pearson, Kendall, Spearman

VirMach 便宜 VPS

QNews

热门文章

热门搜寻