The Expected Value of a Point-Biserial (or Similar) Correlation

The Expected Value of a Point-Biserial (or Similar) Correlation

Interpreting the observed value of a point-biserial correlation is made easier if we can compare the observed value with its expected value. Is the observed value much higher than the expected value (indicating dependency in the data) or much lower than expected (indicating unmodeled noise)? With knowledge of how the observed value compares with its expected value, there is no need for arbitrary rules such as "Delete items with point-biserials less than 0.2."

The general formula for a Pearson correlation coefficient is:


Point-Biserial Correlation (including all observations in the correlated raw score)

Suppose that Xn is Xni the observation of person n on item i. Yn is Rn, the raw score of person n, then the point-biserial correlation is:


where X. is the mean of the {Xni} for item i, and R. is the mean of the Rn.

According to the Rasch model, the expected value of Xni is Eni and the model variance of Xni around its expectation is Wni. The model variances of X.i, Rn, R. are ignored here. S(Eni) = S(Xni), so that E.i = X.i.

Thus an estimate of the expected value of the point-measure correlation is given by the Rasch model proposition that: Xni = Eni±√Wni


Since ±√Wni is a random residual, its cross-product with any other variable is modeled to be zero. Thus


which provides a convenient formula for computing the expected value of the point-biserial correlation.

Point-Biserial Correlation (excluding current observation from the correlated raw score)


where R.'is the mean of the Rn-Xni.



is the expected value of the point-biserial correlation excluding the current observation.

Point-Measure Correlation

Similarly, suppose that Yn is Bn, the ability measure of person n, then the point-measure correlation is:


where B. is the mean of the Bn.

Thus an estimate of the expected value of the point-measure correlation is:

Similarly, suppose that Yn is Bn, the ability measure of person n, then the point-measure correlation is:


which provides a convenient formula for computing the expected value of a point-measure correlation.

John Michael Linacre

Here is a worked example for a point-measure correlation:


Later note: Experience suggests that Wni*(N-2)/N is a better term in the divisors than Wni, so that the expected correlation for 2 observations becomes its observed value of ±1.0


The Expected Value of a Point-Biserial (or Similar) Correlation. Linacre J.M. … Rasch Measurement Transactions, 2008, 22:1 p. 1154


The URL of this page is www.rasch.org/rmt/rmt221e.htm

Website: www.rasch.org/rmt/contents.htm