Number of Person or Item Strata: (4*Separation + 1)/3

Separation or Strata?

If the outliers in your sample of person or items follow a normal distribution, so that they can be considered to be accidental, then use Separation = G.

If the outliers in your sample of person or items are heavy-tailed, so that they can be considered to represent extreme performance levels, then use Strata = H.

Wright and Masters (Rating Scale Analysis, 1982, pp. 92, 106) write: "... if we define statistically distinct levels of item difficulty as difficulty strata with centers three calibration errors apart, then this separation index G can be translated into the number of item strata defined by the test H=..." (emphasis mine) and similarly for persons.

H = (4G + 1)/3

where G = Separation index = "True" standard deviation / Average measurement error

Note: in RUMM2020 documentation, the "Separation Index" is the reliability = G² / (1+G²).
David Andrich (1982) An Index of Person Separation in Latent Trait Theory, the Traditional KR-20 Index, and the Guttman Scale Response Pattern. Education Research and Perspectives, 9:1, 95-104.

The "average measurement error" is the statistical average, i.e., the root-mean-square, of the standard errors of the measures of items of persons. The "True" standard deviation of the item or person measures is obtained from:

"True" standard deviation2 = Observed standard deviation2 - average measurement error2.

What does this mean? The plot shows the relationship between sample distribution and measurement error when H=3, i.e., when 3 strata are observed. This is when G=2, so that the "true" sample standard deviation, S.D., is twice the average measurement error, S.E. We see that the relevant range of the "true" distribution (assumed normal) is two standard deviations away from the mean, encompassing over 95% of the distribution. Compare this plot with the "Separation" plot in RMT 9:4.

3 Statistical Strata


The strata are defined as statistically distinct measures. They are located 3 S.E. apart, because this is conveniently more than 1.96 * sqrt(2) = 2.77 S.E., the distance corresponding to .05 significance. The centers of the extreme strata are also positioned 1 S.E. within the boundaries of the sample distribution. Thus 84% of even the most extreme strata lie within the 4 S.D. range.

G itself is a more conservative "Separation Index" than H. For instance, suppose that the "true" standard deviation of a sample is the same as the average measurement error. Then G=1, and the test reliability is 0.5, warning us that we don't know whether observed differences within the sample are real differences or merely measurement error. H is (4+1)/3, i.e., roughly 2. This indicates that the opposite ends of the "true" distribution are measurably different, implying that, if the observed measures are sufficiently far apart, they probably reflect real differences.

Number of Person or Item Strata: (4*Separation + 1)/3. Wright BD, Masters GN. … 16:3 p.888

Notes:

  1. In practice, for G < 3.5, H = G+1. For G >= 3.5, H = G+2.
  2. Ben Wright preferred to use G as the count of strata, rather than H, in classroom teaching.
  3. For G and H, the levels are 3 S.E.s apart, centered on the sample mean.
  4. For H, the levels in the true distribution are 12*"True S.D."/(3H - 1) apart, centered on the sample mean.
    For H, the levels in the observed distribution are 12*"Observed S.D."/(3H - 1) apart, centered on the sample mean.
  5. For G, the levels in the true distribution are 3*"True S.D."/G apart, centered on the sample mean.
    For G, the levels in the observed distribution are 3*"Observed S.D."/G apart, centered on the sample mean.


Number of Person or Item Strata. Wright BD, Masters GN. … Rasch Measurement Transactions, 2002, 16:3 p.888


The URL of this page is www.rasch.org/rmt/rmt163f.htm

Website: www.rasch.org/rmt/contents.htm