Prioritizing Misfit Indicators: an Insight based on Log-Linear Rasch Modeling

There are many misfit indices, but they are not of equal utility. Though all fit indices flag data departures from model specifications, most departures are mere wrinkles, others are pot-holes, but some are crevasses. The analyst must circumvent the crevasses before bothering with the wrinkles. Most theoretical discussions focus on one type of misfit at a time, providing little guidance for the practitioner. Examination of empirical data sets, however, quickly identifies what needs to be investigated first.

TenVergert, Gillespie and Kingma (TGK, 1993) construct Rasch measures from the responses of 1299 subjects to 4 items of the Reiss Premarital Sexual Permissiveness Scale. They use a log- linear method implemented with SPSS. The equivalent logit-linear Rasch model for these dichotomous responses is:

log_e (Pni1/Pni0) = Bn + Ei

where Bn is the ability of person n, and Ei the easiness of item i. In addition to reporting the item calibrations (without standard errors!), TGK provide an evaluation of item fit (see Table 1).

Facets TGK Item Point-Biserial INFIT/OUTFIT Log-linear rpbs Fit Report Fit Report FSL .41 Muted VLI FSC -.16 Noisy OK FSA .25 OK VLI FSE -.06 OK VLI

Table 1. Fit analysis.
Note: VLI = "Violates Local Independence"

TGK's log-linear analysis investigated fit to the Rasch specification of local independence. Local deviations from the model specification of local independence "can be measured by the size of residual covariances. Unfortunately, some computer programs for fitting the Rasch model do not give any information about these. A choice would be to examine the covariance matrix of the item residuals, not the sizes of the residuals themselves, to see if the items are indeed conditionally uncorrelated, as required by the principle of local independence" (McDonald 1985 p. 212). TGK report that three of their four items "violate local independence".

TGK's analysis was repeated using Facets. Facets' INFIT and OUTFIT are concerned with the size and distribution of residuals, not with their independence. Item FSL is reported to have the highest point-biserial correlation, rpbs. Conventional interpretation of rpbs would evaluate this as the best item. Facets detects that responses to this item are deficient in stochasticity and so problematic. TGK detects that this item lacks local independence.

TGK disagree with Facets and rpbs about Item FSC. According to TGK, it is the best item, because it is locally independent. For Facets INFIT and OUTFIT statistics and rpbs, it is the worst. Facets evaluates Item FSC to be the most obviously misfitting, because two males assented to this difficult item, but dissented from the three easier items. TGK's local independence analysis failed to identify the most blatant unmodelled behavior in the data. What TGK detected as independence, Facets identified as noise.

According to Facets and rpbs, Item FSA is acceptable. According to TGK, it is defective. According to Facets, Item FSE is also acceptable. According to TGK and rpbs, it is defective. Analysis of the matrix of standardized residuals identifies as most problematic the large correlation of -0.5 between the standardized residuals for Items FSA and FSE. Other inter-item correlations are much smaller. There is an empirical local dependency between FSA and FSE which is masked in the Facets INFIT and OUTFIT statistics by the generally stochastic pattern of interactions between all items.

These results enable us to prioritize fit indicators:

1) A negative rpbs indicates that success on the item is not associated with higher scores on the test. Unless this is an adaptive test, negative (or very low) rpbs probably contradict our definition of the variable. Often they point to miskeyed items or items with ambiguous or negatively worded stems. But once negative (or very low) rpbs have been investigated, differences in sizes between positive rpbs have little diagnostic power, due to their local dependence on targeting.

2) Misfit detected by OUTFIT and INFIT is caused by aberrant single responses or aberrant response patterns of responses within individual items. These patterns may be due to unpredicted or overly predictable responses. They reflect directly on the measuring power of individual items, and may motivate dropping an item from the analysis (e.g., a flawed item), or side-lining individual responses (e.g., response sets), or splitting the original item into several items according to respondents' response style (e.g., a curriculum-dependent item).

3) Despite the concern, often expressed in the literature, that local independence is the sine qua non of Rasch measurement, it turns out to be a tertiary consideration in practice. Local independence addresses the relationships between items. But these relationships have little practical meaning until there is evidence that the component items appear to be effective measurement devices. Rogue observation patterns to individual items are a more immediate threat to measure validity. Lack of local independence is manifested by large correlations between standardized residuals. Diagnosing the reasons for large correlations, however, requires examination of item content and response structures for pairs of items [using, for instance, principal components analysis PCA of residuals]. These investigations are more arduous than the inspection of aberrations in single items. Remedying defects is also more difficult.

McDonald RP (1985) Factor Analysis and Related Methods. Hillsdale, NJ: Lawrence Erlbaum.

TenVergert E, Gillespie M, & Kingma J (1993) Testing the assumptions and interpreting the results of the Rasch model using log-linear procedures in SPSS. Behavior Research Methods, Instruments, and Computers 25(3) 350-359.

Prioritizing misfit indicators: an Insight based on Log-Linear Rasch Modeling. Linacre JM. … Rasch Measurement Transactions, 1995, 9:2 p.422

Rasch Books and Publications

Invariant Measurement: Using Rasch Models in the Social, Behavioral, and Health Sciences, 2nd Edn. George Engelhard, Jr. & Jue Wang Applying the Rasch Model (Winsteps, Facets) 4th Ed., Bond, Yan, Heene Advances in Rasch Analyses in the Human Sciences (Winsteps, Facets) 1st Ed., Boone, Staver Advances in Applications of Rasch Measurement in Science Education, X. Liu & W. J. Boone Rasch Analysis in the Human Sciences (Winsteps) Boone, Staver, Yale

Introduction to Many-Facet Rasch Measurement (Facets), Thomas Eckes Statistical Analyses for Language Testers (Facets), Rita Green Invariant Measurement with Raters and Rating Scales: Rasch Models for Rater-Mediated Assessments (Facets), George Engelhard, Jr. & Stefanie Wind Aplicação do Modelo de Rasch (Português), de Bond, Trevor G., Fox, Christine M Appliquer le modèle de Rasch: Défis et pistes de solution (Winsteps) E. Dionne, S. Béland

Exploring Rating Scale Functioning for Survey Research (R, Facets), Stefanie Wind Rasch Measurement: Applications, Khine Winsteps Tutorials - free
Facets Tutorials - free Many-Facet Rasch Measurement (Facets) - free, J.M. Linacre Fairness, Justice and Language Assessment (Winsteps, Facets), McNamara, Knoch, Fan

Other Rasch-Related Resources: Rasch Measurement YouTube Channel

Rasch Measurement Transactions & Rasch Measurement research papers - free An Introduction to the Rasch Model with Examples in R (eRm, etc.), Debelak, Strobl, Zeigenfuse Rasch Measurement Theory Analysis in R, Wind, Hua Applying the Rasch Model in Social Sciences Using R, Lamprianou El modelo métrico de Rasch: Fundamentación, implementación e interpretación de la medida en ciencias sociales (Spanish Edition), Manuel González-Montesinos M.

Rasch Models: Foundations, Recent Developments, and Applications, Fischer & Molenaar Probabilistic Models for Some Intelligence and Attainment Tests, Georg Rasch Rasch Models for Measurement, David Andrich Constructing Measures, Mark Wilson Best Test Design - free, Wright & Stone
Rating Scale Analysis - free, Wright & Masters

Virtual Standard Setting: Setting Cut Scores, Charalambos Kollias Diseño de Mejores Pruebas - free, Spanish Best Test Design A Course in Rasch Measurement Theory, Andrich, Marais Rasch Models in Health, Christensen, Kreiner, Mesba Multivariate and Mixture Distribution Rasch Models, von Davier, Carstensen

Rasch Books and Publications
Invariant Measurement: Using Rasch Models in the Social, Behavioral, and Health Sciences, 2nd Edn. George Engelhard, Jr. & Jue Wang	Applying the Rasch Model (Winsteps, Facets) 4th Ed., Bond, Yan, Heene	Advances in Rasch Analyses in the Human Sciences (Winsteps, Facets) 1st Ed., Boone, Staver	Advances in Applications of Rasch Measurement in Science Education, X. Liu & W. J. Boone	Rasch Analysis in the Human Sciences (Winsteps) Boone, Staver, Yale
Introduction to Many-Facet Rasch Measurement (Facets), Thomas Eckes	Statistical Analyses for Language Testers (Facets), Rita Green	Invariant Measurement with Raters and Rating Scales: Rasch Models for Rater-Mediated Assessments (Facets), George Engelhard, Jr. & Stefanie Wind	Aplicação do Modelo de Rasch (Português), de Bond, Trevor G., Fox, Christine M	Appliquer le modèle de Rasch: Défis et pistes de solution (Winsteps) E. Dionne, S. Béland
Exploring Rating Scale Functioning for Survey Research (R, Facets), Stefanie Wind	Rasch Measurement: Applications, Khine	Winsteps Tutorials - free Facets Tutorials - free	Many-Facet Rasch Measurement (Facets) - free, J.M. Linacre	Fairness, Justice and Language Assessment (Winsteps, Facets), McNamara, Knoch, Fan
Other Rasch-Related Resources: Rasch Measurement YouTube Channel
Rasch Measurement Transactions & Rasch Measurement research papers - free	An Introduction to the Rasch Model with Examples in R (eRm, etc.), Debelak, Strobl, Zeigenfuse	Rasch Measurement Theory Analysis in R, Wind, Hua	Applying the Rasch Model in Social Sciences Using R, Lamprianou	El modelo métrico de Rasch: Fundamentación, implementación e interpretación de la medida en ciencias sociales (Spanish Edition), Manuel González-Montesinos M.
Rasch Models: Foundations, Recent Developments, and Applications, Fischer & Molenaar	Probabilistic Models for Some Intelligence and Attainment Tests, Georg Rasch	Rasch Models for Measurement, David Andrich	Constructing Measures, Mark Wilson	Best Test Design - free, Wright & Stone Rating Scale Analysis - free, Wright & Masters
Virtual Standard Setting: Setting Cut Scores, Charalambos Kollias	Diseño de Mejores Pruebas - free, Spanish Best Test Design	A Course in Rasch Measurement Theory, Andrich, Marais	Rasch Models in Health, Christensen, Kreiner, Mesba	Multivariate and Mixture Distribution Rasch Models, von Davier, Carstensen

Forum Rasch Measurement Forum to discuss any Rasch-related topic

Go to Top of Page
Go to index of all Rasch Measurement Transactions
AERA members: Join the Rasch Measurement SIG and receive the printed version of RMT
Some back issues of RMT are available as bound volumes
Subscribe to Journal of Applied Measurement

Go to Institute for Objective Measurement Home Page. The Rasch Measurement SIG (AERA) thanks the Institute for Objective Measurement for inviting the publication of Rasch Measurement Transactions on the Institute's website, www.rasch.org.

Coming Rasch-related Events
Jan. 16 - Feb. 13, 2025, Fri.-Fri.	On-line workshop: Rasch Measurement - Core Topics (E. Smith, Winsteps), www.statistics.com
Apr. 8 - Apr. 11, 2026, Wed.-Sat.	National Council for Measurement in Education - Los Angeles, CA, ncme.org/events/2026-annual-meeting
Apr. 8 - Apr. 12, 2026, Wed.-Sun.	American Educational Research Association - Los Angeles, CA, www.aera.net/AERA2026
May. 15 - June 12, 2026, Fri.-Fri.	On-line workshop: Rasch Measurement - Core Topics (E. Smith, Winsteps), www.statistics.com
June 19 - July 25, 2026, Fri.-Sat.	On-line workshop: Rasch Measurement - Further Topics (E. Smith, Winsteps), www.statistics.com

The URL of this page is www.rasch.org/rmt/rmt92b.htm

Website: www.rasch.org/rmt/contents.htm