Good Measures from Bad Data

In many assessments, there are examinees who misbehave, and items that are poorly constructed. Nevertheless, everyone must be measured, and every item must be included except those that are obviously, blatantly faulty.

Blatantly faulty items are those that we can show to a content expert (who knows nothing about statistics) and say: "Do you see this ... (typographical error, ambiguity, scoring problem, irrelevant content, ... ). This item is obviously wrong or off-topic!"

Items with conspicuous DIF are more awkward to handle, and depend on the policy of the testing agency. It is easiest to treat them as blatantly faulty and omit them, but they can be split into separate items for separate DIF groups.

But what about random guessing, doubtful items and other problematic data? A three-stage estimation process provides a solution:

i) Analyze all the data. Identify problems.

ii) Reanalyze all the data, but with items and persons with misfit problems deleted and obviously errant or off-target responses omitted. This is the "good" dataset. Save the estimates of the item difficulties and Rasch-Andrich thresholds (for polytomies).

iii) Analyze all the data. Delete only obviously, blatantly faulty items. Anchor (fix) the "good" items at their "good" difficulties, and the polytomies at their "good" thresholds. Output the final set of person measures and item difficulties.

The measure for each person is now estimated in the frame-of-reference of the "good" data with the minimum of distortion of that measure by irrelevant (to that person) "bad" data.

Timed Tests

If we have a timed test, and score all incorrect answers and all item-not-reached answers as "0", then the final items have few correct answers, "1", even if the very last item is the conceptually easiest item on the test.

To get around this problem we do the three-stage analysis. In the second stage, we use only data from examinees who have definitely reached an item (right or wrong). All unreached responses are coded "not administered" (e.g., M for missing) and excluded from the analysis. This analysis gives us the best estimates of the difficulties of the items. We save these "good" item difficulties.

In the third stage, we score all the data 0-1, but use the "good" item difficulties, so that the measures of students who responded to most of the items are not distorted by the performances of students who responded to fewer items.

John M. Linacre



Good Measures from Bad Data, J.M. Linacre ... Rasch Measurement Transactions, 2011, 24:4, 1313




Rasch Publications
Rasch Measurement Transactions (free, online) Rasch Measurement research papers (free, online) Probabilistic Models for Some Intelligence and Attainment Tests, Georg Rasch Applying the Rasch Model 3rd. Ed., Bond & Fox Best Test Design, Wright & Stone
Rating Scale Analysis, Wright & Masters Introduction to Rasch Measurement, E. Smith & R. Smith Introduction to Many-Facet Rasch Measurement, Thomas Eckes Invariant Measurement: Using Rasch Models in the Social, Behavioral, and Health Sciences, George Engelhard, Jr. Statistical Analyses for Language Testers, Rita Green
Rasch Models: Foundations, Recent Developments, and Applications, Fischer & Molenaar Journal of Applied Measurement Rasch models for measurement, David Andrich Constructing Measures, Mark Wilson Rasch Analysis in the Human Sciences, Boone, Stave, Yale
in Spanish: Análisis de Rasch para todos, Agustín Tristán Mediciones, Posicionamientos y Diagnósticos Competitivos, Juan Ramón Oreja Rodríguez

To be emailed about new material on www.rasch.org
please enter your email address here:

I want to Subscribe: & click below
I want to Unsubscribe: & click below

Please set your SPAM filter to accept emails from Rasch.org

www.rasch.org welcomes your comments:

Your email address (if you want us to reply):

 

ForumRasch Measurement Forum to discuss any Rasch-related topic

Go to Top of Page
Go to index of all Rasch Measurement Transactions
AERA members: Join the Rasch Measurement SIG and receive the printed version of RMT
Some back issues of RMT are available as bound volumes
Subscribe to Journal of Applied Measurement

Go to Institute for Objective Measurement Home Page. The Rasch Measurement SIG (AERA) thanks the Institute for Objective Measurement for inviting the publication of Rasch Measurement Transactions on the Institute's website, www.rasch.org.

Coming Rasch-related Events
June 29 - July 27, 2018, Fri.-Fri. On-line workshop: Practical Rasch Measurement - Further Topics (E. Smith, Winsteps), www.statistics.com
July 25 - July 27, 2018, Wed.-Fri. Pacific-Rim Objective Measurement Symposium (PROMS), (Preconference workshops July 23-24, 2018) Fudan University, Shanghai, China "Applying Rasch Measurement in Language Assessment and across the Human Sciences", www.promsociety.org
July 29 - August 4, 2018 Vth International Summer School `Applied Psychometrics in Psychology and Education`, Institute of Education at the Higher School of Economics, St. Petersburg, Russia, https://ioe.hse.ru/en/announcements/215681182.html
July 30 - Nov., 2018Online Introduction to Classical and Rasch Measurement Theories (D.Andrich), University of Western Australia, Perth, Australia, http://www.education.uwa.edu.au/ppl/courses
Aug. 10 - Sept. 7, 2018, Fri.-Fri. On-line workshop: Many-Facet Rasch Measurement (E. Smith, Facets), www.statistics.com
August 25 - 28, 2018, Sat.-Tue.Análisis de Rasch introductorio (en español). (Agustín Tristán), Instituto de Evaluación e Ingeniería Avanzada. San Luis Potosí, México. www.ieia.com.mx
Sept. 3 - 6, 2018, Mon.-Thurs. IMEKO World Congress, Belfast, Northern Ireland, www.imeko2018.org
Oct. 12 - Nov. 9, 2018, Fri.-Fri. On-line workshop: Practical Rasch Measurement - Core Topics (E. Smith, Winsteps), www.statistics.com

 

The URL of this page is www.rasch.org/rmt/rmt244m.htm

Website: www.rasch.org/rmt/contents.htm