Thorndike's Scaling vs. Wood's Scoring

"Historical understanding very often depends upon our ability to recognize that, in the course of time, problems undergo subtle, and sometimes profound, changes in both formulation and substance." (Laudan, 1977, p. 241)

Ben Wood, Thorndike's student from 1919 to 1923, prepared a careful exegesis of Thorndike's essentials of a valid scale: objectivity, defined zero and unit, definition of function to be measured, consistency, and comparability. Wood claims that "practically all the material ... is taken from Thorndike's well-known treatise, or directly inferred from some of its propositions" (Wood, 1923, p.141 referring to Thorndike, 1913, p. 11-18). In spite of this, Wood adapts Thorndike's "scaling" tradition meaning of these measurement essentials to fit his own "test score" tradition. (Wood was previously a student of Truman Lee Kelley, a leading proponent of test reliability).

Both Thorndike and Wood consider objectivity the cornerstone of measurement. Thorndike views objectivity in a broad sense which included reliability and validity. In Thorndike's scaling tradition, objectivity is sought in the development of calibrated scales and variable maps that competent thinkers would agree upon as defining the latent variable. Wood's test score tradition leads him to focus instead on how tests can be scored reliably.

The development of scales with a defined zero and unit is important to both Thorndike and Wood. Wood's view reflects a test score tradition with a strong emphasis on test score reliability: "the essential quality of a good point of origin and of a good unit of measure is stability" (Wood p. 149). He recommends reporting test scores on a z- score scale with a sample mean to define the zero point and a sample standard deviation to define the unit of measurement. Thorndike recognizes that the zero point on the scale is arbitrary, and seeks to construct scales with units defined by a set of calibrated items. This view reflects a scaling tradition with close connections to Rasch measurement. Although Thorndike does not calibrate individuals and items onto an underlying latent variable scale simultaneously, he does attempt to develop calibrated scales that can be used to measure individuals.

Wood's third principle, that the function to be measured must be defined as adequately as possible, deals primarily with content validity. Wood emphasizes precise operational definitions of the variable measured and cautions against "the tendency to confuse identity of name with identity of function" (Wood p. 153). For Thorndike, the position of the facts on the item map of the latent variable plays the key role in defining the function measured: "An ideal scale, such as that for weight, is a series of perfectly defined amounts of a perfectly defined thing, the differences between any two of them being also perfectly defined, so that a series varying by steps of equal difference can readily be selected" (Thorndike, p. 13 and Wood, p. 150).

Consistency is defined by Thorndike and Wood as the unidimensionality of a scale. Thorndike thinks this so obvious that his discussion consists of two statements: "The series of facts used as a scale must be varying amounts of the same sort of thing or quality. This requirement needs no comment" (Thorndike p. 13 and Wood p. 153). Wood recommends inspecting test items in order to determine the consistency of the test, but, in his view, useful test scores can be obtained from any hodgepodge of combinations of items measuring visibly different content areas, such as reading and mathematics.

Comparability deals with what constitutes a useable scale. According to Wood, "a scale is valid only if it can be applied with reasonable ease and accuracy to the objects or facts to be measured" (Wood p. 157). But Wood's discussion of this principle is disjointed and difficult to understand because the test score tradition does not contain the concept of a calibrated item scale. Within Thorndike's scaling tradition, however, the idea of comparisons is fundamental. Items are calibrated through a comparison process, and the measurement of individuals is viewed as a comparison between an individual's ability and item difficulties on a latent variable scale. Consequently, "scales for mental and social measurements can be of great service in spite of gross inferiority [in ease of use and accuracy] to the common scales for physical facts" (Thorndike p. 16).

Thorndike's and Wood's theories of measurement offer important insights into major measurement problems identified in the first quarter of this century. Their theories illustrate how definitions of measurement problems changed as the dominant measurement tradition moved from a scaling tradition (Thorndike) to a test score tradition (Wood). Evaluation of "progress" in measurement theory depends on the measurement problem being addressed and the research tradition that underlies the evaluator's perspective.

Laudan L 1977. Progress and its problems: Towards a theory of scientific growth. Berkeley, CA: University of California Press.

Thorndike EL 1913. An introduction to the theory of mental and social measurements. 2nd Ed. Revised and Enlarged. New York: Teachers College

Wood BD 1923. Measurement in higher education. Yonkers-on-Hudson, NY: World Book Co.

Thorndike's scaling vs. Wood's scoring. Engelhard G Jr. … Rasch Measurement Transactions, 1992, 5:4 p.182

Rasch Books and Publications
Invariant Measurement: Using Rasch Models in the Social, Behavioral, and Health Sciences, 2nd Edn. George Engelhard, Jr. & Jue Wang	Applying the Rasch Model (Winsteps, Facets) 4th Ed., Bond, Yan, Heene	Advances in Rasch Analyses in the Human Sciences (Winsteps, Facets) 1st Ed., Boone, Staver	Advances in Applications of Rasch Measurement in Science Education, X. Liu & W. J. Boone	Rasch Analysis in the Human Sciences (Winsteps) Boone, Staver, Yale
Introduction to Many-Facet Rasch Measurement (Facets), Thomas Eckes	Statistical Analyses for Language Testers (Facets), Rita Green	Invariant Measurement with Raters and Rating Scales: Rasch Models for Rater-Mediated Assessments (Facets), George Engelhard, Jr. & Stefanie Wind	Aplicação do Modelo de Rasch (Português), de Bond, Trevor G., Fox, Christine M	Appliquer le modèle de Rasch: Défis et pistes de solution (Winsteps) E. Dionne, S. Béland
Exploring Rating Scale Functioning for Survey Research (R, Facets), Stefanie Wind	Rasch Measurement: Applications, Khine	Winsteps Tutorials - free Facets Tutorials - free	Many-Facet Rasch Measurement (Facets) - free, J.M. Linacre	Fairness, Justice and Language Assessment (Winsteps, Facets), McNamara, Knoch, Fan
Other Rasch-Related Resources: Rasch Measurement YouTube Channel
Rasch Measurement Transactions & Rasch Measurement research papers - free	An Introduction to the Rasch Model with Examples in R (eRm, etc.), Debelak, Strobl, Zeigenfuse	Rasch Measurement Theory Analysis in R, Wind, Hua	Applying the Rasch Model in Social Sciences Using R, Lamprianou	El modelo métrico de Rasch: Fundamentación, implementación e interpretación de la medida en ciencias sociales (Spanish Edition), Manuel González-Montesinos M.
Rasch Models: Foundations, Recent Developments, and Applications, Fischer & Molenaar	Probabilistic Models for Some Intelligence and Attainment Tests, Georg Rasch	Rasch Models for Measurement, David Andrich	Constructing Measures, Mark Wilson	Best Test Design - free, Wright & Stone Rating Scale Analysis - free, Wright & Masters
Virtual Standard Setting: Setting Cut Scores, Charalambos Kollias	Diseño de Mejores Pruebas - free, Spanish Best Test Design	A Course in Rasch Measurement Theory, Andrich, Marais	Rasch Models in Health, Christensen, Kreiner, Mesba	Multivariate and Mixture Distribution Rasch Models, von Davier, Carstensen

Go to Institute for Objective Measurement Home Page. The Rasch Measurement SIG (AERA) thanks the Institute for Objective Measurement for inviting the publication of Rasch Measurement Transactions on the Institute's website, www.rasch.org.

Coming Rasch-related Events
Apr. 21 - 22, 2025, Mon.-Tue.	International Objective Measurement Workshop (IOMW) - Boulder, CO, www.iomw.net
Jan. 17 - Feb. 21, 2025, Fri.-Fri.	On-line workshop: Rasch Measurement - Core Topics (E. Smith, Winsteps), www.statistics.com
Feb. - June, 2025	On-line course: Introduction to Classical Test and Rasch Measurement Theories (D. Andrich, I. Marais, RUMM2030), University of Western Australia
Feb. - June, 2025	On-line course: Advanced Course in Rasch Measurement Theory (D. Andrich, I. Marais, RUMM2030), University of Western Australia
May 16 - June 20, 2025, Fri.-Fri.	On-line workshop: Rasch Measurement - Core Topics (E. Smith, Winsteps), www.statistics.com
June 20 - July 18, 2025, Fri.-Fri.	On-line workshop: Rasch Measurement - Further Topics (E. Smith, Facets), www.statistics.com
July 21 - 23, 2025, Mon.-Wed.	Pacific Rim Objective Measurement Symposium (PROMS) 2025, www.proms2025.com
Oct. 3 - Nov. 7, 2025, Fri.-Fri.	On-line workshop: Rasch Measurement - Core Topics (E. Smith, Winsteps), www.statistics.com