Thorndike's Scaling vs. Wood's Scoring

"Historical understanding very often depends upon our ability to recognize that, in the course of time, problems undergo subtle, and sometimes profound, changes in both formulation and substance." (Laudan, 1977, p. 241)

Ben Wood, Thorndike's student from 1919 to 1923, prepared a careful exegesis of Thorndike's essentials of a valid scale: objectivity, defined zero and unit, definition of function to be measured, consistency, and comparability. Wood claims that "practically all the material ... is taken from Thorndike's well-known treatise, or directly inferred from some of its propositions" (Wood, 1923, p.141 referring to Thorndike, 1913, p. 11-18). In spite of this, Wood adapts Thorndike's "scaling" tradition meaning of these measurement essentials to fit his own "test score" tradition. (Wood was previously a student of Truman Lee Kelley, a leading proponent of test reliability).

Both Thorndike and Wood consider objectivity the cornerstone of measurement. Thorndike views objectivity in a broad sense which included reliability and validity. In Thorndike's scaling tradition, objectivity is sought in the development of calibrated scales and variable maps that competent thinkers would agree upon as defining the latent variable. Wood's test score tradition leads him to focus instead on how tests can be scored reliably.

The development of scales with a defined zero and unit is important to both Thorndike and Wood. Wood's view reflects a test score tradition with a strong emphasis on test score reliability: "the essential quality of a good point of origin and of a good unit of measure is stability" (Wood p. 149). He recommends reporting test scores on a z- score scale with a sample mean to define the zero point and a sample standard deviation to define the unit of measurement. Thorndike recognizes that the zero point on the scale is arbitrary, and seeks to construct scales with units defined by a set of calibrated items. This view reflects a scaling tradition with close connections to Rasch measurement. Although Thorndike does not calibrate individuals and items onto an underlying latent variable scale simultaneously, he does attempt to develop calibrated scales that can be used to measure individuals.

Wood's third principle, that the function to be measured must be defined as adequately as possible, deals primarily with content validity. Wood emphasizes precise operational definitions of the variable measured and cautions against "the tendency to confuse identity of name with identity of function" (Wood p. 153). For Thorndike, the position of the facts on the item map of the latent variable plays the key role in defining the function measured: "An ideal scale, such as that for weight, is a series of perfectly defined amounts of a perfectly defined thing, the differences between any two of them being also perfectly defined, so that a series varying by steps of equal difference can readily be selected" (Thorndike, p. 13 and Wood, p. 150).

Consistency is defined by Thorndike and Wood as the unidimensionality of a scale. Thorndike thinks this so obvious that his discussion consists of two statements: "The series of facts used as a scale must be varying amounts of the same sort of thing or quality. This requirement needs no comment" (Thorndike p. 13 and Wood p. 153). Wood recommends inspecting test items in order to determine the consistency of the test, but, in his view, useful test scores can be obtained from any hodgepodge of combinations of items measuring visibly different content areas, such as reading and mathematics.

Comparability deals with what constitutes a useable scale. According to Wood, "a scale is valid only if it can be applied with reasonable ease and accuracy to the objects or facts to be measured" (Wood p. 157). But Wood's discussion of this principle is disjointed and difficult to understand because the test score tradition does not contain the concept of a calibrated item scale. Within Thorndike's scaling tradition, however, the idea of comparisons is fundamental. Items are calibrated through a comparison process, and the measurement of individuals is viewed as a comparison between an individual's ability and item difficulties on a latent variable scale. Consequently, "scales for mental and social measurements can be of great service in spite of gross inferiority [in ease of use and accuracy] to the common scales for physical facts" (Thorndike p. 16).

Thorndike's and Wood's theories of measurement offer important insights into major measurement problems identified in the first quarter of this century. Their theories illustrate how definitions of measurement problems changed as the dominant measurement tradition moved from a scaling tradition (Thorndike) to a test score tradition (Wood). Evaluation of "progress" in measurement theory depends on the measurement problem being addressed and the research tradition that underlies the evaluator's perspective.

Laudan L 1977. Progress and its problems: Towards a theory of scientific growth. Berkeley, CA: University of California Press.

Thorndike EL 1913. An introduction to the theory of mental and social measurements. 2nd Ed. Revised and Enlarged. New York: Teachers College

Wood BD 1923. Measurement in higher education. Yonkers-on-Hudson, NY: World Book Co.

Thorndike's scaling vs. Wood's scoring. Engelhard G Jr. … Rasch Measurement Transactions, 1992, 5:4 p.182




Rasch Books and Publications
Invariant Measurement: Using Rasch Models in the Social, Behavioral, and Health Sciences, 2nd Edn. George Engelhard, Jr. & Jue Wang Applying the Rasch Model (Winsteps, Facets) 4th Ed., Bond, Yan, Heene Advances in Rasch Analyses in the Human Sciences (Winsteps, Facets) 1st Ed., Boone, Staver Advances in Applications of Rasch Measurement in Science Education, X. Liu & W. J. Boone Rasch Analysis in the Human Sciences (Winsteps) Boone, Staver, Yale
Introduction to Many-Facet Rasch Measurement (Facets), Thomas Eckes Statistical Analyses for Language Testers (Facets), Rita Green Invariant Measurement with Raters and Rating Scales: Rasch Models for Rater-Mediated Assessments (Facets), George Engelhard, Jr. & Stefanie Wind Aplicação do Modelo de Rasch (Português), de Bond, Trevor G., Fox, Christine M Appliquer le modèle de Rasch: Défis et pistes de solution (Winsteps) E. Dionne, S. Béland
Exploring Rating Scale Functioning for Survey Research (R, Facets), Stefanie Wind Rasch Measurement: Applications, Khine Winsteps Tutorials - free
Facets Tutorials - free
Many-Facet Rasch Measurement (Facets) - free, J.M. Linacre Fairness, Justice and Language Assessment (Winsteps, Facets), McNamara, Knoch, Fan
Other Rasch-Related Resources: Rasch Measurement YouTube Channel
Rasch Measurement Transactions & Rasch Measurement research papers - free An Introduction to the Rasch Model with Examples in R (eRm, etc.), Debelak, Strobl, Zeigenfuse Rasch Measurement Theory Analysis in R, Wind, Hua Applying the Rasch Model in Social Sciences Using R, Lamprianou El modelo métrico de Rasch: Fundamentación, implementación e interpretación de la medida en ciencias sociales (Spanish Edition), Manuel González-Montesinos M.
Rasch Models: Foundations, Recent Developments, and Applications, Fischer & Molenaar Probabilistic Models for Some Intelligence and Attainment Tests, Georg Rasch Rasch Models for Measurement, David Andrich Constructing Measures, Mark Wilson Best Test Design - free, Wright & Stone
Rating Scale Analysis - free, Wright & Masters
Virtual Standard Setting: Setting Cut Scores, Charalambos Kollias Diseño de Mejores Pruebas - free, Spanish Best Test Design A Course in Rasch Measurement Theory, Andrich, Marais Rasch Models in Health, Christensen, Kreiner, Mesba Multivariate and Mixture Distribution Rasch Models, von Davier, Carstensen

To be emailed about new material on www.rasch.org
please enter your email address here:

I want to Subscribe: & click below
I want to Unsubscribe: & click below

Please set your SPAM filter to accept emails from Rasch.org

Rasch Measurement Transactions welcomes your comments:

Your email address (if you want us to reply):

If Rasch.org does not reply, please post your message on the Rasch Forum
 

ForumRasch Measurement Forum to discuss any Rasch-related topic

Go to Top of Page
Go to index of all Rasch Measurement Transactions
AERA members: Join the Rasch Measurement SIG and receive the printed version of RMT
Some back issues of RMT are available as bound volumes
Subscribe to Journal of Applied Measurement

Go to Institute for Objective Measurement Home Page. The Rasch Measurement SIG (AERA) thanks the Institute for Objective Measurement for inviting the publication of Rasch Measurement Transactions on the Institute's website, www.rasch.org.

Coming Rasch-related Events
Apr. 21 - 22, 2025, Mon.-Tue. International Objective Measurement Workshop (IOMW) - Boulder, CO, www.iomw.net
Jan. 17 - Feb. 21, 2025, Fri.-Fri. On-line workshop: Rasch Measurement - Core Topics (E. Smith, Winsteps), www.statistics.com
Feb. - June, 2025 On-line course: Introduction to Classical Test and Rasch Measurement Theories (D. Andrich, I. Marais, RUMM2030), University of Western Australia
Feb. - June, 2025 On-line course: Advanced Course in Rasch Measurement Theory (D. Andrich, I. Marais, RUMM2030), University of Western Australia
May 16 - June 20, 2025, Fri.-Fri. On-line workshop: Rasch Measurement - Core Topics (E. Smith, Winsteps), www.statistics.com
June 20 - July 18, 2025, Fri.-Fri. On-line workshop: Rasch Measurement - Further Topics (E. Smith, Facets), www.statistics.com
Oct. 3 - Nov. 7, 2025, Fri.-Fri. On-line workshop: Rasch Measurement - Core Topics (E. Smith, Winsteps), www.statistics.com

 

The URL of this page is www.rasch.org/rmt/rmt54e.htm

Website: www.rasch.org/rmt/contents.htm