RMT 13:4 Notations and Quotations

"Measurement is the Achilles' heel of sociobehavioral research. Although most programs in sociobehavioral sciences ... require a medium of exposure to statistics and research design, few seem to require the same where measurement is concerned ... It is, therefore, not surprising that little or no attention is given to the properties of the measures used in many research studies."

Pedhazur, E.J., & Schmelkin, L.P. (1991) Measurement, Design and Analysis: An Integrated Approach. Hillsdale NJ: Erlbaum. (p. 2-3).

Also quoted in Kieffer K.M. (1999) Why generalizability theory is essential and classical test theory (CTT) is often inadequate. In B. Thompson (Ed.) Advances in Social Science Methodology. Vol. 5. Stamford, CT: JAI Press.

Unfortunately, neither the original authors nor the quoting author appear to realize that an essential property of a useful measure is linearity.

Rasch in London

"It was very unfortunate that there was a definite antagonism between [Jerzy] Neyman and [Ronald A.] Fisher. In 1934 Neyman had given his famous paper on sampling methods at the Royal Statistical Society that brought out Fisher's wrath. And this wrath continued at University College [London] during my [Churchill Eisenhart's] time there (1935-37). Fisher's approach to teaching and writing on methods was: I'll tell you what to do, and you leave it up to me what the basic theory is. But then he wouldn't always tell you all the relevant facts of the theory. He would be lecturing, say, on factorial design, and would never mention the importance of additivity. Someone would tell Neyman about this, and in Neyman's next lecture on probability he'd digress and give a bitter discourse on Professor Fisher and his factorial design."

[Who were the faculty members?] "In our group [downstairs] there [were Jerzy Neyman and] B.L. Welch. M.S. Bartlett had been there, but he had left. ... Upstairs with Fisher, there was W.L. (Tony) Stevens, Professor Paul Rider (of Washington University, St. Louis) and Professor George Rasch (Copenhagen). There was some intercourse between the two floors among the students, but you had to change your language when you went from one floor to the other. You would talk about inductive behavior with Neyman. You talked about fiducial inference when you were with Fisher."

Excerpted from Olkin I. (1992) A conversation with Churchill Eisenhart. Statistical Science 7:4 514-5.

Objectivity First or Data First?
Which Produces Intelligible Results?

Here is Martha Stocking's summary of Item Response Theory, the statistical methodology of Frederic M. Lord (1912-2000). "Building statistical models is just like this. You take a real situation with real data, messy as this is, and build a model that works to explain the behavior of real data." (New York Times, 2-10-2000)

Georg Rasch summarizes his work as follows: "The concepts of measurement introduced through the definition of the two types of parameters [item difficulty and person ability] differ radically from those employed in the psychometric theory of mental tests - we may, as an instance in point, just mention that in our theory we have no reason for considering a normal distribution of test scores as evidence for equality of units. Our concepts are more akin to psychophysical measurements in so far as these are concerned with individuals, each observed on several occasions. The most conspicuous feature of our concepts, however, seems to be that they, in a certain well defined sense, carry the same conceptual status as mass and force in classical physics." (Probabilistic Models, page 4.)

"[Duns] Scotus's [?-1308] argument for an intelligible species [grouping] was that where an agent acts directly upon an object, together they suffice to produce an effect. ... Ockham [1270-1349] on the other hand used the same argument of the concurrence of agent and object to prove the minor [conclusion] that they suffice to produce intuitive knowledge as the effect without the need of anything else." [Emphasis author's.]

Gordon Leff (1975) William of Ockham: The metamorphosis of scholastic disclosure. Manchester UK: Manchester Univ. Press. p. 35.

"The U.S. health care system is a $1 trillion industry without a definition of its product. Until population outcome measures are developed and rewarded for, we will not solve the twenty-first century challenge of maximizing health outcome management for the resources available."

David A. Kindig (1999). Purchasing population health: Aligning financial incentives to improve health outcomes. Nursing Outlook, 47, 15-22.

"In this example, items 1-12 [of a Quality of Life instrument] are plotted according to calibrated `difficulty'. The fact that they all fall within the 95% statistical control lines (dotted) indicates absence of bias [between the test versions.]" [To draw these plots, see Best Test Design, Wright & Stone, 1979, p. 93-5.]

Cella, D.F., Lloyd, S.R., Wright, B.D. (1996) Cross-cultural instrument equating: current research and future directions. Chapter 73 in B. Spilker (Ed.) Quality of Life and Pharmacoeconomics in Clinical Trials. (2nd. Ed.). Philadelphia: Lippincott-Raven.

The Impermanence of Scientific Tools

"As we use our tools, we constantly remake them. Recent years have seen the remaking of a good many ... tools and the forging of some new ones. Those of us who have participated in this effort ought to feel a bit uneasy. To the extent that our product succeeds ..., it is likely to become another one of those tools that limits subjects for future study and constrains the ways in which those subjects will be studied. Either that or it will continually threaten to undo itself - to undo what we claim to know by questioning the bases on which we claim to know it. In the end we can only hope to be honest in our account of ... the past without, however, restricting ... the future."

Don Michael Randel, President-elect, University of Chicago (1992) The canons in the musicological toolbox. In K. Bergeron & P.V. Bohlman (Eds.) Disciplining Music. Chicago: U. Chicago Press.

Data Must be Controlled!

Chaos is the natural state of things. Order must be firmly imposed. The "fit of the data to the model", i.e., "statistical control", must be enforced, if data are to guide the future.

"An inference, if it is to have scientific value, must constitute a prediction concerning future data. If the inference is to be made purely with the help of the distribution theory of statistics, the experiments that constitute evidence for the inference must arise from a state of statistical control; until that state is reached, there is no universe, normal or otherwise, and the statistician's calculations by themselves are an illusion if not a delusion. The fact is that when distribution theory is not applicable for lack of control, any inference, statistical or otherwise, is little better than a conjecture. The state of statistical control is therefore the goal of all experimentation."

W. Edwards Deming in W.A. Shewhart, 1939, Statistical Method from the Viewpoint of Quality Control. Washington: Dept. of Agriculture. p. iii.

Since statistical control is not the natural state of things, it must be imposed and then verified. The analyst must be ruthless with the data and Draconian with the process in order to enforce statistical control. Only then can the future be predicted with confidence. Walter Shewhart came to see this during the 1920's during work to improve the quality of mass-produced goods. W. Edwards Deming continued this work into wider fields of business management at all levels.

In social science, researchers are taught that the data are sacrosanct, and that the process must not be altered, or tampered with, during the experiment. In industry and the physical sciences, an aberrant observation prompts immediate investigation and corrective action, even as production and experimentation continue. Social science experiments are not conducted in conditions of statistical process control. Quality-oriented industrial operations are.

What if we discover a child is guessing on a math test, or a patient is responding carelessly to a quality-of-life survey? Then those data are useless for inference. If we really cared about the child or the patient, (at least to the extent that manufacturers care about their products), we would reject the problematic data, retest the child or interview the patient, and change the data-collection instrument or process.

Quotations and Notations … Rasch Measurement Transactions, 2000, 13:4 p. 715 etc.

Rasch Books and Publications
Invariant Measurement: Using Rasch Models in the Social, Behavioral, and Health Sciences, 2nd Edn. George Engelhard, Jr. & Jue Wang	Applying the Rasch Model (Winsteps, Facets) 4th Ed., Bond, Yan, Heene	Advances in Rasch Analyses in the Human Sciences (Winsteps, Facets) 1st Ed., Boone, Staver	Advances in Applications of Rasch Measurement in Science Education, X. Liu & W. J. Boone	Rasch Analysis in the Human Sciences (Winsteps) Boone, Staver, Yale
Introduction to Many-Facet Rasch Measurement (Facets), Thomas Eckes	Statistical Analyses for Language Testers (Facets), Rita Green	Invariant Measurement with Raters and Rating Scales: Rasch Models for Rater-Mediated Assessments (Facets), George Engelhard, Jr. & Stefanie Wind	Aplicação do Modelo de Rasch (Português), de Bond, Trevor G., Fox, Christine M	Appliquer le modèle de Rasch: Défis et pistes de solution (Winsteps) E. Dionne, S. Béland
Exploring Rating Scale Functioning for Survey Research (R, Facets), Stefanie Wind	Rasch Measurement: Applications, Khine	Winsteps Tutorials - free Facets Tutorials - free	Many-Facet Rasch Measurement (Facets) - free, J.M. Linacre	Fairness, Justice and Language Assessment (Winsteps, Facets), McNamara, Knoch, Fan
Other Rasch-Related Resources: Rasch Measurement YouTube Channel
Rasch Measurement Transactions & Rasch Measurement research papers - free	An Introduction to the Rasch Model with Examples in R (eRm, etc.), Debelak, Strobl, Zeigenfuse	Rasch Measurement Theory Analysis in R, Wind, Hua	Applying the Rasch Model in Social Sciences Using R, Lamprianou	El modelo métrico de Rasch: Fundamentación, implementación e interpretación de la medida en ciencias sociales (Spanish Edition), Manuel González-Montesinos M.
Rasch Models: Foundations, Recent Developments, and Applications, Fischer & Molenaar	Probabilistic Models for Some Intelligence and Attainment Tests, Georg Rasch	Rasch Models for Measurement, David Andrich	Constructing Measures, Mark Wilson	Best Test Design - free, Wright & Stone Rating Scale Analysis - free, Wright & Masters
Virtual Standard Setting: Setting Cut Scores, Charalambos Kollias	Diseño de Mejores Pruebas - free, Spanish Best Test Design	A Course in Rasch Measurement Theory, Andrich, Marais	Rasch Models in Health, Christensen, Kreiner, Mesba	Multivariate and Mixture Distribution Rasch Models, von Davier, Carstensen

Go to Institute for Objective Measurement Home Page. The Rasch Measurement SIG (AERA) thanks the Institute for Objective Measurement for inviting the publication of Rasch Measurement Transactions on the Institute's website, www.rasch.org.

Coming Rasch-related Events
Apr. 21 - 22, 2025, Mon.-Tue.	International Objective Measurement Workshop (IOMW) - Boulder, CO, www.iomw.net
Jan. 17 - Feb. 21, 2025, Fri.-Fri.	On-line workshop: Rasch Measurement - Core Topics (E. Smith, Winsteps), www.statistics.com
Feb. - June, 2025	On-line course: Introduction to Classical Test and Rasch Measurement Theories (D. Andrich, I. Marais, RUMM2030), University of Western Australia
Feb. - June, 2025	On-line course: Advanced Course in Rasch Measurement Theory (D. Andrich, I. Marais, RUMM2030), University of Western Australia
May 16 - June 20, 2025, Fri.-Fri.	On-line workshop: Rasch Measurement - Core Topics (E. Smith, Winsteps), www.statistics.com
June 20 - July 18, 2025, Fri.-Fri.	On-line workshop: Rasch Measurement - Further Topics (E. Smith, Facets), www.statistics.com
July 21 - 23, 2025, Mon.-Wed.	Pacific Rim Objective Measurement Symposium (PROMS) 2025, www.proms2025.com
Oct. 3 - Nov. 7, 2025, Fri.-Fri.	On-line workshop: Rasch Measurement - Core Topics (E. Smith, Winsteps), www.statistics.com

Notations and Quotations

Rasch in London

Objectivity First or Data First? Which Produces Intelligible Results?

The Impermanence of Scientific Tools

Data Must be Controlled!

Objectivity First or Data First?
Which Produces Intelligible Results?