Thresholds, Steps and Rating Scale Conceptualization

How many steps does a rock take to get to its weight? Our measurement principles should be the same for properties of rocks as it is for the properties of people. What we say has to be consistent with physical measurement. We do not model how a rock got to its weight, even though we could model our process of locating the weight. If the thresholds on our ruler come out back to front, then our process of locating the person, or the rock has gone wrong.

The word "step" has no place in measurement. You can take steps over thresholds, but you can have steps without thresholds, and thresholds without steps, and measurement is not about modelling people taking steps, but modelling the responses to check where they are. The word "step" has a connotation of a local distance in the sense that the next step starts where the previous one finished and on movement. It is never used when we are trying to explain physical measurement.

The point at which the probability is 50% of a response being in one or other category, given that it is in one of them, is a threshold. No one goes anywhere in measurement - the person location is fixed. The Rasch model is a measurement model that models probabilistically where we reckon the person is, between thresholds, as in a ruler. It does not model how a person got there.

The threshold estimates are independent of the person distribution - so we can tell if there is something wrong with our instrument independently of the distribution. All those other statistics to check if the categories are "going up" are not distribution free and they do not answer the question unequivocally and they add to the confusion.

(i) The rating scale transition points are thresholds - they are points where there is 50% chance of being in either category, given that you are in one of them. Everyone understands that as a threshold;

(ii) If in the data, the thresholds are reversed as evidenced by the estimates, there is something wrong in the data. Nothing else. And the evidence on the thresholds is distribution free - very much consistent with Rasch philosophy.

When I say that there is "something wrong in the data", I, of course, mean in the sense of measurement. It is the same kind of wrong as when we have different item discriminations in the dichotomous case. In any particular set of data, it might be perfectly explainable why it is that some item really discriminates too much or too little, and that is a property of the data that we must acknowledge. Likewise when the thresholds in a multicategory item are reversed, they may be reversed for some good reason, and we should acknowledge it. However, the data demonstrate that the ordering is not working as intended.

If the discriminations at these Rasch thresholds are equal, then we have the integer scoring function - just as an integer score arises when the items are independent. Everything is the same as in the simple dichotomous case, except that the dichotomies are not independent. Being dependent does not turn them from dichotomies to steps. The dichotomous case with one dichotomy specializes from the general case. In the special case we have a threshold, not a step. No one goes from wrong to right, they are either wrong or right, and that is a big difference.

This is central - the category probability curves are about the implied distribution of responses of a single person to the item on replications. It is important to think of this as sample- and item-distribution free, that it is about the implied distribution of a single person with a single fixed location responding to a single item on repeated occasions with nothing but random error causing the variation - that is all there is in the model - together with the item parameters, which are thresholds. Those category characteristic curves pertain to only one person responding to the item, not the distribution of persons. Most important.

The Rasch model is not a model of how long any one spends in a category. If that is an issue in the data, then this is not the right model. So we should not try to force the model to do things it cannot do. This is model of where people are. The idea of time, of motion, is not relevant to this model, and it leads to misunderstanding. A rating scale is just like dichotomous items, except that there is a constraint on the dichotomies within the items - the constraint is that response must follow a Guttman pattern.

My suggestions are simple, and consistent: the transition points should be called "thresholds", and in the context of Thurstone or other thresholds, they should be called Rasch thresholds.

Andrich D. 1978. A rating formulation for ordered response categories. Psychometrika, 43 561-573

Andrich D. 1979. A model for contingency tables having an ordered response classification. Biometrics, 35, 403-415.

Thresholds, Steps and Rating Scale Conceptualization.Andrich D.A. … Rasch Measurement Transactions, 1998, 12:3 p. 648-9.

Rasch Books and Publications
Invariant Measurement: Using Rasch Models in the Social, Behavioral, and Health Sciences, 2nd Edn. George Engelhard, Jr. & Jue Wang	Applying the Rasch Model (Winsteps, Facets) 4th Ed., Bond, Yan, Heene	Advances in Rasch Analyses in the Human Sciences (Winsteps, Facets) 1st Ed., Boone, Staver	Advances in Applications of Rasch Measurement in Science Education, X. Liu & W. J. Boone	Rasch Analysis in the Human Sciences (Winsteps) Boone, Staver, Yale
Introduction to Many-Facet Rasch Measurement (Facets), Thomas Eckes	Statistical Analyses for Language Testers (Facets), Rita Green	Invariant Measurement with Raters and Rating Scales: Rasch Models for Rater-Mediated Assessments (Facets), George Engelhard, Jr. & Stefanie Wind	Aplicação do Modelo de Rasch (Português), de Bond, Trevor G., Fox, Christine M	Appliquer le modèle de Rasch: Défis et pistes de solution (Winsteps) E. Dionne, S. Béland
Exploring Rating Scale Functioning for Survey Research (R, Facets), Stefanie Wind	Rasch Measurement: Applications, Khine	Winsteps Tutorials - free Facets Tutorials - free	Many-Facet Rasch Measurement (Facets) - free, J.M. Linacre	Fairness, Justice and Language Assessment (Winsteps, Facets), McNamara, Knoch, Fan
Other Rasch-Related Resources: Rasch Measurement YouTube Channel
Rasch Measurement Transactions & Rasch Measurement research papers - free	An Introduction to the Rasch Model with Examples in R (eRm, etc.), Debelak, Strobl, Zeigenfuse	Rasch Measurement Theory Analysis in R, Wind, Hua	Applying the Rasch Model in Social Sciences Using R, Lamprianou	El modelo métrico de Rasch: Fundamentación, implementación e interpretación de la medida en ciencias sociales (Spanish Edition), Manuel González-Montesinos M.
Rasch Models: Foundations, Recent Developments, and Applications, Fischer & Molenaar	Probabilistic Models for Some Intelligence and Attainment Tests, Georg Rasch	Rasch Models for Measurement, David Andrich	Constructing Measures, Mark Wilson	Best Test Design - free, Wright & Stone Rating Scale Analysis - free, Wright & Masters
Virtual Standard Setting: Setting Cut Scores, Charalambos Kollias	Diseño de Mejores Pruebas - free, Spanish Best Test Design	A Course in Rasch Measurement Theory, Andrich, Marais	Rasch Models in Health, Christensen, Kreiner, Mesba	Multivariate and Mixture Distribution Rasch Models, von Davier, Carstensen

Go to Institute for Objective Measurement Home Page. The Rasch Measurement SIG (AERA) thanks the Institute for Objective Measurement for inviting the publication of Rasch Measurement Transactions on the Institute's website, www.rasch.org.

Coming Rasch-related Events
Apr. 21 - 22, 2025, Mon.-Tue.	International Objective Measurement Workshop (IOMW) - Boulder, CO, www.iomw.net
Jan. 17 - Feb. 21, 2025, Fri.-Fri.	On-line workshop: Rasch Measurement - Core Topics (E. Smith, Winsteps), www.statistics.com
Feb. - June, 2025	On-line course: Introduction to Classical Test and Rasch Measurement Theories (D. Andrich, I. Marais, RUMM2030), University of Western Australia
Feb. - June, 2025	On-line course: Advanced Course in Rasch Measurement Theory (D. Andrich, I. Marais, RUMM2030), University of Western Australia
May 16 - June 20, 2025, Fri.-Fri.	On-line workshop: Rasch Measurement - Core Topics (E. Smith, Winsteps), www.statistics.com
June 20 - July 18, 2025, Fri.-Fri.	On-line workshop: Rasch Measurement - Further Topics (E. Smith, Facets), www.statistics.com
July 21 - 23, 2025, Mon.-Wed.	Pacific Rim Objective Measurement Symposium (PROMS) 2025, www.proms2025.com
Oct. 3 - Nov. 7, 2025, Fri.-Fri.	On-line workshop: Rasch Measurement - Core Topics (E. Smith, Winsteps), www.statistics.com