Comparison of Rasch and summated rating scales constructed from SF-36 physical functioning items in seven countries: Results from the IQOLA Project

Anastasia E. Raczek, John E. Ware, Jakob B. Bjorner, Barbara Gandek, Stephen M. Haley, Neil K. Aaronson, Giovanni Apolone, Per Bech, John E. Brazier, Monika Bullinger, Marianne Sullivan

Research output: Contribution to journalArticle

126 Citations (Scopus)

Abstract

Rasch models for polytomous items were used to assess the scaling assumptions and compare item response patterns in the 10-item SF-36 physical functioning scale (PF-10) for general population respondents in Denmark, Germany, Italy, the Netherlands, Sweden, the United Kingdom, and the United States. The Rasch model of physical functioning developed in the United States was compared to models for other countries, and each country was compared to a multinational composite. Strong scale congruence across the seven countries was demonstrated; items that varied between countries and from the composite may reflect unique cultural response patterns or differences in translation. Scoring algorithms based on the Rasch model for each country were superior to the current Likert scoring in tests of relative validity (RV) in discriminating among age groups in all countries. In relation to the Likert PF-10 scoring (RV = 1.00), scores estimated using the Rasch rating scale model achieve a median RV of 1.31 (range: 1.01-1.59), while the Rasch partial credit model attained a median RV of 1.44 (range: 1.01-2.23). Rasch models hold good potential for improving health status measures, estimating individual scores when responses to scale items are missing, and equating scores across countries.

Original languageEnglish
Pages (from-to)1203-1214
Number of pages12
JournalJournal of Clinical Epidemiology
Volume51
Issue number11
DOIs
Publication statusPublished - Nov 1998

Fingerprint

Denmark
Sweden
Netherlands
Italy
Health Status
Germany
Age Groups
Population
Surveys and Questionnaires
United Kingdom

Keywords

  • International comparisons
  • Item response theory
  • Physical functioning
  • Rasch model
  • SF-36 Health Survey
  • Translations
  • Validity

ASJC Scopus subject areas

  • Medicine(all)
  • Public Health, Environmental and Occupational Health
  • Epidemiology

Cite this

Comparison of Rasch and summated rating scales constructed from SF-36 physical functioning items in seven countries : Results from the IQOLA Project. / Raczek, Anastasia E.; Ware, John E.; Bjorner, Jakob B.; Gandek, Barbara; Haley, Stephen M.; Aaronson, Neil K.; Apolone, Giovanni; Bech, Per; Brazier, John E.; Bullinger, Monika; Sullivan, Marianne.

In: Journal of Clinical Epidemiology, Vol. 51, No. 11, 11.1998, p. 1203-1214.

Research output: Contribution to journalArticle

Raczek, AE, Ware, JE, Bjorner, JB, Gandek, B, Haley, SM, Aaronson, NK, Apolone, G, Bech, P, Brazier, JE, Bullinger, M & Sullivan, M 1998, 'Comparison of Rasch and summated rating scales constructed from SF-36 physical functioning items in seven countries: Results from the IQOLA Project', Journal of Clinical Epidemiology, vol. 51, no. 11, pp. 1203-1214. https://doi.org/10.1016/S0895-4356(98)00112-7
Raczek, Anastasia E. ; Ware, John E. ; Bjorner, Jakob B. ; Gandek, Barbara ; Haley, Stephen M. ; Aaronson, Neil K. ; Apolone, Giovanni ; Bech, Per ; Brazier, John E. ; Bullinger, Monika ; Sullivan, Marianne. / Comparison of Rasch and summated rating scales constructed from SF-36 physical functioning items in seven countries : Results from the IQOLA Project. In: Journal of Clinical Epidemiology. 1998 ; Vol. 51, No. 11. pp. 1203-1214.
@article{aa460a1cd6744fee8b490eec642aaa5a,
title = "Comparison of Rasch and summated rating scales constructed from SF-36 physical functioning items in seven countries: Results from the IQOLA Project",
abstract = "Rasch models for polytomous items were used to assess the scaling assumptions and compare item response patterns in the 10-item SF-36 physical functioning scale (PF-10) for general population respondents in Denmark, Germany, Italy, the Netherlands, Sweden, the United Kingdom, and the United States. The Rasch model of physical functioning developed in the United States was compared to models for other countries, and each country was compared to a multinational composite. Strong scale congruence across the seven countries was demonstrated; items that varied between countries and from the composite may reflect unique cultural response patterns or differences in translation. Scoring algorithms based on the Rasch model for each country were superior to the current Likert scoring in tests of relative validity (RV) in discriminating among age groups in all countries. In relation to the Likert PF-10 scoring (RV = 1.00), scores estimated using the Rasch rating scale model achieve a median RV of 1.31 (range: 1.01-1.59), while the Rasch partial credit model attained a median RV of 1.44 (range: 1.01-2.23). Rasch models hold good potential for improving health status measures, estimating individual scores when responses to scale items are missing, and equating scores across countries.",
keywords = "International comparisons, Item response theory, Physical functioning, Rasch model, SF-36 Health Survey, Translations, Validity",
author = "Raczek, {Anastasia E.} and Ware, {John E.} and Bjorner, {Jakob B.} and Barbara Gandek and Haley, {Stephen M.} and Aaronson, {Neil K.} and Giovanni Apolone and Per Bech and Brazier, {John E.} and Monika Bullinger and Marianne Sullivan",
year = "1998",
month = "11",
doi = "10.1016/S0895-4356(98)00112-7",
language = "English",
volume = "51",
pages = "1203--1214",
journal = "Journal of Clinical Epidemiology",
issn = "0895-4356",
publisher = "Elsevier USA",
number = "11",

}

TY - JOUR

T1 - Comparison of Rasch and summated rating scales constructed from SF-36 physical functioning items in seven countries

T2 - Results from the IQOLA Project

AU - Raczek, Anastasia E.

AU - Ware, John E.

AU - Bjorner, Jakob B.

AU - Gandek, Barbara

AU - Haley, Stephen M.

AU - Aaronson, Neil K.

AU - Apolone, Giovanni

AU - Bech, Per

AU - Brazier, John E.

AU - Bullinger, Monika

AU - Sullivan, Marianne

PY - 1998/11

Y1 - 1998/11

N2 - Rasch models for polytomous items were used to assess the scaling assumptions and compare item response patterns in the 10-item SF-36 physical functioning scale (PF-10) for general population respondents in Denmark, Germany, Italy, the Netherlands, Sweden, the United Kingdom, and the United States. The Rasch model of physical functioning developed in the United States was compared to models for other countries, and each country was compared to a multinational composite. Strong scale congruence across the seven countries was demonstrated; items that varied between countries and from the composite may reflect unique cultural response patterns or differences in translation. Scoring algorithms based on the Rasch model for each country were superior to the current Likert scoring in tests of relative validity (RV) in discriminating among age groups in all countries. In relation to the Likert PF-10 scoring (RV = 1.00), scores estimated using the Rasch rating scale model achieve a median RV of 1.31 (range: 1.01-1.59), while the Rasch partial credit model attained a median RV of 1.44 (range: 1.01-2.23). Rasch models hold good potential for improving health status measures, estimating individual scores when responses to scale items are missing, and equating scores across countries.

AB - Rasch models for polytomous items were used to assess the scaling assumptions and compare item response patterns in the 10-item SF-36 physical functioning scale (PF-10) for general population respondents in Denmark, Germany, Italy, the Netherlands, Sweden, the United Kingdom, and the United States. The Rasch model of physical functioning developed in the United States was compared to models for other countries, and each country was compared to a multinational composite. Strong scale congruence across the seven countries was demonstrated; items that varied between countries and from the composite may reflect unique cultural response patterns or differences in translation. Scoring algorithms based on the Rasch model for each country were superior to the current Likert scoring in tests of relative validity (RV) in discriminating among age groups in all countries. In relation to the Likert PF-10 scoring (RV = 1.00), scores estimated using the Rasch rating scale model achieve a median RV of 1.31 (range: 1.01-1.59), while the Rasch partial credit model attained a median RV of 1.44 (range: 1.01-2.23). Rasch models hold good potential for improving health status measures, estimating individual scores when responses to scale items are missing, and equating scores across countries.

KW - International comparisons

KW - Item response theory

KW - Physical functioning

KW - Rasch model

KW - SF-36 Health Survey

KW - Translations

KW - Validity

UR - http://www.scopus.com/inward/record.url?scp=0032211655&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=0032211655&partnerID=8YFLogxK

U2 - 10.1016/S0895-4356(98)00112-7

DO - 10.1016/S0895-4356(98)00112-7

M3 - Article

C2 - 9817138

AN - SCOPUS:0032211655

VL - 51

SP - 1203

EP - 1214

JO - Journal of Clinical Epidemiology

JF - Journal of Clinical Epidemiology

SN - 0895-4356

IS - 11

ER -