Estimating meaningful thresholds for multi-item questionnaires using item response theory
Research output: Contribution to journal › Journal article › Research › peer-review
Standard
Estimating meaningful thresholds for multi-item questionnaires using item response theory. / Terluin, Berend; Koopman, Jaimy E.; Hoogendam, Lisa; Griffiths, Pip; Terwee, Caroline B.; Bjorner, Jakob B.
In: Quality of Life Research, Vol. 32, 2023, p. 1819–1830.Research output: Contribution to journal › Journal article › Research › peer-review
Harvard
APA
Vancouver
Author
Bibtex
}
RIS
TY - JOUR
T1 - Estimating meaningful thresholds for multi-item questionnaires using item response theory
AU - Terluin, Berend
AU - Koopman, Jaimy E.
AU - Hoogendam, Lisa
AU - Griffiths, Pip
AU - Terwee, Caroline B.
AU - Bjorner, Jakob B.
N1 - Publisher Copyright: © 2023, The Author(s).
PY - 2023
Y1 - 2023
N2 - Purpose: Meaningful thresholds are needed to interpret patient-reported outcome measure (PROM) results. This paper introduces a new method, based on item response theory (IRT), to estimate such thresholds. The performance of the method is examined in simulated datasets and two real datasets, and compared with other methods. Methods: The IRT method involves fitting an IRT model to the PROM items and an anchor item indicating the criterion state of interest. The difficulty parameter of the anchor item represents the meaningful threshold on the latent trait. The latent threshold is then linked to the corresponding expected PROM score. We simulated 4500 item response datasets to a 10-item PROM, and an anchor item. The datasets varied with respect to the mean and standard deviation of the latent trait, and the reliability of the anchor item. The real datasets consisted of a depression scale with a clinical depression diagnosis as anchor variable and a pain scale with a patient acceptable symptom state (PASS) question as anchor variable. Results: The new IRT method recovered the true thresholds accurately across the simulated datasets. The other methods, except one, produced biased threshold estimates if the state prevalence was smaller or greater than 0.5. The adjusted predictive modeling method matched the new IRT method (also in the real datasets) but showed some residual bias if the prevalence was smaller than 0.3 or greater than 0.7. Conclusions: The new IRT method perfectly recovers meaningful (interpretational) thresholds for multi-item questionnaires, provided that the data satisfy the assumptions for IRT analysis.
AB - Purpose: Meaningful thresholds are needed to interpret patient-reported outcome measure (PROM) results. This paper introduces a new method, based on item response theory (IRT), to estimate such thresholds. The performance of the method is examined in simulated datasets and two real datasets, and compared with other methods. Methods: The IRT method involves fitting an IRT model to the PROM items and an anchor item indicating the criterion state of interest. The difficulty parameter of the anchor item represents the meaningful threshold on the latent trait. The latent threshold is then linked to the corresponding expected PROM score. We simulated 4500 item response datasets to a 10-item PROM, and an anchor item. The datasets varied with respect to the mean and standard deviation of the latent trait, and the reliability of the anchor item. The real datasets consisted of a depression scale with a clinical depression diagnosis as anchor variable and a pain scale with a patient acceptable symptom state (PASS) question as anchor variable. Results: The new IRT method recovered the true thresholds accurately across the simulated datasets. The other methods, except one, produced biased threshold estimates if the state prevalence was smaller or greater than 0.5. The adjusted predictive modeling method matched the new IRT method (also in the real datasets) but showed some residual bias if the prevalence was smaller than 0.3 or greater than 0.7. Conclusions: The new IRT method perfectly recovers meaningful (interpretational) thresholds for multi-item questionnaires, provided that the data satisfy the assumptions for IRT analysis.
KW - Adjusted predictive modeling
KW - Cutoff point
KW - Item response theory (IRT)
KW - Meaningful threshold
KW - Patient acceptable symptom state (PASS)
KW - Receiver operating characteristic (ROC)
U2 - 10.1007/s11136-023-03355-8
DO - 10.1007/s11136-023-03355-8
M3 - Journal article
C2 - 36780033
AN - SCOPUS:85147962907
VL - 32
SP - 1819
EP - 1830
JO - Quality of Life Research
JF - Quality of Life Research
SN - 0962-9343
ER -
ID: 339321640