Estimating meaningful thresholds for multi-item questionnaires using item response theory

Research output: Contribution to journal › Journal article › Research › peer-review

Standard

Estimating meaningful thresholds for multi-item questionnaires using item response theory. / Terluin, Berend; Koopman, Jaimy E.; Hoogendam, Lisa; Griffiths, Pip; Terwee, Caroline B.; Bjorner, Jakob B.

In: Quality of Life Research, Vol. 32, 2023, p. 1819–1830.

Research output: Contribution to journal › Journal article › Research › peer-review

Harvard

Terluin, B, Koopman, JE, Hoogendam, L, Griffiths, P, Terwee, CB & Bjorner, JB 2023, 'Estimating meaningful thresholds for multi-item questionnaires using item response theory', Quality of Life Research, vol. 32, pp. 1819–1830. https://doi.org/10.1007/s11136-023-03355-8

APA

Terluin, B., Koopman, J. E., Hoogendam, L., Griffiths, P., Terwee, C. B., & Bjorner, J. B. (2023). Estimating meaningful thresholds for multi-item questionnaires using item response theory. Quality of Life Research, 32, 1819–1830. https://doi.org/10.1007/s11136-023-03355-8

Vancouver

Terluin B, Koopman JE, Hoogendam L, Griffiths P, Terwee CB, Bjorner JB. Estimating meaningful thresholds for multi-item questionnaires using item response theory. Quality of Life Research. 2023;32:1819–1830. https://doi.org/10.1007/s11136-023-03355-8

Author

Terluin, Berend ; Koopman, Jaimy E. ; Hoogendam, Lisa ; Griffiths, Pip ; Terwee, Caroline B. ; Bjorner, Jakob B. / Estimating meaningful thresholds for multi-item questionnaires using item response theory. In: Quality of Life Research. 2023 ; Vol. 32. pp. 1819–1830.

Bibtex

@article{721e37cb7cf4403ea5e4afbaa144e3bb,

title = "Estimating meaningful thresholds for multi-item questionnaires using item response theory",

abstract = "Purpose: Meaningful thresholds are needed to interpret patient-reported outcome measure (PROM) results. This paper introduces a new method, based on item response theory (IRT), to estimate such thresholds. The performance of the method is examined in simulated datasets and two real datasets, and compared with other methods. Methods: The IRT method involves fitting an IRT model to the PROM items and an anchor item indicating the criterion state of interest. The difficulty parameter of the anchor item represents the meaningful threshold on the latent trait. The latent threshold is then linked to the corresponding expected PROM score. We simulated 4500 item response datasets to a 10-item PROM, and an anchor item. The datasets varied with respect to the mean and standard deviation of the latent trait, and the reliability of the anchor item. The real datasets consisted of a depression scale with a clinical depression diagnosis as anchor variable and a pain scale with a patient acceptable symptom state (PASS) question as anchor variable. Results: The new IRT method recovered the true thresholds accurately across the simulated datasets. The other methods, except one, produced biased threshold estimates if the state prevalence was smaller or greater than 0.5. The adjusted predictive modeling method matched the new IRT method (also in the real datasets) but showed some residual bias if the prevalence was smaller than 0.3 or greater than 0.7. Conclusions: The new IRT method perfectly recovers meaningful (interpretational) thresholds for multi-item questionnaires, provided that the data satisfy the assumptions for IRT analysis.",

keywords = "Adjusted predictive modeling, Cutoff point, Item response theory (IRT), Meaningful threshold, Patient acceptable symptom state (PASS), Receiver operating characteristic (ROC)",

author = "Berend Terluin and Koopman, {Jaimy E.} and Lisa Hoogendam and Pip Griffiths and Terwee, {Caroline B.} and Bjorner, {Jakob B.}",

note = "Publisher Copyright: {\textcopyright} 2023, The Author(s).",

year = "2023",

doi = "10.1007/s11136-023-03355-8",

language = "English",

volume = "32",

pages = "1819–1830",

journal = "Quality of Life Research",

issn = "0962-9343",

publisher = "Springer",

}

RIS

TY - JOUR

T1 - Estimating meaningful thresholds for multi-item questionnaires using item response theory

AU - Terluin, Berend

AU - Koopman, Jaimy E.

AU - Hoogendam, Lisa

AU - Griffiths, Pip

AU - Terwee, Caroline B.

AU - Bjorner, Jakob B.

PY - 2023

Y1 - 2023

N2 - Purpose: Meaningful thresholds are needed to interpret patient-reported outcome measure (PROM) results. This paper introduces a new method, based on item response theory (IRT), to estimate such thresholds. The performance of the method is examined in simulated datasets and two real datasets, and compared with other methods. Methods: The IRT method involves fitting an IRT model to the PROM items and an anchor item indicating the criterion state of interest. The difficulty parameter of the anchor item represents the meaningful threshold on the latent trait. The latent threshold is then linked to the corresponding expected PROM score. We simulated 4500 item response datasets to a 10-item PROM, and an anchor item. The datasets varied with respect to the mean and standard deviation of the latent trait, and the reliability of the anchor item. The real datasets consisted of a depression scale with a clinical depression diagnosis as anchor variable and a pain scale with a patient acceptable symptom state (PASS) question as anchor variable. Results: The new IRT method recovered the true thresholds accurately across the simulated datasets. The other methods, except one, produced biased threshold estimates if the state prevalence was smaller or greater than 0.5. The adjusted predictive modeling method matched the new IRT method (also in the real datasets) but showed some residual bias if the prevalence was smaller than 0.3 or greater than 0.7. Conclusions: The new IRT method perfectly recovers meaningful (interpretational) thresholds for multi-item questionnaires, provided that the data satisfy the assumptions for IRT analysis.

AB - Purpose: Meaningful thresholds are needed to interpret patient-reported outcome measure (PROM) results. This paper introduces a new method, based on item response theory (IRT), to estimate such thresholds. The performance of the method is examined in simulated datasets and two real datasets, and compared with other methods. Methods: The IRT method involves fitting an IRT model to the PROM items and an anchor item indicating the criterion state of interest. The difficulty parameter of the anchor item represents the meaningful threshold on the latent trait. The latent threshold is then linked to the corresponding expected PROM score. We simulated 4500 item response datasets to a 10-item PROM, and an anchor item. The datasets varied with respect to the mean and standard deviation of the latent trait, and the reliability of the anchor item. The real datasets consisted of a depression scale with a clinical depression diagnosis as anchor variable and a pain scale with a patient acceptable symptom state (PASS) question as anchor variable. Results: The new IRT method recovered the true thresholds accurately across the simulated datasets. The other methods, except one, produced biased threshold estimates if the state prevalence was smaller or greater than 0.5. The adjusted predictive modeling method matched the new IRT method (also in the real datasets) but showed some residual bias if the prevalence was smaller than 0.3 or greater than 0.7. Conclusions: The new IRT method perfectly recovers meaningful (interpretational) thresholds for multi-item questionnaires, provided that the data satisfy the assumptions for IRT analysis.

KW - Adjusted predictive modeling

KW - Cutoff point

KW - Item response theory (IRT)

KW - Meaningful threshold

KW - Patient acceptable symptom state (PASS)

KW - Receiver operating characteristic (ROC)

U2 - 10.1007/s11136-023-03355-8

DO - 10.1007/s11136-023-03355-8

M3 - Journal article

C2 - 36780033

AN - SCOPUS:85147962907

VL - 32

SP - 1819

EP - 1830

JO - Quality of Life Research

JF - Quality of Life Research

SN - 0962-9343

ER -

ID: 339321640