The Net Reclassification Index (NRI): A Misleading Measure of Prediction Improvement Even with Independent Test Data Sets

Publikation: Bidrag til tidsskriftTidsskriftartikelForskningfagfællebedømt

Standard

The Net Reclassification Index (NRI) : A Misleading Measure of Prediction Improvement Even with Independent Test Data Sets. / Pepe, Margaret S.; Fan, Jing; Feng, Ziding; Gerds, Thomas; Hilden, Jorgen.

I: Statistics in BioSciences, Bind 7, Nr. 2, 10.2015, s. 282-295.

Publikation: Bidrag til tidsskriftTidsskriftartikelForskningfagfællebedømt

Harvard

Pepe, MS, Fan, J, Feng, Z, Gerds, T & Hilden, J 2015, 'The Net Reclassification Index (NRI): A Misleading Measure of Prediction Improvement Even with Independent Test Data Sets', Statistics in BioSciences, bind 7, nr. 2, s. 282-295. https://doi.org/10.1007/s12561-014-9118-0

APA

Pepe, M. S., Fan, J., Feng, Z., Gerds, T., & Hilden, J. (2015). The Net Reclassification Index (NRI): A Misleading Measure of Prediction Improvement Even with Independent Test Data Sets. Statistics in BioSciences, 7(2), 282-295. https://doi.org/10.1007/s12561-014-9118-0

Vancouver

Pepe MS, Fan J, Feng Z, Gerds T, Hilden J. The Net Reclassification Index (NRI): A Misleading Measure of Prediction Improvement Even with Independent Test Data Sets. Statistics in BioSciences. 2015 okt.;7(2):282-295. https://doi.org/10.1007/s12561-014-9118-0

Author

Pepe, Margaret S. ; Fan, Jing ; Feng, Ziding ; Gerds, Thomas ; Hilden, Jorgen. / The Net Reclassification Index (NRI) : A Misleading Measure of Prediction Improvement Even with Independent Test Data Sets. I: Statistics in BioSciences. 2015 ; Bind 7, Nr. 2. s. 282-295.

Bibtex

@article{76220ae5194140acb1d38944f70467ec,
title = "The Net Reclassification Index (NRI): A Misleading Measure of Prediction Improvement Even with Independent Test Data Sets",
abstract = "The Net Reclassification Index (NRI) is a very popular measure for evaluating the improvement in prediction performance gained by adding a marker to a set of baseline predictors. However, the statistical properties of this novel measure have not been explored in depth. We demonstrate the alarming result that the NRI statistic calculated on a large test dataset using risk models derived from a training set is likely to be positive even when the new marker has no predictive information. A related theoretical example is provided in which an incorrect risk function that includes an uninformative marker is proven to erroneously yield a positive NRI. Some insight into this phenomenon is provided. Since large values for the NRI statistic may simply be due to use of poorly fitting risk models, we suggest caution in using the NRI as the basis for marker evaluation. Other measures of prediction performance improvement, such as measures derived from the receiver operating characteristic curve, the net benefit function, and the Brier score, cannot be large due to poorly fitting risk functions.",
author = "Pepe, {Margaret S.} and Jing Fan and Ziding Feng and Thomas Gerds and Jorgen Hilden",
year = "2015",
month = oct,
doi = "10.1007/s12561-014-9118-0",
language = "English",
volume = "7",
pages = "282--295",
journal = "Statistics in Biosciences",
issn = "1867-1764",
publisher = "Springer",
number = "2",

}

RIS

TY - JOUR

T1 - The Net Reclassification Index (NRI)

T2 - A Misleading Measure of Prediction Improvement Even with Independent Test Data Sets

AU - Pepe, Margaret S.

AU - Fan, Jing

AU - Feng, Ziding

AU - Gerds, Thomas

AU - Hilden, Jorgen

PY - 2015/10

Y1 - 2015/10

N2 - The Net Reclassification Index (NRI) is a very popular measure for evaluating the improvement in prediction performance gained by adding a marker to a set of baseline predictors. However, the statistical properties of this novel measure have not been explored in depth. We demonstrate the alarming result that the NRI statistic calculated on a large test dataset using risk models derived from a training set is likely to be positive even when the new marker has no predictive information. A related theoretical example is provided in which an incorrect risk function that includes an uninformative marker is proven to erroneously yield a positive NRI. Some insight into this phenomenon is provided. Since large values for the NRI statistic may simply be due to use of poorly fitting risk models, we suggest caution in using the NRI as the basis for marker evaluation. Other measures of prediction performance improvement, such as measures derived from the receiver operating characteristic curve, the net benefit function, and the Brier score, cannot be large due to poorly fitting risk functions.

AB - The Net Reclassification Index (NRI) is a very popular measure for evaluating the improvement in prediction performance gained by adding a marker to a set of baseline predictors. However, the statistical properties of this novel measure have not been explored in depth. We demonstrate the alarming result that the NRI statistic calculated on a large test dataset using risk models derived from a training set is likely to be positive even when the new marker has no predictive information. A related theoretical example is provided in which an incorrect risk function that includes an uninformative marker is proven to erroneously yield a positive NRI. Some insight into this phenomenon is provided. Since large values for the NRI statistic may simply be due to use of poorly fitting risk models, we suggest caution in using the NRI as the basis for marker evaluation. Other measures of prediction performance improvement, such as measures derived from the receiver operating characteristic curve, the net benefit function, and the Brier score, cannot be large due to poorly fitting risk functions.

U2 - 10.1007/s12561-014-9118-0

DO - 10.1007/s12561-014-9118-0

M3 - Journal article

C2 - 26504496

VL - 7

SP - 282

EP - 295

JO - Statistics in Biosciences

JF - Statistics in Biosciences

SN - 1867-1764

IS - 2

ER -

ID: 140154597