The area under a receiver operating characteristic (ROC) curve (AUC) is a popular measure of pure diagnostic accuracy that is independent from the proportion of diseased subjects in the analysed sample. However, its actual usefulness in the clinical con text has been questioned, because it does not seem to be directly related to the actual performance of a diagnostic marker in identifying diseased and non-diseased subjects in real clinical settings. This study evaluates the relationship between the AUC and the proportion of correct classifications (global diagnostic accuracy, GDA) in relation to the shape of the corresponding ROC curves. We demonstrate that AUC represents an upward-biased measure of GDA at an optimal accuracy cut-off for bal anced groups. The magnitude of bias depends on the shape of the ROC plot and on the proportion of diseased and non-dis eased subjects. In proper curves, the bias is independent from the diseased/non-diseased ratio and can be easily estimated and removed. Moreover, a comparison between 2 partial AUCs can be replaced by a more powerful test for the corresponding whole AUCs. Applications to 3 real datasets are provided: a marker for a hormone deficit in children, 2 tumour markers for ma lignant mesothelioma, and 2 gene expression profiles in ovarian cancer patients. In conclusion, the AUC is a measure of accu racy with potential clinical relevance for the evaluation of disease markers. The clinical meaning of ROC parameters should al ways be evaluated with an analysis of the shape of the corresponding ROC curve.

The clinical meaning of the area under a Receiver Operating Characteristic curve for the evaluation of the performance of disease markers

Marco Muselli
Ultimo
2022

Abstract

The area under a receiver operating characteristic (ROC) curve (AUC) is a popular measure of pure diagnostic accuracy that is independent from the proportion of diseased subjects in the analysed sample. However, its actual usefulness in the clinical con text has been questioned, because it does not seem to be directly related to the actual performance of a diagnostic marker in identifying diseased and non-diseased subjects in real clinical settings. This study evaluates the relationship between the AUC and the proportion of correct classifications (global diagnostic accuracy, GDA) in relation to the shape of the corresponding ROC curves. We demonstrate that AUC represents an upward-biased measure of GDA at an optimal accuracy cut-off for bal anced groups. The magnitude of bias depends on the shape of the ROC plot and on the proportion of diseased and non-dis eased subjects. In proper curves, the bias is independent from the diseased/non-diseased ratio and can be easily estimated and removed. Moreover, a comparison between 2 partial AUCs can be replaced by a more powerful test for the corresponding whole AUCs. Applications to 3 real datasets are provided: a marker for a hormone deficit in children, 2 tumour markers for ma lignant mesothelioma, and 2 gene expression profiles in ovarian cancer patients. In conclusion, the AUC is a measure of accu racy with potential clinical relevance for the evaluation of disease markers. The clinical meaning of ROC parameters should al ways be evaluated with an analysis of the shape of the corresponding ROC curve.
2022
Istituto di Elettronica e di Ingegneria dell'Informazione e delle Telecomunicazioni - IEIIT
Receiver operating characteristic curve
Biomarkers
Diagnostic accuracy
Metz and Kronman test
Binormal model
File in questo prodotto:
File Dimensione Formato  
prod_485315-doc_201023.pdf

accesso aperto

Descrizione: The clinical meaning of the area under a receiver operating characteristic curve for the evaluation of the performance of disease markers
Tipologia: Versione Editoriale (PDF)
Licenza: Creative commons
Dimensione 509.94 kB
Formato Adobe PDF
509.94 kB Adobe PDF Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/20.500.14243/459799
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 25
  • ???jsp.display-item.citation.isi??? 22
social impact