This paper introduces some methods for outlier identiîEUR,cation in the regression setting, motivated by the analysis of steelmakingprocess data. The proposed methodology extends to the regression setting the boxplot rule, commonly used for outlier screening withunivariate data. The focus here is on bivariate settings with a single covariate, but extensions are possible. The proposal is basedon quantile regression, including an additional transformation parameter for selecting the best scale for linearity of the conditionalquantiles. The resulting method is used to perform effective labeling of potential outliers, with a quite low computational complexity,allowing for simple implementation within statistical software as well as commonly used spreadsheets. Some simulation experimentshave been carried out to study the swamping and masking properties of the proposal. The methodology is also illustrated by somereal life examples, taking as the response variable the energy consumed in the melting process.

Simple outlier labeling based on quantileregression, with application to thesteelmaking process

Coletto M
2016

Abstract

This paper introduces some methods for outlier identiîEUR,cation in the regression setting, motivated by the analysis of steelmakingprocess data. The proposed methodology extends to the regression setting the boxplot rule, commonly used for outlier screening withunivariate data. The focus here is on bivariate settings with a single covariate, but extensions are possible. The proposal is basedon quantile regression, including an additional transformation parameter for selecting the best scale for linearity of the conditionalquantiles. The resulting method is used to perform effective labeling of potential outliers, with a quite low computational complexity,allowing for simple implementation within statistical software as well as commonly used spreadsheets. Some simulation experimentshave been carried out to study the swamping and masking properties of the proposal. The methodology is also illustrated by somereal life examples, taking as the response variable the energy consumed in the melting process.
2016
Istituto di Scienza e Tecnologie dell'Informazione "Alessandro Faedo" - ISTI
Boxplot rule
Outlier
Quantile regression
Single-index model
Steelmaking process
File in questo prodotto:
File Dimensione Formato  
prod_344213-doc_121856.pdf

solo utenti autorizzati

Descrizione: Simple outlier labeling based on quantileregression, with application to thesteelmaking process
Tipologia: Versione Editoriale (PDF)
Dimensione 803.18 kB
Formato Adobe PDF
803.18 kB Adobe PDF   Visualizza/Apri   Richiedi una copia

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/20.500.14243/301100
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 1
  • ???jsp.display-item.citation.isi??? ND
social impact