In the gas sensor field, principal component analysis (PCA) is still the mostly used technique for exploratory data analysis, although the human judgment of PCA plots often determines the classification results. In this paper, we propose a new approach based on cluster analysis (CA) in combination with cluster validity (CLV) that can be used to objectively infer and assess the data structure. Hierarchical and k-means clustering methods are implemented together with four indices of CLV, aiming to estimating the best partition of data. We initially demonstrate the approach on simulated data sets, and then we apply it to the experimental data coming from an electronic nose. With reference to the e-nose data, the unsupervised classification results are consistent with those achieved by the visual inspection of data through PCA plots. In addition, CLV gives quantitative information about the classification strength of the data categories; this allows to understand which of the different categories better describes the data set. The proposed method, due to its reliability and objectivity, can be used for an automated and standardized evaluation of data in gas sensor devices. (c) 2007 Elsevier B.V. All rights reserved.

Cluster validation for electronic nose data

Pardo M;Sberveglieri G
2007

Abstract

In the gas sensor field, principal component analysis (PCA) is still the mostly used technique for exploratory data analysis, although the human judgment of PCA plots often determines the classification results. In this paper, we propose a new approach based on cluster analysis (CA) in combination with cluster validity (CLV) that can be used to objectively infer and assess the data structure. Hierarchical and k-means clustering methods are implemented together with four indices of CLV, aiming to estimating the best partition of data. We initially demonstrate the approach on simulated data sets, and then we apply it to the experimental data coming from an electronic nose. With reference to the e-nose data, the unsupervised classification results are consistent with those achieved by the visual inspection of data through PCA plots. In addition, CLV gives quantitative information about the classification strength of the data categories; this allows to understand which of the different categories better describes the data set. The proposed method, due to its reliability and objectivity, can be used for an automated and standardized evaluation of data in gas sensor devices. (c) 2007 Elsevier B.V. All rights reserved.
2007
INFM
Cluster validity
Electronic nose
Exploratory data analysis
Unsupervised classification
File in questo prodotto:
File Dimensione Formato  
prod_1999-doc_30074.pdf

non disponibili

Descrizione: Cluster validation for electronic nose data
Tipologia: Versione Editoriale (PDF)
Licenza: NON PUBBLICO - Accesso privato/ristretto
Dimensione 740.45 kB
Formato Adobe PDF
740.45 kB Adobe PDF   Visualizza/Apri   Richiedi una copia

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/20.500.14243/161856
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 16
  • ???jsp.display-item.citation.isi??? 11
social impact