This work introduces a novel approach for detecting and spatially mapping microplastics in soil using Terahertz (THz) Time-Domain Hyperspectral Imaging. To this end, we applied a range of statistical and machine-learning techniques. First, we conducted a statistical analysis that enabled the identification of spectral and temporal signatures of the THz signal associated with low-density polyethylene contamination. These features informed a similarity analysis, which generated heatmaps exhibiting clear spatial patterns characterized by an inverse relationship between similarity to soil and to plastic. Although effective, this approach relies on prior knowledge of pure reference spectra, thereby limiting its applicability to uncharacterized soil environments. To address this limitation, we explored unsupervised learning methods—Principal Component Analysis, K-means clustering, Agglomerative Clustering, Spectral Clustering and Gaussian Mixture Models—that leverage internal variance to segment the data without reference spectra. All five methods consistently separated the mixtures into distinct phases associated with soil and plastic. Averaged spectra within these regions recovered the characteristic features of each material. The convergence of independent unsupervised methods to similar spatial and spectral distinctions highlights the robustness of the approach. Note that unsupervised approaches offer a key advantage when labeled samples are unavailable for training supervised algorithms. The key contribution of this work is the demonstration that, under controlled—albeit simplified—experimental conditions, a robust separation between plastic and soil consistently emerges across multiple, conceptually independent analytical methods, indicating that the observed discrimination is rooted in underlying physical contrasts rather than being specific to any single algorithm. This controlled regime allows the fundamental physical behavior of the plastic–soil system to be characterized before addressing more challenging scenarios.

Microplastic detection in soil by THz Time-Domain hyperspectral imaging combined with unsupervised learning analysis

Martinez A.
Primo
;
Di Sarno V.;Maddaloni P.;Cerruti P.;Cocca M.;Paturzo M.;Paparo D.
2026

Abstract

This work introduces a novel approach for detecting and spatially mapping microplastics in soil using Terahertz (THz) Time-Domain Hyperspectral Imaging. To this end, we applied a range of statistical and machine-learning techniques. First, we conducted a statistical analysis that enabled the identification of spectral and temporal signatures of the THz signal associated with low-density polyethylene contamination. These features informed a similarity analysis, which generated heatmaps exhibiting clear spatial patterns characterized by an inverse relationship between similarity to soil and to plastic. Although effective, this approach relies on prior knowledge of pure reference spectra, thereby limiting its applicability to uncharacterized soil environments. To address this limitation, we explored unsupervised learning methods—Principal Component Analysis, K-means clustering, Agglomerative Clustering, Spectral Clustering and Gaussian Mixture Models—that leverage internal variance to segment the data without reference spectra. All five methods consistently separated the mixtures into distinct phases associated with soil and plastic. Averaged spectra within these regions recovered the characteristic features of each material. The convergence of independent unsupervised methods to similar spatial and spectral distinctions highlights the robustness of the approach. Note that unsupervised approaches offer a key advantage when labeled samples are unavailable for training supervised algorithms. The key contribution of this work is the demonstration that, under controlled—albeit simplified—experimental conditions, a robust separation between plastic and soil consistently emerges across multiple, conceptually independent analytical methods, indicating that the observed discrimination is rooted in underlying physical contrasts rather than being specific to any single algorithm. This controlled regime allows the fundamental physical behavior of the plastic–soil system to be characterized before addressing more challenging scenarios.
2026
Istituto per i Polimeri, Compositi e Biomateriali - IPCB
Istituto Nazionale di Ottica - INO - Sede Secondaria di Pozzuoli
Istituto di Scienze Applicate e Sistemi Intelligenti "Eduardo Caianiello" - ISASI
Agglomerative clustering
K-Means clustering
Machine learning
Microplastic
Principal components analysis
Soil contamination
Spectral clustering
Spectral similarity analysis
THz imaging
THz-TDS
File in questo prodotto:
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/20.500.14243/582648
 Attenzione

Attenzione! I dati visualizzati non sono stati sottoposti a validazione da parte dell'ente

Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 0
  • ???jsp.display-item.citation.isi??? ND
social impact