Noise manifests ubiquitously in nonlinear spectroscopy, where multiple sources contribute to experimental signals generating interrelated unwanted components, from random point-wise fluctuations to structured baseline signals. Mitigating strategies are usually heuristic, depending on subjective biases such as the setting of parameters in data analysis algorithms and the removal order of the unwanted components. We propose a data-driven frequency-domain denoiser based on a convolutional neural network to extract authentic vibrational features from a nonlinear background in noisy spectroscopic raw data. The different spectral scales in the problem are treated in parallel by means of filters with multiple kernel sizes, which allow the receptive field of the network to adapt to the informative features in the spectra. We test our approach by retrieving asymmetric peaks in stimulated Raman spectroscopy, an ideal test-bed due to its intrinsic complex spectral features combined with a strong background signal. By using a theoretical perturbative toolbox, we efficiently train the network with simulated datasets resembling the statistical properties and lineshapes of the experimental spectra. The developed algorithm is successfully applied to experimental data to obtain noise- and background-free stimulated Raman spectra of organic molecules and prototypical heme proteins.

Retrieving genuine nonlinear Raman responses in ultrafast spectroscopy via deep learning

Ferrante C.;
2024

Abstract

Noise manifests ubiquitously in nonlinear spectroscopy, where multiple sources contribute to experimental signals generating interrelated unwanted components, from random point-wise fluctuations to structured baseline signals. Mitigating strategies are usually heuristic, depending on subjective biases such as the setting of parameters in data analysis algorithms and the removal order of the unwanted components. We propose a data-driven frequency-domain denoiser based on a convolutional neural network to extract authentic vibrational features from a nonlinear background in noisy spectroscopic raw data. The different spectral scales in the problem are treated in parallel by means of filters with multiple kernel sizes, which allow the receptive field of the network to adapt to the informative features in the spectra. We test our approach by retrieving asymmetric peaks in stimulated Raman spectroscopy, an ideal test-bed due to its intrinsic complex spectral features combined with a strong background signal. By using a theoretical perturbative toolbox, we efficiently train the network with simulated datasets resembling the statistical properties and lineshapes of the experimental spectra. The developed algorithm is successfully applied to experimental data to obtain noise- and background-free stimulated Raman spectra of organic molecules and prototypical heme proteins.
2024
Istituto Superconduttori, materiali innovativi e dispositivi - SPIN - Sede Secondaria L'Aquila
spectroscopy, deep Learning
File in questo prodotto:
File Dimensione Formato  
066119_1_5.0198013.pdf

accesso aperto

Tipologia: Versione Editoriale (PDF)
Licenza: Creative commons
Dimensione 6.85 MB
Formato Adobe PDF
6.85 MB Adobe PDF Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/20.500.14243/515523
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 0
  • ???jsp.display-item.citation.isi??? 0
social impact