With the advent of Advanced Driver Assistance Systems (ADAS) and intelligent transport system applications, recognizing driver emotions has become essential for a decision support system (DSS) with humans in the loop (HITL). Multimodal approaches using visual cues, speech, physiological signals, and driving patterns improve emotion recognition but are challenging in resource-constrained environments where only a subset of modalities is available. This work addresses these challenges by combining multi-modal benefits with single-modality inference for emotion recognition using unlabeled external road condition data. Unlike traditional methods that average teachers' contribution, the proposed cross-modal distillation (CMD) weights teachers thanks to the Shapley additive global explanation (SAGE) aid, which improves the student model's accuracy and provides an interpretation of it. Experimental evaluations of the PPBEmo dataset show that XA-CMD improves emotion recognition accuracy with other baselines and provides deeper insights into decision-making.

Cross-modal distillation by additive importance measure in HITL autonomous driving

Bano S.;Cassara' P.;Gennaro C.;Gotta A.
2025

Abstract

With the advent of Advanced Driver Assistance Systems (ADAS) and intelligent transport system applications, recognizing driver emotions has become essential for a decision support system (DSS) with humans in the loop (HITL). Multimodal approaches using visual cues, speech, physiological signals, and driving patterns improve emotion recognition but are challenging in resource-constrained environments where only a subset of modalities is available. This work addresses these challenges by combining multi-modal benefits with single-modality inference for emotion recognition using unlabeled external road condition data. Unlike traditional methods that average teachers' contribution, the proposed cross-modal distillation (CMD) weights teachers thanks to the Shapley additive global explanation (SAGE) aid, which improves the student model's accuracy and provides an interpretation of it. Experimental evaluations of the PPBEmo dataset show that XA-CMD improves emotion recognition accuracy with other baselines and provides deeper insights into decision-making.
2025
Istituto di Scienza e Tecnologie dell'Informazione "Alessandro Faedo" - ISTI
979-8-3315-3147-8
ADAS; SAGE; CMD; AI
File in questo prodotto:
File Dimensione Formato  
Cross-Modal_Distillation_by_Additive_Importance_Measure_in_Hitl_Autonomous_Driving.pdf

solo utenti autorizzati

Descrizione: Cross-modal distillation by additive importance measure in HITL autonomous driving
Tipologia: Versione Editoriale (PDF)
Licenza: NON PUBBLICO - Accesso privato/ristretto
Dimensione 1.72 MB
Formato Adobe PDF
1.72 MB Adobe PDF   Visualizza/Apri   Richiedi una copia
Gotta et al_IEEE_VTC_2025_XModalHITL_postprint.pdf

accesso aperto

Descrizione: Cross-modal distillation by additive importance measure in HITL autonomous driving
Tipologia: Documento in Post-print
Licenza: Altro tipo di licenza
Dimensione 1.64 MB
Formato Adobe PDF
1.64 MB Adobe PDF Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/20.500.14243/562926
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 0
  • ???jsp.display-item.citation.isi??? ND
social impact