With the advent of Advanced Driver Assistance Systems (ADAS) and intelligent transport system applications, recognizing driver emotions has become essential for a decision support system (DSS) with humans in the loop (HITL). Multimodal approaches using visual cues, speech, physiological signals, and driving patterns improve emotion recognition but are challenging in resource-constrained environments where only a subset of modalities is available. This work addresses these challenges by combining multi-modal benefits with single-modality inference for emotion recognition using unlabeled external road condition data. Unlike traditional methods that average teachers' contribution, the proposed cross-modal distillation (CMD) weights teachers thanks to the Shapley additive global explanation (SAGE) aid, which improves the student model's accuracy and provides an interpretation of it. Experimental evaluations of the PPBEmo dataset show that XA-CMD improves emotion recognition accuracy with other baselines and provides deeper insights into decision-making.
Cross-modal distillation by additive importance measure in HITL autonomous driving
Bano S.;Cassara' P.;Gennaro C.;Gotta A.
2025
Abstract
With the advent of Advanced Driver Assistance Systems (ADAS) and intelligent transport system applications, recognizing driver emotions has become essential for a decision support system (DSS) with humans in the loop (HITL). Multimodal approaches using visual cues, speech, physiological signals, and driving patterns improve emotion recognition but are challenging in resource-constrained environments where only a subset of modalities is available. This work addresses these challenges by combining multi-modal benefits with single-modality inference for emotion recognition using unlabeled external road condition data. Unlike traditional methods that average teachers' contribution, the proposed cross-modal distillation (CMD) weights teachers thanks to the Shapley additive global explanation (SAGE) aid, which improves the student model's accuracy and provides an interpretation of it. Experimental evaluations of the PPBEmo dataset show that XA-CMD improves emotion recognition accuracy with other baselines and provides deeper insights into decision-making.| File | Dimensione | Formato | |
|---|---|---|---|
|
Cross-Modal_Distillation_by_Additive_Importance_Measure_in_Hitl_Autonomous_Driving.pdf
solo utenti autorizzati
Descrizione: Cross-modal distillation by additive importance measure in HITL autonomous driving
Tipologia:
Versione Editoriale (PDF)
Licenza:
NON PUBBLICO - Accesso privato/ristretto
Dimensione
1.72 MB
Formato
Adobe PDF
|
1.72 MB | Adobe PDF | Visualizza/Apri Richiedi una copia |
|
Gotta et al_IEEE_VTC_2025_XModalHITL_postprint.pdf
accesso aperto
Descrizione: Cross-modal distillation by additive importance measure in HITL autonomous driving
Tipologia:
Documento in Post-print
Licenza:
Altro tipo di licenza
Dimensione
1.64 MB
Formato
Adobe PDF
|
1.64 MB | Adobe PDF | Visualizza/Apri |
I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.


