Emotion classification is a research area in which there has been very intensive literature production concerning natural language processing, multimedia data, semantic knowledge discovery, social network mining, and text and multimedia data mining. This paper addresses the issue of emotion classification and proposes a method for classifying the emotions expressed in multimodal data extracted from videos. The proposed method models multimodal data as a sequence of features extracted from facial expressions, speech, gestures, and text, using a linguistic approach. Each sequence of multimodal data is correctly associated with the emotion by a method that models each emotion using a hidden Markov model. The trained model is evaluated on samples of multimodal sentences associated with seven basic emotions. The experimental results demonstrate a good classification rate for emotions.

Emotion Classification from Speech and Text in Videos Using a Multimodal Approach

Caschera MC
Primo
;
Grifoni P
;
Ferri F
2022

Abstract

Emotion classification is a research area in which there has been very intensive literature production concerning natural language processing, multimedia data, semantic knowledge discovery, social network mining, and text and multimedia data mining. This paper addresses the issue of emotion classification and proposes a method for classifying the emotions expressed in multimodal data extracted from videos. The proposed method models multimodal data as a sequence of features extracted from facial expressions, speech, gestures, and text, using a linguistic approach. Each sequence of multimodal data is correctly associated with the emotion by a method that models each emotion using a hidden Markov model. The trained model is evaluated on samples of multimodal sentences associated with seven basic emotions. The experimental results demonstrate a good classification rate for emotions.
2022
Istituto di Ricerche sulla Popolazione e le Politiche Sociali - IRPPS
Inglese
6
28
1
23
23
https://www.mdpi.com/2414-4088/6/4/28
Sì, ma tipo non specificato
emotion classification; multimodal interaction; hidden Markov models
"Emotion Classification from Speech and Text in Videos Using a Multimodal Approach" is the winner of the MTI Best Paper Award. The award is based on an evaluation of the originality, and significance of the papers, citations, and downloads. The selection is made by the Best Paper Award Committee of Multimodal Technologies and Interaction (MTI), an international, peer-reviewed, open-access journal published monthly online by MDPI (Impact Factor: 2.5). You can view the announcement here: https://www.mdpi.com/journal/mti/awards/2337
Internazionale
Elettronico
No
3
info:eu-repo/semantics/article
262
Caschera, Mc; Grifoni, P; Ferri, F
01 Contributo su Rivista::01.01 Articolo in rivista
open
File in questo prodotto:
File Dimensione Formato  
prod_468902-doc_189682.pdf

accesso aperto

Descrizione: Emotion Classification from Speech and Text in Videos Using a Multimodal Approach
Tipologia: Versione Editoriale (PDF)
Licenza: Creative commons
Dimensione 4.39 MB
Formato Adobe PDF
4.39 MB Adobe PDF Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/20.500.14243/414841
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 28
  • ???jsp.display-item.citation.isi??? 20
social impact