In this paper, we present a further step in the development of an emotion tracking system based on phonetic syllables and machine learning algorithms. A system built on phonetically defined units has advantages both on the side of the amount of data needed to train the classifier and on the ability of improving our knowledge about how humans use speech to recognize emotions on the base of the physical meaning of each used feature. Since the features extraction frequency is intrinsically variable, however, it is necessary to study how to represent context and dynamics as well as to evaluate the effects of their introduction in the system. The goal of this study is to evaluate the effects of context in a previously presented system working on isolated syllables only. Obtained results show that the system performance is improved.

Introducing context in syllable based emotion tracking

Vincenzo;
2014

Abstract

In this paper, we present a further step in the development of an emotion tracking system based on phonetic syllables and machine learning algorithms. A system built on phonetically defined units has advantages both on the side of the amount of data needed to train the classifier and on the ability of improving our knowledge about how humans use speech to recognize emotions on the base of the physical meaning of each used feature. Since the features extraction frequency is intrinsically variable, however, it is necessary to study how to represent context and dynamics as well as to evaluate the effects of their introduction in the system. The goal of this study is to evaluate the effects of context in a previously presented system working on isolated syllables only. Obtained results show that the system performance is improved.
2014
Istituto di Scienze e Tecnologie della Cognizione - ISTC
emotion recognition
feature extraction
learning (artificial intelligence)
signal classification
speech processing
speech recognition
classifier training
context representation
dynamics representation
emotion recognition
intrinsically-variable feature extraction frequency
isolated syllables
knowledge improvement ability
machine learning algorithms
phonetic syllables
phonetically defined units
physical meaning
syllable-based emotion tracking system
system performance improvement
Context
Feature extraction
Linear approximation
Speech
Splines (mathematics)
Vectors
continuous emotion recognition
prosody
syllables
emotional speech
emotional speech
File in questo prodotto:
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/20.500.14243/282486
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 3
  • ???jsp.display-item.citation.isi??? ND
social impact