This paper reports on the analysis of the spectral variation of emotional speech. Spectral envelopes of time aligned speech frames are compared between emotionally neutral and active utterances. Statistics are computed over the resulting differential spectral envelopes for each phoneme. Finally, these statistics are classified using agglomerative hierarchical clustering and a measure of dissimilarity between statistical distributions and the resulting clusters are analysed. The results show that there are systematic changes in spectral envelopes when going from neutral to sad or happy speech, and those changes depend on the valence of the emotional content (negative, positive) as well as on the phonetic properties of the sounds such as voicing and place of articulation.

Cluster Analysis of Differential Spectral Envelopes on Emotional Speech

Fabio Tesser;Piero Cosi
2010

Abstract

This paper reports on the analysis of the spectral variation of emotional speech. Spectral envelopes of time aligned speech frames are compared between emotionally neutral and active utterances. Statistics are computed over the resulting differential spectral envelopes for each phoneme. Finally, these statistics are classified using agglomerative hierarchical clustering and a measure of dissimilarity between statistical distributions and the resulting clusters are analysed. The results show that there are systematic changes in spectral envelopes when going from neutral to sad or happy speech, and those changes depend on the valence of the emotional content (negative, positive) as well as on the phonetic properties of the sounds such as voicing and place of articulation.
2010
Istituto di Scienze e Tecnologie della Cognizione - ISTC
Inglese
Takao Kobayashi; Keikichi Hirose; and Satoshi Nakamura
Proceedings of the 11th Annual Conference of the International Speech Communication Association (INTERSPEECH 2010)
INTERSPEECH 2010
322
325
4
978-1-61782-123-3
http://www.isca-speech.org/archive/interspeech_2010/
ISCA-INST SPEECH COMMUNICATION ASSOCIATION, C/O EMMANUELLE FOXONET
ISCA, International speech communication association
LIEU DIT LOUS TOURILS, BAIXAS, F-66390
Baixas
FRANCIA
FRANCIA
Sì, ma tipo non specificato
26-20 Settembre
Makuhari, Japan
emotional speech
hierarchical clustering
spectral envelopes
CD Proceedings Source: 11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), Pages: 322-325 Published: 2010 ISSN: 1990-9772 ______________________________________________________ Printed Proceedings Book IDS Number: BWO17 ISBN: 978-1-61782-123-3ISI Web of Science ______________________________________________________ Cluster Analysis of Differential Spectral Envelopes on Emotional Speech Author(s): Salvi, G (Salvi, Giampiero)1; Tesser, F (Tesser, Fabio); Zovato, E (Zovato, Enrico); Cosi, P (Cosi, Piero) Book Group Author(s): INST SPEECH COMMUN ASSOC Source: 11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 1-4 Pages: 322-325 Published: 2010 Times Cited: 0 (from Web of Science) Cited References: 19 [ view related records ] Citation MapCitation Map Conference: 11th Annual Conference of the International-Speech-Communication-Association 2010 Location: Makuhari, JAPAN Date: SEP 26-30, 2010 Sponsor(s): Japan World Exposit, Commemorat Org; Japan Soc Promot Sci; Telecommunicat Advancement Fdn; KDDI Fdn; Murata Sci Fdn; Adv Telecommunicat Technol Res Fdn, Support Ctr; Chiba Convent Bur & Int Ctr; Renesas Elect Corp; Google; Microsoft Corp; Nuance Commun Inc; Appen Pty Ltd; IBM Res; Sony Corp; Hitachi Ltd; Yahoo Japan Corp; Asahi Kasei Corp; KDDI R & D Lab Inc; Yamaha Corp; Toshiba Corp; Fujitsu Ltd; Mitsubishi Elect Corp; RION Co Ltd; NEC Corp Abstract: This paper reports on the analysis of the spectral variation of emotional speech. Spectral envelopes of time aligned speech frames are compared between emotionally neutral and active utterances. Statistics are computed over the resulting differential spectral envelopes for each phoneme. Finally, these statistics are classified using agglomerative hierarchical clustering and a measure of dissimilarity between statistical distributions and the resulting clusters are analysed. The results show that there are systematic changes in spectral envelopes when going from neutral to sad or happy speech, and those changes depend on the valence of the emotional content (negative, positive) as well as on the phonetic properties of the sounds such as voicing and place of articulation. Accession Number: WOS:000294382400077 Document Type: Proceedings Paper Language: English Author Keywords: emotional speech; hierarchical clustering; spectral envelopes Reprint Address: Salvi, G (reprint author), KTH, Sch Comp Sci & Commun, Dept Speech Mus & Hearing, Stockholm, Sweden Addresses: 1. KTH, Sch Comp Sci & Commun, Dept Speech Mus & Hearing, Stockholm, Sweden E-mail Address: [email protected], [email protected], [email protected], [email protected] Publisher: ISCA-INST SPEECH COMMUNICATION ASSOC, C/O EMMANUELLE FOXONET, 4 RUE DES FAUVETTES, LIEU DIT LOUS TOURILS, BAIXAS, F-66390, FRANCE Web of Science Category: Computer Science, Artificial Intelligence; Engineering, Electrical & Electronic Subject Category: Computer Science; Engineering IDS Number: BWO17 ISBN: 978-1-61782-123-3
2
none
Giampiero Salvi; Fabio Tesser; Enrico Zovato; Piero Cosi;
273
info:eu-repo/semantics/conferenceObject
04 Contributo in convegno::04.01 Contributo in Atti di convegno
File in questo prodotto:
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/20.500.14243/14164
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 5
  • ???jsp.display-item.citation.isi??? 4
social impact