The effectiveness of phonetic-contrast motivated adaptation on HMM-based synthetic voices was previously tested on English successfully. The aim of this paper is to prove that such adaptation can be exported with minor changes to languages having different intrinsic characteristics. The Italian language was chosen because it has no obvious phonemic configuration towards which human speech tend when hypo-articulated such as the mid-central vowel (schwa) for English. Nonetheless, low-contrastive attractors were identified and a linear transformation was trained by contrasting each phone pronunciation with its nearest acoustic neighbour. Different degree of hyper and hypo articulated synthetic speech was then achieved by scaling such adaptation along the dimension identified by each contrastive pair. The Italian synthesiser outcome adapted with both the maximum and the minimum transformation strength was evaluated with two objective assessments: the analysis of some common acoustic correlates and the measurement of a intelligibility-in-noise index. For the latter, signals were mixed with different disturbances at various energy ratios and intelligibility was compared to the standard-TTS generated speech. The experimental results proved such transformation on the Italian voices to be as effective as those on the English one.

A phonetic-contrast motivated adaptation to control the degree-of-articulation on Italian HMM-based synthetic voices

Fabio Tesser;
2013

Abstract

The effectiveness of phonetic-contrast motivated adaptation on HMM-based synthetic voices was previously tested on English successfully. The aim of this paper is to prove that such adaptation can be exported with minor changes to languages having different intrinsic characteristics. The Italian language was chosen because it has no obvious phonemic configuration towards which human speech tend when hypo-articulated such as the mid-central vowel (schwa) for English. Nonetheless, low-contrastive attractors were identified and a linear transformation was trained by contrasting each phone pronunciation with its nearest acoustic neighbour. Different degree of hyper and hypo articulated synthetic speech was then achieved by scaling such adaptation along the dimension identified by each contrastive pair. The Italian synthesiser outcome adapted with both the maximum and the minimum transformation strength was evaluated with two objective assessments: the analysis of some common acoustic correlates and the measurement of a intelligibility-in-noise index. For the latter, signals were mixed with different disturbances at various energy ratios and intelligibility was compared to the standard-TTS generated speech. The experimental results proved such transformation on the Italian voices to be as effective as those on the English one.
2013
Istituto di Scienze e Tecnologie della Cognizione - ISTC
Inglese
Antonio Bonafonte
8th ISCA Speech Synthesis Workshop (Publisher: UPC. Universitat Politècnica de Catalunya. BarcelonaTech.)
8th ISCA Speech Synthesis Workshop
8th
107
112
6
http://ssw8.talp.cat/papers/ssw8_OS3-1_Nicolao.pdf
Sì, ma tipo non specificato
August 31st - September 2nd, 2013
Barcelona, Spain
hypo/hyper-articulated speech synthesis
Italian HMM-based synthesis
intelligibility enhancement
speech adaptation
statistical parametric speech synthesis
on-line Proceedings of 8th ISCA Speech Synthesis Workshop Barcelona, Spain August 31st - September 2nd, 2013, Barcelona, Spain, September 2nd, 2013 (url: http://ssw8.talp.cat/papers/ssw8_OS3-1_Nicolao.pdf) pp. 107-112
1
none
Mauro Nicolao; Fabio Tesser; Roger K. Moore
273
info:eu-repo/semantics/conferenceObject
04 Contributo in convegno::04.01 Contributo in Atti di convegno
   Adaptive Strategies for Sustainable Long-Term Social Interaction
   ALIZ-E
   FP7
   248116
File in questo prodotto:
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/20.500.14243/264697
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus ND
  • ???jsp.display-item.citation.isi??? ND
social impact