This paper describes the development of the Italian modules and the building of a new Italian female voice for the MaryTTS Text-To-Speech synthesis system. The building of new resources, such as Natural Language Processing (NLP) modules and corpus based voices for a new language in a Text To Speech system is a costly task. MaryTTS provides a number of useful tools for automatize and simplify this task. Nowadays two state-of-the-art speech synthesis technologies are applied on modern TTS: unit selection and HMM-based synthesis. A brief introduction about the peculiar characteristic of the HMM-based speech synthesis is given in this paper; the HMM-based synthesis approach has been chosen for its higher degree of flexibility. In the paper, the main steps necessary to built the essential NLP modules used in a TTS system using the MaryTTS tools are described. For the Italian language, more advanced NLP modules have been implemented with respect to the basic ones provided by the automatic procedures of MaryTTS. A detailed description of the Italian MaryTTS NLP modules (such as Lexicon, LTS rules and homograph pronunciation disambiguation, numbers expansion, Part of Speech Tagger and prosodic labels prediction) has been reported here. The paper finally illustrates the MaryTTS process necessary to select a phonetically and prosodic balanced text corpus for TTS and reports the details of the procedure used to build the first Italian MaryTTS voice with the HMM synthesis technology.

A New Language and a New Voice for MaryTTS

Tesser F;Paci G;Sommavilla G;Cosi P
2013

Abstract

This paper describes the development of the Italian modules and the building of a new Italian female voice for the MaryTTS Text-To-Speech synthesis system. The building of new resources, such as Natural Language Processing (NLP) modules and corpus based voices for a new language in a Text To Speech system is a costly task. MaryTTS provides a number of useful tools for automatize and simplify this task. Nowadays two state-of-the-art speech synthesis technologies are applied on modern TTS: unit selection and HMM-based synthesis. A brief introduction about the peculiar characteristic of the HMM-based speech synthesis is given in this paper; the HMM-based synthesis approach has been chosen for its higher degree of flexibility. In the paper, the main steps necessary to built the essential NLP modules used in a TTS system using the MaryTTS tools are described. For the Italian language, more advanced NLP modules have been implemented with respect to the basic ones provided by the automatic procedures of MaryTTS. A detailed description of the Italian MaryTTS NLP modules (such as Lexicon, LTS rules and homograph pronunciation disambiguation, numbers expansion, Part of Speech Tagger and prosodic labels prediction) has been reported here. The paper finally illustrates the MaryTTS process necessary to select a phonetically and prosodic balanced text corpus for TTS and reports the details of the procedure used to build the first Italian MaryTTS voice with the HMM synthesis technology.
2013
Istituto di Scienze e Tecnologie della Cognizione - ISTC
Italiano
Inglese
Vincenzo Galata
Multimodalità e Multilingualità - La sfida più avanzata della comunicazione orale (Multimodality and Multilinguism: new Challenges for the study of Oral Communication)
435
443
9
978-88-7870-901-0
http://www.bulzoni.it/index2.php?page=shop.product_details&flypage=flypage.tpl&product_id=17503&category_id=573&option=com_virtuemart&Itemid=1
Bulzoni Editore
Roma
ITALIA
Sì, ma tipo non specificato
MaryTTS
TTS
Speech Synthesis
Abstract Book & CD-Rom Proceedings of AISV 2013, 9th Conference of Associazione Italiana di Scienze della Voce, Multimodalità e Multilingualità - La sfida più avanzata della comunicazione orale Multimodality and Multilinguism: new Challenges for the study of Oral Communication (MaMChOC) Jan 21-23, 2013, Università Ca' Foscari - Venezia, Abstract Book: 75 - (CD: 435-443).
4
02 Contributo in Volume::02.01 Contributo in volume (Capitolo o Saggio)
268
none
Tesser F.; Paci G.; Sommavilla G.; Cosi P.
info:eu-repo/semantics/bookPart
   Adaptive Strategies for Sustainable Long-Term Social Interaction
   ALIZ-E
   FP7
   248116
File in questo prodotto:
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/20.500.14243/260587
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus ND
  • ???jsp.display-item.citation.isi??? ND
social impact