CNR Institutional Research Information System

This paper describes the development of the Italian modules and the building of a new Italian female voice for the MaryTTS Text-To-Speech synthesis system. The building of new resources, such as Natural Language Processing (NLP) modules and corpus based voices for a new language in a Text To Speech system is a costly task. MaryTTS provides a number of useful tools for automatize and simplify this task. Nowadays two state-of-the-art speech synthesis technologies are applied on modern TTS: unit selection and HMM-based synthesis. A brief introduction about the peculiar characteristic of the HMM-based speech synthesis is given in this paper; the HMM-based synthesis approach has been chosen for its higher degree of flexibility. In the paper, the main steps necessary to built the essential NLP modules used in a TTS system using the MaryTTS tools are described. For the Italian language, more advanced NLP modules have been implemented with respect to the basic ones provided by the automatic procedures of MaryTTS. A detailed description of the Italian MaryTTS NLP modules (such as Lexicon, LTS rules and homograph pronunciation disambiguation, numbers expansion, Part of Speech Tagger and prosodic labels prediction) has been reported here. The paper finally illustrates the MaryTTS process necessary to select a phonetically and prosodic balanced text corpus for TTS and reports the details of the procedure used to build the first Italian MaryTTS voice with the HMM synthesis technology.

A New Language and a New Voice for MaryTTS

Tesser F;Paci G;Sommavilla G;Cosi P

2013

Abstract

This paper describes the development of the Italian modules and the building of a new Italian female voice for the MaryTTS Text-To-Speech synthesis system. The building of new resources, such as Natural Language Processing (NLP) modules and corpus based voices for a new language in a Text To Speech system is a costly task. MaryTTS provides a number of useful tools for automatize and simplify this task. Nowadays two state-of-the-art speech synthesis technologies are applied on modern TTS: unit selection and HMM-based synthesis. A brief introduction about the peculiar characteristic of the HMM-based speech synthesis is given in this paper; the HMM-based synthesis approach has been chosen for its higher degree of flexibility. In the paper, the main steps necessary to built the essential NLP modules used in a TTS system using the MaryTTS tools are described. For the Italian language, more advanced NLP modules have been implemented with respect to the basic ones provided by the automatic procedures of MaryTTS. A detailed description of the Italian MaryTTS NLP modules (such as Lexicon, LTS rules and homograph pronunciation disambiguation, numbers expansion, Part of Speech Tagger and prosodic labels prediction) has been reported here. The paper finally illustrates the MaryTTS process necessary to select a phonetically and prosodic balanced text corpus for TTS and reports the details of the procedure used to build the first Italian MaryTTS voice with the HMM synthesis technology.

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno
	
				2013
			
	Strutture organizzative
	
				Istituto di Scienze e Tecnologie della Cognizione - ISTC
			
	Codice ISBN
	
				978-88-7870-901-0
			
	Parole chiave
	
				MaryTTS
TTS
Speech Synthesis
			
	Appare nelle tipologie:
	
				02.01 Contributo in volume (Capitolo o Saggio)

File in questo prodotto:

Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/20.500.14243/260587

Citazioni

ND

ND

ND

social impact