In this paper, we consider two different aspects of the automatic speech recognition task: the effectiveness of using open-source ASR toolkits and the quite problematic recognition of children speech. On this difficult task, we compare three well established and widely available ASR toolkits and we finally demonstrate the feasibility of applying these results to speech recognition and spoken dialogue system design. Even if various open source ASR toolkits are now available, we were mainly interested in evaluate the usability of the relatively new BAVIECA system in comparison to two systems (SONIC and SPHINX) for which we had already various results in past experiments on children speech. This paper is intended to provide the reader with a simple overview of the solutions adopted by the three different systems under investigation and with the demonstration of their effectiveness on children speech. Furthermore, the paper provides suggestions for future research directions in the field.

Comparing Open Source ASR Toolkits on Italian Children Speech

Cosi P;Paci G;Sommavilla G;Tesser;
2014

Abstract

In this paper, we consider two different aspects of the automatic speech recognition task: the effectiveness of using open-source ASR toolkits and the quite problematic recognition of children speech. On this difficult task, we compare three well established and widely available ASR toolkits and we finally demonstrate the feasibility of applying these results to speech recognition and spoken dialogue system design. Even if various open source ASR toolkits are now available, we were mainly interested in evaluate the usability of the relatively new BAVIECA system in comparison to two systems (SONIC and SPHINX) for which we had already various results in past experiments on children speech. This paper is intended to provide the reader with a simple overview of the solutions adopted by the three different systems under investigation and with the demonstration of their effectiveness on children speech. Furthermore, the paper provides suggestions for future research directions in the field.
2014
Istituto di Scienze e Tecnologie della Cognizione - ISTC
Inglese
Kay Berkling
WOCCI 2014, 4th Workshop on Child Computer Interaction
http://www.wocci.org/proceedings/2014/wocci2014_proceedings.pdf
ISCA, International speech communication association
Baixas
FRANCIA
Sì, ma tipo non specificato
September 19th, 2014
Singapore
Cosi, P., Nicolao, M., G., Paci, G., Sommavilla, G., Tesser, F. "Comparing Open Source ASR Toolkits on Italian Children Speech" in onLine Proceedings of WOCCI 2014, 4th Workshop on Child Computer Interaction, Satellite Event of INTERSPEECH 2014 Singapore, September 19th, 2014, http://www.wocci.org/proceedings/2014/wocci2014_proceedings.pdf
2
none
Cosi P; Nicolao M G; Paci G; Sommavilla G; Tesser; F
273
info:eu-repo/semantics/conferenceObject
04 Contributo in convegno::04.01 Contributo in Atti di convegno
   Adaptive Strategies for Sustainable Long-Term Social Interaction
   ALIZ-E
   FP7
   248116
File in questo prodotto:
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/20.500.14243/264075
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus ND
  • ???jsp.display-item.citation.isi??? ND
social impact