The development of a speaker independent connected "digits" recognizer for Italian is described. The CSLU Speech Toolkit was used to develop and implement the system which is based on an hybrid ANN/HMM architecture. The recognizer is trained on contextdependent categories to account for coarticulatory variation. Various front-end processing was compared and, when the best features (MFCC with CMS + ?) were considered, there was a 98.68% word recognition accuracy (90.76% sentence recognition accuracy) on a test set of the FIELD continuous digits recognition task.

High Performance Italian Continuous "Digit" recognition

Cosi P;Tesser F
2000

Abstract

The development of a speaker independent connected "digits" recognizer for Italian is described. The CSLU Speech Toolkit was used to develop and implement the system which is based on an hybrid ANN/HMM architecture. The recognizer is trained on contextdependent categories to account for coarticulatory variation. Various front-end processing was compared and, when the best features (MFCC with CMS + ?) were considered, there was a 98.68% word recognition accuracy (90.76% sentence recognition accuracy) on a test set of the FIELD continuous digits recognition task.
2000
Istituto di Scienze e Tecnologie della Cognizione - ISTC
Istituto di Scienze e Tecnologie della Cognizione - ISTC
Inglese
Baozong Yuan, Taiyi Huang, Xiaofang Tang
ICSLP 2000 - 6th International Conference on Spoken language Processing
242
245
748
7-80150-114-4
http://www2.pd.istc.cnr.it/Papers/PieroCosi/cp-ICSLP2000-01.pdf
China Military Friendship Publish
Beijing
REPUBBLICA POPOLARE CINESE
High Performance
Italian
Continuous ASR
Digit Recognition
Cosi P., Hosom J.P., Tesser F. "High Performance Italian Continuous "Digit" recognition" Proceedings ICSLP-2000, International Conference on Spoken Language Processing Beijing, Cina 16-20 October, 2000 Vol. IV ISBN 7-80150-114-4 http://www2.pd.istc.cnr.it/Papers/PieroCosi/cp-ICSLP2000-01.pdf pp. 242-245.
2
02 Contributo in Volume::02.01 Contributo in volume (Capitolo o Saggio)
268
none
Cosi P.; Hosom J.P.; Tesser F.
info:eu-repo/semantics/bookPart
File in questo prodotto:
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/20.500.14243/18535
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus ND
  • ???jsp.display-item.citation.isi??? ND
social impact