A joint synchrony/mean-rate auditory model, recently proposed by Seneff[6], is embedded into a classical DTW-based system for the recognition of Italian digits. Its performances are evalu¬ated in both clean and noisy speech and compared with those of a system based on the Mel¬cepstrum representation. Experimental results show that the Mel representation outperforms the auditory model. Problems encountered by the auditory model in noisy speech are outlined and suggestions for noise compensation techniques both inside and outside the model are given. Simple image processing techniques aiming to clean up the synchrony spectrogram in noisy speech are suggested and some promising preliminary results are presented.

A Comparison between Mel-Scale Cepstrum and Auditory Model Representation for Noisy Speech Recognition

Cosi P;
1990

Abstract

A joint synchrony/mean-rate auditory model, recently proposed by Seneff[6], is embedded into a classical DTW-based system for the recognition of Italian digits. Its performances are evalu¬ated in both clean and noisy speech and compared with those of a system based on the Mel¬cepstrum representation. Experimental results show that the Mel representation outperforms the auditory model. Problems encountered by the auditory model in noisy speech are outlined and suggestions for noise compensation techniques both inside and outside the model are given. Simple image processing techniques aiming to clean up the synchrony spectrogram in noisy speech are suggested and some promising preliminary results are presented.
1990
Istituto di Scienze e Tecnologie della Cognizione - ISTC
Istituto di Scienze e Tecnologie della Cognizione - ISTC
Inglese
Luis Torres, Enrique Masgrau, Miguel A. Lagunas
Proceedings EUSIPCO-90 - Signal Processing: Theory and Applications. Fifth European Signal Processing Conference
Proceedings EUSIPCO-90
1199
1202
2034
0444886362
North Holland Pub. Co.
Elsevier
Amsterdam
Amsterdam
PAESI BASSI
PAESI BASSI
Sì, ma tipo non specificato
18-21 September, 1990
Barcellona, Spain
Mel-Scale Cepstrum
Auditory Model Representation
Noisy Speech Recognition
Cosi P., Falavigna D., Mian G.A., Omologo M. A Comparison between Mel-Scale Cepstrum and Auditory Model Representation for Noisy Speech Recognition Proceedings EUSIPCO-90 - Signal Processing: Theory and Applications. Fifth European Signal Processing Conference Barcellona, Spain 18-21 September, 1990 pp. 1199-1202
4
none
Cosi, P; Falavigna, D; Mian, Ga; Omologo, M
273
info:eu-repo/semantics/conferenceObject
04 Contributo in convegno::04.01 Contributo in Atti di convegno
File in questo prodotto:
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/20.500.14243/16760
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus ND
  • ???jsp.display-item.citation.isi??? ND
social impact