Short time Fourier analysis in combination with filter-bank techniques or cep-strum analysis have been used for many years in order to reduce timbre repre¬sentation complexity. Recently, in speech analysis and recognition, the intro¬duction of auditory models (Cooke et al. 1993) which explicitly consider non¬linear phenomena occurring in the perception mechanism, has given promising results especially when speech is highly degraded by noise (Cosi 1993). On the other hand, Neural Networks (NN) have already proved their classification capability in various pattern recognition tasks. For these reasons, a timbre clas¬sification system, directly starting from sound signals, was conceived in which auditory modeling and neural network techniques were combined together in order to reduce timbre multidimensionality. In particular S. Seneff's auditory modeling (Seneff, 1988) was used in the analysis stage, while a bidimensional Kohonen Self Organizing Map (SOM) was used in the classification stage.

Timbre Classification by NN and Auditory Modeling

Cosi P;
1994

Abstract

Short time Fourier analysis in combination with filter-bank techniques or cep-strum analysis have been used for many years in order to reduce timbre repre¬sentation complexity. Recently, in speech analysis and recognition, the intro¬duction of auditory models (Cooke et al. 1993) which explicitly consider non¬linear phenomena occurring in the perception mechanism, has given promising results especially when speech is highly degraded by noise (Cosi 1993). On the other hand, Neural Networks (NN) have already proved their classification capability in various pattern recognition tasks. For these reasons, a timbre clas¬sification system, directly starting from sound signals, was conceived in which auditory modeling and neural network techniques were combined together in order to reduce timbre multidimensionality. In particular S. Seneff's auditory modeling (Seneff, 1988) was used in the analysis stage, while a bidimensional Kohonen Self Organizing Map (SOM) was used in the classification stage.
1994
Istituto di Scienze e Tecnologie della Cognizione - ISTC
Istituto di Scienze e Tecnologie della Cognizione - ISTC
978-3-540-19887-1
Timbre Classification
Neural Networks
Auditory Modeling
File in questo prodotto:
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/20.500.14243/16052
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus ND
  • ???jsp.display-item.citation.isi??? ND
social impact