Document clustering based on semantics is a fundamental method of helping users to search and browse in large cllections of documents. Recently a number of papers have reported the applications of self-organizing artificial neural networks in document clustering based on semantics. In particular Growing Neural Gas is a growing neural network that allows the user to reproduce the topological distribution of the inputs, but the structure obtained often has the same complexity as the input data structure; if the input space has more than three dimensions it is impossible to visualize or represent the GNG network as well as the input data structure. In this paper the authors propose a LBG modified network, called LBG-m, that can simplify the GNG structure in order to visualize and summarize it. The two algorithms constitute a tool for browsing large document sets and generating a set of semantic links between clusters of similar documents.

A Neural Network Tool to Organize Large Document Sets

Rizzo R;
2000

Abstract

Document clustering based on semantics is a fundamental method of helping users to search and browse in large cllections of documents. Recently a number of papers have reported the applications of self-organizing artificial neural networks in document clustering based on semantics. In particular Growing Neural Gas is a growing neural network that allows the user to reproduce the topological distribution of the inputs, but the structure obtained often has the same complexity as the input data structure; if the input space has more than three dimensions it is impossible to visualize or represent the GNG network as well as the input data structure. In this paper the authors propose a LBG modified network, called LBG-m, that can simplify the GNG structure in order to visualize and summarize it. The two algorithms constitute a tool for browsing large document sets and generating a set of semantic links between clusters of similar documents.
2000
Istituto per le Tecnologie Didattiche - ITD - Sede Genova
978-3-540-41044-7
File in questo prodotto:
File Dimensione Formato  
prod_97402-doc_62427.pdf

non disponibili

Descrizione: A Neural Network Tool to Organize Large Document Sets
Dimensione 117.85 kB
Formato Adobe PDF
117.85 kB Adobe PDF   Visualizza/Apri   Richiedi una copia

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/20.500.14243/63380
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus ND
  • ???jsp.display-item.citation.isi??? ND
social impact