In this paper the problem of indexing heterogeneous structured documents and of retrieving semi-structured documents is considered. We propose a flexible paradigm for both indexing such documents and formulating user queries specifying soft constraints on both documents' structure and content. At the indexing level we propose a model that achieves flexibility by constructing personalised document representations based on users’ views of the documents. This is obtained by allowing users to specify their preferences on the documents’ sections that they estimate to bear the most interesting information, as well as to linguistically quantify the number of sections which determine the global potential interest of the documents. At the query language level, a flexible query language for expressing soft selection conditions on both the documents’ structure and content is proposed.

Personalised indexing and retrieval of heterogeneous structured documents.

Bordogna G;Pasi G
2005

Abstract

In this paper the problem of indexing heterogeneous structured documents and of retrieving semi-structured documents is considered. We propose a flexible paradigm for both indexing such documents and formulating user queries specifying soft constraints on both documents' structure and content. At the indexing level we propose a model that achieves flexibility by constructing personalised document representations based on users’ views of the documents. This is obtained by allowing users to specify their preferences on the documents’ sections that they estimate to bear the most interesting information, as well as to linguistically quantify the number of sections which determine the global potential interest of the documents. At the query language level, a flexible query language for expressing soft selection conditions on both the documents’ structure and content is proposed.
2005
Istituto per la Dinamica dei Processi Ambientali - IDPA - Sede Venezia
Istituto per le Tecnologie della Costruzione - ITC
indicizzazione
documenti semistrutturati
operatori di aggregazione
linguaggi di interrogazione
information retreival
File in questo prodotto:
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/20.500.14243/148109
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus ND
  • ???jsp.display-item.citation.isi??? ND
social impact