The growth of the Internet has produced a lot of advantages, together with the opportunity to provide different people with access to a large warehouse of information. However, this phenomenon produces some difficulties in the activities of searching and retrieving. A large amount of information is sometimes useless if it does not offer tools to respond to the information needs of the user. This paper introduces an approach devoted to facilitating information access and retrieval using the World Wide Web's syntactic structures and semantic organization. We consider the HTML language syntax structure and the organization of information in a general Web document, and we define some rules that people use for structuring Web information. These rules can be used for managing and retrieving Web information and its semantics. A Web document is treated as a complex informative object formed by images, tables, animations, videos and text organized into chapters, paragraphs, titles, and so on, connected according to semantic links. Knowledge associated with the information structure helps in retrieving relevant information

Toward a retrieval of HTML documents using a semantic approach

Ferri F;Grifoni P;
2000

Abstract

The growth of the Internet has produced a lot of advantages, together with the opportunity to provide different people with access to a large warehouse of information. However, this phenomenon produces some difficulties in the activities of searching and retrieving. A large amount of information is sometimes useless if it does not offer tools to respond to the information needs of the user. This paper introduces an approach devoted to facilitating information access and retrieval using the World Wide Web's syntactic structures and semantic organization. We consider the HTML language syntax structure and the organization of information in a general Web document, and we define some rules that people use for structuring Web information. These rules can be used for managing and retrieving Web information and its semantics. A Web document is treated as a complex informative object formed by images, tables, animations, videos and text organized into chapters, paragraphs, titles, and so on, connected according to semantic links. Knowledge associated with the information structure helps in retrieving relevant information
2000
Istituto di Ricerche sulla Popolazione e le Politiche Sociali - IRPPS
0-7803-6536-4
File in questo prodotto:
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/20.500.14243/185658
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus ND
  • ???jsp.display-item.citation.isi??? ND
social impact