This paper proposes the integration of tools to provide unified access to remote and heterogeneous archives, the contents of which can be grouped under the same subject, and which have been integrated to allow the user to navigate and conduct thematic searches. The information sources are locally frequently modified, added to, and removed, therefore attention has been paid to the permanence of their references. Source interoperability is supported at language, protocol and schema levels. The architecture is based on a new common schema of the archives which is defined in new representation and query languages on the basis of an ontology to avoid misunderstanding and ambiguity
Unified Access to Extract Knowledge from Heterogeneous Web Archives
Marco Padula
2001
Abstract
This paper proposes the integration of tools to provide unified access to remote and heterogeneous archives, the contents of which can be grouped under the same subject, and which have been integrated to allow the user to navigate and conduct thematic searches. The information sources are locally frequently modified, added to, and removed, therefore attention has been paid to the permanence of their references. Source interoperability is supported at language, protocol and schema levels. The architecture is based on a new common schema of the archives which is defined in new representation and query languages on the basis of an ontology to avoid misunderstanding and ambiguityI documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.