In the last decade, the field of Big Data Analytics has become increasingly important in both the academic and the business communities. Typically, data are mostly structured, collected by different actors through various heterogeneous and distributed information sources, and stored and managed often directly in XML. In order to enable large volume of data to be described in such a way that their meaning can be exploited by machines and, thus, semantic queries and automatic inferential procedures can be enabled, this paper presents an automatic method to derive OWL ontologies from XML schemas. The main contribution of this method relies on the possibility of producing a target ontology starting from multiple XML schemas, by discriminating between domain and cross-domain entities and, contextually, simplifying the overall structure of the final ontology generated, i.e. By eliminating not-used cross-domain entities. This method has been applied to a concrete application case in the healthcare domain, with the goal of generating an ontological model from the XML schemas implementing the HL7 Version 3 Clinical Document Architecture Release 2.

An automatic method for deriving OWL ontologies from XML documents

Aniello Minutolo;Angelo Esposito;Mario Ciampi;Massimo Esposito;
2014

Abstract

In the last decade, the field of Big Data Analytics has become increasingly important in both the academic and the business communities. Typically, data are mostly structured, collected by different actors through various heterogeneous and distributed information sources, and stored and managed often directly in XML. In order to enable large volume of data to be described in such a way that their meaning can be exploited by machines and, thus, semantic queries and automatic inferential procedures can be enabled, this paper presents an automatic method to derive OWL ontologies from XML schemas. The main contribution of this method relies on the possibility of producing a target ontology starting from multiple XML schemas, by discriminating between domain and cross-domain entities and, contextually, simplifying the overall structure of the final ontology generated, i.e. By eliminating not-used cross-domain entities. This method has been applied to a concrete application case in the healthcare domain, with the goal of generating an ontological model from the XML schemas implementing the HL7 Version 3 Clinical Document Architecture Release 2.
2014
Istituto di Calcolo e Reti ad Alte Prestazioni - ICAR
978-1-4799-4171-1
Ontology generation
XML Schema
OWL
Ontologies
HL7 CDA
File in questo prodotto:
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/20.500.14243/282899
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 2
  • ???jsp.display-item.citation.isi??? ND
social impact