A typical structure of medical data is a sequence of ob- servations of clinical parameters taken at different time mo- ments. In this kind of contexts, the temporal dimension of data is a fundamental variable that should be taken into account in the mining process and returned as part of the extracted knowledge. Therefore, the classical and well es- tablished framework of sequential pattern mining is not enough, because it only focuses on the sequentiality of events, without extracting the typical time elapsing between two particular events. Time-annotated sequences ( TAS) is a novel mining paradigm that solves this problem. Recently defined in our laboratory [4] together with an efficient al- gorithm for extracting them, TAS are sequential patterns where each transition between two events is annotated with a typical transition time that is found frequent in the data. In this paper we report a real-world medical case study, in which the TAS mining paradigm is applied to clinical data regarding a set of patients in the follow-up of a liver transplantation. The aim of the data analysis is that of as- sessing the effectiveness of the extracorporeal photophere- sis (ECP) as a therapy to prevent rejection in solid organ transplantation. We believe that this case study does not only show the interestingness of extracting TAS patterns in this particular context but, more ambitiously, it suggests a general method- ology for clinical data mining, whenever the time dimension is an important variable of the problem under investigation.

Time-annotated sequences for medical data mining

Bonchi F;Giannotti F
2007

Abstract

A typical structure of medical data is a sequence of ob- servations of clinical parameters taken at different time mo- ments. In this kind of contexts, the temporal dimension of data is a fundamental variable that should be taken into account in the mining process and returned as part of the extracted knowledge. Therefore, the classical and well es- tablished framework of sequential pattern mining is not enough, because it only focuses on the sequentiality of events, without extracting the typical time elapsing between two particular events. Time-annotated sequences ( TAS) is a novel mining paradigm that solves this problem. Recently defined in our laboratory [4] together with an efficient al- gorithm for extracting them, TAS are sequential patterns where each transition between two events is annotated with a typical transition time that is found frequent in the data. In this paper we report a real-world medical case study, in which the TAS mining paradigm is applied to clinical data regarding a set of patients in the follow-up of a liver transplantation. The aim of the data analysis is that of as- sessing the effectiveness of the extracorporeal photophere- sis (ECP) as a therapy to prevent rejection in solid organ transplantation. We believe that this case study does not only show the interestingness of extracting TAS patterns in this particular context but, more ambitiously, it suggests a general method- ology for clinical data mining, whenever the time dimension is an important variable of the problem under investigation.
2007
Istituto di Scienza e Tecnologie dell'Informazione "Alessandro Faedo" - ISTI
Time-annotated sequences
Medical data mining
File in questo prodotto:
File Dimensione Formato  
prod_91796-doc_131672.pdf

solo utenti autorizzati

Descrizione: Time-annotated sequences for medical data mining
Tipologia: Versione Editoriale (PDF)
Dimensione 163.23 kB
Formato Adobe PDF
163.23 kB Adobe PDF   Visualizza/Apri   Richiedi una copia

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/20.500.14243/58458
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus ND
  • ???jsp.display-item.citation.isi??? ND
social impact