We propose a data warehousing architecture for effective risk analysis in a banking scenario. The core of the archi- tecture consists in two data mining tools for improving the quality of consolidated data during the acquisition process. Specifically, we deal with schema reconciliation, i.e. seg- mentation of a string sequence according to fixed attribute schema. To this purpose we present the RecBoost method- ology which pursuits effective reconciliation via multiple stages of classification. In addition, we propose a hash- based technique for data reconciliation, i.e. the recognition of apparently different records that, as a matter of fact, refer to the same real-world entity.

Data Mining for Effective Risk Analysis in a Bank Intelligence Scenario.

Giovanni Costa;Francesco Folino;Giuseppe Manco;Riccardo Ortale
2007

Abstract

We propose a data warehousing architecture for effective risk analysis in a banking scenario. The core of the archi- tecture consists in two data mining tools for improving the quality of consolidated data during the acquisition process. Specifically, we deal with schema reconciliation, i.e. seg- mentation of a string sequence according to fixed attribute schema. To this purpose we present the RecBoost method- ology which pursuits effective reconciliation via multiple stages of classification. In addition, we propose a hash- based technique for data reconciliation, i.e. the recognition of apparently different records that, as a matter of fact, refer to the same real-world entity.
2007
Istituto di Calcolo e Reti ad Alte Prestazioni - ICAR
File in questo prodotto:
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/20.500.14243/70036
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 8
  • ???jsp.display-item.citation.isi??? ND
social impact