The market basket transactions observed at microscale (each individual product bought by each individual customer at each store visit) over a large population for a long time, offer a detailed picture of customers' shopping activity. Given the high cardinality of such a detailed dataset, data mining techniques have been developed to let the hidden knowledge emerge from it. In this technical report, we propose to use the system of all customer-product connections as a whole. We create a framework able to exploit the characteristics of the customer-product matrix and we test it on a unique transaction database, recording the micro-purchases of a million customers observed for several years at the stores of the top national supermarket retailer. We propose it as a novel analytic paradigm for market basket analysis, a paradigm that is challenging both conceptually, given the high complexity of the structures we build, and computationally, given the scale of the data it needs to analyze

Calculating Product and Customer Sophistication on a Large Transactional Dataset

Giannotti F;Pedreschi D
2013

Abstract

The market basket transactions observed at microscale (each individual product bought by each individual customer at each store visit) over a large population for a long time, offer a detailed picture of customers' shopping activity. Given the high cardinality of such a detailed dataset, data mining techniques have been developed to let the hidden knowledge emerge from it. In this technical report, we propose to use the system of all customer-product connections as a whole. We create a framework able to exploit the characteristics of the customer-product matrix and we test it on a unique transaction database, recording the micro-purchases of a million customers observed for several years at the stores of the top national supermarket retailer. We propose it as a novel analytic paradigm for market basket analysis, a paradigm that is challenging both conceptually, given the high complexity of the structures we build, and computationally, given the scale of the data it needs to analyze
2013
Istituto di Scienza e Tecnologie dell'Informazione "Alessandro Faedo" - ISTI
Customer Beha
Data Mining Economic Complexity
Database applications. Data mining
File in questo prodotto:
File Dimensione Formato  
prod_272059-doc_75828.pdf

accesso aperto

Descrizione: Calculating Product and Customer Sophistication on a Large Transactional Dataset
Dimensione 692.31 kB
Formato Adobe PDF
692.31 kB Adobe PDF Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/20.500.14243/262222
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus ND
  • ???jsp.display-item.citation.isi??? ND
social impact