Unlike single-instance learning, where each data-sample is represented as a vector of features, in multiple-instance learning (MIL) data-sample are complex objects (bags) described by sets of instances. In this framework, each instance-vector is only characterized by its feature components and its membership in a bag, while a label is only available for the entire bag. In the multi-instance classification (MIC) version of the supervised problem each training bag is associated to a categorical label. Thus, the aim is to learn a prediction model, based on the training set, that allows to determine the class-label of new bags. The relevant difference of multi-instance regression (MIR), as an extension of the traditional regression paradigm, is that each bag is associated with a real-valued label rather than a class. Although there are significant motivation for developing MIR application, as testified by some recent interesting study regarding remote sensing, age estimation, landmark recognition, and drug activity prediction, still MIR is less widespread than MIC. This is mainly due to the intrinsic more challenging nature of the MIR where, instead of learning a classification surface based on the relative positioning of instances of categorical bags, one has to learn a function that associates real numbers to sets of points. This introduces an obvious difficulty as soon as one recognizes that there exist multiple possible mathematical descriptions of the same bag. After briefly surveying existing approaches to MIR, in this work we focus on models that adopt the support vector regression paradigm, discussing about the training algorithms, and addressing the issues posed by the validation phase.

Models and Algorithms for Multiple Instance Regression

Marcello Sammarra
2023

Abstract

Unlike single-instance learning, where each data-sample is represented as a vector of features, in multiple-instance learning (MIL) data-sample are complex objects (bags) described by sets of instances. In this framework, each instance-vector is only characterized by its feature components and its membership in a bag, while a label is only available for the entire bag. In the multi-instance classification (MIC) version of the supervised problem each training bag is associated to a categorical label. Thus, the aim is to learn a prediction model, based on the training set, that allows to determine the class-label of new bags. The relevant difference of multi-instance regression (MIR), as an extension of the traditional regression paradigm, is that each bag is associated with a real-valued label rather than a class. Although there are significant motivation for developing MIR application, as testified by some recent interesting study regarding remote sensing, age estimation, landmark recognition, and drug activity prediction, still MIR is less widespread than MIC. This is mainly due to the intrinsic more challenging nature of the MIR where, instead of learning a classification surface based on the relative positioning of instances of categorical bags, one has to learn a function that associates real numbers to sets of points. This introduces an obvious difficulty as soon as one recognizes that there exist multiple possible mathematical descriptions of the same bag. After briefly surveying existing approaches to MIR, in this work we focus on models that adopt the support vector regression paradigm, discussing about the training algorithms, and addressing the issues posed by the validation phase.
2023
Istituto di Calcolo e Reti ad Alte Prestazioni - ICAR
978-88-907488-1-3
Multiple Instance Learning
Regression
Support Vector
File in questo prodotto:
File Dimensione Formato  
GMS.pdf

solo utenti autorizzati

Tipologia: Abstract
Licenza: Dominio pubblico
Dimensione 96.81 kB
Formato Adobe PDF
96.81 kB Adobe PDF   Visualizza/Apri   Richiedi una copia

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/20.500.14243/461944
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus ND
  • ???jsp.display-item.citation.isi??? ND
social impact