In recent times, except for sporadic cases, the trend in Computer Vision is to achieve minor improvements over considerable increases in complexity.To reverse this tendency, we propose a novel method to boost image classification performances without an increase in complexity.To this end, we revisited ensembling, a powerful approach, not often adequately used due to its nature of increased complexity and training time, making it viable by specific design choices. First, we trained end-to-end two EfficientNet-b0 models (known to be the architecture with the best overall accuracy/complexity trade-off in image classification) on disjoint subsets of data (i.e. bagging). Then, we made an efficient adaptive ensemble by performing fine-tuning of a trainable combination layer. In this way, we were able to outperform the state-of-the-art by an average of 0.5\% on the accuracy with restrained complexity both in terms of number of parameters (by 5-60 times), and FLoating point Operations Per Second (by 10-100 times) on several major benchmark datasets, fully embracing the green AI.

Efficient adaptive ensembling for image classification

Bruno A.;Moroni D.;Martinelli M.
2023

Abstract

In recent times, except for sporadic cases, the trend in Computer Vision is to achieve minor improvements over considerable increases in complexity.To reverse this tendency, we propose a novel method to boost image classification performances without an increase in complexity.To this end, we revisited ensembling, a powerful approach, not often adequately used due to its nature of increased complexity and training time, making it viable by specific design choices. First, we trained end-to-end two EfficientNet-b0 models (known to be the architecture with the best overall accuracy/complexity trade-off in image classification) on disjoint subsets of data (i.e. bagging). Then, we made an efficient adaptive ensemble by performing fine-tuning of a trainable combination layer. In this way, we were able to outperform the state-of-the-art by an average of 0.5\% on the accuracy with restrained complexity both in terms of number of parameters (by 5-60 times), and FLoating point Operations Per Second (by 10-100 times) on several major benchmark datasets, fully embracing the green AI.
2023
Istituto di Scienza e Tecnologie dell'Informazione "Alessandro Faedo" - ISTI
Deep Learning
Ensemble
Convolutional Neural Networks
EfficientNet
Image Classification
File in questo prodotto:
File Dimensione Formato  
prod_468474-doc_201016.pdf

accesso aperto

Descrizione: Preprint - Efficient adaptive ensembling for image classification
Tipologia: Versione Editoriale (PDF)
Licenza: Nessuna licenza dichiarata (non attribuibile a prodotti successivi al 2023)
Dimensione 583.97 kB
Formato Adobe PDF
583.97 kB Adobe PDF Visualizza/Apri
prod_468474-doc_201393.pdf

accesso aperto

Descrizione: Efficient adaptive ensembling for image classification
Tipologia: Versione Editoriale (PDF)
Licenza: Creative commons
Dimensione 1.34 MB
Formato Adobe PDF
1.34 MB Adobe PDF Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/20.500.14243/414222
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 9
  • ???jsp.display-item.citation.isi??? 7
social impact