CNR Institutional Research Information System

In this talk, we present the main results of a paper accepted at ECIR 2022 [1]. We investigate novel SoC-FPGA solutions for fast and energy-efficient ranking based on machine learned ensembles of decision trees. Since the memory footprint of ranking ensembles limits the effective exploitation of programmable logic for large-scale inference tasks [2], we investigate binning and quantization techniques to reduce the memory occupation of the learned model and we optimize the state-of-the-art ensemble-traversal algorithm for deployment on lowcost, energy-efficient FPGA devices. The results of the experiments conducted using publicly available Learning-to-Rank datasets, show that our model compression techniques do not impact significantly the accuracy. Moreover, the reduced space requirements allow the models and the logic to be replicated on the FPGA device in order to execute several inference tasks in parallel. We discuss in details the experimental settings and the feasibility of the deployment of the proposed solution in a real setting. The results of the experiments conducted show that our FPGA solution achieves performances at the state of the art and consumes from 9× up to 19.8× less energy than an equivalent multi-threaded CPU implementation.

Energy-efficient ranking on FPGAs through ensemble model compression (Abstract)

Gil-Costa V.;Loor F.;Molina R.;Nardini F. M.;Perego R.;Trani S.

2022

Abstract

In this talk, we present the main results of a paper accepted at ECIR 2022 [1]. We investigate novel SoC-FPGA solutions for fast and energy-efficient ranking based on machine learned ensembles of decision trees. Since the memory footprint of ranking ensembles limits the effective exploitation of programmable logic for large-scale inference tasks [2], we investigate binning and quantization techniques to reduce the memory occupation of the learned model and we optimize the state-of-the-art ensemble-traversal algorithm for deployment on lowcost, energy-efficient FPGA devices. The results of the experiments conducted using publicly available Learning-to-Rank datasets, show that our model compression techniques do not impact significantly the accuracy. Moreover, the reduced space requirements allow the models and the logic to be replicated on the FPGA device in order to execute several inference tasks in parallel. We discuss in details the experimental settings and the feasibility of the deployment of the proposed solution in a real setting. The results of the experiments conducted show that our FPGA solution achieves performances at the state of the art and consumes from 9× up to 19.8× less energy than an equivalent multi-threaded CPU implementation.

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno
	
				2022
			
	Strutture organizzative
	
				Istituto di Scienza e Tecnologie dell'Informazione "Alessandro Faedo" - ISTI
			
	Lingua/e
	
				Inglese
			
	Supervisori e coordinatori esterni
	
				Pasi G., Cremonesi P., Orlando S., Zanker M., Massimo D., Turati G.
			
	Titolo del Volume
	
				Italian Information Retrieval Workshop
			
	Serie
	
				CEUR WORKSHOP PROCEEDINGS
			
	Titolo del convegno
	
				IIR 2022 - 12th Italian Information Retrieval Workshop 2022
			
	Numero di pagine
	
				1
			
	URL
	
				http://ceur-ws.org/Vol-3177/paper9.pdf
			
	Referee
	
				Sì, ma tipo non specificato
			
	Periodo del Convegno
	
				19-22/06/2022
			
	Luogo del Convegno
	
				Tirrenia, Pisa, Italy
			
	Parole chiave
	
				Learning to rank
Model compression
Efficient inference
SoC FPGA
			
	Codice Scopus
	
				2-s2.0-85136195169
			
	Formato
	
				Elettronico
			
	Presenza di coautori internazionali
	
				Sì
			
	Numero autori
	
				6
			
	Tipologia
	
				info:eu-repo/semantics/conferenceObject
			
	Fulltext
	
				open
			
	Tipologia Login Miur
	
				274
			
	Tipologia
	
				04 Contributo in convegno::04.02 Abstract in Atti di convegno
			
	Tutti gli autori
	
						Gil-Costa, V.; Loor, F.; Molina, R.; Nardini, F. M.; Perego, R.; Trani, S.
					
	Appare nelle tipologie:
	
				04.02 Abstract in Atti di convegno

File in questo prodotto:

File	Dimensione	Formato
prod_471851-doc_191803.pdf accesso aperto Descrizione: Energy-Efficient Ranking on FPGAs through Ensemble Model Compression (Abstract) Tipologia: Versione Editoriale (PDF) Licenza: Creative commons Dimensione 342.02 kB Formato Adobe PDF Visualizza/Apri	342.02 kB	Adobe PDF	Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/20.500.14243/417708

Citazioni

ND

0

ND

social impact