CNR Institutional Research Information System

In recent years, the proliferation of smart mobile devices has lead to the gradual integration of search functionality within mobile platforms. This has created an incentive to move away from the "ten blue links" metaphor, as mobile users are less likely to click on them, expecting to get the answer directly from the snippets. In turn, this has revived the interest in Question Answering. Then, along came chatbots, conversational systems, and messaging platforms, where the user needs could be better served with the system asking followup questions in order to better understand the user's intent. While typically a user would expect a single response at any utterance, a system could also return multiple options for the user to select from, based on different system understandings of the user's intent. However, this possibility should not be overused, as this practice could confuse and/or annoy the user. How to produce good variable-length lists, given the conflicting objectives of staying short while maximizing the likelihood of having a correct answer included in the list, is an underexplored problem. It is also unclear how to evaluate a system that tries to do that. Here we aim to bridge this gap. In particular, we define some necessary and some optional properties that an evaluation measure fit for this purpose should have. We further show that existing evaluation measures from the IR tradition are not entirely suitable for this setup, and we propose novel evaluation measures that address it satisfactorily.

Evaluating variable-length multiple-option lists in chatbots and mobile search

Atanasova P;Karadzhov G;Kiprov Y;Nakov P;Sebastiani F

2019

Abstract

In recent years, the proliferation of smart mobile devices has lead to the gradual integration of search functionality within mobile platforms. This has created an incentive to move away from the "ten blue links" metaphor, as mobile users are less likely to click on them, expecting to get the answer directly from the snippets. In turn, this has revived the interest in Question Answering. Then, along came chatbots, conversational systems, and messaging platforms, where the user needs could be better served with the system asking followup questions in order to better understand the user's intent. While typically a user would expect a single response at any utterance, a system could also return multiple options for the user to select from, based on different system understandings of the user's intent. However, this possibility should not be overused, as this practice could confuse and/or annoy the user. How to produce good variable-length lists, given the conflicting objectives of staying short while maximizing the likelihood of having a correct answer included in the list, is an underexplored problem. It is also unclear how to evaluate a system that tries to do that. Here we aim to bridge this gap. In particular, we define some necessary and some optional properties that an evaluation measure fit for this purpose should have. We further show that existing evaluation measures from the IR tradition are not entirely suitable for this setup, and we propose novel evaluation measures that address it satisfactorily.

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno
	
				2019
			
	Strutture organizzative
	
				Istituto di Scienza e Tecnologie dell'Informazione "Alessandro Faedo" - ISTI
			
	Lingua/e
	
				Inglese
			
	Titolo del convegno
	
				SIGIR'19 - 42nd International ACM SIGIR Conference on Research and Development in Information Retrieval
			
	Da pagina
	
				997
			
	A pagina
	
				1000
			
	Codice ISBN
	
				978-1-4503-6172-9
			
	Codice DOI
	
				https://dx.doi.org/10.1145/3331184.3331308
			
	URL
	
				https://dl.acm.org/doi/10.1145/3331184.3331308
			
	Referee
	
				Sì, ma tipo non specificato
			
	Periodo del Convegno
	
				July, 2019
			
	Luogo del Convegno
	
				Paris, France
			
	Parole chiave
	
				Chatbots
Mobile Search
Evaluation Measures
			
	Codice Scopus
	
				2-s2.0-85073802530
			
	Codice Web of Science
	
				WOS:000501488900123
			
	Numero autori
	
				5
			
	Fulltext
	
				reserved
			
	Tutti gli autori
	
						Atanasova, P; Karadzhov, G; Kiprov, Y; Nakov, P; Sebastiani, F
					
	Tipologia Login Miur
	
				273
			
	Tipologia
	
				info:eu-repo/semantics/conferenceObject
			
	Tipologia
	
				04 Contributo in convegno::04.01 Contributo in Atti di convegno
			
	Appare nelle tipologie:
	
				04.01 Contributo in Atti di convegno

File in questo prodotto:

File	Dimensione	Formato
prod_422754-doc_150372.pdf non disponibili Descrizione: Evaluating variable-length multiple-option lists in chatbots and mobile search Tipologia: Versione Editoriale (PDF) Dimensione 1.11 MB Formato Adobe PDF Visualizza/Apri Richiedi una copia	1.11 MB	Adobe PDF	Visualizza/Apri Richiedi una copia

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/20.500.14243/379898

Citazioni

ND

0

0

social impact