CNR Institutional Research Information System

In recent years there has been a growing interest in text quantification, a supervised learning task where the goal is to accurately estimate, in an unlabelled set of items, the prevalence (or "relative frequency") of each class c in a predefined set C. Text quantification has several applications, and is a dominant concern in fields such as market research, the social sciences, political science, and epidemiology. In this paper we tackle, for the first time, the problem of ordinal text quantification, defined as the task of performing text quantification when a total order is defined on the set of classes; estimating the prevalence of "five stars" reviews in a set of reviews of a given product, and monitoring this prevalence across time, is an example application. We present OQT, a novel tree-based OQ algorithm, and discuss experimental results obtained on a dataset of tweets classified according to sentiment strength.

Ordinal text quantification

Da San Martino G;Gao W;Sebastiani F

2016

Abstract

In recent years there has been a growing interest in text quantification, a supervised learning task where the goal is to accurately estimate, in an unlabelled set of items, the prevalence (or "relative frequency") of each class c in a predefined set C. Text quantification has several applications, and is a dominant concern in fields such as market research, the social sciences, political science, and epidemiology. In this paper we tackle, for the first time, the problem of ordinal text quantification, defined as the task of performing text quantification when a total order is defined on the set of classes; estimating the prevalence of "five stars" reviews in a set of reviews of a given product, and monitoring this prevalence across time, is an example application. We present OQT, a novel tree-based OQ algorithm, and discuss experimental results obtained on a dataset of tweets classified according to sentiment strength.

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno
	
				2016
			
	Strutture organizzative
	
				Istituto di Scienza e Tecnologie dell'Informazione "Alessandro Faedo" - ISTI
			
	Lingua/e
	
				Inglese
			
	Supervisori e coordinatori esterni
	
				Raffaele Perego, Fabrizio sebastiani, Aslam, Javed, Ruthven, Ian, Zobel, Justin
			
	Titolo del convegno
	
				SIGIR 2016 - 39th International ACM SIGIR Conference on Research and Development in Information Retrieval
			
	Da pagina
	
				937
			
	A pagina
	
				940
			
	Codice ISBN
	
				978-1-4503-4069-4
			
	Codice DOI
	
				https://dx.doi.org/10.1145/2911451.2914749
			
	URL
	
				http://dl.acm.org/citation.cfm?doid=2911451.2914749
			
	Nome Editore
	
				ACM Press
			
	Città Editore
	
				New York
			
	Nazione Editore
	
				STATI UNITI D'AMERICA
			
	Referee
	
				Sì, ma tipo non specificato
			
	Periodo del Convegno
	
				17-21 July 2016
			
	Luogo del Convegno
	
				Pisa, Italy
			
	Parole chiave
	
				ARTIFICIAL INTELLIGENCE. Learning
			
	Codice Scopus
	
				2-s2.0-84980349728
			
	Codice Web of Science
	
				WOS:000455100800144
			
	Numero autori
	
				3
			
	Fulltext
	
				partially_open
			
	Tutti gli autori
	
						Da San Martino, G; Gao, W; Sebastiani, F
					
	Tipologia Login Miur
	
				273
			
	Tipologia
	
				info:eu-repo/semantics/conferenceObject
			
	Tipologia
	
				04 Contributo in convegno::04.01 Contributo in Atti di convegno
			
	Appare nelle tipologie:
	
				04.01 Contributo in Atti di convegno

File in questo prodotto:

File	Dimensione	Formato
prod_356992-doc_116358.pdf solo utenti autorizzati Descrizione: Ordinal Text Quantification Tipologia: Versione Editoriale (PDF) Dimensione 530.34 kB Formato Adobe PDF Visualizza/Apri Richiedi una copia	530.34 kB	Adobe PDF	Visualizza/Apri Richiedi una copia
prod_356992-doc_159209.pdf accesso aperto Descrizione: Ordinal Text Quantification Tipologia: Versione Editoriale (PDF) Dimensione 273.86 kB Formato Adobe PDF Visualizza/Apri	273.86 kB	Adobe PDF	Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/20.500.14243/320946

Citazioni

ND

25

18

social impact