In recent years there has been a growing interest in text quantification, a supervised learning task where the goal is to accurately estimate, in an unlabelled set of items, the prevalence (or "relative frequency") of each class c in a predefined set C. Text quantification has several applications, and is a dominant concern in fields such as market research, the social sciences, political science, and epidemiology. In this paper we tackle, for the first time, the problem of ordinal text quantification, defined as the task of performing text quantification when a total order is defined on the set of classes; estimating the prevalence of "five stars" reviews in a set of reviews of a given product, and monitoring this prevalence across time, is an example application. We present OQT, a novel tree-based OQ algorithm, and discuss experimental results obtained on a dataset of tweets classified according to sentiment strength.

Ordinal text quantification

Sebastiani F
2016

Abstract

In recent years there has been a growing interest in text quantification, a supervised learning task where the goal is to accurately estimate, in an unlabelled set of items, the prevalence (or "relative frequency") of each class c in a predefined set C. Text quantification has several applications, and is a dominant concern in fields such as market research, the social sciences, political science, and epidemiology. In this paper we tackle, for the first time, the problem of ordinal text quantification, defined as the task of performing text quantification when a total order is defined on the set of classes; estimating the prevalence of "five stars" reviews in a set of reviews of a given product, and monitoring this prevalence across time, is an example application. We present OQT, a novel tree-based OQ algorithm, and discuss experimental results obtained on a dataset of tweets classified according to sentiment strength.
2016
Istituto di Scienza e Tecnologie dell'Informazione "Alessandro Faedo" - ISTI
Inglese
Raffaele Perego, Fabrizio sebastiani, Aslam, Javed, Ruthven, Ian, Zobel, Justin
SIGIR 2016 - 39th International ACM SIGIR Conference on Research and Development in Information Retrieval
937
940
978-1-4503-4069-4
http://dl.acm.org/citation.cfm?doid=2911451.2914749
ACM Press
New York
STATI UNITI D'AMERICA
Sì, ma tipo non specificato
17-21 July 2016
Pisa, Italy
ARTIFICIAL INTELLIGENCE. Learning
3
partially_open
Da San Martino, G; Gao, W; Sebastiani, F
273
info:eu-repo/semantics/conferenceObject
04 Contributo in convegno::04.01 Contributo in Atti di convegno
File in questo prodotto:
File Dimensione Formato  
prod_356992-doc_116358.pdf

solo utenti autorizzati

Descrizione: Ordinal Text Quantification
Tipologia: Versione Editoriale (PDF)
Dimensione 530.34 kB
Formato Adobe PDF
530.34 kB Adobe PDF   Visualizza/Apri   Richiedi una copia
prod_356992-doc_159209.pdf

accesso aperto

Descrizione: Ordinal Text Quantification
Tipologia: Versione Editoriale (PDF)
Dimensione 273.86 kB
Formato Adobe PDF
273.86 kB Adobe PDF Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/20.500.14243/320946
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 25
  • ???jsp.display-item.citation.isi??? 18
social impact