We present the Trip-MAML dataset, a Multi-Lingual dataset of hotel reviews that have been manually annotated at the sentence-level with Multi-Aspect sentiment labels. This dataset has been built as an extension of an existent English-only dataset, adding documents written in Italian and Spanish. We detail the dataset construction process, covering the data gathering, selection, and annotation. We present inter-annotator agreement figures and baseline experimental results, comparing the three languages. Trip-MAML is a multi-lingual dataset for aspect-oriented opinion mining that enables researchers (i) to face the problem on languages other than English and (ii) to the experiment the application of cross-lingual learning methods to the task

A Multi-lingual Annotated Dataset for Aspect-Oriented Opinion Mining

Esuli A;Moreo Fernández A
2015

Abstract

We present the Trip-MAML dataset, a Multi-Lingual dataset of hotel reviews that have been manually annotated at the sentence-level with Multi-Aspect sentiment labels. This dataset has been built as an extension of an existent English-only dataset, adding documents written in Italian and Spanish. We detail the dataset construction process, covering the data gathering, selection, and annotation. We present inter-annotator agreement figures and baseline experimental results, comparing the three languages. Trip-MAML is a multi-lingual dataset for aspect-oriented opinion mining that enables researchers (i) to face the problem on languages other than English and (ii) to the experiment the application of cross-lingual learning methods to the task
2015
Istituto di Scienza e Tecnologie dell'Informazione "Alessandro Faedo" - ISTI
Inglese
Conference on Empirical Methods in Natural Language Processing
2533
2538
978-1-941643-32-7
https://www.aclweb.org/anthology/D15-1302/
Sì, ma tipo non specificato
17-21/0972015
Lisbon
multilingual
opinion mining
aspect mining
Pubblicato da The Association for Computational Linguistics
6
open
Jimenez Zafra, S; Berardi, G; Esuli, A; Marcheggiani, D; Martinvaldivia, M T; Moreo Fernández, A
273
info:eu-repo/semantics/conferenceObject
04 Contributo in convegno::04.01 Contributo in Atti di convegno
File in questo prodotto:
File Dimensione Formato  
prod_345815-doc_108517.pdf

accesso aperto

Descrizione: A Multi-lingual Annotated Dataset for Aspect-Oriented Opinion Mining
Tipologia: Versione Editoriale (PDF)
Dimensione 121.29 kB
Formato Adobe PDF
121.29 kB Adobe PDF Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/20.500.14243/341967
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 14
  • ???jsp.display-item.citation.isi??? ND
social impact