In this paper, we introduce Profiling-UD, a new text analysis tool inspired to the principles of linguistic profiling that can support language variation research from different perspectives. It allows the extraction of more than 130 features, spanning across different levels of linguistic description. Beyond the large number of features that can be monitored, a main novelty of Profiling-UD is that it has been specifically devised to be multilingual since it is based on the Universal Dependencies framework. In the second part of the paper, we demonstrate the effectiveness of these features in a number of theoretical and applicative studies in which they were successfully used for text and author profiling.

Profiling-UD: a Tool for Linguistic Profiling of Texts

Dominique Brunato;Andrea Cimino;Felice Dell'Orletta;Simonetta Montemagni;Giulia Venturi
2020

Abstract

In this paper, we introduce Profiling-UD, a new text analysis tool inspired to the principles of linguistic profiling that can support language variation research from different perspectives. It allows the extraction of more than 130 features, spanning across different levels of linguistic description. Beyond the large number of features that can be monitored, a main novelty of Profiling-UD is that it has been specifically devised to be multilingual since it is based on the Universal Dependencies framework. In the second part of the paper, we demonstrate the effectiveness of these features in a number of theoretical and applicative studies in which they were successfully used for text and author profiling.
2020
Istituto di linguistica computazionale "Antonio Zampolli" - ILC
Inglese
Proceedings of the 12th Language Resources and Evaluation Conference - LREC 2020
Conference on Language Resources and Evaluation (LREC)
7145
7151
6
979-10-95546-34-4
http://www.lrec-conf.org/proceedings/lrec2020/pdf/2020.lrec-1.883.pdf
European Language Resources Association ELRA
Paris
FRANCIA
Sì, ma tipo non specificato
11-16/05/2020
Computational Language Variation Analysis
Linguistic Profiling
Universal Dependencies
5
open
Brunato, Dominique; Cimino, Andrea; Dell'Orletta, Felice; Montemagni, Simonetta; Venturi, Giulia
273
info:eu-repo/semantics/conferenceObject
04 Contributo in convegno::04.01 Contributo in Atti di convegno
File in questo prodotto:
File Dimensione Formato  
2020.lrec-1.883.pdf

accesso aperto

Licenza: Creative commons
Dimensione 570.38 kB
Formato Adobe PDF
570.38 kB Adobe PDF Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/20.500.14243/384930
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus ND
  • ???jsp.display-item.citation.isi??? ND
social impact