We present the first work to our knowledge on automatic age identification for Italian texts. For this work we built a dataset consisting of more than 2.400.000 posts extracted from publicly available forums and containing authorship attribution metadata, such as age and gender. We developed an age classifier and performed a set of experiments with the aim of evaluating the possibility of assigning the correct age of an user and which information is useful to tackle this task: lexical or linguistic information spanning across different levels of linguistic descriptions. The performed experiments show the importance of lexical information in age classification, but also that exists writing style that relates to the age of an user.

Quanti anni hai? Age identification for Italian

Cimino A;Dell'Orletta F
2019

Abstract

We present the first work to our knowledge on automatic age identification for Italian texts. For this work we built a dataset consisting of more than 2.400.000 posts extracted from publicly available forums and containing authorship attribution metadata, such as age and gender. We developed an age classifier and performed a set of experiments with the aim of evaluating the possibility of assigning the correct age of an user and which information is useful to tackle this task: lexical or linguistic information spanning across different levels of linguistic descriptions. The performed experiments show the importance of lexical information in age classification, but also that exists writing style that relates to the age of an user.
Campo DC Valore Lingua
dc.authority.anceserie CEUR WORKSHOP PROCEEDINGS -
dc.authority.anceserie CEUR Workshop Proceedings -
dc.authority.people Maslennikova A it
dc.authority.people Labruna P it
dc.authority.people Cimino A it
dc.authority.people Dell'Orletta F it
dc.collection.id.s 71c7200a-7c5f-4e83-8d57-d3d2ba88f40d *
dc.collection.name 04.01 Contributo in Atti di convegno *
dc.contributor.appartenenza Istituto di linguistica computazionale "Antonio Zampolli" - ILC *
dc.contributor.appartenenza.mi 918 *
dc.date.accessioned 2024/02/20 02:59:59 -
dc.date.available 2024/02/20 02:59:59 -
dc.date.issued 2019 -
dc.description.abstracteng We present the first work to our knowledge on automatic age identification for Italian texts. For this work we built a dataset consisting of more than 2.400.000 posts extracted from publicly available forums and containing authorship attribution metadata, such as age and gender. We developed an age classifier and performed a set of experiments with the aim of evaluating the possibility of assigning the correct age of an user and which information is useful to tackle this task: lexical or linguistic information spanning across different levels of linguistic descriptions. The performed experiments show the importance of lexical information in age classification, but also that exists writing style that relates to the age of an user. -
dc.description.affiliations Università di Pisa, Università di Pisa, Italy; Istituto di Linguistica Computazionale "Antonio Zampolli" (ILC-CNR), Pisa, Italy -
dc.description.allpeople Maslennikova, A; Labruna, P; Cimino, A; Dell'Orletta, F -
dc.description.allpeopleoriginal Maslennikova A.; Labruna P.; Cimino A.; Dell'Orletta F. -
dc.description.fulltext none en
dc.description.numberofauthors 4 -
dc.identifier.isbn 9791280136008 -
dc.identifier.scopus 2-s2.0-85074841934 -
dc.identifier.uri https://hdl.handle.net/20.500.14243/390425 -
dc.identifier.url http://www.scopus.com/record/display.url?eid=2-s2.0-85074841934&origin=inward -
dc.language.iso eng -
dc.relation.conferencedate 13-15/11/2019 -
dc.relation.conferencename 6th Italian Conference on Computational Linguistics (CLiC-it) -
dc.relation.conferenceplace Bari -
dc.relation.volume 2481 -
dc.subject.keywords authorship profiling -
dc.subject.singlekeyword authorship profiling *
dc.title Quanti anni hai? Age identification for Italian en
dc.type.driver info:eu-repo/semantics/conferenceObject -
dc.type.full 04 Contributo in convegno::04.01 Contributo in Atti di convegno it
dc.type.miur 273 -
dc.type.referee Sì, ma tipo non specificato -
dc.ugov.descaux1 434841 -
iris.orcid.lastModifiedDate 2024/04/04 13:30:13 *
iris.orcid.lastModifiedMillisecond 1712230213718 *
iris.scopus.extIssued 2019 -
iris.scopus.extTitle Quanti anni hai? Age identification for Italian -
iris.sitodocente.maxattempts 2 -
scopus.authority.anceserie CEUR WORKSHOP PROCEEDINGS###1613-0073 *
scopus.category 1700 *
scopus.contributor.affiliation Università di Pisa -
scopus.contributor.affiliation Università di Pisa -
scopus.contributor.affiliation ItaliaNLP Lab -
scopus.contributor.affiliation ItaliaNLP Lab -
scopus.contributor.afid 60028868 -
scopus.contributor.afid 60028868 -
scopus.contributor.afid 60008941 -
scopus.contributor.afid 60008941 -
scopus.contributor.auid 57211689435 -
scopus.contributor.auid 57211693120 -
scopus.contributor.auid 57002803800 -
scopus.contributor.auid 57540567000 -
scopus.contributor.country Italy -
scopus.contributor.country Italy -
scopus.contributor.country Italy -
scopus.contributor.country Italy -
scopus.contributor.dptid -
scopus.contributor.dptid -
scopus.contributor.dptid 114087935 -
scopus.contributor.dptid 114087935 -
scopus.contributor.name Aleksandra -
scopus.contributor.name Paolo -
scopus.contributor.name Andrea -
scopus.contributor.name Felice -
scopus.contributor.subaffiliation -
scopus.contributor.subaffiliation -
scopus.contributor.subaffiliation Istituto di Linguistica Computazionale “Antonio Zampolli” (ILC–CNR); -
scopus.contributor.subaffiliation Istituto di Linguistica Computazionale “Antonio Zampolli” (ILC–CNR); -
scopus.contributor.surname Maslennikova -
scopus.contributor.surname Labruna -
scopus.contributor.surname Cimino -
scopus.contributor.surname Dell’Orletta -
scopus.date.issued 2019 *
scopus.description.abstracteng We present the first work to our knowledge on automatic age identification for Italian texts. For this work we built a dataset consisting of more than 2.400.000 posts extracted from publicly available forums and containing authorship attribution metadata, such as age and gender. We developed an age classifier and performed a set of experiments with the aim of evaluating the possibility of assigning the correct age of an user and which information is useful to tackle this task: lexical or linguistic information spanning across different levels of linguistic descriptions. The performed experiments show the importance of lexical information in age classification, but also that exists writing style that relates to the age of an user. *
scopus.description.allpeopleoriginal Maslennikova A.; Labruna P.; Cimino A.; Dell'Orletta F. *
scopus.differences scopus.relation.conferencename *
scopus.differences scopus.authority.anceserie *
scopus.differences scopus.publisher.name *
scopus.differences scopus.relation.conferencedate *
scopus.differences scopus.relation.conferenceplace *
scopus.document.type cp *
scopus.document.types cp *
scopus.funding.funders 501100009888 - Regione Toscana; *
scopus.identifier.pui 629833173 *
scopus.identifier.scopus 2-s2.0-85074841934 *
scopus.journal.sourceid 21100218356 *
scopus.language.iso eng *
scopus.publisher.name CEUR-WS *
scopus.relation.conferencedate 2019 *
scopus.relation.conferencename 6th Italian Conference on Computational Linguistics, CLiC-it 2019 *
scopus.relation.conferenceplace ita *
scopus.relation.volume 2481 *
scopus.title Quanti anni hai? Age identification for Italian *
scopus.titleeng Quanti anni hai? Age identification for Italian *
Appare nelle tipologie: 04.01 Contributo in Atti di convegno
File in questo prodotto:
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/20.500.14243/390425
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 1
  • ???jsp.display-item.citation.isi??? ND
social impact