The age of acquisition of a word is a psycholin- guistic variable concerning the age at which a word is typically learned. It correlates with other psycholinguistic variables such as famil- iarity, concreteness, and imageability. Exist- ing datasets for multiple languages also in- clude linguistic variables such as the length and the frequency of lemmas in different cor- pora. There are substantial sets of normative values for English, but for other languages, such as Italian, the coverage is scarce. In this paper, a set of regression experiments investigates whether it is possible to guess the age of acqui- sition of Italian lemmas that have not been pre- viously rated by humans. An intrinsic evalua- tion is proposed, correlating estimated Italian lemmas’ AoA with English lemmas’ AoA. An extrinsic evaluation - using AoA values as fea- tures for the classification of literary excerpts labeled by age appropriateness - shows how es- sential is lexical coverage for this task.

Guessing the age of acquisition of italian lemmas through linear regression

irene russo
2020

Abstract

The age of acquisition of a word is a psycholin- guistic variable concerning the age at which a word is typically learned. It correlates with other psycholinguistic variables such as famil- iarity, concreteness, and imageability. Exist- ing datasets for multiple languages also in- clude linguistic variables such as the length and the frequency of lemmas in different cor- pora. There are substantial sets of normative values for English, but for other languages, such as Italian, the coverage is scarce. In this paper, a set of regression experiments investigates whether it is possible to guess the age of acqui- sition of Italian lemmas that have not been pre- viously rated by humans. An intrinsic evalua- tion is proposed, correlating estimated Italian lemmas’ AoA with English lemmas’ AoA. An extrinsic evaluation - using AoA values as fea- tures for the classification of literary excerpts labeled by age appropriateness - shows how es- sential is lexical coverage for this task.
Campo DC Valore Lingua
dc.authority.orgunit Istituto di linguistica computazionale "Antonio Zampolli" - ILC en
dc.authority.people irene russo en
dc.collection.id.s 71c7200a-7c5f-4e83-8d57-d3d2ba88f40d *
dc.collection.name 04.01 Contributo in Atti di convegno *
dc.contributor.appartenenza Istituto di linguistica computazionale "Antonio Zampolli" - ILC *
dc.contributor.appartenenza.mi 918 *
dc.contributor.area Non assegn *
dc.date.accessioned 2024/12/17 16:33:53 -
dc.date.available 2024/12/17 16:33:53 -
dc.date.firstsubmission 2024/10/09 09:40:44 *
dc.date.issued 2020 -
dc.date.submission 2025/03/07 09:01:59 *
dc.description.abstracteng The age of acquisition of a word is a psycholin- guistic variable concerning the age at which a word is typically learned. It correlates with other psycholinguistic variables such as famil- iarity, concreteness, and imageability. Exist- ing datasets for multiple languages also in- clude linguistic variables such as the length and the frequency of lemmas in different cor- pora. There are substantial sets of normative values for English, but for other languages, such as Italian, the coverage is scarce. In this paper, a set of regression experiments investigates whether it is possible to guess the age of acqui- sition of Italian lemmas that have not been pre- viously rated by humans. An intrinsic evalua- tion is proposed, correlating estimated Italian lemmas’ AoA with English lemmas’ AoA. An extrinsic evaluation - using AoA values as fea- tures for the classification of literary excerpts labeled by age appropriateness - shows how es- sential is lexical coverage for this task. -
dc.description.allpeople Russo, Irene -
dc.description.allpeopleoriginal irene russo en
dc.description.fulltext open en
dc.description.numberofauthors 1 -
dc.identifier.isbn 978-1-952148-68-2 en
dc.identifier.source manual *
dc.identifier.uri https://hdl.handle.net/20.500.14243/505921 -
dc.identifier.url https://aclanthology.org/volumes/2020.cmcl-1/ en
dc.language.iso eng en
dc.relation.conferencename Workshop on Cognitive Modeling and Computational Linguistics en
dc.relation.firstpage 43 en
dc.relation.ispartofbook Proceedings of the Workshop on Cognitive Modeling and Computational Linguistics en
dc.relation.lastpage 48 en
dc.relation.numberofpages 6 en
dc.subject.keywordseng lexical complexity, computational psycholinguistics -
dc.subject.singlekeyword lexical complexity *
dc.subject.singlekeyword computational psycholinguistics *
dc.title Guessing the age of acquisition of italian lemmas through linear regression en
dc.type.driver info:eu-repo/semantics/conferenceObject -
dc.type.full 04 Contributo in convegno::04.01 Contributo in Atti di convegno it
dc.type.miur 273 -
iris.mediafilter.data 2026/03/04 02:52:19 *
iris.orcid.lastModifiedDate 2026/03/03 17:57:43 *
iris.orcid.lastModifiedMillisecond 1772557063935 *
iris.sitodocente.maxattempts 1 -
Appare nelle tipologie: 04.01 Contributo in Atti di convegno
File in questo prodotto:
File Dimensione Formato  
2020.cmcl-1.5(2).pdf

accesso aperto

Tipologia: Versione Editoriale (PDF)
Licenza: Creative commons
Dimensione 173.3 kB
Formato Adobe PDF
173.3 kB Adobe PDF Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/20.500.14243/505921
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus ND
  • ???jsp.display-item.citation.isi??? ND
social impact