The age of acquisition of a word is a psycholin- guistic variable concerning the age at which a word is typically learned. It correlates with other psycholinguistic variables such as famil- iarity, concreteness, and imageability. Exist- ing datasets for multiple languages also in- clude linguistic variables such as the length and the frequency of lemmas in different cor- pora. There are substantial sets of normative values for English, but for other languages, such as Italian, the coverage is scarce. In this paper, a set of regression experiments investigates whether it is possible to guess the age of acqui- sition of Italian lemmas that have not been pre- viously rated by humans. An intrinsic evalua- tion is proposed, correlating estimated Italian lemmas’ AoA with English lemmas’ AoA. An extrinsic evaluation - using AoA values as fea- tures for the classification of literary excerpts labeled by age appropriateness - shows how es- sential is lexical coverage for this task.
Guessing the age of acquisition of italian lemmas through linear regression
irene russo
2020
Abstract
The age of acquisition of a word is a psycholin- guistic variable concerning the age at which a word is typically learned. It correlates with other psycholinguistic variables such as famil- iarity, concreteness, and imageability. Exist- ing datasets for multiple languages also in- clude linguistic variables such as the length and the frequency of lemmas in different cor- pora. There are substantial sets of normative values for English, but for other languages, such as Italian, the coverage is scarce. In this paper, a set of regression experiments investigates whether it is possible to guess the age of acqui- sition of Italian lemmas that have not been pre- viously rated by humans. An intrinsic evalua- tion is proposed, correlating estimated Italian lemmas’ AoA with English lemmas’ AoA. An extrinsic evaluation - using AoA values as fea- tures for the classification of literary excerpts labeled by age appropriateness - shows how es- sential is lexical coverage for this task.| Campo DC | Valore | Lingua |
|---|---|---|
| dc.authority.orgunit | Istituto di linguistica computazionale "Antonio Zampolli" - ILC | en |
| dc.authority.people | irene russo | en |
| dc.collection.id.s | 71c7200a-7c5f-4e83-8d57-d3d2ba88f40d | * |
| dc.collection.name | 04.01 Contributo in Atti di convegno | * |
| dc.contributor.appartenenza | Istituto di linguistica computazionale "Antonio Zampolli" - ILC | * |
| dc.contributor.appartenenza.mi | 918 | * |
| dc.contributor.area | Non assegn | * |
| dc.date.accessioned | 2024/12/17 16:33:53 | - |
| dc.date.available | 2024/12/17 16:33:53 | - |
| dc.date.firstsubmission | 2024/10/09 09:40:44 | * |
| dc.date.issued | 2020 | - |
| dc.date.submission | 2025/03/07 09:01:59 | * |
| dc.description.abstracteng | The age of acquisition of a word is a psycholin- guistic variable concerning the age at which a word is typically learned. It correlates with other psycholinguistic variables such as famil- iarity, concreteness, and imageability. Exist- ing datasets for multiple languages also in- clude linguistic variables such as the length and the frequency of lemmas in different cor- pora. There are substantial sets of normative values for English, but for other languages, such as Italian, the coverage is scarce. In this paper, a set of regression experiments investigates whether it is possible to guess the age of acqui- sition of Italian lemmas that have not been pre- viously rated by humans. An intrinsic evalua- tion is proposed, correlating estimated Italian lemmas’ AoA with English lemmas’ AoA. An extrinsic evaluation - using AoA values as fea- tures for the classification of literary excerpts labeled by age appropriateness - shows how es- sential is lexical coverage for this task. | - |
| dc.description.allpeople | Russo, Irene | - |
| dc.description.allpeopleoriginal | irene russo | en |
| dc.description.fulltext | open | en |
| dc.description.numberofauthors | 1 | - |
| dc.identifier.isbn | 978-1-952148-68-2 | en |
| dc.identifier.source | manual | * |
| dc.identifier.uri | https://hdl.handle.net/20.500.14243/505921 | - |
| dc.identifier.url | https://aclanthology.org/volumes/2020.cmcl-1/ | en |
| dc.language.iso | eng | en |
| dc.relation.conferencename | Workshop on Cognitive Modeling and Computational Linguistics | en |
| dc.relation.firstpage | 43 | en |
| dc.relation.ispartofbook | Proceedings of the Workshop on Cognitive Modeling and Computational Linguistics | en |
| dc.relation.lastpage | 48 | en |
| dc.relation.numberofpages | 6 | en |
| dc.subject.keywordseng | lexical complexity, computational psycholinguistics | - |
| dc.subject.singlekeyword | lexical complexity | * |
| dc.subject.singlekeyword | computational psycholinguistics | * |
| dc.title | Guessing the age of acquisition of italian lemmas through linear regression | en |
| dc.type.driver | info:eu-repo/semantics/conferenceObject | - |
| dc.type.full | 04 Contributo in convegno::04.01 Contributo in Atti di convegno | it |
| dc.type.miur | 273 | - |
| iris.mediafilter.data | 2026/03/04 02:52:19 | * |
| iris.orcid.lastModifiedDate | 2026/03/03 17:57:43 | * |
| iris.orcid.lastModifiedMillisecond | 1772557063935 | * |
| iris.sitodocente.maxattempts | 1 | - |
| Appare nelle tipologie: | 04.01 Contributo in Atti di convegno | |
| File | Dimensione | Formato | |
|---|---|---|---|
|
2020.cmcl-1.5(2).pdf
accesso aperto
Tipologia:
Versione Editoriale (PDF)
Licenza:
Creative commons
Dimensione
173.3 kB
Formato
Adobe PDF
|
173.3 kB | Adobe PDF | Visualizza/Apri |
I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.


