We present a novel evaluation framework designed to assess the lexical proficiency and linguistic creativity of Transformer-based Language Models (LMs). We validate the framework by analyzing the performance of a set of LMs of different sizes, in both mono- and multilingual configuration, across tasks involving the generation, definition, and contextual usage of lexicalized words, neologisms, and nonce words. To support these evaluations, we developed a novel dataset of lexical entries for the Italian language, including curated definitions and usage examples sourced from various online platforms. The results highlight the robustness and effectiveness of our framework in evaluating multiple dimensions of LMs' linguistic understanding and offer an insight, through the assessment of their linguistic creativity, on the lexical generalization abilities of LMs.
Evaluating Lexical Proficiency in Neural Language Models
Ciaccio C.;Miaschi A.;Dell'Orletta F.
2025
Abstract
We present a novel evaluation framework designed to assess the lexical proficiency and linguistic creativity of Transformer-based Language Models (LMs). We validate the framework by analyzing the performance of a set of LMs of different sizes, in both mono- and multilingual configuration, across tasks involving the generation, definition, and contextual usage of lexicalized words, neologisms, and nonce words. To support these evaluations, we developed a novel dataset of lexical entries for the Italian language, including curated definitions and usage examples sourced from various online platforms. The results highlight the robustness and effectiveness of our framework in evaluating multiple dimensions of LMs' linguistic understanding and offer an insight, through the assessment of their linguistic creativity, on the lexical generalization abilities of LMs.| Campo DC | Valore | Lingua |
|---|---|---|
| dc.authority.anceserie | PROCEEDINGS OF THE CONFERENCE - ASSOCIATION FOR COMPUTATIONAL LINGUISTICS. MEETING | en |
| dc.authority.orgunit | Istituto di linguistica computazionale "Antonio Zampolli" - ILC | en |
| dc.authority.people | Ciaccio C. | en |
| dc.authority.people | Miaschi A. | en |
| dc.authority.people | Dell'Orletta F. | en |
| dc.collection.id.s | 71c7200a-7c5f-4e83-8d57-d3d2ba88f40d | * |
| dc.collection.name | 04.01 Contributo in Atti di convegno | * |
| dc.contributor.appartenenza | Istituto di linguistica computazionale "Antonio Zampolli" - ILC | * |
| dc.contributor.appartenenza.mi | 918 | * |
| dc.contributor.area | Non assegn | * |
| dc.contributor.area | Non assegn | * |
| dc.date.accessioned | 2026/03/03 14:35:19 | - |
| dc.date.available | 2026/03/03 14:35:19 | - |
| dc.date.firstsubmission | 2026/03/02 18:36:54 | * |
| dc.date.issued | 2025 | - |
| dc.date.submission | 2026/03/02 18:36:54 | * |
| dc.description.abstracteng | We present a novel evaluation framework designed to assess the lexical proficiency and linguistic creativity of Transformer-based Language Models (LMs). We validate the framework by analyzing the performance of a set of LMs of different sizes, in both mono- and multilingual configuration, across tasks involving the generation, definition, and contextual usage of lexicalized words, neologisms, and nonce words. To support these evaluations, we developed a novel dataset of lexical entries for the Italian language, including curated definitions and usage examples sourced from various online platforms. The results highlight the robustness and effectiveness of our framework in evaluating multiple dimensions of LMs' linguistic understanding and offer an insight, through the assessment of their linguistic creativity, on the lexical generalization abilities of LMs. | - |
| dc.description.allpeople | Ciaccio, C.; Miaschi, A.; Dell'Orletta, F. | - |
| dc.description.allpeopleoriginal | Ciaccio C.; Miaschi A.; Dell'Orletta F. | en |
| dc.description.fulltext | open | en |
| dc.description.international | no | en |
| dc.description.numberofauthors | 3 | - |
| dc.identifier.doi | 10.18653/v1/2025.acl-long.64 | en |
| dc.identifier.scopus | 2-s2.0-105021058451 | en |
| dc.identifier.source | scopus | * |
| dc.identifier.uri | https://hdl.handle.net/20.500.14243/570462 | - |
| dc.language.iso | eng | en |
| dc.publisher.name | Association for Computational Linguistics (ACL) | en |
| dc.relation.conferencedate | 2025 | en |
| dc.relation.conferencename | 63rd Annual Meeting of the Association for Computational Linguistics, ACL 2025 | en |
| dc.relation.conferenceplace | Vienna | en |
| dc.relation.firstpage | 1267 | en |
| dc.relation.ispartofbook | Proceedings of the Annual Meeting of the Association for Computational Linguistics | en |
| dc.relation.lastpage | 1286 | en |
| dc.relation.numberofpages | 20 | en |
| dc.relation.volume | 1 | en |
| dc.subject.keywords | Large Language Models (LLMs) | - |
| dc.subject.keywordseng | Interpretability | - |
| dc.subject.singlekeyword | Large Language Models (LLMs) | * |
| dc.subject.singlekeyword | Interpretability | * |
| dc.title | Evaluating Lexical Proficiency in Neural Language Models | en |
| dc.type.driver | info:eu-repo/semantics/conferenceObject | - |
| dc.type.full | 04 Contributo in convegno::04.01 Contributo in Atti di convegno | it |
| dc.type.miur | 273 | - |
| iris.mediafilter.data | 2026/03/04 02:52:29 | * |
| iris.orcid.lastModifiedDate | 2026/03/03 14:35:19 | * |
| iris.orcid.lastModifiedMillisecond | 1772544919279 | * |
| iris.scopus.extIssued | 2025 | - |
| iris.scopus.extTitle | Evaluating Lexical Proficiency in Neural Language Models | - |
| iris.sitodocente.maxattempts | 1 | - |
| iris.unpaywall.bestoaversion | publishedVersion | * |
| iris.unpaywall.doi | 10.18653/v1/2025.acl-long.64 | * |
| iris.unpaywall.isoa | true | * |
| iris.unpaywall.landingpage | https://doi.org/10.18653/v1/2025.acl-long.64 | * |
| iris.unpaywall.license | cc-by | * |
| iris.unpaywall.metadataCallLastModified | 04/03/2026 04:34:01 | - |
| iris.unpaywall.metadataCallLastModifiedMillisecond | 1772595241744 | - |
| iris.unpaywall.oastatus | gold | * |
| iris.unpaywall.pdfurl | https://aclanthology.org/2025.acl-long.64.pdf | * |
| scopus.authority.anceserie | PROCEEDINGS OF THE CONFERENCE - ASSOCIATION FOR COMPUTATIONAL LINGUISTICS. MEETING###0736-587X | * |
| scopus.category | 1203 | * |
| scopus.category | 3310 | * |
| scopus.category | 1706 | * |
| scopus.contributor.affiliation | ItaliaNLP Lab | - |
| scopus.contributor.affiliation | ItaliaNLP Lab | - |
| scopus.contributor.affiliation | ItaliaNLP Lab | - |
| scopus.contributor.afid | 60008941 | - |
| scopus.contributor.afid | 60008941 | - |
| scopus.contributor.afid | 60008941 | - |
| scopus.contributor.auid | 59504212000 | - |
| scopus.contributor.auid | 57211678681 | - |
| scopus.contributor.auid | 57540567000 | - |
| scopus.contributor.country | Italy | - |
| scopus.contributor.country | Italy | - |
| scopus.contributor.country | Italy | - |
| scopus.contributor.dptid | 114087935 | - |
| scopus.contributor.dptid | 114087935 | - |
| scopus.contributor.dptid | 114087935 | - |
| scopus.contributor.name | Cristiano | - |
| scopus.contributor.name | Alessio | - |
| scopus.contributor.name | Felice | - |
| scopus.contributor.subaffiliation | Istituto di Linguistica Computazionale “Antonio Zampolli” (CNR-ILC); | - |
| scopus.contributor.subaffiliation | Istituto di Linguistica Computazionale “Antonio Zampolli” (CNR-ILC); | - |
| scopus.contributor.subaffiliation | Istituto di Linguistica Computazionale “Antonio Zampolli” (CNR-ILC); | - |
| scopus.contributor.surname | Ciaccio | - |
| scopus.contributor.surname | Miaschi | - |
| scopus.contributor.surname | Dell'Orletta | - |
| scopus.date.issued | 2025 | * |
| scopus.description.abstracteng | We present a novel evaluation framework designed to assess the lexical proficiency and linguistic creativity of Transformer-based Language Models (LMs). We validate the framework by analyzing the performance of a set of LMs of different sizes, in both mono- and multilingual configuration, across tasks involving the generation, definition, and contextual usage of lexicalized words, neologisms, and nonce words. To support these evaluations, we developed a novel dataset of lexical entries for the Italian language, including curated definitions and usage examples sourced from various online platforms. The results highlight the robustness and effectiveness of our framework in evaluating multiple dimensions of LMs' linguistic understanding and offer an insight, through the assessment of their linguistic creativity, on the lexical generalization abilities of LMs. | * |
| scopus.description.allpeopleoriginal | Ciaccio C.; Miaschi A.; Dell'Orletta F. | * |
| scopus.differences | scopus.identifier.isbn | * |
| scopus.differences | scopus.relation.conferenceplace | * |
| scopus.document.type | cp | * |
| scopus.document.types | cp | * |
| scopus.funding.funders | 100031478 - NextGenerationEU; 100031478 - NextGenerationEU; | * |
| scopus.funding.ids | PE0000013-FAIR; | * |
| scopus.identifier.doi | 10.18653/v1/2025.acl-long.64 | * |
| scopus.identifier.isbn | 9798891762510 | * |
| scopus.identifier.pui | 649063887 | * |
| scopus.identifier.scopus | 2-s2.0-105021058451 | * |
| scopus.journal.sourceid | 21101138302 | * |
| scopus.language.iso | eng | * |
| scopus.publisher.name | Association for Computational Linguistics (ACL) | * |
| scopus.relation.conferencedate | 2025 | * |
| scopus.relation.conferencename | 63rd Annual Meeting of the Association for Computational Linguistics, ACL 2025 | * |
| scopus.relation.conferenceplace | aut | * |
| scopus.relation.firstpage | 1267 | * |
| scopus.relation.lastpage | 1286 | * |
| scopus.relation.volume | 1 | * |
| scopus.title | Evaluating Lexical Proficiency in Neural Language Models | * |
| scopus.titleeng | Evaluating Lexical Proficiency in Neural Language Models | * |
| Appare nelle tipologie: | 04.01 Contributo in Atti di convegno | |
| File | Dimensione | Formato | |
|---|---|---|---|
|
2025.acl-long.64.pdf
accesso aperto
Licenza:
Creative commons
Dimensione
1.06 MB
Formato
Adobe PDF
|
1.06 MB | Adobe PDF | Visualizza/Apri |
I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.


