We present a novel evaluation framework designed to assess the lexical proficiency and linguistic creativity of Transformer-based Language Models (LMs). We validate the framework by analyzing the performance of a set of LMs of different sizes, in both mono- and multilingual configuration, across tasks involving the generation, definition, and contextual usage of lexicalized words, neologisms, and nonce words. To support these evaluations, we developed a novel dataset of lexical entries for the Italian language, including curated definitions and usage examples sourced from various online platforms. The results highlight the robustness and effectiveness of our framework in evaluating multiple dimensions of LMs' linguistic understanding and offer an insight, through the assessment of their linguistic creativity, on the lexical generalization abilities of LMs.

Evaluating Lexical Proficiency in Neural Language Models

Ciaccio C.;Miaschi A.;Dell'Orletta F.
2025

Abstract

We present a novel evaluation framework designed to assess the lexical proficiency and linguistic creativity of Transformer-based Language Models (LMs). We validate the framework by analyzing the performance of a set of LMs of different sizes, in both mono- and multilingual configuration, across tasks involving the generation, definition, and contextual usage of lexicalized words, neologisms, and nonce words. To support these evaluations, we developed a novel dataset of lexical entries for the Italian language, including curated definitions and usage examples sourced from various online platforms. The results highlight the robustness and effectiveness of our framework in evaluating multiple dimensions of LMs' linguistic understanding and offer an insight, through the assessment of their linguistic creativity, on the lexical generalization abilities of LMs.
Campo DC Valore Lingua
dc.authority.anceserie PROCEEDINGS OF THE CONFERENCE - ASSOCIATION FOR COMPUTATIONAL LINGUISTICS. MEETING en
dc.authority.orgunit Istituto di linguistica computazionale "Antonio Zampolli" - ILC en
dc.authority.people Ciaccio C. en
dc.authority.people Miaschi A. en
dc.authority.people Dell'Orletta F. en
dc.collection.id.s 71c7200a-7c5f-4e83-8d57-d3d2ba88f40d *
dc.collection.name 04.01 Contributo in Atti di convegno *
dc.contributor.appartenenza Istituto di linguistica computazionale "Antonio Zampolli" - ILC *
dc.contributor.appartenenza.mi 918 *
dc.contributor.area Non assegn *
dc.contributor.area Non assegn *
dc.date.accessioned 2026/03/03 14:35:19 -
dc.date.available 2026/03/03 14:35:19 -
dc.date.firstsubmission 2026/03/02 18:36:54 *
dc.date.issued 2025 -
dc.date.submission 2026/03/02 18:36:54 *
dc.description.abstracteng We present a novel evaluation framework designed to assess the lexical proficiency and linguistic creativity of Transformer-based Language Models (LMs). We validate the framework by analyzing the performance of a set of LMs of different sizes, in both mono- and multilingual configuration, across tasks involving the generation, definition, and contextual usage of lexicalized words, neologisms, and nonce words. To support these evaluations, we developed a novel dataset of lexical entries for the Italian language, including curated definitions and usage examples sourced from various online platforms. The results highlight the robustness and effectiveness of our framework in evaluating multiple dimensions of LMs' linguistic understanding and offer an insight, through the assessment of their linguistic creativity, on the lexical generalization abilities of LMs. -
dc.description.allpeople Ciaccio, C.; Miaschi, A.; Dell'Orletta, F. -
dc.description.allpeopleoriginal Ciaccio C.; Miaschi A.; Dell'Orletta F. en
dc.description.fulltext open en
dc.description.international no en
dc.description.numberofauthors 3 -
dc.identifier.doi 10.18653/v1/2025.acl-long.64 en
dc.identifier.scopus 2-s2.0-105021058451 en
dc.identifier.source scopus *
dc.identifier.uri https://hdl.handle.net/20.500.14243/570462 -
dc.language.iso eng en
dc.publisher.name Association for Computational Linguistics (ACL) en
dc.relation.conferencedate 2025 en
dc.relation.conferencename 63rd Annual Meeting of the Association for Computational Linguistics, ACL 2025 en
dc.relation.conferenceplace Vienna en
dc.relation.firstpage 1267 en
dc.relation.ispartofbook Proceedings of the Annual Meeting of the Association for Computational Linguistics en
dc.relation.lastpage 1286 en
dc.relation.numberofpages 20 en
dc.relation.volume 1 en
dc.subject.keywords Large Language Models (LLMs) -
dc.subject.keywordseng Interpretability -
dc.subject.singlekeyword Large Language Models (LLMs) *
dc.subject.singlekeyword Interpretability *
dc.title Evaluating Lexical Proficiency in Neural Language Models en
dc.type.driver info:eu-repo/semantics/conferenceObject -
dc.type.full 04 Contributo in convegno::04.01 Contributo in Atti di convegno it
dc.type.miur 273 -
iris.mediafilter.data 2026/03/04 02:52:29 *
iris.orcid.lastModifiedDate 2026/03/03 14:35:19 *
iris.orcid.lastModifiedMillisecond 1772544919279 *
iris.scopus.extIssued 2025 -
iris.scopus.extTitle Evaluating Lexical Proficiency in Neural Language Models -
iris.sitodocente.maxattempts 1 -
iris.unpaywall.bestoaversion publishedVersion *
iris.unpaywall.doi 10.18653/v1/2025.acl-long.64 *
iris.unpaywall.isoa true *
iris.unpaywall.landingpage https://doi.org/10.18653/v1/2025.acl-long.64 *
iris.unpaywall.license cc-by *
iris.unpaywall.metadataCallLastModified 04/03/2026 04:34:01 -
iris.unpaywall.metadataCallLastModifiedMillisecond 1772595241744 -
iris.unpaywall.oastatus gold *
iris.unpaywall.pdfurl https://aclanthology.org/2025.acl-long.64.pdf *
scopus.authority.anceserie PROCEEDINGS OF THE CONFERENCE - ASSOCIATION FOR COMPUTATIONAL LINGUISTICS. MEETING###0736-587X *
scopus.category 1203 *
scopus.category 3310 *
scopus.category 1706 *
scopus.contributor.affiliation ItaliaNLP Lab -
scopus.contributor.affiliation ItaliaNLP Lab -
scopus.contributor.affiliation ItaliaNLP Lab -
scopus.contributor.afid 60008941 -
scopus.contributor.afid 60008941 -
scopus.contributor.afid 60008941 -
scopus.contributor.auid 59504212000 -
scopus.contributor.auid 57211678681 -
scopus.contributor.auid 57540567000 -
scopus.contributor.country Italy -
scopus.contributor.country Italy -
scopus.contributor.country Italy -
scopus.contributor.dptid 114087935 -
scopus.contributor.dptid 114087935 -
scopus.contributor.dptid 114087935 -
scopus.contributor.name Cristiano -
scopus.contributor.name Alessio -
scopus.contributor.name Felice -
scopus.contributor.subaffiliation Istituto di Linguistica Computazionale “Antonio Zampolli” (CNR-ILC); -
scopus.contributor.subaffiliation Istituto di Linguistica Computazionale “Antonio Zampolli” (CNR-ILC); -
scopus.contributor.subaffiliation Istituto di Linguistica Computazionale “Antonio Zampolli” (CNR-ILC); -
scopus.contributor.surname Ciaccio -
scopus.contributor.surname Miaschi -
scopus.contributor.surname Dell'Orletta -
scopus.date.issued 2025 *
scopus.description.abstracteng We present a novel evaluation framework designed to assess the lexical proficiency and linguistic creativity of Transformer-based Language Models (LMs). We validate the framework by analyzing the performance of a set of LMs of different sizes, in both mono- and multilingual configuration, across tasks involving the generation, definition, and contextual usage of lexicalized words, neologisms, and nonce words. To support these evaluations, we developed a novel dataset of lexical entries for the Italian language, including curated definitions and usage examples sourced from various online platforms. The results highlight the robustness and effectiveness of our framework in evaluating multiple dimensions of LMs' linguistic understanding and offer an insight, through the assessment of their linguistic creativity, on the lexical generalization abilities of LMs. *
scopus.description.allpeopleoriginal Ciaccio C.; Miaschi A.; Dell'Orletta F. *
scopus.differences scopus.identifier.isbn *
scopus.differences scopus.relation.conferenceplace *
scopus.document.type cp *
scopus.document.types cp *
scopus.funding.funders 100031478 - NextGenerationEU; 100031478 - NextGenerationEU; *
scopus.funding.ids PE0000013-FAIR; *
scopus.identifier.doi 10.18653/v1/2025.acl-long.64 *
scopus.identifier.isbn 9798891762510 *
scopus.identifier.pui 649063887 *
scopus.identifier.scopus 2-s2.0-105021058451 *
scopus.journal.sourceid 21101138302 *
scopus.language.iso eng *
scopus.publisher.name Association for Computational Linguistics (ACL) *
scopus.relation.conferencedate 2025 *
scopus.relation.conferencename 63rd Annual Meeting of the Association for Computational Linguistics, ACL 2025 *
scopus.relation.conferenceplace aut *
scopus.relation.firstpage 1267 *
scopus.relation.lastpage 1286 *
scopus.relation.volume 1 *
scopus.title Evaluating Lexical Proficiency in Neural Language Models *
scopus.titleeng Evaluating Lexical Proficiency in Neural Language Models *
Appare nelle tipologie: 04.01 Contributo in Atti di convegno
File in questo prodotto:
File Dimensione Formato  
2025.acl-long.64.pdf

accesso aperto

Licenza: Creative commons
Dimensione 1.06 MB
Formato Adobe PDF
1.06 MB Adobe PDF Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/20.500.14243/570462
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 1
  • ???jsp.display-item.citation.isi??? ND
social impact