CNR Institutional Research Information System

We present a novel evaluation framework designed to assess the lexical proficiency and linguistic creativity of Transformer-based Language Models (LMs). We validate the framework by analyzing the performance of a set of LMs of different sizes, in both mono- and multilingual configuration, across tasks involving the generation, definition, and contextual usage of lexicalized words, neologisms, and nonce words. To support these evaluations, we developed a novel dataset of lexical entries for the Italian language, including curated definitions and usage examples sourced from various online platforms. The results highlight the robustness and effectiveness of our framework in evaluating multiple dimensions of LMs' linguistic understanding and offer an insight, through the assessment of their linguistic creativity, on the lexical generalization abilities of LMs.

Evaluating Lexical Proficiency in Neural Language Models

Ciaccio C.;Miaschi A.;Dell'Orletta F.

2025

Abstract

We present a novel evaluation framework designed to assess the lexical proficiency and linguistic creativity of Transformer-based Language Models (LMs). We validate the framework by analyzing the performance of a set of LMs of different sizes, in both mono- and multilingual configuration, across tasks involving the generation, definition, and contextual usage of lexicalized words, neologisms, and nonce words. To support these evaluations, we developed a novel dataset of lexical entries for the Italian language, including curated definitions and usage examples sourced from various online platforms. The results highlight the robustness and effectiveness of our framework in evaluating multiple dimensions of LMs' linguistic understanding and offer an insight, through the assessment of their linguistic creativity, on the lexical generalization abilities of LMs.

Scheda breve

Scheda completa

Scheda completa (DC)

Campo DC	Valore	Lingua
dc.authority.anceserie	PROCEEDINGS OF THE CONFERENCE - ASSOCIATION FOR COMPUTATIONAL LINGUISTICS. MEETING	en
dc.authority.orgunit	Istituto di linguistica computazionale "Antonio Zampolli" - ILC	en
dc.authority.people	Ciaccio C.	en
dc.authority.people	Miaschi A.	en
dc.authority.people	Dell'Orletta F.	en
dc.collection.id.s	71c7200a-7c5f-4e83-8d57-d3d2ba88f40d	*
dc.collection.name	04.01 Contributo in Atti di convegno	*
dc.contributor.appartenenza	Istituto di linguistica computazionale "Antonio Zampolli" - ILC	*
dc.contributor.appartenenza.mi	918	*
dc.contributor.area	Non assegn	*
dc.contributor.area	Non assegn	*
dc.date.accessioned	2026/03/03 14:35:19	-
dc.date.available	2026/03/03 14:35:19	-
dc.date.firstsubmission	2026/03/02 18:36:54	*
dc.date.issued	2025	-
dc.date.submission	2026/03/02 18:36:54	*
dc.description.abstracteng	We present a novel evaluation framework designed to assess the lexical proficiency and linguistic creativity of Transformer-based Language Models (LMs). We validate the framework by analyzing the performance of a set of LMs of different sizes, in both mono- and multilingual configuration, across tasks involving the generation, definition, and contextual usage of lexicalized words, neologisms, and nonce words. To support these evaluations, we developed a novel dataset of lexical entries for the Italian language, including curated definitions and usage examples sourced from various online platforms. The results highlight the robustness and effectiveness of our framework in evaluating multiple dimensions of LMs' linguistic understanding and offer an insight, through the assessment of their linguistic creativity, on the lexical generalization abilities of LMs.	-
dc.description.allpeople	Ciaccio, C.; Miaschi, A.; Dell'Orletta, F.	-
dc.description.allpeopleoriginal	Ciaccio C.; Miaschi A.; Dell'Orletta F.	en
dc.description.fulltext	open	en
dc.description.international	no	en
dc.description.numberofauthors	3	-
dc.identifier.doi	10.18653/v1/2025.acl-long.64	en
dc.identifier.scopus	2-s2.0-105021058451	en
dc.identifier.source	scopus	*
dc.identifier.uri	https://hdl.handle.net/20.500.14243/570462	-
dc.language.iso	eng	en
dc.publisher.name	Association for Computational Linguistics (ACL)	en
dc.relation.conferencedate	2025	en
dc.relation.conferencename	63rd Annual Meeting of the Association for Computational Linguistics, ACL 2025	en
dc.relation.conferenceplace	Vienna	en
dc.relation.firstpage	1267	en
dc.relation.ispartofbook	Proceedings of the Annual Meeting of the Association for Computational Linguistics	en
dc.relation.lastpage	1286	en
dc.relation.numberofpages	20	en
dc.relation.volume	1	en
dc.subject.keywords	Large Language Models (LLMs)	-
dc.subject.keywordseng	Interpretability	-
dc.subject.singlekeyword	Large Language Models (LLMs)	*
dc.subject.singlekeyword	Interpretability	*
dc.title	Evaluating Lexical Proficiency in Neural Language Models	en
dc.type.driver	info:eu-repo/semantics/conferenceObject	-
dc.type.full	04 Contributo in convegno::04.01 Contributo in Atti di convegno	it
dc.type.miur	273	-
iris.mediafilter.data	2026/03/04 02:52:29	*
iris.orcid.lastModifiedDate	2026/03/03 14:35:19	*
iris.orcid.lastModifiedMillisecond	1772544919279	*
iris.scopus.extIssued	2025	-
iris.scopus.extTitle	Evaluating Lexical Proficiency in Neural Language Models	-
iris.sitodocente.maxattempts	1	-
iris.unpaywall.bestoaversion	publishedVersion	*
iris.unpaywall.doi	10.18653/v1/2025.acl-long.64	*
iris.unpaywall.isoa	true	*
iris.unpaywall.landingpage	https://doi.org/10.18653/v1/2025.acl-long.64	*
iris.unpaywall.license	cc-by	*
iris.unpaywall.metadataCallLastModified	04/03/2026 04:34:01	-
iris.unpaywall.metadataCallLastModifiedMillisecond	1772595241744	-
iris.unpaywall.oastatus	gold	*
iris.unpaywall.pdfurl	https://aclanthology.org/2025.acl-long.64.pdf	*
scopus.authority.anceserie	PROCEEDINGS OF THE CONFERENCE - ASSOCIATION FOR COMPUTATIONAL LINGUISTICS. MEETING###0736-587X	*
scopus.category	1203	*
scopus.category	3310	*
scopus.category	1706	*
scopus.contributor.affiliation	ItaliaNLP Lab	-
scopus.contributor.affiliation	ItaliaNLP Lab	-
scopus.contributor.affiliation	ItaliaNLP Lab	-
scopus.contributor.afid	60008941	-
scopus.contributor.afid	60008941	-
scopus.contributor.afid	60008941	-
scopus.contributor.auid	59504212000	-
scopus.contributor.auid	57211678681	-
scopus.contributor.auid	57540567000	-
scopus.contributor.country	Italy	-
scopus.contributor.country	Italy	-
scopus.contributor.country	Italy	-
scopus.contributor.dptid	114087935	-
scopus.contributor.dptid	114087935	-
scopus.contributor.dptid	114087935	-
scopus.contributor.name	Cristiano	-
scopus.contributor.name	Alessio	-
scopus.contributor.name	Felice	-
scopus.contributor.subaffiliation	Istituto di Linguistica Computazionale “Antonio Zampolli” (CNR-ILC);	-
scopus.contributor.subaffiliation	Istituto di Linguistica Computazionale “Antonio Zampolli” (CNR-ILC);	-
scopus.contributor.subaffiliation	Istituto di Linguistica Computazionale “Antonio Zampolli” (CNR-ILC);	-
scopus.contributor.surname	Ciaccio	-
scopus.contributor.surname	Miaschi	-
scopus.contributor.surname	Dell'Orletta	-
scopus.date.issued	2025	*
scopus.description.abstracteng	We present a novel evaluation framework designed to assess the lexical proficiency and linguistic creativity of Transformer-based Language Models (LMs). We validate the framework by analyzing the performance of a set of LMs of different sizes, in both mono- and multilingual configuration, across tasks involving the generation, definition, and contextual usage of lexicalized words, neologisms, and nonce words. To support these evaluations, we developed a novel dataset of lexical entries for the Italian language, including curated definitions and usage examples sourced from various online platforms. The results highlight the robustness and effectiveness of our framework in evaluating multiple dimensions of LMs' linguistic understanding and offer an insight, through the assessment of their linguistic creativity, on the lexical generalization abilities of LMs.	*
scopus.description.allpeopleoriginal	Ciaccio C.; Miaschi A.; Dell'Orletta F.	*
scopus.differences	scopus.identifier.isbn	*
scopus.differences	scopus.relation.conferenceplace	*
scopus.document.type	cp	*
scopus.document.types	cp	*
scopus.funding.funders	100031478 - NextGenerationEU; 100031478 - NextGenerationEU;	*
scopus.funding.ids	PE0000013-FAIR;	*
scopus.identifier.doi	10.18653/v1/2025.acl-long.64	*
scopus.identifier.isbn	9798891762510	*
scopus.identifier.pui	649063887	*
scopus.identifier.scopus	2-s2.0-105021058451	*
scopus.journal.sourceid	21101138302	*
scopus.language.iso	eng	*
scopus.publisher.name	Association for Computational Linguistics (ACL)	*
scopus.relation.conferencedate	2025	*
scopus.relation.conferencename	63rd Annual Meeting of the Association for Computational Linguistics, ACL 2025	*
scopus.relation.conferenceplace	aut	*
scopus.relation.firstpage	1267	*
scopus.relation.lastpage	1286	*
scopus.relation.volume	1	*
scopus.title	Evaluating Lexical Proficiency in Neural Language Models	*
scopus.titleeng	Evaluating Lexical Proficiency in Neural Language Models	*
Appare nelle tipologie:	04.01 Contributo in Atti di convegno

File in questo prodotto:

File	Dimensione	Formato
2025.acl-long.64.pdf accesso aperto Licenza: Creative commons Dimensione 1.06 MB Formato Adobe PDF Visualizza/Apri	1.06 MB	Adobe PDF	Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/20.500.14243/570462

Citazioni

ND

1

ND

social impact