CNR Institutional Research Information System

Large Language Models (LLMs) undergo extensive evaluation against various benchmarks collected in established leaderboards to assess their performance across multiple tasks. However, to the best of our knowledge, there is a lack of comprehensive studies evaluating these models’ linguistic abilities independent of specific tasks. In this paper, we introduce a novel evaluation methodology designed to test LLMs’ sentence generation abilities under specific linguistic constraints. Drawing on the ‘linguistic profiling’ approach, we rigorously investigate the extent to which five LLMs of varying sizes, tested in both zero- and few-shot scenarios, effectively adhere to (morpho)syntactic constraints. Our findings shed light on the linguistic proficiency of LLMs, revealing both their capabilities and limitations in generating linguistically-constrained sentences.

Evaluating Large Language Models via Linguistic Profiling

Alessio Miaschi;Felice Dell'Orletta;Giulia Venturi

2024

Abstract

Large Language Models (LLMs) undergo extensive evaluation against various benchmarks collected in established leaderboards to assess their performance across multiple tasks. However, to the best of our knowledge, there is a lack of comprehensive studies evaluating these models’ linguistic abilities independent of specific tasks. In this paper, we introduce a novel evaluation methodology designed to test LLMs’ sentence generation abilities under specific linguistic constraints. Drawing on the ‘linguistic profiling’ approach, we rigorously investigate the extent to which five LLMs of varying sizes, tested in both zero- and few-shot scenarios, effectively adhere to (morpho)syntactic constraints. Our findings shed light on the linguistic proficiency of LLMs, revealing both their capabilities and limitations in generating linguistically-constrained sentences.

Scheda breve

Scheda completa

Scheda completa (DC)

Campo DC	Valore	Lingua
dc.authority.orgunit	Istituto di linguistica computazionale "Antonio Zampolli" - ILC	en
dc.authority.people	Alessio Miaschi	en
dc.authority.people	Felice Dell'Orletta	en
dc.authority.people	Giulia Venturi	en
dc.collection.id.s	71c7200a-7c5f-4e83-8d57-d3d2ba88f40d	*
dc.collection.name	04.01 Contributo in Atti di convegno	*
dc.contributor.appartenenza	Istituto di linguistica computazionale "Antonio Zampolli" - ILC	*
dc.contributor.appartenenza.mi	918	*
dc.contributor.area	Non assegn	*
dc.contributor.area	Non assegn	*
dc.contributor.area	Non assegn	*
dc.date.accessioned	2024/12/16 17:04:28	-
dc.date.available	2024/12/16 17:04:28	-
dc.date.firstsubmission	2024/12/13 18:44:31	*
dc.date.issued	2024	-
dc.date.submission	2025/01/24 12:12:50	*
dc.description.abstract	Large Language Models (LLMs) undergo extensive evaluation against various benchmarks collected in established leaderboards to assess their performance across multiple tasks. However, to the best of our knowledge, there is a lack of comprehensive studies evaluating these models’ linguistic abilities independent of specific tasks. In this paper, we introduce a novel evaluation methodology designed to test LLMs’ sentence generation abilities under specific linguistic constraints. Drawing on the ‘linguistic profiling’ approach, we rigorously investigate the extent to which five LLMs of varying sizes, tested in both zero- and few-shot scenarios, effectively adhere to (morpho)syntactic constraints. Our findings shed light on the linguistic proficiency of LLMs, revealing both their capabilities and limitations in generating linguistically-constrained sentences.	-
dc.description.allpeople	Miaschi, Alessio; Dell'Orletta, Felice; Venturi, Giulia	-
dc.description.allpeopleoriginal	Alessio Miaschi, Felice Dell'Orletta, Giulia Venturi	en
dc.description.fulltext	open	en
dc.description.numberofauthors	3	-
dc.identifier.doi	10.18653/v1/2024.emnlp-main.166	en
dc.identifier.isbn	979-8-89176-164-3	en
dc.identifier.source	manual	*
dc.identifier.uri	https://hdl.handle.net/20.500.14243/518427	-
dc.identifier.url	https://aclanthology.org/2024.emnlp-main.166	en
dc.language.iso	eng	en
dc.publisher.country	USA	en
dc.publisher.name	Association for Computational Linguistics	en
dc.relation.conferencedate	12-16 novembre	en
dc.relation.conferencename	Conference on Empirical Methods in Natural Language Processing (EMNLP)	en
dc.relation.conferenceplace	Miami, Florida	en
dc.relation.firstpage	2835	en
dc.relation.ispartofbook	Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing	en
dc.relation.lastpage	2848	en
dc.relation.medium	ELETTRONICO	en
dc.relation.numberofpages	14	en
dc.subject.keywords	Large Language Models, Controllable Text Generation, Linguistic Profiling	-
dc.subject.singlekeyword	Large Language Models	*
dc.subject.singlekeyword	Controllable Text Generation	*
dc.subject.singlekeyword	Linguistic Profiling	*
dc.title	Evaluating Large Language Models via Linguistic Profiling	en
dc.type.circulation	Internazionale	en
dc.type.driver	info:eu-repo/semantics/conferenceObject	-
dc.type.full	04 Contributo in convegno::04.01 Contributo in Atti di convegno	it
dc.type.impactfactor	si	en
dc.type.miur	273	-
dc.type.referee	Esperti anonimi	en
iris.mediafilter.data	2025/04/11 03:01:39	*
iris.orcid.lastModifiedDate	2025/02/07 11:27:40	*
iris.orcid.lastModifiedMillisecond	1738924060320	*
iris.sitodocente.maxattempts	1	-
iris.unpaywall.bestoaversion	publishedVersion	*
iris.unpaywall.doi	10.18653/v1/2024.emnlp-main.166	*
iris.unpaywall.isoa	true	*
iris.unpaywall.journalisindoaj	false	*
iris.unpaywall.landingpage	http://dx.doi.org/10.18653/v1/2024.emnlp-main.166	*
iris.unpaywall.license	cc-by	*
iris.unpaywall.metadataCallLastModified	13/02/2026 03:23:49	-
iris.unpaywall.metadataCallLastModifiedMillisecond	1770949429268	-
iris.unpaywall.oastatus	gold	*
iris.unpaywall.pdfurl	https://aclanthology.org/2024.emnlp-main.166.pdf	*
Appare nelle tipologie:	04.01 Contributo in Atti di convegno

File in questo prodotto:

File	Dimensione	Formato
2024.emnlp-main.166.pdf accesso aperto Tipologia: Versione Editoriale (PDF) Licenza: Creative commons Dimensione 765.08 kB Formato Adobe PDF Visualizza/Apri	765.08 kB	Adobe PDF	Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/20.500.14243/518427

Citazioni

ND

ND

ND

social impact