Large Language Models (LLMs) undergo extensive evaluation against various benchmarks collected in established leaderboards to assess their performance across multiple tasks. However, to the best of our knowledge, there is a lack of comprehensive studies evaluating these models’ linguistic abilities independent of specific tasks. In this paper, we introduce a novel evaluation methodology designed to test LLMs’ sentence generation abilities under specific linguistic constraints. Drawing on the ‘linguistic profiling’ approach, we rigorously investigate the extent to which five LLMs of varying sizes, tested in both zero- and few-shot scenarios, effectively adhere to (morpho)syntactic constraints. Our findings shed light on the linguistic proficiency of LLMs, revealing both their capabilities and limitations in generating linguistically-constrained sentences.

Evaluating Large Language Models via Linguistic Profiling

Alessio Miaschi;Felice Dell'Orletta;Giulia Venturi
2024

Abstract

Large Language Models (LLMs) undergo extensive evaluation against various benchmarks collected in established leaderboards to assess their performance across multiple tasks. However, to the best of our knowledge, there is a lack of comprehensive studies evaluating these models’ linguistic abilities independent of specific tasks. In this paper, we introduce a novel evaluation methodology designed to test LLMs’ sentence generation abilities under specific linguistic constraints. Drawing on the ‘linguistic profiling’ approach, we rigorously investigate the extent to which five LLMs of varying sizes, tested in both zero- and few-shot scenarios, effectively adhere to (morpho)syntactic constraints. Our findings shed light on the linguistic proficiency of LLMs, revealing both their capabilities and limitations in generating linguistically-constrained sentences.
Campo DC Valore Lingua
dc.authority.orgunit Istituto di linguistica computazionale "Antonio Zampolli" - ILC en
dc.authority.people Alessio Miaschi en
dc.authority.people Felice Dell'Orletta en
dc.authority.people Giulia Venturi en
dc.collection.id.s 71c7200a-7c5f-4e83-8d57-d3d2ba88f40d *
dc.collection.name 04.01 Contributo in Atti di convegno *
dc.contributor.appartenenza Istituto di linguistica computazionale "Antonio Zampolli" - ILC *
dc.contributor.appartenenza.mi 918 *
dc.contributor.area Non assegn *
dc.contributor.area Non assegn *
dc.contributor.area Non assegn *
dc.date.accessioned 2024/12/16 17:04:28 -
dc.date.available 2024/12/16 17:04:28 -
dc.date.firstsubmission 2024/12/13 18:44:31 *
dc.date.issued 2024 -
dc.date.submission 2025/01/24 12:12:50 *
dc.description.abstract Large Language Models (LLMs) undergo extensive evaluation against various benchmarks collected in established leaderboards to assess their performance across multiple tasks. However, to the best of our knowledge, there is a lack of comprehensive studies evaluating these models’ linguistic abilities independent of specific tasks. In this paper, we introduce a novel evaluation methodology designed to test LLMs’ sentence generation abilities under specific linguistic constraints. Drawing on the ‘linguistic profiling’ approach, we rigorously investigate the extent to which five LLMs of varying sizes, tested in both zero- and few-shot scenarios, effectively adhere to (morpho)syntactic constraints. Our findings shed light on the linguistic proficiency of LLMs, revealing both their capabilities and limitations in generating linguistically-constrained sentences. -
dc.description.allpeople Miaschi, Alessio; Dell'Orletta, Felice; Venturi, Giulia -
dc.description.allpeopleoriginal Alessio Miaschi, Felice Dell'Orletta, Giulia Venturi en
dc.description.fulltext open en
dc.description.numberofauthors 3 -
dc.identifier.doi 10.18653/v1/2024.emnlp-main.166 en
dc.identifier.isbn 979-8-89176-164-3 en
dc.identifier.source manual *
dc.identifier.uri https://hdl.handle.net/20.500.14243/518427 -
dc.identifier.url https://aclanthology.org/2024.emnlp-main.166 en
dc.language.iso eng en
dc.publisher.country USA en
dc.publisher.name Association for Computational Linguistics en
dc.relation.conferencedate 12-16 novembre en
dc.relation.conferencename Conference on Empirical Methods in Natural Language Processing (EMNLP) en
dc.relation.conferenceplace Miami, Florida en
dc.relation.firstpage 2835 en
dc.relation.ispartofbook Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing en
dc.relation.lastpage 2848 en
dc.relation.medium ELETTRONICO en
dc.relation.numberofpages 14 en
dc.subject.keywords Large Language Models, Controllable Text Generation, Linguistic Profiling -
dc.subject.singlekeyword Large Language Models *
dc.subject.singlekeyword Controllable Text Generation *
dc.subject.singlekeyword Linguistic Profiling *
dc.title Evaluating Large Language Models via Linguistic Profiling en
dc.type.circulation Internazionale en
dc.type.driver info:eu-repo/semantics/conferenceObject -
dc.type.full 04 Contributo in convegno::04.01 Contributo in Atti di convegno it
dc.type.impactfactor si en
dc.type.miur 273 -
dc.type.referee Esperti anonimi en
iris.mediafilter.data 2025/04/11 03:01:39 *
iris.orcid.lastModifiedDate 2025/02/07 11:27:40 *
iris.orcid.lastModifiedMillisecond 1738924060320 *
iris.sitodocente.maxattempts 1 -
iris.unpaywall.bestoaversion publishedVersion *
iris.unpaywall.doi 10.18653/v1/2024.emnlp-main.166 *
iris.unpaywall.isoa true *
iris.unpaywall.journalisindoaj false *
iris.unpaywall.landingpage http://dx.doi.org/10.18653/v1/2024.emnlp-main.166 *
iris.unpaywall.license cc-by *
iris.unpaywall.metadataCallLastModified 13/02/2026 03:23:49 -
iris.unpaywall.metadataCallLastModifiedMillisecond 1770949429268 -
iris.unpaywall.oastatus gold *
iris.unpaywall.pdfurl https://aclanthology.org/2024.emnlp-main.166.pdf *
Appare nelle tipologie: 04.01 Contributo in Atti di convegno
File in questo prodotto:
File Dimensione Formato  
2024.emnlp-main.166.pdf

accesso aperto

Tipologia: Versione Editoriale (PDF)
Licenza: Creative commons
Dimensione 765.08 kB
Formato Adobe PDF
765.08 kB Adobe PDF Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/20.500.14243/518427
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus ND
  • ???jsp.display-item.citation.isi??? ND
social impact