Large Language Models (LLMs) undergo extensive evaluation against various benchmarks collected in established leaderboards to assess their performance across multiple tasks. However, to the best of our knowledge, there is a lack of comprehensive studies evaluating these models’ linguistic abilities independent of specific tasks. In this paper, we introduce a novel evaluation methodology designed to test LLMs’ sentence generation abilities under specific linguistic constraints. Drawing on the ‘linguistic profiling’ approach, we rigorously investigate the extent to which five LLMs of varying sizes, tested in both zero- and few-shot scenarios, effectively adhere to (morpho)syntactic constraints. Our findings shed light on the linguistic proficiency of LLMs, revealing both their capabilities and limitations in generating linguistically-constrained sentences.
Evaluating Large Language Models via Linguistic Profiling
Alessio Miaschi;Felice Dell'Orletta;Giulia Venturi
2024
Abstract
Large Language Models (LLMs) undergo extensive evaluation against various benchmarks collected in established leaderboards to assess their performance across multiple tasks. However, to the best of our knowledge, there is a lack of comprehensive studies evaluating these models’ linguistic abilities independent of specific tasks. In this paper, we introduce a novel evaluation methodology designed to test LLMs’ sentence generation abilities under specific linguistic constraints. Drawing on the ‘linguistic profiling’ approach, we rigorously investigate the extent to which five LLMs of varying sizes, tested in both zero- and few-shot scenarios, effectively adhere to (morpho)syntactic constraints. Our findings shed light on the linguistic proficiency of LLMs, revealing both their capabilities and limitations in generating linguistically-constrained sentences.| Campo DC | Valore | Lingua |
|---|---|---|
| dc.authority.orgunit | Istituto di linguistica computazionale "Antonio Zampolli" - ILC | en |
| dc.authority.people | Alessio Miaschi | en |
| dc.authority.people | Felice Dell'Orletta | en |
| dc.authority.people | Giulia Venturi | en |
| dc.collection.id.s | 71c7200a-7c5f-4e83-8d57-d3d2ba88f40d | * |
| dc.collection.name | 04.01 Contributo in Atti di convegno | * |
| dc.contributor.appartenenza | Istituto di linguistica computazionale "Antonio Zampolli" - ILC | * |
| dc.contributor.appartenenza.mi | 918 | * |
| dc.contributor.area | Non assegn | * |
| dc.contributor.area | Non assegn | * |
| dc.contributor.area | Non assegn | * |
| dc.date.accessioned | 2024/12/16 17:04:28 | - |
| dc.date.available | 2024/12/16 17:04:28 | - |
| dc.date.firstsubmission | 2024/12/13 18:44:31 | * |
| dc.date.issued | 2024 | - |
| dc.date.submission | 2025/01/24 12:12:50 | * |
| dc.description.abstract | Large Language Models (LLMs) undergo extensive evaluation against various benchmarks collected in established leaderboards to assess their performance across multiple tasks. However, to the best of our knowledge, there is a lack of comprehensive studies evaluating these models’ linguistic abilities independent of specific tasks. In this paper, we introduce a novel evaluation methodology designed to test LLMs’ sentence generation abilities under specific linguistic constraints. Drawing on the ‘linguistic profiling’ approach, we rigorously investigate the extent to which five LLMs of varying sizes, tested in both zero- and few-shot scenarios, effectively adhere to (morpho)syntactic constraints. Our findings shed light on the linguistic proficiency of LLMs, revealing both their capabilities and limitations in generating linguistically-constrained sentences. | - |
| dc.description.allpeople | Miaschi, Alessio; Dell'Orletta, Felice; Venturi, Giulia | - |
| dc.description.allpeopleoriginal | Alessio Miaschi, Felice Dell'Orletta, Giulia Venturi | en |
| dc.description.fulltext | open | en |
| dc.description.numberofauthors | 3 | - |
| dc.identifier.doi | 10.18653/v1/2024.emnlp-main.166 | en |
| dc.identifier.isbn | 979-8-89176-164-3 | en |
| dc.identifier.source | manual | * |
| dc.identifier.uri | https://hdl.handle.net/20.500.14243/518427 | - |
| dc.identifier.url | https://aclanthology.org/2024.emnlp-main.166 | en |
| dc.language.iso | eng | en |
| dc.publisher.country | USA | en |
| dc.publisher.name | Association for Computational Linguistics | en |
| dc.relation.conferencedate | 12-16 novembre | en |
| dc.relation.conferencename | Conference on Empirical Methods in Natural Language Processing (EMNLP) | en |
| dc.relation.conferenceplace | Miami, Florida | en |
| dc.relation.firstpage | 2835 | en |
| dc.relation.ispartofbook | Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing | en |
| dc.relation.lastpage | 2848 | en |
| dc.relation.medium | ELETTRONICO | en |
| dc.relation.numberofpages | 14 | en |
| dc.subject.keywords | Large Language Models, Controllable Text Generation, Linguistic Profiling | - |
| dc.subject.singlekeyword | Large Language Models | * |
| dc.subject.singlekeyword | Controllable Text Generation | * |
| dc.subject.singlekeyword | Linguistic Profiling | * |
| dc.title | Evaluating Large Language Models via Linguistic Profiling | en |
| dc.type.circulation | Internazionale | en |
| dc.type.driver | info:eu-repo/semantics/conferenceObject | - |
| dc.type.full | 04 Contributo in convegno::04.01 Contributo in Atti di convegno | it |
| dc.type.impactfactor | si | en |
| dc.type.miur | 273 | - |
| dc.type.referee | Esperti anonimi | en |
| iris.mediafilter.data | 2025/04/11 03:01:39 | * |
| iris.orcid.lastModifiedDate | 2025/02/07 11:27:40 | * |
| iris.orcid.lastModifiedMillisecond | 1738924060320 | * |
| iris.sitodocente.maxattempts | 1 | - |
| iris.unpaywall.bestoaversion | publishedVersion | * |
| iris.unpaywall.doi | 10.18653/v1/2024.emnlp-main.166 | * |
| iris.unpaywall.isoa | true | * |
| iris.unpaywall.journalisindoaj | false | * |
| iris.unpaywall.landingpage | http://dx.doi.org/10.18653/v1/2024.emnlp-main.166 | * |
| iris.unpaywall.license | cc-by | * |
| iris.unpaywall.metadataCallLastModified | 13/02/2026 03:23:49 | - |
| iris.unpaywall.metadataCallLastModifiedMillisecond | 1770949429268 | - |
| iris.unpaywall.oastatus | gold | * |
| iris.unpaywall.pdfurl | https://aclanthology.org/2024.emnlp-main.166.pdf | * |
| Appare nelle tipologie: | 04.01 Contributo in Atti di convegno | |
| File | Dimensione | Formato | |
|---|---|---|---|
|
2024.emnlp-main.166.pdf
accesso aperto
Tipologia:
Versione Editoriale (PDF)
Licenza:
Creative commons
Dimensione
765.08 kB
Formato
Adobe PDF
|
765.08 kB | Adobe PDF | Visualizza/Apri |
I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.


