Pragmatic competence presents a persistent challenge for Large Language Models (LLMs), as it requires contextdependent inference beyond literal meaning. This study examines whether few-shot prompting can reliably steer LLMs toward appropriate interpretations of indirect speech acts under small-data conditions. Focusing on Italian, we evaluate three LLMs on a small dataset that captures pragmatic ambiguity through graded plausibility judgments. We compare a zero-shot baseline with multiple few-shot prompting configurations that vary in the number and composition of demonstrations, as well as in the presence of explicit pragmatic guidance. Results show that few-shot prompting does not yield robust or monotonic improvements overall. While performance improves substantially for conventionalized indirect speech acts, gains for non-conventionalized indirect speech acts are unstable and limited. In contrast, introducing explicit pragmatic reasoning along with demonstrations through guided chain-of-thought prompting appears more promising. Overall, these findings highlight the limits of example-based steering for pragmatic inference and suggest that explicitly modeling pragmatic reasoning may be a more effective direction in small-data settings.

Steering Pragmatic Interpretation in LLMs: A Diagnostic Evaluation of Few-Shot and Reasoning-Based Prompting for Indirect Speech Acts

Dominique Brunato
2026

Abstract

Pragmatic competence presents a persistent challenge for Large Language Models (LLMs), as it requires contextdependent inference beyond literal meaning. This study examines whether few-shot prompting can reliably steer LLMs toward appropriate interpretations of indirect speech acts under small-data conditions. Focusing on Italian, we evaluate three LLMs on a small dataset that captures pragmatic ambiguity through graded plausibility judgments. We compare a zero-shot baseline with multiple few-shot prompting configurations that vary in the number and composition of demonstrations, as well as in the presence of explicit pragmatic guidance. Results show that few-shot prompting does not yield robust or monotonic improvements overall. While performance improves substantially for conventionalized indirect speech acts, gains for non-conventionalized indirect speech acts are unstable and limited. In contrast, introducing explicit pragmatic reasoning along with demonstrations through guided chain-of-thought prompting appears more promising. Overall, these findings highlight the limits of example-based steering for pragmatic inference and suggest that explicitly modeling pragmatic reasoning may be a more effective direction in small-data settings.
Campo DC Valore Lingua
dc.authority.orgunit Istituto di linguistica computazionale "Antonio Zampolli" - ILC en
dc.authority.people Massimiliano Orsini en
dc.authority.people Dominique Brunato en
dc.collection.id.s 71c7200a-7c5f-4e83-8d57-d3d2ba88f40d *
dc.collection.name 04.01 Contributo in Atti di convegno *
dc.contributor.appartenenza Istituto di linguistica computazionale "Antonio Zampolli" - ILC *
dc.contributor.appartenenza.mi 918 *
dc.contributor.area Non assegn *
dc.date.firstsubmission 2026/05/12 15:47:00 *
dc.date.issued 2026 -
dc.date.submission 2026/05/12 15:47:00 *
dc.description.abstracteng Pragmatic competence presents a persistent challenge for Large Language Models (LLMs), as it requires contextdependent inference beyond literal meaning. This study examines whether few-shot prompting can reliably steer LLMs toward appropriate interpretations of indirect speech acts under small-data conditions. Focusing on Italian, we evaluate three LLMs on a small dataset that captures pragmatic ambiguity through graded plausibility judgments. We compare a zero-shot baseline with multiple few-shot prompting configurations that vary in the number and composition of demonstrations, as well as in the presence of explicit pragmatic guidance. Results show that few-shot prompting does not yield robust or monotonic improvements overall. While performance improves substantially for conventionalized indirect speech acts, gains for non-conventionalized indirect speech acts are unstable and limited. In contrast, introducing explicit pragmatic reasoning along with demonstrations through guided chain-of-thought prompting appears more promising. Overall, these findings highlight the limits of example-based steering for pragmatic inference and suggest that explicitly modeling pragmatic reasoning may be a more effective direction in small-data settings. -
dc.description.allpeople Orsini, Massimiliano; Brunato, Dominique -
dc.description.allpeopleoriginal Massimiliano Orsini, Dominique Brunato en
dc.description.fulltext none en
dc.description.numberofauthors 2 -
dc.identifier.isbn 978-2-493814-80-7 en
dc.identifier.source manual *
dc.identifier.uri https://hdl.handle.net/20.500.14243/580582 -
dc.language.iso eng en
dc.relation.conferencename Workshop on Learning Non-Literal Expressions with Small Data @ LREC 2026 en
dc.relation.firstpage 12 en
dc.relation.ispartofbook Proceedings of the Workshop on Learning Non-Literal Expressions with Small Data, LREC 2026 en
dc.relation.lastpage 20 en
dc.relation.numberofpages 9 en
dc.subject.keywords Italian -
dc.subject.keywordseng Indirect Speech Acts -
dc.subject.keywordseng Few-shot Prompting -
dc.subject.keywordseng Large Language Models Evaluation -
dc.subject.singlekeyword Italian *
dc.subject.singlekeyword Indirect Speech Acts *
dc.subject.singlekeyword Few-shot Prompting *
dc.subject.singlekeyword Large Language Models Evaluation *
dc.title Steering Pragmatic Interpretation in LLMs: A Diagnostic Evaluation of Few-Shot and Reasoning-Based Prompting for Indirect Speech Acts en
dc.type.driver info:eu-repo/semantics/conferenceObject -
dc.type.full 04 Contributo in convegno::04.01 Contributo in Atti di convegno it
dc.type.miur 273 -
iris.orcid.lastModifiedDate 2026/05/12 15:47:00 *
iris.orcid.lastModifiedMillisecond 1778593620321 *
iris.sitodocente.maxattempts 1 -
File in questo prodotto:
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/20.500.14243/580582
 Attenzione

Attenzione! I dati visualizzati non sono stati sottoposti a validazione da parte dell'ente

Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus ND
  • ???jsp.display-item.citation.isi??? ND
social impact