Pragmatic competence presents a persistent challenge for Large Language Models (LLMs), as it requires contextdependent inference beyond literal meaning. This study examines whether few-shot prompting can reliably steer LLMs toward appropriate interpretations of indirect speech acts under small-data conditions. Focusing on Italian, we evaluate three LLMs on a small dataset that captures pragmatic ambiguity through graded plausibility judgments. We compare a zero-shot baseline with multiple few-shot prompting configurations that vary in the number and composition of demonstrations, as well as in the presence of explicit pragmatic guidance. Results show that few-shot prompting does not yield robust or monotonic improvements overall. While performance improves substantially for conventionalized indirect speech acts, gains for non-conventionalized indirect speech acts are unstable and limited. In contrast, introducing explicit pragmatic reasoning along with demonstrations through guided chain-of-thought prompting appears more promising. Overall, these findings highlight the limits of example-based steering for pragmatic inference and suggest that explicitly modeling pragmatic reasoning may be a more effective direction in small-data settings.
Steering Pragmatic Interpretation in LLMs: A Diagnostic Evaluation of Few-Shot and Reasoning-Based Prompting for Indirect Speech Acts
Dominique Brunato
2026
Abstract
Pragmatic competence presents a persistent challenge for Large Language Models (LLMs), as it requires contextdependent inference beyond literal meaning. This study examines whether few-shot prompting can reliably steer LLMs toward appropriate interpretations of indirect speech acts under small-data conditions. Focusing on Italian, we evaluate three LLMs on a small dataset that captures pragmatic ambiguity through graded plausibility judgments. We compare a zero-shot baseline with multiple few-shot prompting configurations that vary in the number and composition of demonstrations, as well as in the presence of explicit pragmatic guidance. Results show that few-shot prompting does not yield robust or monotonic improvements overall. While performance improves substantially for conventionalized indirect speech acts, gains for non-conventionalized indirect speech acts are unstable and limited. In contrast, introducing explicit pragmatic reasoning along with demonstrations through guided chain-of-thought prompting appears more promising. Overall, these findings highlight the limits of example-based steering for pragmatic inference and suggest that explicitly modeling pragmatic reasoning may be a more effective direction in small-data settings.| Campo DC | Valore | Lingua |
|---|---|---|
| dc.authority.orgunit | Istituto di linguistica computazionale "Antonio Zampolli" - ILC | en |
| dc.authority.people | Massimiliano Orsini | en |
| dc.authority.people | Dominique Brunato | en |
| dc.collection.id.s | 71c7200a-7c5f-4e83-8d57-d3d2ba88f40d | * |
| dc.collection.name | 04.01 Contributo in Atti di convegno | * |
| dc.contributor.appartenenza | Istituto di linguistica computazionale "Antonio Zampolli" - ILC | * |
| dc.contributor.appartenenza.mi | 918 | * |
| dc.contributor.area | Non assegn | * |
| dc.date.firstsubmission | 2026/05/12 15:47:00 | * |
| dc.date.issued | 2026 | - |
| dc.date.submission | 2026/05/12 15:47:00 | * |
| dc.description.abstracteng | Pragmatic competence presents a persistent challenge for Large Language Models (LLMs), as it requires contextdependent inference beyond literal meaning. This study examines whether few-shot prompting can reliably steer LLMs toward appropriate interpretations of indirect speech acts under small-data conditions. Focusing on Italian, we evaluate three LLMs on a small dataset that captures pragmatic ambiguity through graded plausibility judgments. We compare a zero-shot baseline with multiple few-shot prompting configurations that vary in the number and composition of demonstrations, as well as in the presence of explicit pragmatic guidance. Results show that few-shot prompting does not yield robust or monotonic improvements overall. While performance improves substantially for conventionalized indirect speech acts, gains for non-conventionalized indirect speech acts are unstable and limited. In contrast, introducing explicit pragmatic reasoning along with demonstrations through guided chain-of-thought prompting appears more promising. Overall, these findings highlight the limits of example-based steering for pragmatic inference and suggest that explicitly modeling pragmatic reasoning may be a more effective direction in small-data settings. | - |
| dc.description.allpeople | Orsini, Massimiliano; Brunato, Dominique | - |
| dc.description.allpeopleoriginal | Massimiliano Orsini, Dominique Brunato | en |
| dc.description.fulltext | none | en |
| dc.description.numberofauthors | 2 | - |
| dc.identifier.isbn | 978-2-493814-80-7 | en |
| dc.identifier.source | manual | * |
| dc.identifier.uri | https://hdl.handle.net/20.500.14243/580582 | - |
| dc.language.iso | eng | en |
| dc.relation.conferencename | Workshop on Learning Non-Literal Expressions with Small Data @ LREC 2026 | en |
| dc.relation.firstpage | 12 | en |
| dc.relation.ispartofbook | Proceedings of the Workshop on Learning Non-Literal Expressions with Small Data, LREC 2026 | en |
| dc.relation.lastpage | 20 | en |
| dc.relation.numberofpages | 9 | en |
| dc.subject.keywords | Italian | - |
| dc.subject.keywordseng | Indirect Speech Acts | - |
| dc.subject.keywordseng | Few-shot Prompting | - |
| dc.subject.keywordseng | Large Language Models Evaluation | - |
| dc.subject.singlekeyword | Italian | * |
| dc.subject.singlekeyword | Indirect Speech Acts | * |
| dc.subject.singlekeyword | Few-shot Prompting | * |
| dc.subject.singlekeyword | Large Language Models Evaluation | * |
| dc.title | Steering Pragmatic Interpretation in LLMs: A Diagnostic Evaluation of Few-Shot and Reasoning-Based Prompting for Indirect Speech Acts | en |
| dc.type.driver | info:eu-repo/semantics/conferenceObject | - |
| dc.type.full | 04 Contributo in convegno::04.01 Contributo in Atti di convegno | it |
| dc.type.miur | 273 | - |
| iris.orcid.lastModifiedDate | 2026/05/12 15:47:00 | * |
| iris.orcid.lastModifiedMillisecond | 1778593620321 | * |
| iris.sitodocente.maxattempts | 1 | - |
I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.


