Large Language Models (LLMs) are increasingly used in the social sciences and humanities (SSH) to support the analysis of complex textual data, raising methodological questions about evaluation and interpretive reliability. This paper explores the use of LLMs in Critical Discourse Analysis (CDA), considered here as a paradigmatic case of interpretive research in SSH, through a preliminary consensus-based evaluation framework. The study reports on a pilot experiment conducted on a small, theory-driven corpus of opinion articles addressing the October 7, 2023 attack and its aftermath. An LLM is asked to answer analytically motivated questions targeting different levels of discourse structure. Its responses are compared with annotations produced by multiple human analysts and aggregated through a consensus-based procedure. The results reveal an asymmetry in model performance: while LLMs align well with human consensus on macro- and superstructural features, they struggle with microstructural phenomena involving implicit meaning. These findings support the view of LLMs as epistemic support tools rather than replacements for human interpretation.
Exploring the Use of Large Language Models in Critical Discourse Analysis: A Consensus-Based Pilot Study
emiliano giovannetti
Primo
;francesca cristiano
2026
Abstract
Large Language Models (LLMs) are increasingly used in the social sciences and humanities (SSH) to support the analysis of complex textual data, raising methodological questions about evaluation and interpretive reliability. This paper explores the use of LLMs in Critical Discourse Analysis (CDA), considered here as a paradigmatic case of interpretive research in SSH, through a preliminary consensus-based evaluation framework. The study reports on a pilot experiment conducted on a small, theory-driven corpus of opinion articles addressing the October 7, 2023 attack and its aftermath. An LLM is asked to answer analytically motivated questions targeting different levels of discourse structure. Its responses are compared with annotations produced by multiple human analysts and aggregated through a consensus-based procedure. The results reveal an asymmetry in model performance: while LLMs align well with human consensus on macro- and superstructural features, they struggle with microstructural phenomena involving implicit meaning. These findings support the view of LLMs as epistemic support tools rather than replacements for human interpretation.| Campo DC | Valore | Lingua |
|---|---|---|
| dc.authority.orgunit | Istituto di linguistica computazionale "Antonio Zampolli" - ILC | en |
| dc.authority.people | emiliano giovannetti | en |
| dc.authority.people | francesca cristiano | en |
| dc.collection.id.s | 71c7200a-7c5f-4e83-8d57-d3d2ba88f40d | * |
| dc.collection.name | 04.01 Contributo in Atti di convegno | * |
| dc.contributor.appartenenza | Istituto di linguistica computazionale "Antonio Zampolli" - ILC | * |
| dc.contributor.appartenenza.mi | 918 | * |
| dc.contributor.area | Non assegn | * |
| dc.contributor.area | Non assegn | * |
| dc.date.firstsubmission | 2026/05/18 17:22:04 | * |
| dc.date.issued | 2026 | - |
| dc.date.submission | 2026/05/18 17:22:04 | * |
| dc.description.abstracteng | Large Language Models (LLMs) are increasingly used in the social sciences and humanities (SSH) to support the analysis of complex textual data, raising methodological questions about evaluation and interpretive reliability. This paper explores the use of LLMs in Critical Discourse Analysis (CDA), considered here as a paradigmatic case of interpretive research in SSH, through a preliminary consensus-based evaluation framework. The study reports on a pilot experiment conducted on a small, theory-driven corpus of opinion articles addressing the October 7, 2023 attack and its aftermath. An LLM is asked to answer analytically motivated questions targeting different levels of discourse structure. Its responses are compared with annotations produced by multiple human analysts and aggregated through a consensus-based procedure. The results reveal an asymmetry in model performance: while LLMs align well with human consensus on macro- and superstructural features, they struggle with microstructural phenomena involving implicit meaning. These findings support the view of LLMs as epistemic support tools rather than replacements for human interpretation. | - |
| dc.description.allpeople | Giovannetti, Emiliano; Cristiano, Francesca | - |
| dc.description.allpeopleoriginal | emiliano giovannetti,francesca cristiano | en |
| dc.description.fulltext | none | en |
| dc.description.numberofauthors | 2 | - |
| dc.identifier.isbn | 978-2-493814-85-2 | en |
| dc.identifier.source | manual | * |
| dc.identifier.uri | https://hdl.handle.net/20.500.14243/582229 | - |
| dc.identifier.url | http://lrec-conf.org/proceedings/lrec2026/workshops/llms4ssh/2026.llms4ssh-1.0.pdf | en |
| dc.language.iso | eng | en |
| dc.publisher.name | ELRA Language Resources Association (ELRA) | en |
| dc.relation.allauthors | Arturo Montejo-Ráez, Cristina Grisot, Joanna Blochowiak | en |
| dc.relation.conferencedate | 11 maggio 2026 | en |
| dc.relation.conferencename | Shaping Multilingual, Multimodal AI for the Social Sciences and Humanities (LLMs4SSH) @ LREC 2026 | en |
| dc.relation.conferenceplace | Palma di Maiorca | en |
| dc.relation.firstpage | 18 | en |
| dc.relation.ispartofbook | Proceedings of Shaping Multilingual, Multimodal AI for the Social Sciences and Humanities (LLMs4SSH) @ LREC 2026 | en |
| dc.relation.lastpage | 22 | en |
| dc.relation.medium | ELETTRONICO | en |
| dc.relation.numberofpages | 5 | en |
| dc.subject.keywordseng | critical discourse analysis, large language models, interpretive evaluation, consensus-based analysis, social sciences and humanities | - |
| dc.subject.singlekeyword | critical discourse analysis | * |
| dc.subject.singlekeyword | large language models | * |
| dc.subject.singlekeyword | interpretive evaluation | * |
| dc.subject.singlekeyword | consensus-based analysis | * |
| dc.subject.singlekeyword | social sciences and humanities | * |
| dc.title | Exploring the Use of Large Language Models in Critical Discourse Analysis: A Consensus-Based Pilot Study | en |
| dc.type.circulation | Internazionale | en |
| dc.type.driver | info:eu-repo/semantics/conferenceObject | - |
| dc.type.full | 04 Contributo in convegno::04.01 Contributo in Atti di convegno | it |
| dc.type.miur | 273 | - |
| iris.orcid.lastModifiedDate | 2026/05/18 17:22:04 | * |
| iris.orcid.lastModifiedMillisecond | 1779117724927 | * |
| iris.sitodocente.maxattempts | 1 | - |
I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.


