Large Language Models (LLMs) are increasingly used in the social sciences and humanities (SSH) to support the analysis of complex textual data, raising methodological questions about evaluation and interpretive reliability. This paper explores the use of LLMs in Critical Discourse Analysis (CDA), considered here as a paradigmatic case of interpretive research in SSH, through a preliminary consensus-based evaluation framework. The study reports on a pilot experiment conducted on a small, theory-driven corpus of opinion articles addressing the October 7, 2023 attack and its aftermath. An LLM is asked to answer analytically motivated questions targeting different levels of discourse structure. Its responses are compared with annotations produced by multiple human analysts and aggregated through a consensus-based procedure. The results reveal an asymmetry in model performance: while LLMs align well with human consensus on macro- and superstructural features, they struggle with microstructural phenomena involving implicit meaning. These findings support the view of LLMs as epistemic support tools rather than replacements for human interpretation.

Exploring the Use of Large Language Models in Critical Discourse Analysis: A Consensus-Based Pilot Study

emiliano giovannetti
Primo
;
francesca cristiano
2026

Abstract

Large Language Models (LLMs) are increasingly used in the social sciences and humanities (SSH) to support the analysis of complex textual data, raising methodological questions about evaluation and interpretive reliability. This paper explores the use of LLMs in Critical Discourse Analysis (CDA), considered here as a paradigmatic case of interpretive research in SSH, through a preliminary consensus-based evaluation framework. The study reports on a pilot experiment conducted on a small, theory-driven corpus of opinion articles addressing the October 7, 2023 attack and its aftermath. An LLM is asked to answer analytically motivated questions targeting different levels of discourse structure. Its responses are compared with annotations produced by multiple human analysts and aggregated through a consensus-based procedure. The results reveal an asymmetry in model performance: while LLMs align well with human consensus on macro- and superstructural features, they struggle with microstructural phenomena involving implicit meaning. These findings support the view of LLMs as epistemic support tools rather than replacements for human interpretation.
Campo DC Valore Lingua
dc.authority.orgunit Istituto di linguistica computazionale "Antonio Zampolli" - ILC en
dc.authority.people emiliano giovannetti en
dc.authority.people francesca cristiano en
dc.collection.id.s 71c7200a-7c5f-4e83-8d57-d3d2ba88f40d *
dc.collection.name 04.01 Contributo in Atti di convegno *
dc.contributor.appartenenza Istituto di linguistica computazionale "Antonio Zampolli" - ILC *
dc.contributor.appartenenza.mi 918 *
dc.contributor.area Non assegn *
dc.contributor.area Non assegn *
dc.date.firstsubmission 2026/05/18 17:22:04 *
dc.date.issued 2026 -
dc.date.submission 2026/05/18 17:22:04 *
dc.description.abstracteng Large Language Models (LLMs) are increasingly used in the social sciences and humanities (SSH) to support the analysis of complex textual data, raising methodological questions about evaluation and interpretive reliability. This paper explores the use of LLMs in Critical Discourse Analysis (CDA), considered here as a paradigmatic case of interpretive research in SSH, through a preliminary consensus-based evaluation framework. The study reports on a pilot experiment conducted on a small, theory-driven corpus of opinion articles addressing the October 7, 2023 attack and its aftermath. An LLM is asked to answer analytically motivated questions targeting different levels of discourse structure. Its responses are compared with annotations produced by multiple human analysts and aggregated through a consensus-based procedure. The results reveal an asymmetry in model performance: while LLMs align well with human consensus on macro- and superstructural features, they struggle with microstructural phenomena involving implicit meaning. These findings support the view of LLMs as epistemic support tools rather than replacements for human interpretation. -
dc.description.allpeople Giovannetti, Emiliano; Cristiano, Francesca -
dc.description.allpeopleoriginal emiliano giovannetti,francesca cristiano en
dc.description.fulltext none en
dc.description.numberofauthors 2 -
dc.identifier.isbn 978-2-493814-85-2 en
dc.identifier.source manual *
dc.identifier.uri https://hdl.handle.net/20.500.14243/582229 -
dc.identifier.url http://lrec-conf.org/proceedings/lrec2026/workshops/llms4ssh/2026.llms4ssh-1.0.pdf en
dc.language.iso eng en
dc.publisher.name ELRA Language Resources Association (ELRA) en
dc.relation.allauthors Arturo Montejo-Ráez, Cristina Grisot, Joanna Blochowiak en
dc.relation.conferencedate 11 maggio 2026 en
dc.relation.conferencename Shaping Multilingual, Multimodal AI for the Social Sciences and Humanities (LLMs4SSH) @ LREC 2026 en
dc.relation.conferenceplace Palma di Maiorca en
dc.relation.firstpage 18 en
dc.relation.ispartofbook Proceedings of Shaping Multilingual, Multimodal AI for the Social Sciences and Humanities (LLMs4SSH) @ LREC 2026 en
dc.relation.lastpage 22 en
dc.relation.medium ELETTRONICO en
dc.relation.numberofpages 5 en
dc.subject.keywordseng critical discourse analysis, large language models, interpretive evaluation, consensus-based analysis, social sciences and humanities -
dc.subject.singlekeyword critical discourse analysis *
dc.subject.singlekeyword large language models *
dc.subject.singlekeyword interpretive evaluation *
dc.subject.singlekeyword consensus-based analysis *
dc.subject.singlekeyword social sciences and humanities *
dc.title Exploring the Use of Large Language Models in Critical Discourse Analysis: A Consensus-Based Pilot Study en
dc.type.circulation Internazionale en
dc.type.driver info:eu-repo/semantics/conferenceObject -
dc.type.full 04 Contributo in convegno::04.01 Contributo in Atti di convegno it
dc.type.miur 273 -
iris.orcid.lastModifiedDate 2026/05/18 17:22:04 *
iris.orcid.lastModifiedMillisecond 1779117724927 *
iris.sitodocente.maxattempts 1 -
File in questo prodotto:
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/20.500.14243/582229
 Attenzione

Attenzione! I dati visualizzati non sono stati sottoposti a validazione da parte dell'ente

Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus ND
  • ???jsp.display-item.citation.isi??? ND
social impact