Cognitive signals, particularly eye-tracking data, offer a unique lens for understanding human sentence processing. Leveraging eye-gaze data from the English and Italian section of the Multilingual Eye-Movement Corpus (MECO), we designed a series of experiments aiming at exploring whether pre-trained neural language models (NLMs) encode patterns representative of human reading behavior and if directly incorporating this information through a fine-tuning process influences the cognitive plausibility of the model. Additionally, we sought to determine if such an impact persists through a downstream task. Our findings reveal that transformers encode eye-gaze-related information during pretraining and that explicitly integrating eye-tracking features increases model alignment with human attention. When investigating the effect of intermediate fine-tuning on eye-tracking data on the model's performance on a downstream task, we observe that this intermediate step does not result in catastrophic forgetting, despite the very different nature of the considered downstream task. In addition, the attention mechanism of models undergoing intermediate fine-tuning remains closely aligned with human attention. In conclusion, our comprehensive evaluation of NLMs informed by human attention patterns offers great potential for advancing the growing field of eXplainable Artificial Intelligence (XAI). Grounding language models in real-world cognitive processes enables the creation of systems that not only replicate human language output but also align with the cognitive mechanisms behind reading and comprehension. This alignment with human behavior enhances model adaptability, interpretability, and effectiveness, fostering more human-centric, transparent, and reliable AI applications across various domains.1

In the eyes of a language model: A comprehensive examination through eye-tracking data

Luca Dini;Dominique Brunato;Felice Dell'Orletta
2025

Abstract

Cognitive signals, particularly eye-tracking data, offer a unique lens for understanding human sentence processing. Leveraging eye-gaze data from the English and Italian section of the Multilingual Eye-Movement Corpus (MECO), we designed a series of experiments aiming at exploring whether pre-trained neural language models (NLMs) encode patterns representative of human reading behavior and if directly incorporating this information through a fine-tuning process influences the cognitive plausibility of the model. Additionally, we sought to determine if such an impact persists through a downstream task. Our findings reveal that transformers encode eye-gaze-related information during pretraining and that explicitly integrating eye-tracking features increases model alignment with human attention. When investigating the effect of intermediate fine-tuning on eye-tracking data on the model's performance on a downstream task, we observe that this intermediate step does not result in catastrophic forgetting, despite the very different nature of the considered downstream task. In addition, the attention mechanism of models undergoing intermediate fine-tuning remains closely aligned with human attention. In conclusion, our comprehensive evaluation of NLMs informed by human attention patterns offers great potential for advancing the growing field of eXplainable Artificial Intelligence (XAI). Grounding language models in real-world cognitive processes enables the creation of systems that not only replicate human language output but also align with the cognitive mechanisms behind reading and comprehension. This alignment with human behavior enhances model adaptability, interpretability, and effectiveness, fostering more human-centric, transparent, and reliable AI applications across various domains.1
Campo DC Valore Lingua
dc.authority.ancejournal NEUROCOMPUTING en
dc.authority.orgunit Istituto di linguistica computazionale "Antonio Zampolli" - ILC en
dc.authority.people Luca Dini en
dc.authority.people Luca Moroni en
dc.authority.people Dominique Brunato en
dc.authority.people Felice Dell'Orletta en
dc.collection.id.s b3f88f24-048a-4e43-8ab1-6697b90e068e *
dc.collection.name 01.01 Articolo in rivista *
dc.contributor.appartenenza Istituto di linguistica computazionale "Antonio Zampolli" - ILC *
dc.contributor.appartenenza.mi 918 *
dc.contributor.area Non assegn *
dc.contributor.area Non assegn *
dc.contributor.area Non assegn *
dc.date.accessioned 2026/03/03 15:05:51 -
dc.date.available 2026/03/03 15:05:51 -
dc.date.firstsubmission 2026/03/02 18:26:56 *
dc.date.issued 2025 -
dc.date.submission 2026/03/02 18:26:56 *
dc.description.abstracteng Cognitive signals, particularly eye-tracking data, offer a unique lens for understanding human sentence processing. Leveraging eye-gaze data from the English and Italian section of the Multilingual Eye-Movement Corpus (MECO), we designed a series of experiments aiming at exploring whether pre-trained neural language models (NLMs) encode patterns representative of human reading behavior and if directly incorporating this information through a fine-tuning process influences the cognitive plausibility of the model. Additionally, we sought to determine if such an impact persists through a downstream task. Our findings reveal that transformers encode eye-gaze-related information during pretraining and that explicitly integrating eye-tracking features increases model alignment with human attention. When investigating the effect of intermediate fine-tuning on eye-tracking data on the model's performance on a downstream task, we observe that this intermediate step does not result in catastrophic forgetting, despite the very different nature of the considered downstream task. In addition, the attention mechanism of models undergoing intermediate fine-tuning remains closely aligned with human attention. In conclusion, our comprehensive evaluation of NLMs informed by human attention patterns offers great potential for advancing the growing field of eXplainable Artificial Intelligence (XAI). Grounding language models in real-world cognitive processes enables the creation of systems that not only replicate human language output but also align with the cognitive mechanisms behind reading and comprehension. This alignment with human behavior enhances model adaptability, interpretability, and effectiveness, fostering more human-centric, transparent, and reliable AI applications across various domains.1 -
dc.description.allpeople Dini, Luca; Moroni, Luca; Brunato, Dominique; Dell'Orletta, Felice -
dc.description.allpeopleoriginal Luca Dini; Luca Moroni; Dominique Brunato; Felice Dell'Orletta en
dc.description.fulltext open en
dc.description.international no en
dc.description.numberofauthors 4 -
dc.identifier.doi 10.1016/j.neucom.2025.130617 en
dc.identifier.isi WOS:001533310500001 -
dc.identifier.scopus 2-s2.0-105010133396 en
dc.identifier.source orcid *
dc.identifier.uri https://hdl.handle.net/20.500.14243/570447 -
dc.language.iso eng en
dc.relation.volume 650 en
dc.subject.keywords Cognitive plausibility -
dc.subject.keywords Eye-tracking -
dc.subject.keywords Interpretability -
dc.subject.keywords Neural attention -
dc.subject.keywords Neural Language Models -
dc.subject.singlekeyword Cognitive plausibility *
dc.subject.singlekeyword Eye-tracking *
dc.subject.singlekeyword Interpretability *
dc.subject.singlekeyword Neural attention *
dc.subject.singlekeyword Neural Language Models *
dc.title In the eyes of a language model: A comprehensive examination through eye-tracking data en
dc.type.driver info:eu-repo/semantics/article -
dc.type.full 01 Contributo su Rivista::01.01 Articolo in rivista it
dc.type.miur 262 -
iris.isi.extIssued 2025 -
iris.isi.extTitle In the eyes of a language model: A comprehensive examination through eye-tracking data -
iris.mediafilter.data 2026/03/04 02:52:28 *
iris.orcid.lastModifiedDate 2026/03/04 01:09:50 *
iris.orcid.lastModifiedMillisecond 1772582990917 *
iris.scopus.extIssued 2025 -
iris.scopus.extTitle In the eyes of a language model: A comprehensive examination through eye-tracking data -
iris.sitodocente.maxattempts 1 -
iris.unpaywall.bestoahost publisher *
iris.unpaywall.bestoaversion publishedVersion *
iris.unpaywall.doi 10.1016/j.neucom.2025.130617 *
iris.unpaywall.hosttype publisher *
iris.unpaywall.isoa true *
iris.unpaywall.journalisindoaj false *
iris.unpaywall.landingpage https://doi.org/10.1016/j.neucom.2025.130617 *
iris.unpaywall.license cc-by-nc-nd *
iris.unpaywall.metadataCallLastModified 04/03/2026 04:33:59 -
iris.unpaywall.metadataCallLastModifiedMillisecond 1772595239909 -
iris.unpaywall.oastatus hybrid *
isi.authority.ancejournal NEUROCOMPUTING###0925-2312 *
isi.category EP *
isi.contributor.affiliation Inst Computat Linguist Antonio Zampolli CNR ILC -
isi.contributor.affiliation Sapienza University Rome -
isi.contributor.affiliation Inst Computat Linguist Antonio Zampolli CNR ILC -
isi.contributor.affiliation Inst Computat Linguist Antonio Zampolli CNR ILC -
isi.contributor.country Italy -
isi.contributor.country Italy -
isi.contributor.country Italy -
isi.contributor.country Italy -
isi.contributor.name Luca -
isi.contributor.name Luca -
isi.contributor.name Dominique -
isi.contributor.name Felice -
isi.contributor.researcherId EQZ-6001-2022 -
isi.contributor.researcherId PFS-3644-2026 -
isi.contributor.researcherId MCK-5206-2025 -
isi.contributor.researcherId AAX-1864-2020 -
isi.contributor.subaffiliation ItaliaNLP Lab -
isi.contributor.subaffiliation Sapienza NLP Grp -
isi.contributor.subaffiliation ItaliaNLP Lab -
isi.contributor.subaffiliation ItaliaNLP Lab -
isi.contributor.surname Dini -
isi.contributor.surname Moroni -
isi.contributor.surname Brunato -
isi.contributor.surname Dell'Orletta -
isi.date.issued 2025 *
isi.description.abstracteng Cognitive signals, particularly eye-tracking data, offer a unique lens for understanding human sentence processing. Leveraging eye-gaze data from the English and Italian section of the Multilingual Eye-Movement Corpus (MECO), we designed a series of experiments aiming at exploring whether pre-trained neural language models (NLMs) encode patterns representative of human reading behavior and if directly incorporating this information through a fine-tuning process influences the cognitive plausibility of the model. Additionally, we sought to determine if such an impact persists through a downstream task. Our findings reveal that transformers encode eye-gaze-related information during pretraining and that explicitly integrating eye-tracking features increases model alignment with human attention. When investigating the effect of intermediate fine-tuning on eye-tracking data on the model's performance on a downstream task, we observe that this intermediate step does not result in catastrophic forgetting, despite the very different nature of the considered downstream task. In addition, the attention mechanism of models undergoing intermediate fine-tuning remains closely aligned with human attention. In conclusion, our comprehensive evaluation of NLMs informed by human attention patterns offers great potential for advancing the growing field of eXplainable Artificial Intelligence (XAI). Grounding language models in real-world cognitive processes enables the creation of systems that not only replicate human language output but also align with the cognitive mechanisms behind reading and comprehension. This alignment with human behavior enhances model adaptability, interpretability, and effectiveness, fostering more human-centric, transparent, and reliable AI applications across various domains. *
isi.description.allpeopleoriginal Dini, L; Moroni, L; Brunato, D; Dell'Orletta, F; *
isi.document.sourcetype WOS.SCI *
isi.document.type Article *
isi.document.types Article *
isi.identifier.doi 10.1016/j.neucom.2025.130617 *
isi.identifier.eissn 1872-8286 *
isi.identifier.isi WOS:001533310500001 *
isi.journal.journaltitle NEUROCOMPUTING *
isi.journal.journaltitleabbrev NEUROCOMPUTING *
isi.language.original English *
isi.publisher.place RADARWEG 29, 1043 NX AMSTERDAM, NETHERLANDS *
isi.relation.volume 650 *
isi.title In the eyes of a language model: A comprehensive examination through eye-tracking data *
scopus.authority.ancejournal NEUROCOMPUTING###0925-2312 *
scopus.category 1706 *
scopus.category 2805 *
scopus.category 1702 *
scopus.contributor.affiliation University of Pisa -
scopus.contributor.affiliation Sapienza University of Rome/Sapienza NLP group -
scopus.contributor.affiliation Institute of Computational Linguistics “Antonio Zampolli” (CNR_ILC)/ItaliaNLP Lab -
scopus.contributor.affiliation Institute of Computational Linguistics “Antonio Zampolli” (CNR_ILC)/ItaliaNLP Lab -
scopus.contributor.afid 60028868 -
scopus.contributor.afid 60032350 -
scopus.contributor.afid 132192851 -
scopus.contributor.afid 132192851 -
scopus.contributor.auid 35185041000 -
scopus.contributor.auid 59505682200 -
scopus.contributor.auid 55237740200 -
scopus.contributor.auid 57540567000 -
scopus.contributor.country Italy -
scopus.contributor.country Italy -
scopus.contributor.country Italy -
scopus.contributor.country Italy -
scopus.contributor.dptid -
scopus.contributor.dptid -
scopus.contributor.dptid -
scopus.contributor.dptid -
scopus.contributor.name Luca -
scopus.contributor.name Luca -
scopus.contributor.name Dominique -
scopus.contributor.name Felice -
scopus.contributor.subaffiliation -
scopus.contributor.subaffiliation -
scopus.contributor.subaffiliation -
scopus.contributor.subaffiliation -
scopus.contributor.surname Dini -
scopus.contributor.surname Moroni -
scopus.contributor.surname Brunato -
scopus.contributor.surname Dell'Orletta -
scopus.date.issued 2025 *
scopus.description.abstracteng Cognitive signals, particularly eye-tracking data, offer a unique lens for understanding human sentence processing. Leveraging eye-gaze data from the English and Italian section of the Multilingual Eye-Movement Corpus (MECO), we designed a series of experiments aiming at exploring whether pre-trained neural language models (NLMs) encode patterns representative of human reading behavior and if directly incorporating this information through a fine-tuning process influences the cognitive plausibility of the model. Additionally, we sought to determine if such an impact persists through a downstream task. Our findings reveal that transformers encode eye-gaze-related information during pretraining and that explicitly integrating eye-tracking features increases model alignment with human attention. When investigating the effect of intermediate fine-tuning on eye-tracking data on the model's performance on a downstream task, we observe that this intermediate step does not result in catastrophic forgetting, despite the very different nature of the considered downstream task. In addition, the attention mechanism of models undergoing intermediate fine-tuning remains closely aligned with human attention. In conclusion, our comprehensive evaluation of NLMs informed by human attention patterns offers great potential for advancing the growing field of eXplainable Artificial Intelligence (XAI). Grounding language models in real-world cognitive processes enables the creation of systems that not only replicate human language output but also align with the cognitive mechanisms behind reading and comprehension. This alignment with human behavior enhances model adaptability, interpretability, and effectiveness, fostering more human-centric, transparent, and reliable AI applications across various domains.1 *
scopus.description.allpeopleoriginal Dini L.; Moroni L.; Brunato D.; Dell'Orletta F. *
scopus.differences scopus.subject.keywords *
scopus.differences scopus.description.allpeopleoriginal *
scopus.document.type ar *
scopus.document.types ar *
scopus.funding.funders 501100000780 - European Commission; 501100000780 - European Commission; 501100021856 - Ministero dell'Università e della Ricerca; 501100021856 - Ministero dell'Università e della Ricerca; *
scopus.funding.ids PNRR-MAD-2022-12376692_VADALA’ - CUP F83C22002470001; 2022BNE97C_SH4_PRIN2022; *
scopus.identifier.doi 10.1016/j.neucom.2025.130617 *
scopus.identifier.eissn 1872-8286 *
scopus.identifier.pui 2039518360 *
scopus.identifier.scopus 2-s2.0-105010133396 *
scopus.journal.sourceid 24807 *
scopus.language.iso eng *
scopus.publisher.name Elsevier B.V. *
scopus.relation.article 130617 *
scopus.relation.volume 650 *
scopus.subject.keywords Cognitive plausibility; Eye-tracking; Interpretability; Neural attention; Neural Language Models; *
scopus.title In the eyes of a language model: A comprehensive examination through eye-tracking data *
scopus.titleeng In the eyes of a language model: A comprehensive examination through eye-tracking data *
Appare nelle tipologie: 01.01 Articolo in rivista
File in questo prodotto:
File Dimensione Formato  
1-s2.0-S0925231225012895-main.pdf

accesso aperto

Licenza: Creative commons
Dimensione 15.87 MB
Formato Adobe PDF
15.87 MB Adobe PDF Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/20.500.14243/570447
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 1
  • ???jsp.display-item.citation.isi??? 0
social impact