Cognitive signals, particularly eye-tracking data, offer valuable insights into human language processing. Leveraging eye-gaze data from the Ghent Eye-Tracking Corpus, we conducted a series of experiments to examine how integrating knowledge of human reading behavior impacts Neural Language Models (NLMs) across multiple dimensions: task performance, attention mechanisms, and the geometry of their embedding space. We explored several fine-tuning methodologies to inject eyetracking features into the models. Our results reveal that incorporating these features does not degrade downstream task performance, enhances alignment between model attention and human attention patterns, and compresses the geometry of the embedding space.

From Human Reading to NLM Understanding: Evaluating the Role of Eye-Tracking Data in Encoder-Based Models

Dini L.;Domenichelli L.;Brunato D.;Dell'Orletta F.
2025

Abstract

Cognitive signals, particularly eye-tracking data, offer valuable insights into human language processing. Leveraging eye-gaze data from the Ghent Eye-Tracking Corpus, we conducted a series of experiments to examine how integrating knowledge of human reading behavior impacts Neural Language Models (NLMs) across multiple dimensions: task performance, attention mechanisms, and the geometry of their embedding space. We explored several fine-tuning methodologies to inject eyetracking features into the models. Our results reveal that incorporating these features does not degrade downstream task performance, enhances alignment between model attention and human attention patterns, and compresses the geometry of the embedding space.
Campo DC Valore Lingua
dc.authority.anceserie PROCEEDINGS OF THE CONFERENCE - ASSOCIATION FOR COMPUTATIONAL LINGUISTICS. MEETING en
dc.authority.orgunit Istituto di linguistica computazionale "Antonio Zampolli" - ILC en
dc.authority.people Dini L. en
dc.authority.people Domenichelli L. en
dc.authority.people Brunato D. en
dc.authority.people Dell'Orletta F. en
dc.collection.id.s 71c7200a-7c5f-4e83-8d57-d3d2ba88f40d *
dc.collection.name 04.01 Contributo in Atti di convegno *
dc.contributor.appartenenza Istituto di linguistica computazionale "Antonio Zampolli" - ILC *
dc.contributor.appartenenza.mi 918 *
dc.contributor.area Non assegn *
dc.contributor.area Non assegn *
dc.contributor.area Non assegn *
dc.date.accessioned 2026/03/03 14:37:22 -
dc.date.available 2026/03/03 14:37:22 -
dc.date.firstsubmission 2026/03/02 18:13:39 *
dc.date.issued 2025 -
dc.date.submission 2026/03/02 18:13:39 *
dc.description.abstracteng Cognitive signals, particularly eye-tracking data, offer valuable insights into human language processing. Leveraging eye-gaze data from the Ghent Eye-Tracking Corpus, we conducted a series of experiments to examine how integrating knowledge of human reading behavior impacts Neural Language Models (NLMs) across multiple dimensions: task performance, attention mechanisms, and the geometry of their embedding space. We explored several fine-tuning methodologies to inject eyetracking features into the models. Our results reveal that incorporating these features does not degrade downstream task performance, enhances alignment between model attention and human attention patterns, and compresses the geometry of the embedding space. -
dc.description.allpeople Dini, L.; Domenichelli, L.; Brunato, D.; Dell'Orletta, F. -
dc.description.allpeopleoriginal Dini L.; Domenichelli L.; Brunato D.; Dell'Orletta F. en
dc.description.fulltext open en
dc.description.international no en
dc.description.numberofauthors 4 -
dc.identifier.doi 10.18653/v1/2025.acl-long.870 en
dc.identifier.scopus 2-s2.0-105021066891 en
dc.identifier.source scopus *
dc.identifier.uri https://hdl.handle.net/20.500.14243/570446 -
dc.language.iso eng en
dc.publisher.name Association for Computational Linguistics (ACL) en
dc.relation.conferencedate 2025 en
dc.relation.conferencename 63rd Annual Meeting of the Association for Computational Linguistics, ACL 2025 en
dc.relation.conferenceplace Vienna en
dc.relation.firstpage 17796 en
dc.relation.ispartofbook Proceedings of the Annual Meeting of the Association for Computational Linguistics en
dc.relation.lastpage 17813 en
dc.relation.numberofpages 18 en
dc.relation.volume 1 en
dc.subject.keywordseng Large Language Models (LLMs) -
dc.subject.keywordseng Eye-tracking -
dc.subject.keywordseng Interpretability -
dc.subject.singlekeyword Large Language Models (LLMs) *
dc.subject.singlekeyword Eye-tracking *
dc.subject.singlekeyword Interpretability *
dc.title From Human Reading to NLM Understanding: Evaluating the Role of Eye-Tracking Data in Encoder-Based Models en
dc.type.driver info:eu-repo/semantics/conferenceObject -
dc.type.full 04 Contributo in convegno::04.01 Contributo in Atti di convegno it
dc.type.miur 273 -
iris.mediafilter.data 2026/03/04 02:52:19 *
iris.orcid.lastModifiedDate 2026/03/03 14:37:22 *
iris.orcid.lastModifiedMillisecond 1772545042157 *
iris.scopus.extIssued 2025 -
iris.scopus.extTitle From Human Reading to NLM Understanding: Evaluating the Role of Eye-Tracking Data in Encoder-Based Models -
iris.sitodocente.maxattempts 1 -
iris.unpaywall.bestoaversion publishedVersion *
iris.unpaywall.doi 10.18653/v1/2025.acl-long.870 *
iris.unpaywall.isoa true *
iris.unpaywall.landingpage https://doi.org/10.18653/v1/2025.acl-long.870 *
iris.unpaywall.license cc-by *
iris.unpaywall.metadataCallLastModified 04/03/2026 04:34:00 -
iris.unpaywall.metadataCallLastModifiedMillisecond 1772595240275 -
iris.unpaywall.oastatus gold *
iris.unpaywall.pdfurl https://aclanthology.org/2025.acl-long.870.pdf *
scopus.authority.anceserie PROCEEDINGS OF THE CONFERENCE - ASSOCIATION FOR COMPUTATIONAL LINGUISTICS. MEETING###0736-587X *
scopus.category 1203 *
scopus.category 3310 *
scopus.category 1706 *
scopus.contributor.affiliation University of Pisa -
scopus.contributor.affiliation University of Pisa -
scopus.contributor.affiliation ItaliaNLP Lab -
scopus.contributor.affiliation ItaliaNLP Lab -
scopus.contributor.afid 60028868 -
scopus.contributor.afid 60028868 -
scopus.contributor.afid 60008941 -
scopus.contributor.afid 60008941 -
scopus.contributor.auid 35185041000 -
scopus.contributor.auid 60169651800 -
scopus.contributor.auid 55237740200 -
scopus.contributor.auid 57540567000 -
scopus.contributor.country Italy -
scopus.contributor.country Italy -
scopus.contributor.country Italy -
scopus.contributor.country Italy -
scopus.contributor.dptid -
scopus.contributor.dptid -
scopus.contributor.dptid 114087935 -
scopus.contributor.dptid 114087935 -
scopus.contributor.name Luca -
scopus.contributor.name Lucia -
scopus.contributor.name Dominique -
scopus.contributor.name Felice -
scopus.contributor.subaffiliation -
scopus.contributor.subaffiliation -
scopus.contributor.subaffiliation Istituto di Linguistica Computazionale “Antonio Zampolli” (CNR-ILC); -
scopus.contributor.subaffiliation Istituto di Linguistica Computazionale “Antonio Zampolli” (CNR-ILC); -
scopus.contributor.surname Dini -
scopus.contributor.surname Domenichelli -
scopus.contributor.surname Brunato -
scopus.contributor.surname Dell'Orletta -
scopus.date.issued 2025 *
scopus.description.abstracteng Cognitive signals, particularly eye-tracking data, offer valuable insights into human language processing. Leveraging eye-gaze data from the Ghent Eye-Tracking Corpus, we conducted a series of experiments to examine how integrating knowledge of human reading behavior impacts Neural Language Models (NLMs) across multiple dimensions: task performance, attention mechanisms, and the geometry of their embedding space. We explored several fine-tuning methodologies to inject eyetracking features into the models. Our results reveal that incorporating these features does not degrade downstream task performance, enhances alignment between model attention and human attention patterns, and compresses the geometry of the embedding space. *
scopus.description.allpeopleoriginal Dini L.; Domenichelli L.; Brunato D.; Dell'Orletta F. *
scopus.differences scopus.identifier.isbn *
scopus.differences scopus.relation.conferenceplace *
scopus.document.type cp *
scopus.document.types cp *
scopus.funding.funders 501100000780 - European Commission; 501100000780 - European Commission; *
scopus.funding.ids PNRRMAD-2022-12376692_VADALA' - CUP F83C22002470001; IsC93_HiNLM; *
scopus.identifier.doi 10.18653/v1/2025.acl-long.870 *
scopus.identifier.isbn 9798891762510 *
scopus.identifier.pui 649064082 *
scopus.identifier.scopus 2-s2.0-105021066891 *
scopus.journal.sourceid 21101138302 *
scopus.language.iso eng *
scopus.publisher.name Association for Computational Linguistics (ACL) *
scopus.relation.conferencedate 2025 *
scopus.relation.conferencename 63rd Annual Meeting of the Association for Computational Linguistics, ACL 2025 *
scopus.relation.conferenceplace aut *
scopus.relation.firstpage 17796 *
scopus.relation.lastpage 17813 *
scopus.relation.volume 1 *
scopus.title From Human Reading to NLM Understanding: Evaluating the Role of Eye-Tracking Data in Encoder-Based Models *
scopus.titleeng From Human Reading to NLM Understanding: Evaluating the Role of Eye-Tracking Data in Encoder-Based Models *
Appare nelle tipologie: 04.01 Contributo in Atti di convegno
File in questo prodotto:
File Dimensione Formato  
2025.acl-long.870.pdf

accesso aperto

Licenza: Creative commons
Dimensione 509.98 kB
Formato Adobe PDF
509.98 kB Adobe PDF Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/20.500.14243/570446
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 3
  • ???jsp.display-item.citation.isi??? ND
social impact