Cognitive signals, particularly eye-tracking data, offer valuable insights into human language processing. Leveraging eye-gaze data from the Ghent Eye-Tracking Corpus, we conducted a series of experiments to examine how integrating knowledge of human reading behavior impacts Neural Language Models (NLMs) across multiple dimensions: task performance, attention mechanisms, and the geometry of their embedding space. We explored several fine-tuning methodologies to inject eyetracking features into the models. Our results reveal that incorporating these features does not degrade downstream task performance, enhances alignment between model attention and human attention patterns, and compresses the geometry of the embedding space.
From Human Reading to NLM Understanding: Evaluating the Role of Eye-Tracking Data in Encoder-Based Models
Dini L.;Domenichelli L.;Brunato D.;Dell'Orletta F.
2025
Abstract
Cognitive signals, particularly eye-tracking data, offer valuable insights into human language processing. Leveraging eye-gaze data from the Ghent Eye-Tracking Corpus, we conducted a series of experiments to examine how integrating knowledge of human reading behavior impacts Neural Language Models (NLMs) across multiple dimensions: task performance, attention mechanisms, and the geometry of their embedding space. We explored several fine-tuning methodologies to inject eyetracking features into the models. Our results reveal that incorporating these features does not degrade downstream task performance, enhances alignment between model attention and human attention patterns, and compresses the geometry of the embedding space.| Campo DC | Valore | Lingua |
|---|---|---|
| dc.authority.anceserie | PROCEEDINGS OF THE CONFERENCE - ASSOCIATION FOR COMPUTATIONAL LINGUISTICS. MEETING | en |
| dc.authority.orgunit | Istituto di linguistica computazionale "Antonio Zampolli" - ILC | en |
| dc.authority.people | Dini L. | en |
| dc.authority.people | Domenichelli L. | en |
| dc.authority.people | Brunato D. | en |
| dc.authority.people | Dell'Orletta F. | en |
| dc.collection.id.s | 71c7200a-7c5f-4e83-8d57-d3d2ba88f40d | * |
| dc.collection.name | 04.01 Contributo in Atti di convegno | * |
| dc.contributor.appartenenza | Istituto di linguistica computazionale "Antonio Zampolli" - ILC | * |
| dc.contributor.appartenenza.mi | 918 | * |
| dc.contributor.area | Non assegn | * |
| dc.contributor.area | Non assegn | * |
| dc.contributor.area | Non assegn | * |
| dc.date.accessioned | 2026/03/03 14:37:22 | - |
| dc.date.available | 2026/03/03 14:37:22 | - |
| dc.date.firstsubmission | 2026/03/02 18:13:39 | * |
| dc.date.issued | 2025 | - |
| dc.date.submission | 2026/03/02 18:13:39 | * |
| dc.description.abstracteng | Cognitive signals, particularly eye-tracking data, offer valuable insights into human language processing. Leveraging eye-gaze data from the Ghent Eye-Tracking Corpus, we conducted a series of experiments to examine how integrating knowledge of human reading behavior impacts Neural Language Models (NLMs) across multiple dimensions: task performance, attention mechanisms, and the geometry of their embedding space. We explored several fine-tuning methodologies to inject eyetracking features into the models. Our results reveal that incorporating these features does not degrade downstream task performance, enhances alignment between model attention and human attention patterns, and compresses the geometry of the embedding space. | - |
| dc.description.allpeople | Dini, L.; Domenichelli, L.; Brunato, D.; Dell'Orletta, F. | - |
| dc.description.allpeopleoriginal | Dini L.; Domenichelli L.; Brunato D.; Dell'Orletta F. | en |
| dc.description.fulltext | open | en |
| dc.description.international | no | en |
| dc.description.numberofauthors | 4 | - |
| dc.identifier.doi | 10.18653/v1/2025.acl-long.870 | en |
| dc.identifier.scopus | 2-s2.0-105021066891 | en |
| dc.identifier.source | scopus | * |
| dc.identifier.uri | https://hdl.handle.net/20.500.14243/570446 | - |
| dc.language.iso | eng | en |
| dc.publisher.name | Association for Computational Linguistics (ACL) | en |
| dc.relation.conferencedate | 2025 | en |
| dc.relation.conferencename | 63rd Annual Meeting of the Association for Computational Linguistics, ACL 2025 | en |
| dc.relation.conferenceplace | Vienna | en |
| dc.relation.firstpage | 17796 | en |
| dc.relation.ispartofbook | Proceedings of the Annual Meeting of the Association for Computational Linguistics | en |
| dc.relation.lastpage | 17813 | en |
| dc.relation.numberofpages | 18 | en |
| dc.relation.volume | 1 | en |
| dc.subject.keywordseng | Large Language Models (LLMs) | - |
| dc.subject.keywordseng | Eye-tracking | - |
| dc.subject.keywordseng | Interpretability | - |
| dc.subject.singlekeyword | Large Language Models (LLMs) | * |
| dc.subject.singlekeyword | Eye-tracking | * |
| dc.subject.singlekeyword | Interpretability | * |
| dc.title | From Human Reading to NLM Understanding: Evaluating the Role of Eye-Tracking Data in Encoder-Based Models | en |
| dc.type.driver | info:eu-repo/semantics/conferenceObject | - |
| dc.type.full | 04 Contributo in convegno::04.01 Contributo in Atti di convegno | it |
| dc.type.miur | 273 | - |
| iris.mediafilter.data | 2026/03/04 02:52:19 | * |
| iris.orcid.lastModifiedDate | 2026/03/03 14:37:22 | * |
| iris.orcid.lastModifiedMillisecond | 1772545042157 | * |
| iris.scopus.extIssued | 2025 | - |
| iris.scopus.extTitle | From Human Reading to NLM Understanding: Evaluating the Role of Eye-Tracking Data in Encoder-Based Models | - |
| iris.sitodocente.maxattempts | 1 | - |
| iris.unpaywall.bestoaversion | publishedVersion | * |
| iris.unpaywall.doi | 10.18653/v1/2025.acl-long.870 | * |
| iris.unpaywall.isoa | true | * |
| iris.unpaywall.landingpage | https://doi.org/10.18653/v1/2025.acl-long.870 | * |
| iris.unpaywall.license | cc-by | * |
| iris.unpaywall.metadataCallLastModified | 04/03/2026 04:34:00 | - |
| iris.unpaywall.metadataCallLastModifiedMillisecond | 1772595240275 | - |
| iris.unpaywall.oastatus | gold | * |
| iris.unpaywall.pdfurl | https://aclanthology.org/2025.acl-long.870.pdf | * |
| scopus.authority.anceserie | PROCEEDINGS OF THE CONFERENCE - ASSOCIATION FOR COMPUTATIONAL LINGUISTICS. MEETING###0736-587X | * |
| scopus.category | 1203 | * |
| scopus.category | 3310 | * |
| scopus.category | 1706 | * |
| scopus.contributor.affiliation | University of Pisa | - |
| scopus.contributor.affiliation | University of Pisa | - |
| scopus.contributor.affiliation | ItaliaNLP Lab | - |
| scopus.contributor.affiliation | ItaliaNLP Lab | - |
| scopus.contributor.afid | 60028868 | - |
| scopus.contributor.afid | 60028868 | - |
| scopus.contributor.afid | 60008941 | - |
| scopus.contributor.afid | 60008941 | - |
| scopus.contributor.auid | 35185041000 | - |
| scopus.contributor.auid | 60169651800 | - |
| scopus.contributor.auid | 55237740200 | - |
| scopus.contributor.auid | 57540567000 | - |
| scopus.contributor.country | Italy | - |
| scopus.contributor.country | Italy | - |
| scopus.contributor.country | Italy | - |
| scopus.contributor.country | Italy | - |
| scopus.contributor.dptid | - | |
| scopus.contributor.dptid | - | |
| scopus.contributor.dptid | 114087935 | - |
| scopus.contributor.dptid | 114087935 | - |
| scopus.contributor.name | Luca | - |
| scopus.contributor.name | Lucia | - |
| scopus.contributor.name | Dominique | - |
| scopus.contributor.name | Felice | - |
| scopus.contributor.subaffiliation | - | |
| scopus.contributor.subaffiliation | - | |
| scopus.contributor.subaffiliation | Istituto di Linguistica Computazionale “Antonio Zampolli” (CNR-ILC); | - |
| scopus.contributor.subaffiliation | Istituto di Linguistica Computazionale “Antonio Zampolli” (CNR-ILC); | - |
| scopus.contributor.surname | Dini | - |
| scopus.contributor.surname | Domenichelli | - |
| scopus.contributor.surname | Brunato | - |
| scopus.contributor.surname | Dell'Orletta | - |
| scopus.date.issued | 2025 | * |
| scopus.description.abstracteng | Cognitive signals, particularly eye-tracking data, offer valuable insights into human language processing. Leveraging eye-gaze data from the Ghent Eye-Tracking Corpus, we conducted a series of experiments to examine how integrating knowledge of human reading behavior impacts Neural Language Models (NLMs) across multiple dimensions: task performance, attention mechanisms, and the geometry of their embedding space. We explored several fine-tuning methodologies to inject eyetracking features into the models. Our results reveal that incorporating these features does not degrade downstream task performance, enhances alignment between model attention and human attention patterns, and compresses the geometry of the embedding space. | * |
| scopus.description.allpeopleoriginal | Dini L.; Domenichelli L.; Brunato D.; Dell'Orletta F. | * |
| scopus.differences | scopus.identifier.isbn | * |
| scopus.differences | scopus.relation.conferenceplace | * |
| scopus.document.type | cp | * |
| scopus.document.types | cp | * |
| scopus.funding.funders | 501100000780 - European Commission; 501100000780 - European Commission; | * |
| scopus.funding.ids | PNRRMAD-2022-12376692_VADALA' - CUP F83C22002470001; IsC93_HiNLM; | * |
| scopus.identifier.doi | 10.18653/v1/2025.acl-long.870 | * |
| scopus.identifier.isbn | 9798891762510 | * |
| scopus.identifier.pui | 649064082 | * |
| scopus.identifier.scopus | 2-s2.0-105021066891 | * |
| scopus.journal.sourceid | 21101138302 | * |
| scopus.language.iso | eng | * |
| scopus.publisher.name | Association for Computational Linguistics (ACL) | * |
| scopus.relation.conferencedate | 2025 | * |
| scopus.relation.conferencename | 63rd Annual Meeting of the Association for Computational Linguistics, ACL 2025 | * |
| scopus.relation.conferenceplace | aut | * |
| scopus.relation.firstpage | 17796 | * |
| scopus.relation.lastpage | 17813 | * |
| scopus.relation.volume | 1 | * |
| scopus.title | From Human Reading to NLM Understanding: Evaluating the Role of Eye-Tracking Data in Encoder-Based Models | * |
| scopus.titleeng | From Human Reading to NLM Understanding: Evaluating the Role of Eye-Tracking Data in Encoder-Based Models | * |
| Appare nelle tipologie: | 04.01 Contributo in Atti di convegno | |
| File | Dimensione | Formato | |
|---|---|---|---|
|
2025.acl-long.870.pdf
accesso aperto
Licenza:
Creative commons
Dimensione
509.98 kB
Formato
Adobe PDF
|
509.98 kB | Adobe PDF | Visualizza/Apri |
I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.


