Extracting and formally representing the knowledge embedded in textbooks, such as the concepts explained and the relations between them, can support the provision of advanced knowledge-based services for learning environments and digital libraries. In this paper, we consider a specific type of relation in textbooks referred to as prerequisite relations (PR). PRs represent precedence relations between concepts aimed to provide the reader with the knowledge needed to understand a further concept(s). Their annotation in educational texts produces datasets that can be represented as a graph of concepts connected by PRs. However, building good-quality and reliable datasets of PRs from a textbook is still an open issue, not just for automated annotation methods but even for manual annotation. In turn, the lack of good-quality datasets and well-defined criteria to identify PRs affect the development and validation of automated methods for prerequisite identification. As a contribution to this issue, in this paper, we propose PREAP, a protocol for the annotation of prerequisite relations in textbooks aimed at obtaining reliable annotated data that can be shared, compared, and reused in the research community. PREAP defines a novel textbook-driven annotation method aimed to capture the structure of prerequisites underlying the text. The protocol has been evaluated against baseline methods for manual and automatic annotation. The findings show that PREAP enables the creation of prerequisite knowledge graphs that have higher inter-annotator agreement, accuracy, and alignment with text than the baseline methods. This suggests that the protocol is able to accurately capture the PRs expressed in the text. Furthermore, the findings show that the time required to complete the annotation using PREAP are significantly shorter than with the other manual baseline methods. The paper includes also guidelines for using PREAP in three annotation scenarios, experimentally tested. We also provide example datasets and a user interface that we developed to support prerequisite annotation.

Annotation Protocol for Textbook Enrichment with Prerequisite Knowledge Graph

Chiara Alzetta;
2023

Abstract

Extracting and formally representing the knowledge embedded in textbooks, such as the concepts explained and the relations between them, can support the provision of advanced knowledge-based services for learning environments and digital libraries. In this paper, we consider a specific type of relation in textbooks referred to as prerequisite relations (PR). PRs represent precedence relations between concepts aimed to provide the reader with the knowledge needed to understand a further concept(s). Their annotation in educational texts produces datasets that can be represented as a graph of concepts connected by PRs. However, building good-quality and reliable datasets of PRs from a textbook is still an open issue, not just for automated annotation methods but even for manual annotation. In turn, the lack of good-quality datasets and well-defined criteria to identify PRs affect the development and validation of automated methods for prerequisite identification. As a contribution to this issue, in this paper, we propose PREAP, a protocol for the annotation of prerequisite relations in textbooks aimed at obtaining reliable annotated data that can be shared, compared, and reused in the research community. PREAP defines a novel textbook-driven annotation method aimed to capture the structure of prerequisites underlying the text. The protocol has been evaluated against baseline methods for manual and automatic annotation. The findings show that PREAP enables the creation of prerequisite knowledge graphs that have higher inter-annotator agreement, accuracy, and alignment with text than the baseline methods. This suggests that the protocol is able to accurately capture the PRs expressed in the text. Furthermore, the findings show that the time required to complete the annotation using PREAP are significantly shorter than with the other manual baseline methods. The paper includes also guidelines for using PREAP in three annotation scenarios, experimentally tested. We also provide example datasets and a user interface that we developed to support prerequisite annotation.
Campo DC Valore Lingua
dc.authority.ancejournal TECHNOLOGY, KNOWLEDGE AND LEARNING en
dc.authority.orgunit Istituto di linguistica computazionale "Antonio Zampolli" - ILC en
dc.authority.people Chiara Alzetta en
dc.authority.people Ilaria Torre en
dc.authority.people Frosina Koceva en
dc.collection.id.s b3f88f24-048a-4e43-8ab1-6697b90e068e *
dc.collection.name 01.01 Articolo in rivista *
dc.contributor.appartenenza Istituto di linguistica computazionale "Antonio Zampolli" - ILC *
dc.contributor.appartenenza.mi 918 *
dc.contributor.area Non assegn *
dc.date.accessioned 2024/02/20 03:26:31 -
dc.date.available 2024/02/20 03:26:31 -
dc.date.firstsubmission 2025/03/03 18:37:23 *
dc.date.issued 2023 -
dc.date.submission 2025/03/03 18:37:23 *
dc.description.abstracteng Extracting and formally representing the knowledge embedded in textbooks, such as the concepts explained and the relations between them, can support the provision of advanced knowledge-based services for learning environments and digital libraries. In this paper, we consider a specific type of relation in textbooks referred to as prerequisite relations (PR). PRs represent precedence relations between concepts aimed to provide the reader with the knowledge needed to understand a further concept(s). Their annotation in educational texts produces datasets that can be represented as a graph of concepts connected by PRs. However, building good-quality and reliable datasets of PRs from a textbook is still an open issue, not just for automated annotation methods but even for manual annotation. In turn, the lack of good-quality datasets and well-defined criteria to identify PRs affect the development and validation of automated methods for prerequisite identification. As a contribution to this issue, in this paper, we propose PREAP, a protocol for the annotation of prerequisite relations in textbooks aimed at obtaining reliable annotated data that can be shared, compared, and reused in the research community. PREAP defines a novel textbook-driven annotation method aimed to capture the structure of prerequisites underlying the text. The protocol has been evaluated against baseline methods for manual and automatic annotation. The findings show that PREAP enables the creation of prerequisite knowledge graphs that have higher inter-annotator agreement, accuracy, and alignment with text than the baseline methods. This suggests that the protocol is able to accurately capture the PRs expressed in the text. Furthermore, the findings show that the time required to complete the annotation using PREAP are significantly shorter than with the other manual baseline methods. The paper includes also guidelines for using PREAP in three annotation scenarios, experimentally tested. We also provide example datasets and a user interface that we developed to support prerequisite annotation. -
dc.description.affiliations Institute of Computational Linguistics "A. Zampolli", ItaliaNLP Lab, CNR-ILC, Pisa, Italy Department of Informatics, Bioengineering, Robotics and Systems Engineering, University of Genoa, Genoa, Italy E-Learning and Knowledge Management Lab, DIBRIS, University of Genoa, Genoa, Italy -
dc.description.allpeople Alzetta, Chiara; Torre, Ilaria; Koceva, Frosina -
dc.description.allpeopleoriginal Chiara Alzetta, Ilaria Torre & Frosina Koceva en
dc.description.fulltext open en
dc.description.numberofauthors 3 -
dc.identifier.doi 10.1007/s10758-023-09682-6 en
dc.identifier.isi WOS:001067462800001 -
dc.identifier.scopus 2-s2.0-85171573789 en
dc.identifier.uri https://hdl.handle.net/20.500.14243/450158 -
dc.identifier.url https://rdcu.be/dxjsm en
dc.language.iso eng en
dc.miur.last.status.update 2024-06-28T14:02:16Z *
dc.subject.keywords Text annotation -
dc.subject.keywords annotation protocol -
dc.subject.keywords knowledge engeneering -
dc.subject.keywords educational textbook -
dc.subject.singlekeyword Text annotation *
dc.subject.singlekeyword annotation protocol *
dc.subject.singlekeyword knowledge engeneering *
dc.subject.singlekeyword educational textbook *
dc.title Annotation Protocol for Textbook Enrichment with Prerequisite Knowledge Graph en
dc.type.driver info:eu-repo/semantics/article -
dc.type.full 01 Contributo su Rivista::01.01 Articolo in rivista it
dc.type.miur 262 -
dc.type.referee Sì, ma tipo non specificato en
dc.ugov.descaux1 492339 -
iris.isi.extIssued 2024 -
iris.isi.extTitle Annotation Protocol for Textbook Enrichment with Prerequisite Knowledge Graph -
iris.mediafilter.data 2025/04/04 04:36:33 *
iris.orcid.lastModifiedDate 2025/03/03 18:41:29 *
iris.orcid.lastModifiedMillisecond 1741023689674 *
iris.scopus.extIssued 2024 -
iris.scopus.extTitle Annotation Protocol for Textbook Enrichment with Prerequisite Knowledge Graph -
iris.scopus.ideLinkStatusDate 2024/06/18 15:14:24 *
iris.scopus.ideLinkStatusMillisecond 1718716464916 *
iris.sitodocente.maxattempts 1 -
iris.unpaywall.bestoahost publisher *
iris.unpaywall.bestoaversion publishedVersion *
iris.unpaywall.doi 10.1007/s10758-023-09682-6 *
iris.unpaywall.hosttype publisher *
iris.unpaywall.isoa true *
iris.unpaywall.journalisindoaj false *
iris.unpaywall.landingpage https://doi.org/10.1007/s10758-023-09682-6 *
iris.unpaywall.license cc-by *
iris.unpaywall.metadataCallLastModified 05/05/2026 04:49:51 -
iris.unpaywall.metadataCallLastModifiedMillisecond 1777949391163 -
iris.unpaywall.oastatus hybrid *
iris.unpaywall.pdfurl https://link.springer.com/content/pdf/10.1007/s10758-023-09682-6.pdf *
isi.authority.ancejournal TECHNOLOGY, KNOWLEDGE AND LEARNING###2211-1662 *
isi.authority.sdg Goal 4: Quality education###12084 *
isi.category HA *
isi.contributor.affiliation Consiglio Nazionale delle Ricerche (CNR) -
isi.contributor.affiliation University of Genoa -
isi.contributor.affiliation University of Genoa -
isi.contributor.country Italy -
isi.contributor.country Italy -
isi.contributor.country Italy -
isi.contributor.name Chiara -
isi.contributor.name Ilaria -
isi.contributor.name Frosina -
isi.contributor.researcherId KVX-9760-2024 -
isi.contributor.researcherId GCR-9224-2022 -
isi.contributor.researcherId CZR-6648-2022 -
isi.contributor.subaffiliation Inst Computat Linguist A Zampolli -
isi.contributor.subaffiliation Dept Informat Bioengn Robot & Syst Engn -
isi.contributor.subaffiliation Elearning & Knowledge Management Lab -
isi.contributor.surname Alzetta -
isi.contributor.surname Torre -
isi.contributor.surname Koceva -
isi.date.issued 2024 *
isi.description.abstracteng Extracting and formally representing the knowledge embedded in textbooks, such as the concepts explained and the relations between them, can support the provision of advanced knowledge-based services for learning environments and digital libraries. In this paper, we consider a specific type of relation in textbooks referred to as prerequisite relations (PR). PRs represent precedence relations between concepts aimed to provide the reader with the knowledge needed to understand a further concept(s). Their annotation in educational texts produces datasets that can be represented as a graph of concepts connected by PRs. However, building good-quality and reliable datasets of PRs from a textbook is still an open issue, not just for automated annotation methods but even for manual annotation. In turn, the lack of good-quality datasets and well-defined criteria to identify PRs affect the development and validation of automated methods for prerequisite identification. As a contribution to this issue, in this paper, we propose PREAP, a protocol for the annotation of prerequisite relations in textbooks aimed at obtaining reliable annotated data that can be shared, compared, and reused in the research community. PREAP defines a novel textbook-driven annotation method aimed to capture the structure of prerequisites underlying the text. The protocol has been evaluated against baseline methods for manual and automatic annotation. The findings show that PREAP enables the creation of prerequisite knowledge graphs that have higher inter-annotator agreement, accuracy, and alignment with text than the baseline methods. This suggests that the protocol is able to accurately capture the PRs expressed in the text. Furthermore, the findings show that the time required to complete the annotation using PREAP are significantly shorter than with the other manual baseline methods. The paper includes also guidelines for using PREAP in three annotation scenarios, experimentally tested. We also provide example datasets and a user interface that we developed to support prerequisite annotation. *
isi.description.allpeopleoriginal Alzetta, C; Torre, I; Koceva, F; *
isi.document.sourcetype WOS.ESCI *
isi.document.type Article *
isi.document.types Article *
isi.identifier.doi 10.1007/s10758-023-09682-6 *
isi.identifier.eissn 2211-1670 *
isi.identifier.isi WOS:001067462800001 *
isi.journal.journaltitle TECHNOLOGY KNOWLEDGE AND LEARNING *
isi.journal.journaltitleabbrev TECHNOL KNOWL LEARN *
isi.language.original English *
isi.publisher.place VAN GODEWIJCKSTRAAT 30, 3311 GZ DORDRECHT, NETHERLANDS *
isi.relation.firstpage 197 *
isi.relation.issue 1 *
isi.relation.lastpage 228 *
isi.relation.volume 29 *
isi.title Annotation Protocol for Textbook Enrichment with Prerequisite Knowledge Graph *
scopus.authority.ancejournal TECHNOLOGY, KNOWLEDGE AND LEARNING###2211-1662 *
scopus.category 2601 *
scopus.category 3304 *
scopus.category 1709 *
scopus.category 1706 *
scopus.contributor.affiliation CNR-ILC -
scopus.contributor.affiliation University of Genoa -
scopus.contributor.affiliation University of Genoa -
scopus.contributor.afid 60021199 -
scopus.contributor.afid 60121711 -
scopus.contributor.afid 60025153 -
scopus.contributor.auid 57192938832 -
scopus.contributor.auid 57220754587 -
scopus.contributor.auid 56406555300 -
scopus.contributor.country Italy -
scopus.contributor.country Italy -
scopus.contributor.country Italy -
scopus.contributor.dptid 121833164 -
scopus.contributor.dptid -
scopus.contributor.dptid 109230150 -
scopus.contributor.name Chiara -
scopus.contributor.name Ilaria -
scopus.contributor.name Frosina -
scopus.contributor.subaffiliation Institute of Computational Linguistics “A. Zampolli”;ItaliaNLP Lab; -
scopus.contributor.subaffiliation Department of Informatics;Bioengineering;Robotics and Systems Engineering; -
scopus.contributor.subaffiliation E-Learning and Knowledge Management Lab;DIBRIS; -
scopus.contributor.surname Alzetta -
scopus.contributor.surname Torre -
scopus.contributor.surname Koceva -
scopus.date.issued 2024 *
scopus.description.abstracteng Extracting and formally representing the knowledge embedded in textbooks, such as the concepts explained and the relations between them, can support the provision of advanced knowledge-based services for learning environments and digital libraries. In this paper, we consider a specific type of relation in textbooks referred to as prerequisite relations (PR). PRs represent precedence relations between concepts aimed to provide the reader with the knowledge needed to understand a further concept(s). Their annotation in educational texts produces datasets that can be represented as a graph of concepts connected by PRs. However, building good-quality and reliable datasets of PRs from a textbook is still an open issue, not just for automated annotation methods but even for manual annotation. In turn, the lack of good-quality datasets and well-defined criteria to identify PRs affect the development and validation of automated methods for prerequisite identification. As a contribution to this issue, in this paper, we propose PREAP, a protocol for the annotation of prerequisite relations in textbooks aimed at obtaining reliable annotated data that can be shared, compared, and reused in the research community. PREAP defines a novel textbook-driven annotation method aimed to capture the structure of prerequisites underlying the text. The protocol has been evaluated against baseline methods for manual and automatic annotation. The findings show that PREAP enables the creation of prerequisite knowledge graphs that have higher inter-annotator agreement, accuracy, and alignment with text than the baseline methods. This suggests that the protocol is able to accurately capture the PRs expressed in the text. Furthermore, the findings show that the time required to complete the annotation using PREAP are significantly shorter than with the other manual baseline methods. The paper includes also guidelines for using PREAP in three annotation scenarios, experimentally tested. We also provide example datasets and a user interface that we developed to support prerequisite annotation. *
scopus.description.allpeopleoriginal Alzetta C.; Torre I.; Koceva F. *
scopus.differences scopus.relation.lastpage *
scopus.differences scopus.subject.keywords *
scopus.differences scopus.relation.firstpage *
scopus.differences scopus.description.allpeopleoriginal *
scopus.differences scopus.relation.issue *
scopus.differences scopus.date.issued *
scopus.differences scopus.relation.volume *
scopus.document.type ar *
scopus.document.types ar *
scopus.funding.funders 100007921 - University of Pittsburgh; 501100004702 - Università degli Studi di Genova; *
scopus.identifier.doi 10.1007/s10758-023-09682-6 *
scopus.identifier.eissn 2211-1670 *
scopus.identifier.pui 2025510516 *
scopus.identifier.scopus 2-s2.0-85171573789 *
scopus.journal.sourceid 19700200832 *
scopus.language.iso eng *
scopus.publisher.name Springer Science and Business Media B.V. *
scopus.relation.firstpage 197 *
scopus.relation.issue 1 *
scopus.relation.lastpage 228 *
scopus.relation.volume 29 *
scopus.subject.keywords Annotation protocol; Educational textbooks; Knowledge engineering; Text annotation; *
scopus.title Annotation Protocol for Textbook Enrichment with Prerequisite Knowledge Graph *
scopus.titleeng Annotation Protocol for Textbook Enrichment with Prerequisite Knowledge Graph *
Appare nelle tipologie: 01.01 Articolo in rivista
File in questo prodotto:
File Dimensione Formato  
prod_492339-doc_205433.pdf

accesso aperto

Descrizione: Annotation Protocol for Textbook Enrichment with Prerequisite Knowledge Graph
Tipologia: Versione Editoriale (PDF)
Licenza: Creative commons
Dimensione 1.39 MB
Formato Adobe PDF
1.39 MB Adobe PDF Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/20.500.14243/450158
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 6
  • ???jsp.display-item.citation.isi??? 4
social impact