This paper proposes a graph-based Named Entity Linking (NEL) algorithm named REDEN for the disambiguation of authors' names in French literary criticism texts and scientific essays from the 19th and early 20th centuries. The algorithm is described and evaluated according to the two phases of NEL as reported in current state of the art, namely, candidate retrieval and candidate selection. REDEN leverages knowledge from different Linked Data sources in order to select candidates for each author mention, subsequently crawls data from other Linked Data sets using equivalence links (e.g., owl:sameAs), and, finally, fuses graphs of homologous individuals into a non-redundant graph well-suited for graph centrality calculation; the resulting graph is used for choosing the best referent. The REDEN algorithm is distributed in open-source and follows current standards in digital editions (TEI) and semantic Web (RDF). Its integration into an editorial workflow of digital editions in Digital humanities and cultural heritage projects is entirely plausible. Experiments are conducted along with the corresponding error analysis in order to test our approach and to help us to study the weaknesses and strengths of our algorithm, thereby to further improvements of REDEN.

REDEN: Named Entity Linking in Digital Literary Editions Using Linked Data Sets

2016

Abstract

This paper proposes a graph-based Named Entity Linking (NEL) algorithm named REDEN for the disambiguation of authors' names in French literary criticism texts and scientific essays from the 19th and early 20th centuries. The algorithm is described and evaluated according to the two phases of NEL as reported in current state of the art, namely, candidate retrieval and candidate selection. REDEN leverages knowledge from different Linked Data sources in order to select candidates for each author mention, subsequently crawls data from other Linked Data sets using equivalence links (e.g., owl:sameAs), and, finally, fuses graphs of homologous individuals into a non-redundant graph well-suited for graph centrality calculation; the resulting graph is used for choosing the best referent. The REDEN algorithm is distributed in open-source and follows current standards in digital editions (TEI) and semantic Web (RDF). Its integration into an editorial workflow of digital editions in Digital humanities and cultural heritage projects is entirely plausible. Experiments are conducted along with the corresponding error analysis in order to test our approach and to help us to study the weaknesses and strengths of our algorithm, thereby to further improvements of REDEN.
Campo DC Valore Lingua
dc.authority.ancejournal COMPLEX SYSTEMS INFORMATICS AND MODELING QUARTERLY -
dc.authority.orgunit Istituto di linguistica computazionale "Antonio Zampolli" - ILC -
dc.authority.people Carmen BrandoFrancesca FrontiniJeanGabriel Ganascia it
dc.collection.id.s b3f88f24-048a-4e43-8ab1-6697b90e068e *
dc.collection.name 01.01 Articolo in rivista *
dc.contributor.appartenenza Istituto di linguistica computazionale "Antonio Zampolli" - ILC *
dc.contributor.appartenenza.mi 918 *
dc.date.accessioned 2024/02/19 21:25:17 -
dc.date.available 2024/02/19 21:25:17 -
dc.date.issued 2016 -
dc.description.abstracteng This paper proposes a graph-based Named Entity Linking (NEL) algorithm named REDEN for the disambiguation of authors' names in French literary criticism texts and scientific essays from the 19th and early 20th centuries. The algorithm is described and evaluated according to the two phases of NEL as reported in current state of the art, namely, candidate retrieval and candidate selection. REDEN leverages knowledge from different Linked Data sources in order to select candidates for each author mention, subsequently crawls data from other Linked Data sets using equivalence links (e.g., owl:sameAs), and, finally, fuses graphs of homologous individuals into a non-redundant graph well-suited for graph centrality calculation; the resulting graph is used for choosing the best referent. The REDEN algorithm is distributed in open-source and follows current standards in digital editions (TEI) and semantic Web (RDF). Its integration into an editorial workflow of digital editions in Digital humanities and cultural heritage projects is entirely plausible. Experiments are conducted along with the corresponding error analysis in order to test our approach and to help us to study the weaknesses and strengths of our algorithm, thereby to further improvements of REDEN. -
dc.description.affiliations Centre de Recherches Historiques, School for Advanced Studies in the Social Sciences (EHESS), UMR 8558, 190-198 avenue de France, 75013 Paris, France Istituto di Linguistica Computazionale "Antonio Zampolli", Consiglio Nazionale delle Ricerche, Area della Ricerca di Pisa, Via Giuseppe Moruzzi No 1, 56124 Pisa, Italy Labex Observatoire de la vie littéraire (OBVIL). Laboratoire d'Informatique de Paris 6 (LIP6), Pierre and Marie Curie University, UMR 7606, 4 place Jussieu, 75005, Paris, France -
dc.description.allpeople Carmen BrandoFrancesca FrontiniJeanGabriel Ganascia -
dc.description.allpeopleoriginal Carmen Brando Francesca Frontini Jean-Gabriel Ganascia -
dc.description.fulltext none en
dc.description.numberofauthors 1 -
dc.identifier.doi 10.7250/csimq.2016-7.04 -
dc.identifier.uri https://hdl.handle.net/20.500.14243/320494 -
dc.identifier.url https://csimq-journals.rtu.lv/article/view/csimq.2016-7.04 -
dc.language.iso eng -
dc.relation.firstpage 60 -
dc.relation.lastpage 80 -
dc.relation.volume 7 -
dc.subject.keywords Named Entity Linking -
dc.subject.keywords graph centrality -
dc.subject.keywords linked data -
dc.subject.keywords data fusion -
dc.subject.keywords digital humanities -
dc.subject.singlekeyword Named Entity Linking *
dc.subject.singlekeyword graph centrality *
dc.subject.singlekeyword linked data *
dc.subject.singlekeyword data fusion *
dc.subject.singlekeyword digital humanities *
dc.title REDEN: Named Entity Linking in Digital Literary Editions Using Linked Data Sets en
dc.type.driver info:eu-repo/semantics/article -
dc.type.full 01 Contributo su Rivista::01.01 Articolo in rivista it
dc.type.miur 262 -
dc.type.referee Sì, ma tipo non specificato -
dc.ugov.descaux1 357602 -
iris.orcid.lastModifiedDate 2024/03/01 11:59:49 *
iris.orcid.lastModifiedMillisecond 1709290789384 *
iris.sitodocente.maxattempts 1 -
iris.unpaywall.bestoahost publisher *
iris.unpaywall.bestoaversion publishedVersion *
iris.unpaywall.doi 10.7250/csimq.2016-7.04 *
iris.unpaywall.hosttype publisher *
iris.unpaywall.isoa true *
iris.unpaywall.journalisindoaj true *
iris.unpaywall.landingpage https://doi.org/10.7250/csimq.2016-7.04 *
iris.unpaywall.license cc-by *
iris.unpaywall.metadataCallLastModified 27/12/2025 04:16:48 -
iris.unpaywall.metadataCallLastModifiedMillisecond 1766805408715 -
iris.unpaywall.oastatus gold *
iris.unpaywall.pdfurl https://csimq-journals.rtu.lv/article/download/csimq.2016-7.04/pdf_6 *
Appare nelle tipologie: 01.01 Articolo in rivista
File in questo prodotto:
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/20.500.14243/320494
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus ND
  • ???jsp.display-item.citation.isi??? ND
social impact