CNR Institutional Research Information System

This paper proposes a graph-based Named Entity Linking (NEL) algorithm named REDEN for the disambiguation of authors' names in French literary criticism texts and scientific essays from the 19th and early 20th centuries. The algorithm is described and evaluated according to the two phases of NEL as reported in current state of the art, namely, candidate retrieval and candidate selection. REDEN leverages knowledge from different Linked Data sources in order to select candidates for each author mention, subsequently crawls data from other Linked Data sets using equivalence links (e.g., owl:sameAs), and, finally, fuses graphs of homologous individuals into a non-redundant graph well-suited for graph centrality calculation; the resulting graph is used for choosing the best referent. The REDEN algorithm is distributed in open-source and follows current standards in digital editions (TEI) and semantic Web (RDF). Its integration into an editorial workflow of digital editions in Digital humanities and cultural heritage projects is entirely plausible. Experiments are conducted along with the corresponding error analysis in order to test our approach and to help us to study the weaknesses and strengths of our algorithm, thereby to further improvements of REDEN.

REDEN: Named Entity Linking in Digital Literary Editions Using Linked Data Sets

2016

Abstract

This paper proposes a graph-based Named Entity Linking (NEL) algorithm named REDEN for the disambiguation of authors' names in French literary criticism texts and scientific essays from the 19th and early 20th centuries. The algorithm is described and evaluated according to the two phases of NEL as reported in current state of the art, namely, candidate retrieval and candidate selection. REDEN leverages knowledge from different Linked Data sources in order to select candidates for each author mention, subsequently crawls data from other Linked Data sets using equivalence links (e.g., owl:sameAs), and, finally, fuses graphs of homologous individuals into a non-redundant graph well-suited for graph centrality calculation; the resulting graph is used for choosing the best referent. The REDEN algorithm is distributed in open-source and follows current standards in digital editions (TEI) and semantic Web (RDF). Its integration into an editorial workflow of digital editions in Digital humanities and cultural heritage projects is entirely plausible. Experiments are conducted along with the corresponding error analysis in order to test our approach and to help us to study the weaknesses and strengths of our algorithm, thereby to further improvements of REDEN.

Scheda breve

Scheda completa

Scheda completa (DC)

Campo DC	Valore	Lingua
dc.authority.ancejournal	COMPLEX SYSTEMS INFORMATICS AND MODELING QUARTERLY	-
dc.authority.orgunit	Istituto di linguistica computazionale "Antonio Zampolli" - ILC	-
dc.authority.people	Carmen BrandoFrancesca FrontiniJeanGabriel Ganascia	it
dc.collection.id.s	b3f88f24-048a-4e43-8ab1-6697b90e068e	*
dc.collection.name	01.01 Articolo in rivista	*
dc.contributor.appartenenza	Istituto di linguistica computazionale "Antonio Zampolli" - ILC	*
dc.contributor.appartenenza.mi	918	*
dc.date.accessioned	2024/02/19 21:25:17	-
dc.date.available	2024/02/19 21:25:17	-
dc.date.issued	2016	-
dc.description.abstracteng	This paper proposes a graph-based Named Entity Linking (NEL) algorithm named REDEN for the disambiguation of authors' names in French literary criticism texts and scientific essays from the 19th and early 20th centuries. The algorithm is described and evaluated according to the two phases of NEL as reported in current state of the art, namely, candidate retrieval and candidate selection. REDEN leverages knowledge from different Linked Data sources in order to select candidates for each author mention, subsequently crawls data from other Linked Data sets using equivalence links (e.g., owl:sameAs), and, finally, fuses graphs of homologous individuals into a non-redundant graph well-suited for graph centrality calculation; the resulting graph is used for choosing the best referent. The REDEN algorithm is distributed in open-source and follows current standards in digital editions (TEI) and semantic Web (RDF). Its integration into an editorial workflow of digital editions in Digital humanities and cultural heritage projects is entirely plausible. Experiments are conducted along with the corresponding error analysis in order to test our approach and to help us to study the weaknesses and strengths of our algorithm, thereby to further improvements of REDEN.	-
dc.description.affiliations	Centre de Recherches Historiques, School for Advanced Studies in the Social Sciences (EHESS), UMR 8558, 190-198 avenue de France, 75013 Paris, France Istituto di Linguistica Computazionale "Antonio Zampolli", Consiglio Nazionale delle Ricerche, Area della Ricerca di Pisa, Via Giuseppe Moruzzi No 1, 56124 Pisa, Italy Labex Observatoire de la vie littéraire (OBVIL). Laboratoire d'Informatique de Paris 6 (LIP6), Pierre and Marie Curie University, UMR 7606, 4 place Jussieu, 75005, Paris, France	-
dc.description.allpeople	Carmen BrandoFrancesca FrontiniJeanGabriel Ganascia	-
dc.description.allpeopleoriginal	Carmen Brando Francesca Frontini Jean-Gabriel Ganascia	-
dc.description.fulltext	none	en
dc.description.numberofauthors	1	-
dc.identifier.doi	10.7250/csimq.2016-7.04	-
dc.identifier.uri	https://hdl.handle.net/20.500.14243/320494	-
dc.identifier.url	https://csimq-journals.rtu.lv/article/view/csimq.2016-7.04	-
dc.language.iso	eng	-
dc.relation.firstpage	60	-
dc.relation.lastpage	80	-
dc.relation.volume	7	-
dc.subject.keywords	Named Entity Linking	-
dc.subject.keywords	graph centrality	-
dc.subject.keywords	linked data	-
dc.subject.keywords	data fusion	-
dc.subject.keywords	digital humanities	-
dc.subject.singlekeyword	Named Entity Linking	*
dc.subject.singlekeyword	graph centrality	*
dc.subject.singlekeyword	linked data	*
dc.subject.singlekeyword	data fusion	*
dc.subject.singlekeyword	digital humanities	*
dc.title	REDEN: Named Entity Linking in Digital Literary Editions Using Linked Data Sets	en
dc.type.driver	info:eu-repo/semantics/article	-
dc.type.full	01 Contributo su Rivista::01.01 Articolo in rivista	it
dc.type.miur	262	-
dc.type.referee	Sì, ma tipo non specificato	-
dc.ugov.descaux1	357602	-
iris.orcid.lastModifiedDate	2024/03/01 11:59:49	*
iris.orcid.lastModifiedMillisecond	1709290789384	*
iris.sitodocente.maxattempts	1	-
iris.unpaywall.bestoahost	publisher	*
iris.unpaywall.bestoaversion	publishedVersion	*
iris.unpaywall.doi	10.7250/csimq.2016-7.04	*
iris.unpaywall.hosttype	publisher	*
iris.unpaywall.isoa	true	*
iris.unpaywall.journalisindoaj	true	*
iris.unpaywall.landingpage	https://doi.org/10.7250/csimq.2016-7.04	*
iris.unpaywall.license	cc-by	*
iris.unpaywall.metadataCallLastModified	27/12/2025 04:16:48	-
iris.unpaywall.metadataCallLastModifiedMillisecond	1766805408715	-
iris.unpaywall.oastatus	gold	*
iris.unpaywall.pdfurl	https://csimq-journals.rtu.lv/article/download/csimq.2016-7.04/pdf_6	*
Appare nelle tipologie:	01.01 Articolo in rivista

File in questo prodotto:

Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/20.500.14243/320494

Citazioni

ND

ND

ND

social impact