This paper presents the retrodigitization project of the Grande Dizionario della Lingua Italiana (GDLI), the largest historical dictionary of the Italian language. The GDLI’s 23,000 pages - originally designed for human consultation - constitute an exceptional repository of linguistic and cultural-historical information, while posing significant challenges to large-scale digitization and data structuring. The project, still ongoing, will result in the development of a set of interoperable and interlinked resources: (i) a TEI-XML edition of the dictionary text, encoding its complex lexicographic structure; (ii) an annotated corpus of the quoted examples, enabling linguistic and historical research across centuries; and (iii) a database of quoted authors and works. Together, these components form a hybrid lexical resource that establishes the foundations for innovative and advanced modes of accessing and exploring the rich and multifaceted content of this historical dictionary.

From Print to Digital and Beyond: The Retrodigitization of a Historical Dictionary of Italian as a Hybrid Lexical Resource

Sebastiana Cucurullo;Manuel Favaro;Elisa Guadagnini;Simonetta Montemagni;Eva Sassolini
2026

Abstract

This paper presents the retrodigitization project of the Grande Dizionario della Lingua Italiana (GDLI), the largest historical dictionary of the Italian language. The GDLI’s 23,000 pages - originally designed for human consultation - constitute an exceptional repository of linguistic and cultural-historical information, while posing significant challenges to large-scale digitization and data structuring. The project, still ongoing, will result in the development of a set of interoperable and interlinked resources: (i) a TEI-XML edition of the dictionary text, encoding its complex lexicographic structure; (ii) an annotated corpus of the quoted examples, enabling linguistic and historical research across centuries; and (iii) a database of quoted authors and works. Together, these components form a hybrid lexical resource that establishes the foundations for innovative and advanced modes of accessing and exploring the rich and multifaceted content of this historical dictionary.
Campo DC Valore Lingua
dc.authority.orgunit Istituto di linguistica computazionale "Antonio Zampolli" - ILC en
dc.authority.people Marco Biffi en
dc.authority.people Sebastiana Cucurullo en
dc.authority.people Manuel Favaro en
dc.authority.people Elisa Guadagnini en
dc.authority.people Simonetta Montemagni en
dc.authority.people Eva Sassolini en
dc.collection.id.s 71c7200a-7c5f-4e83-8d57-d3d2ba88f40d *
dc.collection.name 04.01 Contributo in Atti di convegno *
dc.contributor.appartenenza Istituto di linguistica computazionale "Antonio Zampolli" - ILC *
dc.contributor.appartenenza.mi 918 *
dc.contributor.area Non assegn *
dc.contributor.area Non assegn *
dc.contributor.area Non assegn *
dc.contributor.area Non assegn *
dc.date.firstsubmission 2026/05/11 14:54:15 *
dc.date.issued 2026 -
dc.date.submission 2026/05/11 14:54:15 *
dc.description.abstracteng This paper presents the retrodigitization project of the Grande Dizionario della Lingua Italiana (GDLI), the largest historical dictionary of the Italian language. The GDLI’s 23,000 pages - originally designed for human consultation - constitute an exceptional repository of linguistic and cultural-historical information, while posing significant challenges to large-scale digitization and data structuring. The project, still ongoing, will result in the development of a set of interoperable and interlinked resources: (i) a TEI-XML edition of the dictionary text, encoding its complex lexicographic structure; (ii) an annotated corpus of the quoted examples, enabling linguistic and historical research across centuries; and (iii) a database of quoted authors and works. Together, these components form a hybrid lexical resource that establishes the foundations for innovative and advanced modes of accessing and exploring the rich and multifaceted content of this historical dictionary. -
dc.description.allpeople Biffi, Marco; Cucurullo, Sebastiana; Favaro, Manuel; Guadagnini, Elisa; Montemagni, Simonetta; Sassolini, Eva -
dc.description.allpeopleoriginal Marco Biffi, Sebastiana Cucurullo, Manuel Favaro, Elisa Guadagnini, Simonetta Montemagni, Eva Sassolini en
dc.description.fulltext none en
dc.description.international no en
dc.description.numberofauthors 6 -
dc.identifier.doi 10.63317/338howsz93sg en
dc.identifier.isbn 9782493814494 en
dc.identifier.source manual *
dc.identifier.uri https://hdl.handle.net/20.500.14243/580341 -
dc.language.iso eng en
dc.publisher.name European Language Resources Association (ELRA) en
dc.relation.firstpage 770 en
dc.relation.ispartofbook Proceedings of the Fifteenth Language Resources and Evaluation Conference (LREC 2026) en
dc.relation.lastpage 777 en
dc.relation.numberofpages 8 en
dc.subject.keywordseng Historical Dictionary, Retro-digitization, Knowledge Organization, e-Lexicography -
dc.subject.singlekeyword Historical Dictionary *
dc.subject.singlekeyword Retro-digitization *
dc.subject.singlekeyword Knowledge Organization *
dc.subject.singlekeyword e-Lexicography *
dc.title From Print to Digital and Beyond: The Retrodigitization of a Historical Dictionary of Italian as a Hybrid Lexical Resource en
dc.type.circulation Internazionale en
dc.type.driver info:eu-repo/semantics/conferenceObject -
dc.type.full 04 Contributo in convegno::04.01 Contributo in Atti di convegno it
dc.type.miur 273 -
iris.orcid.lastModifiedDate 2026/05/11 14:54:15 *
iris.orcid.lastModifiedMillisecond 1778504055984 *
iris.sitodocente.maxattempts 4 -
iris.unpaywall.doi 10.63317/338howsz93sg *
iris.unpaywall.isoa false *
iris.unpaywall.journalisindoaj false *
iris.unpaywall.metadataCallLastModified 22/05/2026 04:47:52 -
iris.unpaywall.metadataCallLastModifiedMillisecond 1779418072859 -
iris.unpaywall.oastatus closed *
File in questo prodotto:
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/20.500.14243/580341
 Attenzione

Attenzione! I dati visualizzati non sono stati sottoposti a validazione da parte dell'ente

Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus ND
  • ???jsp.display-item.citation.isi??? ND
social impact