This paper presents the retrodigitization project of the Grande Dizionario della Lingua Italiana (GDLI), the largest historical dictionary of the Italian language. The GDLI’s 23,000 pages - originally designed for human consultation - constitute an exceptional repository of linguistic and cultural-historical information, while posing significant challenges to large-scale digitization and data structuring. The project, still ongoing, will result in the development of a set of interoperable and interlinked resources: (i) a TEI-XML edition of the dictionary text, encoding its complex lexicographic structure; (ii) an annotated corpus of the quoted examples, enabling linguistic and historical research across centuries; and (iii) a database of quoted authors and works. Together, these components form a hybrid lexical resource that establishes the foundations for innovative and advanced modes of accessing and exploring the rich and multifaceted content of this historical dictionary.
From Print to Digital and Beyond: The Retrodigitization of a Historical Dictionary of Italian as a Hybrid Lexical Resource
Sebastiana Cucurullo;Manuel Favaro;Elisa Guadagnini;Simonetta Montemagni;Eva Sassolini
2026
Abstract
This paper presents the retrodigitization project of the Grande Dizionario della Lingua Italiana (GDLI), the largest historical dictionary of the Italian language. The GDLI’s 23,000 pages - originally designed for human consultation - constitute an exceptional repository of linguistic and cultural-historical information, while posing significant challenges to large-scale digitization and data structuring. The project, still ongoing, will result in the development of a set of interoperable and interlinked resources: (i) a TEI-XML edition of the dictionary text, encoding its complex lexicographic structure; (ii) an annotated corpus of the quoted examples, enabling linguistic and historical research across centuries; and (iii) a database of quoted authors and works. Together, these components form a hybrid lexical resource that establishes the foundations for innovative and advanced modes of accessing and exploring the rich and multifaceted content of this historical dictionary.| Campo DC | Valore | Lingua |
|---|---|---|
| dc.authority.orgunit | Istituto di linguistica computazionale "Antonio Zampolli" - ILC | en |
| dc.authority.people | Marco Biffi | en |
| dc.authority.people | Sebastiana Cucurullo | en |
| dc.authority.people | Manuel Favaro | en |
| dc.authority.people | Elisa Guadagnini | en |
| dc.authority.people | Simonetta Montemagni | en |
| dc.authority.people | Eva Sassolini | en |
| dc.collection.id.s | 71c7200a-7c5f-4e83-8d57-d3d2ba88f40d | * |
| dc.collection.name | 04.01 Contributo in Atti di convegno | * |
| dc.contributor.appartenenza | Istituto di linguistica computazionale "Antonio Zampolli" - ILC | * |
| dc.contributor.appartenenza.mi | 918 | * |
| dc.contributor.area | Non assegn | * |
| dc.contributor.area | Non assegn | * |
| dc.contributor.area | Non assegn | * |
| dc.contributor.area | Non assegn | * |
| dc.date.firstsubmission | 2026/05/11 14:54:15 | * |
| dc.date.issued | 2026 | - |
| dc.date.submission | 2026/05/11 14:54:15 | * |
| dc.description.abstracteng | This paper presents the retrodigitization project of the Grande Dizionario della Lingua Italiana (GDLI), the largest historical dictionary of the Italian language. The GDLI’s 23,000 pages - originally designed for human consultation - constitute an exceptional repository of linguistic and cultural-historical information, while posing significant challenges to large-scale digitization and data structuring. The project, still ongoing, will result in the development of a set of interoperable and interlinked resources: (i) a TEI-XML edition of the dictionary text, encoding its complex lexicographic structure; (ii) an annotated corpus of the quoted examples, enabling linguistic and historical research across centuries; and (iii) a database of quoted authors and works. Together, these components form a hybrid lexical resource that establishes the foundations for innovative and advanced modes of accessing and exploring the rich and multifaceted content of this historical dictionary. | - |
| dc.description.allpeople | Biffi, Marco; Cucurullo, Sebastiana; Favaro, Manuel; Guadagnini, Elisa; Montemagni, Simonetta; Sassolini, Eva | - |
| dc.description.allpeopleoriginal | Marco Biffi, Sebastiana Cucurullo, Manuel Favaro, Elisa Guadagnini, Simonetta Montemagni, Eva Sassolini | en |
| dc.description.fulltext | none | en |
| dc.description.international | no | en |
| dc.description.numberofauthors | 6 | - |
| dc.identifier.doi | 10.63317/338howsz93sg | en |
| dc.identifier.isbn | 9782493814494 | en |
| dc.identifier.source | manual | * |
| dc.identifier.uri | https://hdl.handle.net/20.500.14243/580341 | - |
| dc.language.iso | eng | en |
| dc.publisher.name | European Language Resources Association (ELRA) | en |
| dc.relation.firstpage | 770 | en |
| dc.relation.ispartofbook | Proceedings of the Fifteenth Language Resources and Evaluation Conference (LREC 2026) | en |
| dc.relation.lastpage | 777 | en |
| dc.relation.numberofpages | 8 | en |
| dc.subject.keywordseng | Historical Dictionary, Retro-digitization, Knowledge Organization, e-Lexicography | - |
| dc.subject.singlekeyword | Historical Dictionary | * |
| dc.subject.singlekeyword | Retro-digitization | * |
| dc.subject.singlekeyword | Knowledge Organization | * |
| dc.subject.singlekeyword | e-Lexicography | * |
| dc.title | From Print to Digital and Beyond: The Retrodigitization of a Historical Dictionary of Italian as a Hybrid Lexical Resource | en |
| dc.type.circulation | Internazionale | en |
| dc.type.driver | info:eu-repo/semantics/conferenceObject | - |
| dc.type.full | 04 Contributo in convegno::04.01 Contributo in Atti di convegno | it |
| dc.type.miur | 273 | - |
| iris.orcid.lastModifiedDate | 2026/05/11 14:54:15 | * |
| iris.orcid.lastModifiedMillisecond | 1778504055984 | * |
| iris.sitodocente.maxattempts | 4 | - |
| iris.unpaywall.doi | 10.63317/338howsz93sg | * |
| iris.unpaywall.isoa | false | * |
| iris.unpaywall.journalisindoaj | false | * |
| iris.unpaywall.metadataCallLastModified | 22/05/2026 04:47:52 | - |
| iris.unpaywall.metadataCallLastModifiedMillisecond | 1779418072859 | - |
| iris.unpaywall.oastatus | closed | * |
I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.


