LibMovIt: text corpus of travel literature is a textual resource created within the LibMovIt project (Progetto finanziato dall’Unione Europea – NextGenerationEU a valere sul Piano Nazionale di Ripresa e Resilienza (PNRR) – Missione 4 Istruzione e ricerca – Componente 2 Dalla ricerca all’impresa – Investimento 1.1, Avviso Prin 2022 indetto con DD N. 104 del 2/2/2022, Progetto dal titolo LIBMOVIT – Libraries on the move: scholars, books, ideas trave-ling in Italy in the 18th century, codice proposta 2022CP88KY). Version 1.0 of the corpus contains 52 works for 7,9 milion words: 27 in English (3,590,000 words), 12 in French (2,065,000), 8 in German (1,550,000), 4 in Italian (450,000) and 1 in Spanish (255,000). A detailed description of the corpus and the status of each text (tags: "Revision completed" or "Revision to be completed") is available in a Zotero library at the following link: https://www.zotero.org/groups/5540957/libmovit/library Texts are published in .txt format. They are the result of both automatic text recognition and acquisition from other projects (indicated as tags in the corpus description). All the newly recognised texts have been reviewed with scripts to clean up the most common errors and delete paratextual elements (page numbers, catchwords, signature marks etc.). The editors of the corpus made manual corrections in all the texts, however, due to their lenghth, some of them still need further revision. For this reason, minor updates of the corpus (1.1, 1.2, 1.3 etc.) will be released regularly to improve the texts with further corrections, text mark-up and conversion to other formats; a major update of the corpus will be released once a year and will also include new texts. Additional information about the corpus development are described in the papers listed in the references section.
LibMovIt: text corpus of travel literature
Lorenzo Mancini;Sara Congregati
2025
Abstract
LibMovIt: text corpus of travel literature is a textual resource created within the LibMovIt project (Progetto finanziato dall’Unione Europea – NextGenerationEU a valere sul Piano Nazionale di Ripresa e Resilienza (PNRR) – Missione 4 Istruzione e ricerca – Componente 2 Dalla ricerca all’impresa – Investimento 1.1, Avviso Prin 2022 indetto con DD N. 104 del 2/2/2022, Progetto dal titolo LIBMOVIT – Libraries on the move: scholars, books, ideas trave-ling in Italy in the 18th century, codice proposta 2022CP88KY). Version 1.0 of the corpus contains 52 works for 7,9 milion words: 27 in English (3,590,000 words), 12 in French (2,065,000), 8 in German (1,550,000), 4 in Italian (450,000) and 1 in Spanish (255,000). A detailed description of the corpus and the status of each text (tags: "Revision completed" or "Revision to be completed") is available in a Zotero library at the following link: https://www.zotero.org/groups/5540957/libmovit/library Texts are published in .txt format. They are the result of both automatic text recognition and acquisition from other projects (indicated as tags in the corpus description). All the newly recognised texts have been reviewed with scripts to clean up the most common errors and delete paratextual elements (page numbers, catchwords, signature marks etc.). The editors of the corpus made manual corrections in all the texts, however, due to their lenghth, some of them still need further revision. For this reason, minor updates of the corpus (1.1, 1.2, 1.3 etc.) will be released regularly to improve the texts with further corrections, text mark-up and conversion to other formats; a major update of the corpus will be released once a year and will also include new texts. Additional information about the corpus development are described in the papers listed in the references section.I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.


