CNR Institutional Research Information System

LibMovIt: text corpus of travel literature is a textual resource created within the LibMovIt project (Progetto finanziato dall’Unione Europea – NextGenerationEU a valere sul Piano Nazionale di Ripresa e Resilienza (PNRR) – Missione 4 Istruzione e ricerca – Componente 2 Dalla ricerca all’impresa – Investimento 1.1, Avviso Prin 2022 indetto con DD N. 104 del 2/2/2022, Progetto dal titolo LIBMOVIT – Libraries on the move: scholars, books, ideas trave-ling in Italy in the 18th century, codice proposta 2022CP88KY). Version 1.0 of the corpus contains 52 works for 7,9 milion words: 27 in English (3,590,000 words), 12 in French (2,065,000), 8 in German (1,550,000), 4 in Italian (450,000) and 1 in Spanish (255,000). A detailed description of the corpus and the status of each text (tags: "Revision completed" or "Revision to be completed") is available in a Zotero library at the following link: https://www.zotero.org/groups/5540957/libmovit/library Texts are published in .txt format. They are the result of both automatic text recognition and acquisition from other projects (indicated as tags in the corpus description). All the newly recognised texts have been reviewed with scripts to clean up the most common errors and delete paratextual elements (page numbers, catchwords, signature marks etc.). The editors of the corpus made manual corrections in all the texts, however, due to their lenghth, some of them still need further revision. For this reason, minor updates of the corpus (1.1, 1.2, 1.3 etc.) will be released regularly to improve the texts with further corrections, text mark-up and conversion to other formats; a major update of the corpus will be released once a year and will also include new texts. Additional information about the corpus development are described in the papers listed in the references section.

LibMovIt: text corpus of travel literature

Lorenzo Mancini;Sara Congregati

2025

Abstract

LibMovIt: text corpus of travel literature is a textual resource created within the LibMovIt project (Progetto finanziato dall’Unione Europea – NextGenerationEU a valere sul Piano Nazionale di Ripresa e Resilienza (PNRR) – Missione 4 Istruzione e ricerca – Componente 2 Dalla ricerca all’impresa – Investimento 1.1, Avviso Prin 2022 indetto con DD N. 104 del 2/2/2022, Progetto dal titolo LIBMOVIT – Libraries on the move: scholars, books, ideas trave-ling in Italy in the 18th century, codice proposta 2022CP88KY). Version 1.0 of the corpus contains 52 works for 7,9 milion words: 27 in English (3,590,000 words), 12 in French (2,065,000), 8 in German (1,550,000), 4 in Italian (450,000) and 1 in Spanish (255,000). A detailed description of the corpus and the status of each text (tags: "Revision completed" or "Revision to be completed") is available in a Zotero library at the following link: https://www.zotero.org/groups/5540957/libmovit/library Texts are published in .txt format. They are the result of both automatic text recognition and acquisition from other projects (indicated as tags in the corpus description). All the newly recognised texts have been reviewed with scripts to clean up the most common errors and delete paratextual elements (page numbers, catchwords, signature marks etc.). The editors of the corpus made manual corrections in all the texts, however, due to their lenghth, some of them still need further revision. For this reason, minor updates of the corpus (1.1, 1.2, 1.3 etc.) will be released regularly to improve the texts with further corrections, text mark-up and conversion to other formats; a major update of the corpus will be released once a year and will also include new texts. Additional information about the corpus development are described in the papers listed in the references section.

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno
	
				2025
			
	Strutture organizzative
	
				Istituto per il Lessico Intellettuale Europeo e Storia delle Idee - ILIESI
			
	Parole chiave
	
				Grand Tour, Travel Literature
			
	Appare nelle tipologie:
	
				05.10 Dataset

File in questo prodotto:

Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/20.500.14243/567943

Citazioni

ND

ND

ND

social impact