The paper illustrates the design and development of a textual corpus representative of the historical variants of Italian during the Great War, which was enriched with linguistic (lemmatization and pos-tagging) and meta-linguistic annotation. The corpus, after a manual revision of the linguistic annotation, was used for specializing existing NLP tools to process historical texts with promising results.

Italian in the Trenches: Linguistic annotation and analysis of texts of the great war

Dell'Orletta F;Venturi G;Montemagni S
2018

Abstract

The paper illustrates the design and development of a textual corpus representative of the historical variants of Italian during the Great War, which was enriched with linguistic (lemmatization and pos-tagging) and meta-linguistic annotation. The corpus, after a manual revision of the linguistic annotation, was used for specializing existing NLP tools to process historical texts with promising results.
Campo DC Valore Lingua
dc.authority.anceserie CEUR WORKSHOP PROCEEDINGS -
dc.authority.anceserie CEUR Workshop Proceedings -
dc.authority.orgunit Istituto di linguistica computazionale "Antonio Zampolli" - ILC -
dc.authority.people De Felice I it
dc.authority.people Dell'Orletta F it
dc.authority.people Venturi G it
dc.authority.people Lenci A it
dc.authority.people Montemagni S it
dc.collection.id.s 71c7200a-7c5f-4e83-8d57-d3d2ba88f40d *
dc.collection.name 04.01 Contributo in Atti di convegno *
dc.contributor.appartenenza Istituto di linguistica computazionale "Antonio Zampolli" - ILC *
dc.contributor.appartenenza.mi 918 *
dc.date.accessioned 2024/02/21 06:03:21 -
dc.date.available 2024/02/21 06:03:21 -
dc.date.issued 2018 -
dc.description.abstracteng The paper illustrates the design and development of a textual corpus representative of the historical variants of Italian during the Great War, which was enriched with linguistic (lemmatization and pos-tagging) and meta-linguistic annotation. The corpus, after a manual revision of the linguistic annotation, was used for specializing existing NLP tools to process historical texts with promising results. -
dc.description.affiliations University of Pisa, CoLing Lab., , Italy; Istituto di Linguistica Computazionale A. Zampolli, ItaliaNLP Lab., , Italy -
dc.description.allpeople De Felice I.; Dell'Orletta F.; Venturi G.; Lenci A.; Montemagni S. -
dc.description.allpeopleoriginal De Felice I.; Dell'Orletta F.; Venturi G.; Lenci A.; Montemagni S. -
dc.description.fulltext none en
dc.description.numberofauthors 3 -
dc.identifier.scopus 2-s2.0-85057734451 -
dc.identifier.uri https://hdl.handle.net/20.500.14243/403578 -
dc.identifier.url http://www.scopus.com/record/display.url?eid=2-s2.0-85057734451&origin=inward -
dc.language.iso eng -
dc.relation.conferencedate 10-12/12/2018 -
dc.relation.conferencename 5th Italian Conference on Computational Linguistics (CLiC-it) -
dc.relation.conferenceplace Pisa -
dc.relation.firstpage 1 -
dc.relation.lastpage 5 -
dc.relation.numberofpages 5 -
dc.relation.volume 2253 -
dc.subject.keywords Natural Language Processing -
dc.subject.keywords Automatic Linguistic Annotation -
dc.subject.singlekeyword Natural Language Processing *
dc.subject.singlekeyword Automatic Linguistic Annotation *
dc.title Italian in the Trenches: Linguistic annotation and analysis of texts of the great war en
dc.type.driver info:eu-repo/semantics/conferenceObject -
dc.type.full 04 Contributo in convegno::04.01 Contributo in Atti di convegno it
dc.type.miur 273 -
dc.type.referee Sì, ma tipo non specificato -
dc.ugov.descaux1 423872 -
iris.orcid.lastModifiedDate 2024/03/22 09:32:46 *
iris.orcid.lastModifiedMillisecond 1711096366037 *
iris.scopus.extIssued 2018 -
iris.scopus.extTitle Italian in the Trenches: Linguistic annotation and analysis of texts of the great war -
iris.sitodocente.maxattempts 1 -
scopus.authority.anceserie CEUR WORKSHOP PROCEEDINGS###1613-0073 *
scopus.category 1700 *
scopus.contributor.affiliation CoLing Lab. -
scopus.contributor.affiliation ItaliaNLP Lab. -
scopus.contributor.affiliation ItaliaNLP Lab. -
scopus.contributor.affiliation CoLing Lab. -
scopus.contributor.affiliation ItaliaNLP Lab. -
scopus.contributor.afid 60028868 -
scopus.contributor.afid 60008941 -
scopus.contributor.afid 60008941 -
scopus.contributor.afid 60028868 -
scopus.contributor.afid 60008941 -
scopus.contributor.auid 57198816847 -
scopus.contributor.auid 57540567000 -
scopus.contributor.auid 27568199800 -
scopus.contributor.auid 8286541500 -
scopus.contributor.auid 15056781100 -
scopus.contributor.country Italy -
scopus.contributor.country Italy -
scopus.contributor.country Italy -
scopus.contributor.country Italy -
scopus.contributor.country Italy -
scopus.contributor.dptid 117970377 -
scopus.contributor.dptid 114087935 -
scopus.contributor.dptid 114087935 -
scopus.contributor.dptid 117970377 -
scopus.contributor.dptid 114087935 -
scopus.contributor.name Irene -
scopus.contributor.name Felice -
scopus.contributor.name Giulia -
scopus.contributor.name Alessandro -
scopus.contributor.name Simonetta -
scopus.contributor.subaffiliation University of Pisa; -
scopus.contributor.subaffiliation Istituto di Linguistica Computazionale A. Zampolli; -
scopus.contributor.subaffiliation Istituto di Linguistica Computazionale A. Zampolli; -
scopus.contributor.subaffiliation University of Pisa; -
scopus.contributor.subaffiliation Istituto di Linguistica Computazionale A. Zampolli; -
scopus.contributor.surname De Felice -
scopus.contributor.surname Dell'Orletta -
scopus.contributor.surname Venturi -
scopus.contributor.surname Lenci -
scopus.contributor.surname Montemagni -
scopus.date.issued 2018 *
scopus.description.abstracteng The paper illustrates the design and development of a textual corpus representative of the historical variants of Italian during the Great War, which was enriched with linguistic (lemmatization and pos-tagging) and meta-linguistic annotation. The corpus, after a manual revision of the linguistic annotation, was used for specializing existing NLP tools to process historical texts with promising results. *
scopus.description.allpeopleoriginal De Felice I.; Dell'Orletta F.; Venturi G.; Lenci A.; Montemagni S. *
scopus.differences scopus.relation.conferencename *
scopus.differences scopus.authority.anceserie *
scopus.differences scopus.publisher.name *
scopus.differences scopus.relation.conferencedate *
scopus.differences scopus.identifier.doi *
scopus.differences scopus.relation.conferenceplace *
scopus.document.type cp *
scopus.document.types cp *
scopus.identifier.doi 10.4000/books.aaccademia.3273 *
scopus.identifier.pui 625356860 *
scopus.identifier.scopus 2-s2.0-85057734451 *
scopus.journal.sourceid 21100218356 *
scopus.language.iso eng *
scopus.publisher.name CEUR-WS *
scopus.relation.conferencedate 2018 *
scopus.relation.conferencename 5th Italian Conference on Computational Linguistics, CLiC-it 2018 *
scopus.relation.conferenceplace ita *
scopus.relation.volume 2253 *
scopus.title Italian in the Trenches: Linguistic annotation and analysis of texts of the great war *
scopus.titleeng Italian in the Trenches: Linguistic annotation and analysis of texts of the great war *
Appare nelle tipologie: 04.01 Contributo in Atti di convegno
File in questo prodotto:
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/20.500.14243/403578
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 0
  • ???jsp.display-item.citation.isi??? ND
social impact