This paper presents a pilot study towards the creation of a monolingual written-spoken parallel corpus in Italian, featuring two main novelties in the general landscape of spoken corpora: the alignment with the written counterpart of the same content and the spoken variety dealt with, represented by transcriptions of radio news broadcasting.

Building an Italian written-spoken parallel corpus: A pilot study

Dell'Orletta F;Montemagni S;Quochi V
2019

Abstract

This paper presents a pilot study towards the creation of a monolingual written-spoken parallel corpus in Italian, featuring two main novelties in the general landscape of spoken corpora: the alignment with the written counterpart of the same content and the spoken variety dealt with, represented by transcriptions of radio news broadcasting.
Campo DC Valore Lingua
dc.authority.anceserie CEUR WORKSHOP PROCEEDINGS -
dc.authority.anceserie CEUR Workshop Proceedings -
dc.authority.people Dominutti E it
dc.authority.people Pifferi L it
dc.authority.people Dell'Orletta F it
dc.authority.people Montemagni S it
dc.authority.people Quochi V it
dc.collection.id.s 71c7200a-7c5f-4e83-8d57-d3d2ba88f40d *
dc.collection.name 04.01 Contributo in Atti di convegno *
dc.contributor.appartenenza Istituto di linguistica computazionale "Antonio Zampolli" - ILC *
dc.contributor.appartenenza.mi 918 *
dc.date.accessioned 2024/02/20 03:02:17 -
dc.date.available 2024/02/20 03:02:17 -
dc.date.issued 2019 -
dc.description.abstracteng This paper presents a pilot study towards the creation of a monolingual written-spoken parallel corpus in Italian, featuring two main novelties in the general landscape of spoken corpora: the alignment with the written counterpart of the same content and the spoken variety dealt with, represented by transcriptions of radio news broadcasting. -
dc.description.affiliations Università di Pisa, Università di Pisa, Italy; Istituto di Linguistica Computazionale (ILC), CNR, Pisa, Italy -
dc.description.allpeople Dominutti, E; Pifferi, L; Dell'Orletta, F; Montemagni, S; Quochi, V -
dc.description.allpeopleoriginal Dominutti E.; Pifferi L.; Dell'Orletta F.; Montemagni S.; Quochi V. -
dc.description.fulltext none en
dc.description.numberofauthors 5 -
dc.identifier.isbn 9791280136008 -
dc.identifier.scopus 2-s2.0-85074817518 -
dc.identifier.uri https://hdl.handle.net/20.500.14243/390432 -
dc.identifier.url http://www.scopus.com/record/display.url?eid=2-s2.0-85074817518&origin=inward -
dc.language.iso eng -
dc.relation.conferencedate 13-15/112019 -
dc.relation.conferencename 6th Italian Conference on Computational Linguistics (CLiC-it) -
dc.relation.conferenceplace Bari -
dc.relation.volume 2481 -
dc.subject.keywords Written-Spoken Parallel Corpus -
dc.subject.singlekeyword Written-Spoken Parallel Corpus *
dc.title Building an Italian written-spoken parallel corpus: A pilot study en
dc.type.driver info:eu-repo/semantics/conferenceObject -
dc.type.full 04 Contributo in convegno::04.01 Contributo in Atti di convegno it
dc.type.miur 273 -
dc.ugov.descaux1 434848 -
iris.orcid.lastModifiedDate 2024/04/04 14:33:36 *
iris.orcid.lastModifiedMillisecond 1712234016056 *
iris.scopus.extIssued 2019 -
iris.scopus.extTitle Building an Italian written-spoken parallel corpus: A pilot study -
iris.sitodocente.maxattempts 1 -
scopus.authority.anceserie CEUR WORKSHOP PROCEEDINGS###1613-0073 *
scopus.category 1700 *
scopus.contributor.affiliation Università di Pisa -
scopus.contributor.affiliation Università di Pisa -
scopus.contributor.affiliation ILC–CNR -
scopus.contributor.affiliation ILC–CNR -
scopus.contributor.affiliation ILC–CNR -
scopus.contributor.afid 60028868 -
scopus.contributor.afid 60028868 -
scopus.contributor.afid 60021199 -
scopus.contributor.afid 60021199 -
scopus.contributor.afid 60021199 -
scopus.contributor.auid 57211680934 -
scopus.contributor.auid 57211689144 -
scopus.contributor.auid 57540567000 -
scopus.contributor.auid 15056781100 -
scopus.contributor.auid 34977412400 -
scopus.contributor.country Italy -
scopus.contributor.country Italy -
scopus.contributor.country Italy -
scopus.contributor.country Italy -
scopus.contributor.country Italy -
scopus.contributor.dptid -
scopus.contributor.dptid -
scopus.contributor.dptid -
scopus.contributor.dptid -
scopus.contributor.dptid -
scopus.contributor.name Elisa -
scopus.contributor.name Lucia -
scopus.contributor.name Felice -
scopus.contributor.name Simonetta -
scopus.contributor.name Valeria -
scopus.contributor.subaffiliation -
scopus.contributor.subaffiliation -
scopus.contributor.subaffiliation -
scopus.contributor.subaffiliation -
scopus.contributor.subaffiliation -
scopus.contributor.surname Dominutti -
scopus.contributor.surname Pifferi -
scopus.contributor.surname Dell’Orletta -
scopus.contributor.surname Montemagni -
scopus.contributor.surname Quochi -
scopus.date.issued 2019 *
scopus.description.abstracteng This paper presents a pilot study towards the creation of a monolingual written–spoken parallel corpus in Italian, featuring two main novelties in the general landscape of spoken corpora: the alignment with the written counterpart of the same content and the spoken variety dealt with, represented by transcriptions of radio news broadcasting. *
scopus.description.allpeopleoriginal Dominutti E.; Pifferi L.; Dell'Orletta F.; Montemagni S.; Quochi V. *
scopus.differences scopus.relation.conferencename *
scopus.differences scopus.authority.anceserie *
scopus.differences scopus.publisher.name *
scopus.differences scopus.relation.conferencedate *
scopus.differences scopus.description.abstracteng *
scopus.differences scopus.relation.conferenceplace *
scopus.document.type cp *
scopus.document.types cp *
scopus.funding.funders 501100006084 - Aeronautical Development Agency; 501100009888 - Regione Toscana; *
scopus.identifier.pui 629833106 *
scopus.identifier.scopus 2-s2.0-85074817518 *
scopus.journal.sourceid 21100218356 *
scopus.language.iso eng *
scopus.publisher.name CEUR-WS *
scopus.relation.conferencedate 2019 *
scopus.relation.conferencename 6th Italian Conference on Computational Linguistics, CLiC-it 2019 *
scopus.relation.conferenceplace ita *
scopus.relation.volume 2481 *
scopus.title Building an Italian written-spoken parallel corpus: A pilot study *
scopus.titleeng Building an Italian written-spoken parallel corpus: A pilot study *
Appare nelle tipologie: 04.01 Contributo in Atti di convegno
File in questo prodotto:
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/20.500.14243/390432
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 0
  • ???jsp.display-item.citation.isi??? ND
social impact