This presentation shares the experience of members of the CLARIN Trainers’ Network in reusing, adapting and localising existing learning content related to speech and oral data management to meet the learning needs of the CLARIN-IT research community in the context of the Humanities and Cultural Heritage Italian Open Science Cloud (H2IOSC) project. Following a brief introduction to the H2IOSC project and its training strategy, the article describes the transcription workshop and an accompanying workflow for managing speech and oral data, including data collection, privacy considerations, and the transcription chain. Furthermore, the authors show how they used the Skills4EOSC FAIR-by-Design methodology to convert the workshop into reusable training material, which other trainers can take and adapt to meet the needs of researchers in other communities working with oral history data. -- Progetto H2IOSC - Humanities and cultural Heritage Italian Open Science Cloud finanziato dall’Unione Europea NextGenerationEU – PNRR M4C2 – Codice progetto IR0000029 – CUP B63C22000730005.

From Collection to Transcription: a Workflow for Managing Speech Data by the CLARIN Trainers' Network

Giulia Pedonese
2025

Abstract

This presentation shares the experience of members of the CLARIN Trainers’ Network in reusing, adapting and localising existing learning content related to speech and oral data management to meet the learning needs of the CLARIN-IT research community in the context of the Humanities and Cultural Heritage Italian Open Science Cloud (H2IOSC) project. Following a brief introduction to the H2IOSC project and its training strategy, the article describes the transcription workshop and an accompanying workflow for managing speech and oral data, including data collection, privacy considerations, and the transcription chain. Furthermore, the authors show how they used the Skills4EOSC FAIR-by-Design methodology to convert the workshop into reusable training material, which other trainers can take and adapt to meet the needs of researchers in other communities working with oral history data. -- Progetto H2IOSC - Humanities and cultural Heritage Italian Open Science Cloud finanziato dall’Unione Europea NextGenerationEU – PNRR M4C2 – Codice progetto IR0000029 – CUP B63C22000730005.
2025
Istituto di linguistica computazionale "Antonio Zampolli" - ILC
transcription chain
Linguistics
File in questo prodotto:
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/20.500.14243/561729
 Attenzione

Attenzione! I dati visualizzati non sono stati sottoposti a validazione da parte dell'ente

Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus ND
  • ???jsp.display-item.citation.isi??? ND
social impact