This presentation shares the experience of members of the CLARIN Trainers’ Network in reusing, adapting and localising existing learning content related to speech and oral data management to meet the learning needs of the CLARIN-IT research community in the context of the Humanities and Cultural Heritage Italian Open Science Cloud (H2IOSC) project. Following a brief introduction to the H2IOSC project and its training strategy, the article describes the transcription workshop and an accompanying workflow for managing speech and oral data, including data collection, privacy considerations, and the transcription chain. Furthermore, the authors show how they used the Skills4EOSC FAIR-by-Design methodology to convert the workshop into reusable training material, which other trainers can take and adapt to meet the needs of researchers in other communities working with oral history data. -- Progetto H2IOSC - Humanities and cultural Heritage Italian Open Science Cloud finanziato dall’Unione Europea NextGenerationEU – PNRR M4C2 – Codice progetto IR0000029 – CUP B63C22000730005.
From Collection to Transcription: a Workflow for Managing Speech Data by the CLARIN Trainers' Network
Giulia Pedonese
2025
Abstract
This presentation shares the experience of members of the CLARIN Trainers’ Network in reusing, adapting and localising existing learning content related to speech and oral data management to meet the learning needs of the CLARIN-IT research community in the context of the Humanities and Cultural Heritage Italian Open Science Cloud (H2IOSC) project. Following a brief introduction to the H2IOSC project and its training strategy, the article describes the transcription workshop and an accompanying workflow for managing speech and oral data, including data collection, privacy considerations, and the transcription chain. Furthermore, the authors show how they used the Skills4EOSC FAIR-by-Design methodology to convert the workshop into reusable training material, which other trainers can take and adapt to meet the needs of researchers in other communities working with oral history data. -- Progetto H2IOSC - Humanities and cultural Heritage Italian Open Science Cloud finanziato dall’Unione Europea NextGenerationEU – PNRR M4C2 – Codice progetto IR0000029 – CUP B63C22000730005.I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.


