Digital tools based on automatic speech recognition (ASR) could be a useful support for teachers in assessing the reading skills of the students. We focus on the evaluation of the decoding accuracy of children with grade level ranging from the 3rd to the 6th performing a reading aloud task on a narrative text displayed on an ordinary tablet using the ReadLet platform. On the basis of previously collected data, we built a gold dataset with sentences characterised by the audio data, the original text to be read, and the text actually spoken by the child. By using the open-source Kaldi toolkit an ASR system based on the GMM-HMM model was trained on the training portion of the gold dataset. The accuracy of the ASR system was calculated as the ability to correctly decode the test audio data with respect to the annotated text, and the decoding accuracy of the children was estimated by measuring the gap between the results obtained with the annotated text and the original text. A consistent trend with increasing grade level was found in terms of word correctness, substitutions and insertions, while the trained model appears to be significantly able to evaluate the children decoding accuracy.

Evaluating the accuracy of decoding in children who read aloud

Bruno E;Cappa C;Ferro M
2021

Abstract

Digital tools based on automatic speech recognition (ASR) could be a useful support for teachers in assessing the reading skills of the students. We focus on the evaluation of the decoding accuracy of children with grade level ranging from the 3rd to the 6th performing a reading aloud task on a narrative text displayed on an ordinary tablet using the ReadLet platform. On the basis of previously collected data, we built a gold dataset with sentences characterised by the audio data, the original text to be read, and the text actually spoken by the child. By using the open-source Kaldi toolkit an ASR system based on the GMM-HMM model was trained on the training portion of the gold dataset. The accuracy of the ASR system was calculated as the ability to correctly decode the test audio data with respect to the annotated text, and the decoding accuracy of the children was estimated by measuring the gap between the results obtained with the annotated text and the original text. A consistent trend with increasing grade level was found in terms of word correctness, substitutions and insertions, while the trained model appears to be significantly able to evaluate the children decoding accuracy.
Campo DC Valore Lingua
dc.authority.orgunit Istituto di Fisiologia Clinica - IFC -
dc.authority.orgunit Istituto di linguistica computazionale "Antonio Zampolli" - ILC -
dc.authority.people Bruno E it
dc.authority.people Giulivi S it
dc.authority.people Cappa C it
dc.authority.people Marini M it
dc.authority.people Ferro M it
dc.collection.id.s 71c7200a-7c5f-4e83-8d57-d3d2ba88f40d *
dc.collection.name 04.01 Contributo in Atti di convegno *
dc.contributor.appartenenza Istituto di Fisiologia Clinica - IFC *
dc.contributor.appartenenza Istituto di linguistica computazionale "Antonio Zampolli" - ILC *
dc.contributor.appartenenza.mi 885 *
dc.contributor.appartenenza.mi 918 *
dc.date.accessioned 2024/02/19 14:48:07 -
dc.date.available 2024/02/19 14:48:07 -
dc.date.issued 2021 -
dc.description.abstracteng Digital tools based on automatic speech recognition (ASR) could be a useful support for teachers in assessing the reading skills of the students. We focus on the evaluation of the decoding accuracy of children with grade level ranging from the 3rd to the 6th performing a reading aloud task on a narrative text displayed on an ordinary tablet using the ReadLet platform. On the basis of previously collected data, we built a gold dataset with sentences characterised by the audio data, the original text to be read, and the text actually spoken by the child. By using the open-source Kaldi toolkit an ASR system based on the GMM-HMM model was trained on the training portion of the gold dataset. The accuracy of the ASR system was calculated as the ability to correctly decode the test audio data with respect to the annotated text, and the decoding accuracy of the children was estimated by measuring the gap between the results obtained with the annotated text and the original text. A consistent trend with increasing grade level was found in terms of word correctness, substitutions and insertions, while the trained model appears to be significantly able to evaluate the children decoding accuracy. -
dc.description.affiliations ILC-CNR; SUPSI; IFC-CNR; DICI-UNIPI; ILC-CNR; -
dc.description.allpeople Bruno, E; Giulivi, S; Cappa, C; Marini, M; Ferro, M -
dc.description.allpeopleoriginal Bruno E., Giulivi S., Cappa C., Marini M., Ferro M. -
dc.description.fulltext open en
dc.description.numberofauthors 5 -
dc.identifier.doi 10.36253/978-88-5518-449-6 -
dc.identifier.isbn 978-88-5518-449-6 -
dc.identifier.uri https://hdl.handle.net/20.500.14243/446977 -
dc.language.iso eng -
dc.publisher.country ITA -
dc.publisher.name Firenze University Press -
dc.publisher.place Firenze -
dc.relation.alleditors Manfredi C. -
dc.relation.conferencedate 14-16/12/2021 -
dc.relation.conferencename 12th International Workshop on Models and Analysis of Vocal Emissions for Biomedical Applications (MAVEBA'21) -
dc.relation.conferenceplace Firenze (Italy) -
dc.relation.firstpage 145 -
dc.relation.ispartofbook Proceedings of the 12th International Workshop on Models and Analysis of Vocal Emissions for Biomedical Applications (MAVEBA'21) -
dc.relation.lastpage 148 -
dc.relation.numberofpages 4 -
dc.subject.keywords speech recognition -
dc.subject.keywords decoding accuracy -
dc.subject.keywords reading aloud -
dc.subject.keywords voice parameters -
dc.subject.keywords Kaldi -
dc.subject.keywords GMM-HMM acoustic model -
dc.subject.singlekeyword speech recognition *
dc.subject.singlekeyword decoding accuracy *
dc.subject.singlekeyword reading aloud *
dc.subject.singlekeyword voice parameters *
dc.subject.singlekeyword Kaldi *
dc.subject.singlekeyword GMM-HMM acoustic model *
dc.title Evaluating the accuracy of decoding in children who read aloud en
dc.type.driver info:eu-repo/semantics/conferenceObject -
dc.type.full 04 Contributo in convegno::04.01 Contributo in Atti di convegno it
dc.type.miur 273 -
dc.type.referee Sì, ma tipo non specificato -
dc.ugov.descaux1 461393 -
iris.mediafilter.data 2025/04/23 04:01:40 *
iris.orcid.lastModifiedDate 2024/04/24 14:34:00 *
iris.orcid.lastModifiedMillisecond 1713962040821 *
iris.sitodocente.maxattempts 1 -
iris.unpaywall.bestoahost repository *
iris.unpaywall.bestoaversion publishedVersion *
iris.unpaywall.doi 10.36253/978-88-5518-449-6 *
iris.unpaywall.hosttype *
iris.unpaywall.isoa true *
iris.unpaywall.journalisindoaj false *
iris.unpaywall.landingpage https://hdl.handle.net/11384/134262 *
iris.unpaywall.license other-oa *
iris.unpaywall.metadataCallLastModified 15/12/2024 14:45:27 -
iris.unpaywall.metadataCallLastModifiedMillisecond 1734270327656 -
iris.unpaywall.oastatus green *
iris.unpaywall.pdfurl https://ricerca.sns.it/bitstream/11384/134262/1/IAVARONE_publication5.pdf *
Appare nelle tipologie: 04.01 Contributo in Atti di convegno
File in questo prodotto:
File Dimensione Formato  
prod_461393-doc_180010.pdf

accesso aperto

Descrizione: Evaluating the accuracy of decoding in children who read aloud
Tipologia: Versione Editoriale (PDF)
Dimensione 219.61 kB
Formato Adobe PDF
219.61 kB Adobe PDF Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/20.500.14243/446977
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus ND
  • ???jsp.display-item.citation.isi??? ND
social impact