In this paper we present ISACCO (Italian School-Age Children COrpus), a corpus of oral and written retellings of Italian-speaking children attending primary school. All texts were digitalized and automatically enriched with multi-level linguistic annotation. Preliminary explorations of both the form and the content of children's productions were carried out based on a set of features automatically extracted by NLP tools. Written retellings were manually annotated with a typology of errors belonging to three different linguistic levels. The resource, which has been made publicly available1, is conceived to support research and computational modeling of "later language acquisition", with an emphasis on comparative assessment of the evolution of oral and written language competencies in early school grades.

ISACCO: a corpus for investigating spoken and written language development in Italian school-age children

Dominique Brunato;Felice Dell'Orletta
2016

Abstract

In this paper we present ISACCO (Italian School-Age Children COrpus), a corpus of oral and written retellings of Italian-speaking children attending primary school. All texts were digitalized and automatically enriched with multi-level linguistic annotation. Preliminary explorations of both the form and the content of children's productions were carried out based on a set of features automatically extracted by NLP tools. Written retellings were manually annotated with a typology of errors belonging to three different linguistic levels. The resource, which has been made publicly available1, is conceived to support research and computational modeling of "later language acquisition", with an emphasis on comparative assessment of the evolution of oral and written language competencies in early school grades.
Campo DC Valore Lingua
dc.authority.ancejournal IJCOL -
dc.authority.orgunit Istituto di linguistica computazionale "Antonio Zampolli" - ILC -
dc.authority.people Dominique Brunato it
dc.authority.people Felice Dell'Orletta it
dc.collection.id.s b3f88f24-048a-4e43-8ab1-6697b90e068e *
dc.collection.name 01.01 Articolo in rivista *
dc.contributor.appartenenza Istituto di linguistica computazionale "Antonio Zampolli" - ILC *
dc.contributor.appartenenza.mi 918 *
dc.date.accessioned 2024/02/21 02:09:48 -
dc.date.available 2024/02/21 02:09:48 -
dc.date.issued 2016 -
dc.description.abstracteng In this paper we present ISACCO (Italian School-Age Children COrpus), a corpus of oral and written retellings of Italian-speaking children attending primary school. All texts were digitalized and automatically enriched with multi-level linguistic annotation. Preliminary explorations of both the form and the content of children's productions were carried out based on a set of features automatically extracted by NLP tools. Written retellings were manually annotated with a typology of errors belonging to three different linguistic levels. The resource, which has been made publicly available1, is conceived to support research and computational modeling of "later language acquisition", with an emphasis on comparative assessment of the evolution of oral and written language competencies in early school grades. -
dc.description.affiliations Istituto di Linguistica Computazionale "A.Zampolli", ILC-CNR -
dc.description.allpeople Dominique Brunato; Felice Dell'Orletta -
dc.description.allpeopleoriginal Dominique Brunato, Felice Dell'Orletta -
dc.description.fulltext none en
dc.description.numberofauthors 2 -
dc.identifier.uri https://hdl.handle.net/20.500.14243/325818 -
dc.identifier.url http://www.italianlp.it/wp-content/uploads/2016/09/04_brunato_dell-orletta.pdf -
dc.language.iso eng -
dc.relation.firstpage 63 -
dc.relation.issue 1 -
dc.relation.lastpage 76 -
dc.relation.numberofpages 14 -
dc.relation.volume 2 -
dc.subject.keywords Child language acquisition -
dc.subject.keywords Oral and Written language -
dc.subject.keywords multi-level linguistic analysis -
dc.subject.singlekeyword Child language acquisition *
dc.subject.singlekeyword Oral and Written language *
dc.subject.singlekeyword multi-level linguistic analysis *
dc.title ISACCO: a corpus for investigating spoken and written language development in Italian school-age children en
dc.type.driver info:eu-repo/semantics/article -
dc.type.full 01 Contributo su Rivista::01.01 Articolo in rivista it
dc.type.miur 262 -
dc.type.referee Sì, ma tipo non specificato -
dc.ugov.descaux1 366755 -
iris.orcid.lastModifiedDate 2024/02/22 23:14:53 *
iris.orcid.lastModifiedMillisecond 1708640093055 *
iris.sitodocente.maxattempts 1 -
Appare nelle tipologie: 01.01 Articolo in rivista
File in questo prodotto:
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/20.500.14243/325818
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus ND
  • ???jsp.display-item.citation.isi??? ND
social impact