In this paper we present ISACCO (Italian School-Age Children COrpus), a corpus of oral and written retellings of Italian-speaking children attending primary school. All texts were digitalized and automatically enriched with multi-level linguistic annotation. Preliminary explorations of both the form and the content of children's productions were carried out based on a set of features automatically extracted by NLP tools. Written retellings were manually annotated with a typology of errors belonging to three different linguistic levels. The resource, which has been made publicly available1, is conceived to support research and computational modeling of "later language acquisition", with an emphasis on comparative assessment of the evolution of oral and written language competencies in early school grades.
ISACCO: a corpus for investigating spoken and written language development in Italian school-age children
Dominique Brunato;Felice Dell'Orletta
2016
Abstract
In this paper we present ISACCO (Italian School-Age Children COrpus), a corpus of oral and written retellings of Italian-speaking children attending primary school. All texts were digitalized and automatically enriched with multi-level linguistic annotation. Preliminary explorations of both the form and the content of children's productions were carried out based on a set of features automatically extracted by NLP tools. Written retellings were manually annotated with a typology of errors belonging to three different linguistic levels. The resource, which has been made publicly available1, is conceived to support research and computational modeling of "later language acquisition", with an emphasis on comparative assessment of the evolution of oral and written language competencies in early school grades.| Campo DC | Valore | Lingua |
|---|---|---|
| dc.authority.ancejournal | IJCOL | - |
| dc.authority.orgunit | Istituto di linguistica computazionale "Antonio Zampolli" - ILC | - |
| dc.authority.people | Dominique Brunato | it |
| dc.authority.people | Felice Dell'Orletta | it |
| dc.collection.id.s | b3f88f24-048a-4e43-8ab1-6697b90e068e | * |
| dc.collection.name | 01.01 Articolo in rivista | * |
| dc.contributor.appartenenza | Istituto di linguistica computazionale "Antonio Zampolli" - ILC | * |
| dc.contributor.appartenenza.mi | 918 | * |
| dc.date.accessioned | 2024/02/21 02:09:48 | - |
| dc.date.available | 2024/02/21 02:09:48 | - |
| dc.date.issued | 2016 | - |
| dc.description.abstracteng | In this paper we present ISACCO (Italian School-Age Children COrpus), a corpus of oral and written retellings of Italian-speaking children attending primary school. All texts were digitalized and automatically enriched with multi-level linguistic annotation. Preliminary explorations of both the form and the content of children's productions were carried out based on a set of features automatically extracted by NLP tools. Written retellings were manually annotated with a typology of errors belonging to three different linguistic levels. The resource, which has been made publicly available1, is conceived to support research and computational modeling of "later language acquisition", with an emphasis on comparative assessment of the evolution of oral and written language competencies in early school grades. | - |
| dc.description.affiliations | Istituto di Linguistica Computazionale "A.Zampolli", ILC-CNR | - |
| dc.description.allpeople | Dominique Brunato; Felice Dell'Orletta | - |
| dc.description.allpeopleoriginal | Dominique Brunato, Felice Dell'Orletta | - |
| dc.description.fulltext | none | en |
| dc.description.numberofauthors | 2 | - |
| dc.identifier.uri | https://hdl.handle.net/20.500.14243/325818 | - |
| dc.identifier.url | http://www.italianlp.it/wp-content/uploads/2016/09/04_brunato_dell-orletta.pdf | - |
| dc.language.iso | eng | - |
| dc.relation.firstpage | 63 | - |
| dc.relation.issue | 1 | - |
| dc.relation.lastpage | 76 | - |
| dc.relation.numberofpages | 14 | - |
| dc.relation.volume | 2 | - |
| dc.subject.keywords | Child language acquisition | - |
| dc.subject.keywords | Oral and Written language | - |
| dc.subject.keywords | multi-level linguistic analysis | - |
| dc.subject.singlekeyword | Child language acquisition | * |
| dc.subject.singlekeyword | Oral and Written language | * |
| dc.subject.singlekeyword | multi-level linguistic analysis | * |
| dc.title | ISACCO: a corpus for investigating spoken and written language development in Italian school-age children | en |
| dc.type.driver | info:eu-repo/semantics/article | - |
| dc.type.full | 01 Contributo su Rivista::01.01 Articolo in rivista | it |
| dc.type.miur | 262 | - |
| dc.type.referee | Sì, ma tipo non specificato | - |
| dc.ugov.descaux1 | 366755 | - |
| iris.orcid.lastModifiedDate | 2024/02/22 23:14:53 | * |
| iris.orcid.lastModifiedMillisecond | 1708640093055 | * |
| iris.sitodocente.maxattempts | 1 | - |
| Appare nelle tipologie: | 01.01 Articolo in rivista | |
I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.


