In this paper, we present DARC-IT, a new reading comprehension dataset for the Italian language aimed at identifying 'question-worthy' sentences, i.e. sentences in a text which contain information that is worth asking a question about. The purpose of the corpus is twofold: to investigate the linguistic profile of question-worthy sentences and to support the development of automatic question generation systems.

DARC-IT: A DAtaset for reading comprehension in Italian

Brunato D;Dell'Orletta F
2018

Abstract

In this paper, we present DARC-IT, a new reading comprehension dataset for the Italian language aimed at identifying 'question-worthy' sentences, i.e. sentences in a text which contain information that is worth asking a question about. The purpose of the corpus is twofold: to investigate the linguistic profile of question-worthy sentences and to support the development of automatic question generation systems.
Campo DC Valore Lingua
dc.authority.anceserie CEUR WORKSHOP PROCEEDINGS -
dc.authority.anceserie CEUR Workshop Proceedings -
dc.authority.people Brunato D it
dc.authority.people Valeriani M it
dc.authority.people Dell'Orletta F it
dc.collection.id.s 71c7200a-7c5f-4e83-8d57-d3d2ba88f40d *
dc.collection.name 04.01 Contributo in Atti di convegno *
dc.contributor.appartenenza Istituto di linguistica computazionale "Antonio Zampolli" - ILC *
dc.contributor.appartenenza.mi 918 *
dc.date.accessioned 2024/02/21 02:44:43 -
dc.date.available 2024/02/21 02:44:43 -
dc.date.issued 2018 -
dc.description.abstracteng In this paper, we present DARC-IT, a new reading comprehension dataset for the Italian language aimed at identifying 'question-worthy' sentences, i.e. sentences in a text which contain information that is worth asking a question about. The purpose of the corpus is twofold: to investigate the linguistic profile of question-worthy sentences and to support the development of automatic question generation systems. -
dc.description.affiliations University of Pisa, Italy; Istituto di Linguistica Computazionale Antonio Zampolli (ILC), CNR, Italy -
dc.description.allpeople Brunato D.; Valeriani M.; Dell'Orletta F. -
dc.description.allpeopleoriginal Brunato D.; Valeriani M.; Dell'Orletta F. -
dc.description.fulltext none en
dc.description.numberofauthors 2 -
dc.identifier.doi 10.4000/books.aaccademia.3099 -
dc.identifier.scopus 2-s2.0-85057748908 -
dc.identifier.uri https://hdl.handle.net/20.500.14243/392547 -
dc.identifier.url http://www.scopus.com/record/display.url?eid=2-s2.0-85057748908&origin=inward -
dc.language.iso eng -
dc.relation.conferencedate 10-12/12/2018 -
dc.relation.conferencename 5th Italian Conference on Computational Linguistics (CLiC-it) -
dc.relation.conferenceplace Torino -
dc.relation.volume 2253 -
dc.subject.keywords Reading Comprehension -
dc.subject.singlekeyword Reading Comprehension *
dc.title DARC-IT: A DAtaset for reading comprehension in Italian en
dc.type.driver info:eu-repo/semantics/conferenceObject -
dc.type.full 04 Contributo in convegno::04.01 Contributo in Atti di convegno it
dc.type.miur 273 -
dc.type.referee Sì, ma tipo non specificato -
dc.ugov.descaux1 434878 -
iris.orcid.lastModifiedDate 2024/02/22 19:00:25 *
iris.orcid.lastModifiedMillisecond 1708624825427 *
iris.scopus.extIssued 2018 -
iris.scopus.extTitle DARC-IT: A DAtaset for reading comprehension in Italian -
iris.sitodocente.maxattempts 1 -
iris.unpaywall.doi 10.4000/books.aaccademia.3099 *
iris.unpaywall.isoa false *
iris.unpaywall.journalisindoaj false *
iris.unpaywall.metadataCallLastModified 27/12/2025 04:06:46 -
iris.unpaywall.metadataCallLastModifiedMillisecond 1766804806715 -
iris.unpaywall.oastatus closed *
scopus.authority.anceserie CEUR WORKSHOP PROCEEDINGS###1613-0073 *
scopus.category 1700 *
scopus.contributor.affiliation ItaliaNLP Lab. -
scopus.contributor.affiliation University of Pisa -
scopus.contributor.affiliation ItaliaNLP Lab. -
scopus.contributor.afid 60008941 -
scopus.contributor.afid 60028868 -
scopus.contributor.afid 60008941 -
scopus.contributor.auid 55237740200 -
scopus.contributor.auid 57204902765 -
scopus.contributor.auid 57540567000 -
scopus.contributor.country Italy -
scopus.contributor.country Italy -
scopus.contributor.country Italy -
scopus.contributor.dptid 114087935 -
scopus.contributor.dptid -
scopus.contributor.dptid 114087935 -
scopus.contributor.name Dominique -
scopus.contributor.name Martina -
scopus.contributor.name Felice -
scopus.contributor.subaffiliation Istituto di Linguistica Computazionale Antonio Zampolli (ILC-CNR); -
scopus.contributor.subaffiliation -
scopus.contributor.subaffiliation Istituto di Linguistica Computazionale Antonio Zampolli (ILC-CNR); -
scopus.contributor.surname Brunato -
scopus.contributor.surname Valeriani -
scopus.contributor.surname Dell'Orletta -
scopus.date.issued 2018 *
scopus.description.abstracteng In this paper, we present DARC-IT, a new reading comprehension dataset for the Italian language aimed at identifying 'question-worthy' sentences, i.e. sentences in a text which contain information that is worth asking a question about1. The purpose of the corpus is twofold: to investigate the linguistic profile of question-worthy sentences and to support the development of automatic question generation systems. *
scopus.description.allpeopleoriginal Brunato D.; Valeriani M.; Dell'Orletta F. *
scopus.differences scopus.relation.conferencename *
scopus.differences scopus.authority.anceserie *
scopus.differences scopus.publisher.name *
scopus.differences scopus.relation.conferencedate *
scopus.differences scopus.description.abstracteng *
scopus.differences scopus.relation.conferenceplace *
scopus.document.type cp *
scopus.document.types cp *
scopus.identifier.doi 10.4000/books.aaccademia.3099 *
scopus.identifier.pui 625360186 *
scopus.identifier.scopus 2-s2.0-85057748908 *
scopus.journal.sourceid 21100218356 *
scopus.language.iso eng *
scopus.publisher.name CEUR-WS *
scopus.relation.conferencedate 2018 *
scopus.relation.conferencename 5th Italian Conference on Computational Linguistics, CLiC-it 2018 *
scopus.relation.conferenceplace ita *
scopus.relation.volume 2253 *
scopus.title DARC-IT: A DAtaset for reading comprehension in Italian *
scopus.titleeng DARC-IT: A DAtaset for reading comprehension in Italian *
Appare nelle tipologie: 04.01 Contributo in Atti di convegno
File in questo prodotto:
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/20.500.14243/392547
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 0
  • ???jsp.display-item.citation.isi??? ND
social impact