Starting from a wide set of linguistic features, we present the first in depth feature analysis in two different Native Language Identification (NLI) scenarios. We compare the results obtained in a traditional NLI document classification task and in a newly introduced sentence classification task, investigating the different role played by the considered features. Finally, we study the impact of a set of selected features extracted from the sentence classifier in document classification.

Sentences and documents in native language identification

Cimino A;Dell'Orletta F;Brunato D;Venturi G
2018

Abstract

Starting from a wide set of linguistic features, we present the first in depth feature analysis in two different Native Language Identification (NLI) scenarios. We compare the results obtained in a traditional NLI document classification task and in a newly introduced sentence classification task, investigating the different role played by the considered features. Finally, we study the impact of a set of selected features extracted from the sentence classifier in document classification.
Campo DC Valore Lingua
dc.authority.anceserie CEUR WORKSHOP PROCEEDINGS -
dc.authority.anceserie CEUR Workshop Proceedings -
dc.authority.orgunit Istituto di linguistica computazionale "Antonio Zampolli" - ILC -
dc.authority.people Cimino A it
dc.authority.people Dell'Orletta F it
dc.authority.people Brunato D it
dc.authority.people Venturi G it
dc.collection.id.s 71c7200a-7c5f-4e83-8d57-d3d2ba88f40d *
dc.collection.name 04.01 Contributo in Atti di convegno *
dc.contributor.appartenenza Istituto di linguistica computazionale "Antonio Zampolli" - ILC *
dc.contributor.appartenenza.mi 918 *
dc.date.accessioned 2024/02/21 06:02:38 -
dc.date.available 2024/02/21 06:02:38 -
dc.date.issued 2018 -
dc.description.abstracteng Starting from a wide set of linguistic features, we present the first in depth feature analysis in two different Native Language Identification (NLI) scenarios. We compare the results obtained in a traditional NLI document classification task and in a newly introduced sentence classification task, investigating the different role played by the considered features. Finally, we study the impact of a set of selected features extracted from the sentence classifier in document classification. -
dc.description.affiliations Istituto di Linguistica Computazionale Antonio Zampolli (ILC-CNR), ItaliaNLP Lab., , Italy -
dc.description.allpeople Cimino A.; Dell'Orletta F.; Brunato D.; Venturi G. -
dc.description.allpeopleoriginal Cimino A.; Dell'Orletta F.; Brunato D.; Venturi G. -
dc.description.fulltext none en
dc.description.numberofauthors 4 -
dc.identifier.scopus 2-s2.0-85057749754 -
dc.identifier.uri https://hdl.handle.net/20.500.14243/403576 -
dc.identifier.url http://www.scopus.com/record/display.url?eid=2-s2.0-85057749754&origin=inward -
dc.language.iso eng -
dc.relation.conferencedate 10-12/12/2018 -
dc.relation.conferencename 5th Italian Conference on Computational Linguistics (CLiC-it) -
dc.relation.conferenceplace Torino -
dc.relation.firstpage 1 -
dc.relation.lastpage 6 -
dc.relation.numberofpages 6 -
dc.relation.volume 2253 -
dc.subject.keywords Natural Language Processing -
dc.subject.keywords Native Language Identification -
dc.subject.singlekeyword Natural Language Processing *
dc.subject.singlekeyword Native Language Identification *
dc.title Sentences and documents in native language identification en
dc.type.driver info:eu-repo/semantics/conferenceObject -
dc.type.full 04 Contributo in convegno::04.01 Contributo in Atti di convegno it
dc.type.miur 273 -
dc.ugov.descaux1 423870 -
iris.orcid.lastModifiedDate 2024/03/27 09:56:37 *
iris.orcid.lastModifiedMillisecond 1711529797301 *
iris.scopus.extIssued 2018 -
iris.scopus.extTitle Sentences and documents in native language identification -
iris.sitodocente.maxattempts 1 -
scopus.authority.anceserie CEUR WORKSHOP PROCEEDINGS###1613-0073 *
scopus.category 1700 *
scopus.contributor.affiliation ItaliaNLP Lab. -
scopus.contributor.affiliation ItaliaNLP Lab. -
scopus.contributor.affiliation ItaliaNLP Lab. -
scopus.contributor.affiliation ItaliaNLP Lab. -
scopus.contributor.afid 60008941 -
scopus.contributor.afid 60008941 -
scopus.contributor.afid 60008941 -
scopus.contributor.afid 60008941 -
scopus.contributor.auid 57002803800 -
scopus.contributor.auid 57540567000 -
scopus.contributor.auid 55237740200 -
scopus.contributor.auid 27568199800 -
scopus.contributor.country Italy -
scopus.contributor.country Italy -
scopus.contributor.country Italy -
scopus.contributor.country Italy -
scopus.contributor.dptid 114087935 -
scopus.contributor.dptid 114087935 -
scopus.contributor.dptid 114087935 -
scopus.contributor.dptid 114087935 -
scopus.contributor.name Andrea -
scopus.contributor.name Felice -
scopus.contributor.name Dominique -
scopus.contributor.name Giulia -
scopus.contributor.subaffiliation Istituto di Linguistica Computazionale Antonio Zampolli (ILC-CNR); -
scopus.contributor.subaffiliation Istituto di Linguistica Computazionale Antonio Zampolli (ILC-CNR); -
scopus.contributor.subaffiliation Istituto di Linguistica Computazionale Antonio Zampolli (ILC-CNR); -
scopus.contributor.subaffiliation Istituto di Linguistica Computazionale Antonio Zampolli (ILC-CNR); -
scopus.contributor.surname Cimino -
scopus.contributor.surname Dell'Orletta -
scopus.contributor.surname Brunato -
scopus.contributor.surname Venturi -
scopus.date.issued 2018 *
scopus.description.abstracteng Starting from a wide set of linguistic features, we present the first in depth feature analysis in two different Native Language Identification (NLI) scenarios. We compare the results obtained in a traditional NLI document classification task and in a newly introduced sentence classification task, investigating the different role played by the considered features. Finally, we study the impact of a set of selected features extracted from the sentence classifier in document classification. *
scopus.description.allpeopleoriginal Cimino A.; Dell'Orletta F.; Brunato D.; Venturi G. *
scopus.differences scopus.relation.conferencename *
scopus.differences scopus.authority.anceserie *
scopus.differences scopus.publisher.name *
scopus.differences scopus.relation.conferencedate *
scopus.differences scopus.identifier.doi *
scopus.differences scopus.relation.conferenceplace *
scopus.document.type cp *
scopus.document.types cp *
scopus.identifier.doi 10.4000/books.aaccademia.3204 *
scopus.identifier.pui 625359952 *
scopus.identifier.scopus 2-s2.0-85057749754 *
scopus.journal.sourceid 21100218356 *
scopus.language.iso eng *
scopus.publisher.name CEUR-WS *
scopus.relation.conferencedate 2018 *
scopus.relation.conferencename 5th Italian Conference on Computational Linguistics, CLiC-it 2018 *
scopus.relation.conferenceplace ita *
scopus.relation.volume 2253 *
scopus.title Sentences and documents in native language identification *
scopus.titleeng Sentences and documents in native language identification *
Appare nelle tipologie: 04.01 Contributo in Atti di convegno
File in questo prodotto:
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/20.500.14243/403576
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 1
  • ???jsp.display-item.citation.isi??? ND
social impact