Starting from a wide set of linguistic features, we present the first in depth feature analysis in two different Native Language Identification (NLI) scenarios. We compare the results obtained in a traditional NLI document classification task and in a newly introduced sentence classification task, investigating the different role played by the considered features. Finally, we study the impact of a set of selected features extracted from the sentence classifier in document classification.

Sentences and documents in native language identification

Cimino A;Dell'Orletta F;Brunato D;Venturi G
2018

Abstract

Starting from a wide set of linguistic features, we present the first in depth feature analysis in two different Native Language Identification (NLI) scenarios. We compare the results obtained in a traditional NLI document classification task and in a newly introduced sentence classification task, investigating the different role played by the considered features. Finally, we study the impact of a set of selected features extracted from the sentence classifier in document classification.
2018
Istituto di linguistica computazionale "Antonio Zampolli" - ILC
Inglese
5th Italian Conference on Computational Linguistics (CLiC-it)
2253
1
6
6
http://www.scopus.com/record/display.url?eid=2-s2.0-85057749754&origin=inward
10-12/12/2018
Torino
Natural Language Processing
Native Language Identification
4
none
Cimino A.; Dell'Orletta F.; Brunato D.; Venturi G.
273
info:eu-repo/semantics/conferenceObject
04 Contributo in convegno::04.01 Contributo in Atti di convegno
File in questo prodotto:
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/20.500.14243/403576
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 1
  • ???jsp.display-item.citation.isi??? ND
social impact