This paper deals with Domain Adaptation for automatic syntactic annotation. Until the half of the 1980s, automatic linguistic annotation was based on algorithms built on groups of hand-written rules, defined a priori on the basis of the knowledge of the system to formalise. Subsequently, thanks to the progress of research in the field of Artificial Intelligence and to the development of linguistic resources, algorithms based on machine learning techniques began to be employed. The major difficulties of those algorithms were due to certain aspects of natural language such as ambiguities, diachronic evolutions, or language variations from the original domain of knowledge. More specifically, the issue of Domain Adaptation can be put in the following terms: "can an annotated corpus [which is representative of a specific linguistic variety] be used for the syntactic analysis of a second corpus [which is representative of a different linguistic variety]?". The author answer presenting an algorithm called ULISSE (Unsupervised LInguistically-driven Selection of dEpendency parses), which selects in an optima way the most representative sentences of a new target domain and feed them to the parser in addition to the original training set.
ULISSE: una strategia di adattamento al dominio per l'annotazione sintattica automatica
Dell'Orletta F;Venturi G
2016
Abstract
This paper deals with Domain Adaptation for automatic syntactic annotation. Until the half of the 1980s, automatic linguistic annotation was based on algorithms built on groups of hand-written rules, defined a priori on the basis of the knowledge of the system to formalise. Subsequently, thanks to the progress of research in the field of Artificial Intelligence and to the development of linguistic resources, algorithms based on machine learning techniques began to be employed. The major difficulties of those algorithms were due to certain aspects of natural language such as ambiguities, diachronic evolutions, or language variations from the original domain of knowledge. More specifically, the issue of Domain Adaptation can be put in the following terms: "can an annotated corpus [which is representative of a specific linguistic variety] be used for the syntactic analysis of a second corpus [which is representative of a different linguistic variety]?". The author answer presenting an algorithm called ULISSE (Unsupervised LInguistically-driven Selection of dEpendency parses), which selects in an optima way the most representative sentences of a new target domain and feed them to the parser in addition to the original training set.| Campo DC | Valore | Lingua |
|---|---|---|
| dc.authority.orgunit | Istituto di linguistica computazionale "Antonio Zampolli" - ILC | - |
| dc.authority.people | Dell'Orletta F | it |
| dc.authority.people | Venturi G | it |
| dc.collection.id.s | 71c7200a-7c5f-4e83-8d57-d3d2ba88f40d | * |
| dc.collection.name | 04.01 Contributo in Atti di convegno | * |
| dc.contributor.appartenenza | Istituto di linguistica computazionale "Antonio Zampolli" - ILC | * |
| dc.contributor.appartenenza.mi | 918 | * |
| dc.date.accessioned | 2024/02/21 02:08:51 | - |
| dc.date.available | 2024/02/21 02:08:51 | - |
| dc.date.issued | 2016 | - |
| dc.description.abstracteng | This paper deals with Domain Adaptation for automatic syntactic annotation. Until the half of the 1980s, automatic linguistic annotation was based on algorithms built on groups of hand-written rules, defined a priori on the basis of the knowledge of the system to formalise. Subsequently, thanks to the progress of research in the field of Artificial Intelligence and to the development of linguistic resources, algorithms based on machine learning techniques began to be employed. The major difficulties of those algorithms were due to certain aspects of natural language such as ambiguities, diachronic evolutions, or language variations from the original domain of knowledge. More specifically, the issue of Domain Adaptation can be put in the following terms: "can an annotated corpus [which is representative of a specific linguistic variety] be used for the syntactic analysis of a second corpus [which is representative of a different linguistic variety]?". The author answer presenting an algorithm called ULISSE (Unsupervised LInguistically-driven Selection of dEpendency parses), which selects in an optima way the most representative sentences of a new target domain and feed them to the parser in addition to the original training set. | - |
| dc.description.affiliations | Istituto di Linguistica Computazionale "A. Zampolli" (ILC-CNR) | - |
| dc.description.allpeople | Dell'Orletta, F; Venturi, G | - |
| dc.description.allpeopleoriginal | Dell'Orletta F., Venturi G. | - |
| dc.description.fulltext | none | en |
| dc.description.numberofauthors | 2 | - |
| dc.identifier.isbn | 978-88-6952-038-9 | - |
| dc.identifier.uri | https://hdl.handle.net/20.500.14243/325815 | - |
| dc.identifier.url | http://www.italianlp.it/wp-content/uploads/2016/10/Compter_Parler_Soigner_ULISSE.pdf | - |
| dc.language.iso | ita | - |
| dc.relation.conferencedate | 15-17 dicembre 2014 | - |
| dc.relation.conferencename | Atti del convegno "Compter parler soigner: tra linguistica e intelligenza artificiale" | - |
| dc.relation.conferenceplace | Pavia | - |
| dc.relation.firstpage | 55 | - |
| dc.relation.lastpage | 79 | - |
| dc.relation.numberofpages | 24 | - |
| dc.subject.keywords | Domain Adaptation | - |
| dc.subject.keywords | annotazione sintattica automatica | - |
| dc.subject.singlekeyword | Domain Adaptation | * |
| dc.subject.singlekeyword | annotazione sintattica automatica | * |
| dc.title | ULISSE: una strategia di adattamento al dominio per l'annotazione sintattica automatica | en |
| dc.type.driver | info:eu-repo/semantics/conferenceObject | - |
| dc.type.full | 04 Contributo in convegno::04.01 Contributo in Atti di convegno | it |
| dc.type.miur | 273 | - |
| dc.ugov.descaux1 | 366752 | - |
| iris.orcid.lastModifiedDate | 2024/04/05 05:04:44 | * |
| iris.orcid.lastModifiedMillisecond | 1712286284471 | * |
| iris.sitodocente.maxattempts | 3 | - |
| Appare nelle tipologie: | 04.01 Contributo in Atti di convegno | |
I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.


