CNR Institutional Research Information System

This paper deals with Domain Adaptation for automatic syntactic annotation. Until the half of the 1980s, automatic linguistic annotation was based on algorithms built on groups of hand-written rules, defined a priori on the basis of the knowledge of the system to formalise. Subsequently, thanks to the progress of research in the field of Artificial Intelligence and to the development of linguistic resources, algorithms based on machine learning techniques began to be employed. The major difficulties of those algorithms were due to certain aspects of natural language such as ambiguities, diachronic evolutions, or language variations from the original domain of knowledge. More specifically, the issue of Domain Adaptation can be put in the following terms: "can an annotated corpus [which is representative of a specific linguistic variety] be used for the syntactic analysis of a second corpus [which is representative of a different linguistic variety]?". The author answer presenting an algorithm called ULISSE (Unsupervised LInguistically-driven Selection of dEpendency parses), which selects in an optima way the most representative sentences of a new target domain and feed them to the parser in addition to the original training set.

ULISSE: una strategia di adattamento al dominio per l'annotazione sintattica automatica

Dell'Orletta F;Venturi G

2016

Abstract

This paper deals with Domain Adaptation for automatic syntactic annotation. Until the half of the 1980s, automatic linguistic annotation was based on algorithms built on groups of hand-written rules, defined a priori on the basis of the knowledge of the system to formalise. Subsequently, thanks to the progress of research in the field of Artificial Intelligence and to the development of linguistic resources, algorithms based on machine learning techniques began to be employed. The major difficulties of those algorithms were due to certain aspects of natural language such as ambiguities, diachronic evolutions, or language variations from the original domain of knowledge. More specifically, the issue of Domain Adaptation can be put in the following terms: "can an annotated corpus [which is representative of a specific linguistic variety] be used for the syntactic analysis of a second corpus [which is representative of a different linguistic variety]?". The author answer presenting an algorithm called ULISSE (Unsupervised LInguistically-driven Selection of dEpendency parses), which selects in an optima way the most representative sentences of a new target domain and feed them to the parser in addition to the original training set.

Scheda breve

Scheda completa

Scheda completa (DC)

Campo DC	Valore	Lingua
dc.authority.orgunit	Istituto di linguistica computazionale "Antonio Zampolli" - ILC	-
dc.authority.people	Dell'Orletta F	it
dc.authority.people	Venturi G	it
dc.collection.id.s	71c7200a-7c5f-4e83-8d57-d3d2ba88f40d	*
dc.collection.name	04.01 Contributo in Atti di convegno	*
dc.contributor.appartenenza	Istituto di linguistica computazionale "Antonio Zampolli" - ILC	*
dc.contributor.appartenenza.mi	918	*
dc.date.accessioned	2024/02/21 02:08:51	-
dc.date.available	2024/02/21 02:08:51	-
dc.date.issued	2016	-
dc.description.abstracteng	This paper deals with Domain Adaptation for automatic syntactic annotation. Until the half of the 1980s, automatic linguistic annotation was based on algorithms built on groups of hand-written rules, defined a priori on the basis of the knowledge of the system to formalise. Subsequently, thanks to the progress of research in the field of Artificial Intelligence and to the development of linguistic resources, algorithms based on machine learning techniques began to be employed. The major difficulties of those algorithms were due to certain aspects of natural language such as ambiguities, diachronic evolutions, or language variations from the original domain of knowledge. More specifically, the issue of Domain Adaptation can be put in the following terms: "can an annotated corpus [which is representative of a specific linguistic variety] be used for the syntactic analysis of a second corpus [which is representative of a different linguistic variety]?". The author answer presenting an algorithm called ULISSE (Unsupervised LInguistically-driven Selection of dEpendency parses), which selects in an optima way the most representative sentences of a new target domain and feed them to the parser in addition to the original training set.	-
dc.description.affiliations	Istituto di Linguistica Computazionale "A. Zampolli" (ILC-CNR)	-
dc.description.allpeople	Dell'Orletta, F; Venturi, G	-
dc.description.allpeopleoriginal	Dell'Orletta F., Venturi G.	-
dc.description.fulltext	none	en
dc.description.numberofauthors	2	-
dc.identifier.isbn	978-88-6952-038-9	-
dc.identifier.uri	https://hdl.handle.net/20.500.14243/325815	-
dc.identifier.url	http://www.italianlp.it/wp-content/uploads/2016/10/Compter_Parler_Soigner_ULISSE.pdf	-
dc.language.iso	ita	-
dc.relation.conferencedate	15-17 dicembre 2014	-
dc.relation.conferencename	Atti del convegno "Compter parler soigner: tra linguistica e intelligenza artificiale"	-
dc.relation.conferenceplace	Pavia	-
dc.relation.firstpage	55	-
dc.relation.lastpage	79	-
dc.relation.numberofpages	24	-
dc.subject.keywords	Domain Adaptation	-
dc.subject.keywords	annotazione sintattica automatica	-
dc.subject.singlekeyword	Domain Adaptation	*
dc.subject.singlekeyword	annotazione sintattica automatica	*
dc.title	ULISSE: una strategia di adattamento al dominio per l'annotazione sintattica automatica	en
dc.type.driver	info:eu-repo/semantics/conferenceObject	-
dc.type.full	04 Contributo in convegno::04.01 Contributo in Atti di convegno	it
dc.type.miur	273	-
dc.ugov.descaux1	366752	-
iris.orcid.lastModifiedDate	2024/04/05 05:04:44	*
iris.orcid.lastModifiedMillisecond	1712286284471	*
iris.sitodocente.maxattempts	3	-
Appare nelle tipologie:	04.01 Contributo in Atti di convegno

File in questo prodotto:

Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/20.500.14243/325815

Citazioni

ND

ND

ND

social impact