In this paper we present a novel approach to multi-word terminology extraction combining a well-known automatic term recognition approach, the C-NC value method, with a contrastive ranking technique, aimed at refining obtained results either by filtering noise due to common words or by discerning between semantically different types of terms within heterogeneous terminologies. The proposed methodology has been tested in two case studies carried out in the History of Art and Legal domains with promising results.

A Contrastive Approach to Multi-word Extraction from Domain-specific Corpora

Dell'Orletta F;Montemagni S;Venturi G
2010

Abstract

In this paper we present a novel approach to multi-word terminology extraction combining a well-known automatic term recognition approach, the C-NC value method, with a contrastive ranking technique, aimed at refining obtained results either by filtering noise due to common words or by discerning between semantically different types of terms within heterogeneous terminologies. The proposed methodology has been tested in two case studies carried out in the History of Art and Legal domains with promising results.
Campo DC Valore Lingua
dc.authority.orgunit Istituto di linguistica computazionale "Antonio Zampolli" - ILC -
dc.authority.people Bonin F it
dc.authority.people Dell'Orletta F it
dc.authority.people Montemagni S it
dc.authority.people Venturi G it
dc.collection.id.s 71c7200a-7c5f-4e83-8d57-d3d2ba88f40d *
dc.collection.name 04.01 Contributo in Atti di convegno *
dc.contributor.appartenenza Istituto di linguistica computazionale "Antonio Zampolli" - ILC *
dc.contributor.appartenenza.mi 918 *
dc.date.accessioned 2024/02/19 20:06:32 -
dc.date.available 2024/02/19 20:06:32 -
dc.date.issued 2010 -
dc.description.abstracteng In this paper we present a novel approach to multi-word terminology extraction combining a well-known automatic term recognition approach, the C-NC value method, with a contrastive ranking technique, aimed at refining obtained results either by filtering noise due to common words or by discerning between semantically different types of terms within heterogeneous terminologies. The proposed methodology has been tested in two case studies carried out in the History of Art and Legal domains with promising results. -
dc.description.affiliations Bonin F.: Dipartimento di Informatica Pisa, Università di Pisa; Language Interaction and Computation Lab, University of Trento. Dell'Orletta F.; Montemagni S.; Venturi G.: ILC - Istituto di linguistica computazionale "Antonio Zampolli" -
dc.description.allpeople Bonin, F; Dell'Orletta, F; Montemagni, S; Venturi, G -
dc.description.allpeopleoriginal Bonin F.; Dell'Orletta F.; Montemagni S.; Venturi G. -
dc.description.fulltext none en
dc.description.numberofauthors 4 -
dc.identifier.isbn 2-9517408-6-7 -
dc.identifier.uri https://hdl.handle.net/20.500.14243/65162 -
dc.language.iso eng -
dc.relation.conferencedate 19-21 maggio 2010 -
dc.relation.conferencename Seventh International Conference on Language Resources and Evaluation -
dc.relation.conferenceplace Valletta, Malta -
dc.relation.firstpage 3222 -
dc.relation.lastpage 3229 -
dc.subject.keywords Terminology Extraction -
dc.subject.keywords Domain-specific Corpora -
dc.subject.keywords Multi-word Expression -
dc.subject.singlekeyword Terminology Extraction *
dc.subject.singlekeyword Domain-specific Corpora *
dc.subject.singlekeyword Multi-word Expression *
dc.title A Contrastive Approach to Multi-word Extraction from Domain-specific Corpora en
dc.type.driver info:eu-repo/semantics/conferenceObject -
dc.type.full 04 Contributo in convegno::04.01 Contributo in Atti di convegno it
dc.type.miur 273 -
dc.type.referee Sì, ma tipo non specificato -
dc.ugov.descaux1 84796 -
iris.isi.extIssued 2010 -
iris.isi.extTitle A contrastive Approach to Multi-word Term Extraction from Domain-specific Corpora -
iris.orcid.lastModifiedDate 2024/04/05 12:44:25 *
iris.orcid.lastModifiedMillisecond 1712313865432 *
iris.scopus.extIssued 2010 -
iris.scopus.extTitle A contrastive approach to multi-word term extraction from domain corpora -
iris.sitodocente.maxattempts 4 -
Appare nelle tipologie: 04.01 Contributo in Atti di convegno
File in questo prodotto:
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/20.500.14243/65162
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus ND
  • ???jsp.display-item.citation.isi??? ND
social impact