In this paper we present a novel approach to multi-word terminology extraction combining a well-known automatic term recognition approach, the C-NC value method, with a contrastive ranking technique, aimed at refining obtained results either by filtering noise due to common words or by discerning between semantically different types of terms within heterogeneous terminologies. The proposed methodology has been tested in two case studies carried out in the History of Art and Legal domains with promising results.
A Contrastive Approach to Multi-word Extraction from Domain-specific Corpora
Dell'Orletta F;Montemagni S;Venturi G
2010
Abstract
In this paper we present a novel approach to multi-word terminology extraction combining a well-known automatic term recognition approach, the C-NC value method, with a contrastive ranking technique, aimed at refining obtained results either by filtering noise due to common words or by discerning between semantically different types of terms within heterogeneous terminologies. The proposed methodology has been tested in two case studies carried out in the History of Art and Legal domains with promising results.| Campo DC | Valore | Lingua |
|---|---|---|
| dc.authority.orgunit | Istituto di linguistica computazionale "Antonio Zampolli" - ILC | - |
| dc.authority.people | Bonin F | it |
| dc.authority.people | Dell'Orletta F | it |
| dc.authority.people | Montemagni S | it |
| dc.authority.people | Venturi G | it |
| dc.collection.id.s | 71c7200a-7c5f-4e83-8d57-d3d2ba88f40d | * |
| dc.collection.name | 04.01 Contributo in Atti di convegno | * |
| dc.contributor.appartenenza | Istituto di linguistica computazionale "Antonio Zampolli" - ILC | * |
| dc.contributor.appartenenza.mi | 918 | * |
| dc.date.accessioned | 2024/02/19 20:06:32 | - |
| dc.date.available | 2024/02/19 20:06:32 | - |
| dc.date.issued | 2010 | - |
| dc.description.abstracteng | In this paper we present a novel approach to multi-word terminology extraction combining a well-known automatic term recognition approach, the C-NC value method, with a contrastive ranking technique, aimed at refining obtained results either by filtering noise due to common words or by discerning between semantically different types of terms within heterogeneous terminologies. The proposed methodology has been tested in two case studies carried out in the History of Art and Legal domains with promising results. | - |
| dc.description.affiliations | Bonin F.: Dipartimento di Informatica Pisa, Università di Pisa; Language Interaction and Computation Lab, University of Trento. Dell'Orletta F.; Montemagni S.; Venturi G.: ILC - Istituto di linguistica computazionale "Antonio Zampolli" | - |
| dc.description.allpeople | Bonin, F; Dell'Orletta, F; Montemagni, S; Venturi, G | - |
| dc.description.allpeopleoriginal | Bonin F.; Dell'Orletta F.; Montemagni S.; Venturi G. | - |
| dc.description.fulltext | none | en |
| dc.description.numberofauthors | 4 | - |
| dc.identifier.isbn | 2-9517408-6-7 | - |
| dc.identifier.uri | https://hdl.handle.net/20.500.14243/65162 | - |
| dc.language.iso | eng | - |
| dc.relation.conferencedate | 19-21 maggio 2010 | - |
| dc.relation.conferencename | Seventh International Conference on Language Resources and Evaluation | - |
| dc.relation.conferenceplace | Valletta, Malta | - |
| dc.relation.firstpage | 3222 | - |
| dc.relation.lastpage | 3229 | - |
| dc.subject.keywords | Terminology Extraction | - |
| dc.subject.keywords | Domain-specific Corpora | - |
| dc.subject.keywords | Multi-word Expression | - |
| dc.subject.singlekeyword | Terminology Extraction | * |
| dc.subject.singlekeyword | Domain-specific Corpora | * |
| dc.subject.singlekeyword | Multi-word Expression | * |
| dc.title | A Contrastive Approach to Multi-word Extraction from Domain-specific Corpora | en |
| dc.type.driver | info:eu-repo/semantics/conferenceObject | - |
| dc.type.full | 04 Contributo in convegno::04.01 Contributo in Atti di convegno | it |
| dc.type.miur | 273 | - |
| dc.type.referee | Sì, ma tipo non specificato | - |
| dc.ugov.descaux1 | 84796 | - |
| iris.isi.extIssued | 2010 | - |
| iris.isi.extTitle | A contrastive Approach to Multi-word Term Extraction from Domain-specific Corpora | - |
| iris.orcid.lastModifiedDate | 2024/04/05 12:44:25 | * |
| iris.orcid.lastModifiedMillisecond | 1712313865432 | * |
| iris.scopus.extIssued | 2010 | - |
| iris.scopus.extTitle | A contrastive approach to multi-word term extraction from domain corpora | - |
| iris.sitodocente.maxattempts | 4 | - |
| Appare nelle tipologie: | 04.01 Contributo in Atti di convegno | |
File in questo prodotto:
Non ci sono file associati a questo prodotto.
I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.


