Domain portability and adaptation of NLP components and Word Sense Disambiguation systems present new challenges. The difficulties found by supervised systems to adapt might change the way we assess the strengths and weaknesses of supervised and knowledge-based WSD systems. Unfortunately, all existing evaluation datasets for specific domains are lexical-sample corpora. This task presented all-words datasets on the environment domain for WSD in four languages (Chinese, Dutch, English, Italian). 11 teams participated, with supervised and knowledge-based systems, mainly in the English dataset. The results show that in all languages the participants where able to beat the most frequent sense heuristic as estimated from general corpora. The most successful approaches used some sort of supervision in the form of hand-tagged examples from the domain.
SemEval-2010 task 17: All-words word sense disambiguation on a specific domain
Tesconi M;Monachini M;
2010
Abstract
Domain portability and adaptation of NLP components and Word Sense Disambiguation systems present new challenges. The difficulties found by supervised systems to adapt might change the way we assess the strengths and weaknesses of supervised and knowledge-based WSD systems. Unfortunately, all existing evaluation datasets for specific domains are lexical-sample corpora. This task presented all-words datasets on the environment domain for WSD in four languages (Chinese, Dutch, English, Italian). 11 teams participated, with supervised and knowledge-based systems, mainly in the English dataset. The results show that in all languages the participants where able to beat the most frequent sense heuristic as estimated from general corpora. The most successful approaches used some sort of supervision in the form of hand-tagged examples from the domain.| Campo DC | Valore | Lingua |
|---|---|---|
| dc.authority.orgunit | Istituto di informatica e telematica - IIT | - |
| dc.authority.orgunit | Istituto di linguistica computazionale "Antonio Zampolli" - ILC | - |
| dc.authority.people | Agirre E | it |
| dc.authority.people | López De Lacalle O | it |
| dc.authority.people | Fellbaum C | it |
| dc.authority.people | Hsieh S | it |
| dc.authority.people | Tesconi M | it |
| dc.authority.people | Monachini M | it |
| dc.authority.people | Vossen P | it |
| dc.authority.people | Vossen P | it |
| dc.authority.people | Segers R | it |
| dc.collection.id.s | 71c7200a-7c5f-4e83-8d57-d3d2ba88f40d | * |
| dc.collection.name | 04.01 Contributo in Atti di convegno | * |
| dc.contributor.appartenenza | Istituto di informatica e telematica - IIT | * |
| dc.contributor.appartenenza | Istituto di linguistica computazionale "Antonio Zampolli" - ILC | * |
| dc.contributor.appartenenza.mi | 912 | * |
| dc.contributor.appartenenza.mi | 918 | * |
| dc.date.accessioned | 2024/02/18 19:49:52 | - |
| dc.date.available | 2024/02/18 19:49:52 | - |
| dc.date.issued | 2010 | - |
| dc.description.abstract | Domain portability and adaptation of NLP components and Word Sense Disambiguation systems present new challenges. The difficulties found by supervised systems to adapt might change the way we assess the strengths and weaknesses of supervised and knowledge-based WSD systems. Unfortunately, all existing evaluation datasets for specific domains are lexical-sample corpora. This task presented all-words datasets on the environment domain for WSD in four languages (Chinese, Dutch, English, Italian). 11 teams participated, with supervised and knowledge-based systems, mainly in the English dataset. The results show that in all languages the participants where able to beat the most frequent sense heuristic as estimated from general corpora. The most successful approaches used some sort of supervision in the form of hand-tagged examples from the domain. | - |
| dc.description.affiliations | University of the Basque Country, Spain, Princeton University, National Taiwan Normal Univ., CNR-IIT, Pisa, CNR-ILC, Pisa, Vrije Universiteit, Amsterdam | - |
| dc.description.allpeople | Agirre E.; López De Lacalle O.; Fellbaum C.; Hsieh S.; Tesconi M.; Monachini M.; Vossen P.; Vossen P.; Segers R. | - |
| dc.description.allpeopleoriginal | Agirre E.; López De Lacalle O.; Fellbaum C.; Hsieh S.; Tesconi M.; Monachini M.; Vossen P.; Vossen P.; Segers R. | - |
| dc.description.fulltext | none | en |
| dc.description.numberofauthors | 2 | - |
| dc.identifier.isbn | 978-1-932432-70-1 | - |
| dc.identifier.uri | https://hdl.handle.net/20.500.14243/151404 | - |
| dc.language.iso | eng | - |
| dc.relation.alleditors | Katrin Erk, Carlo Strapparava (eds.) | - |
| dc.relation.conferencedate | 15-16 Luglio 2010 | - |
| dc.relation.conferencename | ACL 2010- SemEval 2010: 5th International Workshop on Semantic Evaluation | - |
| dc.relation.conferenceplace | Uppsala, Sweden | - |
| dc.relation.firstpage | 75 | - |
| dc.relation.lastpage | 80 | - |
| dc.subject.keywords | I.2.7 Natural Language Processing | - |
| dc.subject.keywords | Word Sense Disambiguation sy | - |
| dc.subject.keywords | Semantic Annotation | - |
| dc.subject.keywords | Word-sense disambiguation | - |
| dc.subject.singlekeyword | I.2.7 Natural Language Processing | * |
| dc.subject.singlekeyword | Word Sense Disambiguation sy | * |
| dc.subject.singlekeyword | Semantic Annotation | * |
| dc.subject.singlekeyword | Word-sense disambiguation | * |
| dc.title | SemEval-2010 task 17: All-words word sense disambiguation on a specific domain | en |
| dc.type.driver | info:eu-repo/semantics/conferenceObject | - |
| dc.type.full | 04 Contributo in convegno::04.01 Contributo in Atti di convegno | it |
| dc.type.miur | 273 | - |
| dc.type.referee | Sì, ma tipo non specificato | - |
| dc.ugov.descaux1 | 172865 | - |
| iris.orcid.lastModifiedDate | 2024/03/01 16:36:50 | * |
| iris.orcid.lastModifiedMillisecond | 1709307410610 | * |
| iris.scopus.extIssued | 2010 | - |
| iris.scopus.extTitle | SemEval-2010 task 17: All-words word sense disambiguation on a specific domain | - |
| iris.sitodocente.maxattempts | 1 | - |
| Appare nelle tipologie: | 04.01 Contributo in Atti di convegno | |
I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.


