Domain portability and adaptation of NLP components and Word Sense Disambiguation systems present new challenges. The difficulties found by supervised systems to adapt might change the way we assess the strengths and weaknesses of supervised and knowledge-based WSD systems. Unfortunately, all existing evaluation datasets for specific domains are lexical-sample corpora. This task presented all-words datasets on the environment domain for WSD in four languages (Chinese, Dutch, English, Italian). 11 teams participated, with supervised and knowledge-based systems, mainly in the English dataset. The results show that in all languages the participants where able to beat the most frequent sense heuristic as estimated from general corpora. The most successful approaches used some sort of supervision in the form of hand-tagged examples from the domain.

SemEval-2010 task 17: All-words word sense disambiguation on a specific domain

Tesconi M;Monachini M;
2010

Abstract

Domain portability and adaptation of NLP components and Word Sense Disambiguation systems present new challenges. The difficulties found by supervised systems to adapt might change the way we assess the strengths and weaknesses of supervised and knowledge-based WSD systems. Unfortunately, all existing evaluation datasets for specific domains are lexical-sample corpora. This task presented all-words datasets on the environment domain for WSD in four languages (Chinese, Dutch, English, Italian). 11 teams participated, with supervised and knowledge-based systems, mainly in the English dataset. The results show that in all languages the participants where able to beat the most frequent sense heuristic as estimated from general corpora. The most successful approaches used some sort of supervision in the form of hand-tagged examples from the domain.
Campo DC Valore Lingua
dc.authority.orgunit Istituto di informatica e telematica - IIT -
dc.authority.orgunit Istituto di linguistica computazionale "Antonio Zampolli" - ILC -
dc.authority.people Agirre E it
dc.authority.people López De Lacalle O it
dc.authority.people Fellbaum C it
dc.authority.people Hsieh S it
dc.authority.people Tesconi M it
dc.authority.people Monachini M it
dc.authority.people Vossen P it
dc.authority.people Vossen P it
dc.authority.people Segers R it
dc.collection.id.s 71c7200a-7c5f-4e83-8d57-d3d2ba88f40d *
dc.collection.name 04.01 Contributo in Atti di convegno *
dc.contributor.appartenenza Istituto di informatica e telematica - IIT *
dc.contributor.appartenenza Istituto di linguistica computazionale "Antonio Zampolli" - ILC *
dc.contributor.appartenenza.mi 912 *
dc.contributor.appartenenza.mi 918 *
dc.date.accessioned 2024/02/18 19:49:52 -
dc.date.available 2024/02/18 19:49:52 -
dc.date.issued 2010 -
dc.description.abstract Domain portability and adaptation of NLP components and Word Sense Disambiguation systems present new challenges. The difficulties found by supervised systems to adapt might change the way we assess the strengths and weaknesses of supervised and knowledge-based WSD systems. Unfortunately, all existing evaluation datasets for specific domains are lexical-sample corpora. This task presented all-words datasets on the environment domain for WSD in four languages (Chinese, Dutch, English, Italian). 11 teams participated, with supervised and knowledge-based systems, mainly in the English dataset. The results show that in all languages the participants where able to beat the most frequent sense heuristic as estimated from general corpora. The most successful approaches used some sort of supervision in the form of hand-tagged examples from the domain. -
dc.description.affiliations University of the Basque Country, Spain, Princeton University, National Taiwan Normal Univ., CNR-IIT, Pisa, CNR-ILC, Pisa, Vrije Universiteit, Amsterdam -
dc.description.allpeople Agirre E.; López De Lacalle O.; Fellbaum C.; Hsieh S.; Tesconi M.; Monachini M.; Vossen P.; Vossen P.; Segers R. -
dc.description.allpeopleoriginal Agirre E.; López De Lacalle O.; Fellbaum C.; Hsieh S.; Tesconi M.; Monachini M.; Vossen P.; Vossen P.; Segers R. -
dc.description.fulltext none en
dc.description.numberofauthors 2 -
dc.identifier.isbn 978-1-932432-70-1 -
dc.identifier.uri https://hdl.handle.net/20.500.14243/151404 -
dc.language.iso eng -
dc.relation.alleditors Katrin Erk, Carlo Strapparava (eds.) -
dc.relation.conferencedate 15-16 Luglio 2010 -
dc.relation.conferencename ACL 2010- SemEval 2010: 5th International Workshop on Semantic Evaluation -
dc.relation.conferenceplace Uppsala, Sweden -
dc.relation.firstpage 75 -
dc.relation.lastpage 80 -
dc.subject.keywords I.2.7 Natural Language Processing -
dc.subject.keywords Word Sense Disambiguation sy -
dc.subject.keywords Semantic Annotation -
dc.subject.keywords Word-sense disambiguation -
dc.subject.singlekeyword I.2.7 Natural Language Processing *
dc.subject.singlekeyword Word Sense Disambiguation sy *
dc.subject.singlekeyword Semantic Annotation *
dc.subject.singlekeyword Word-sense disambiguation *
dc.title SemEval-2010 task 17: All-words word sense disambiguation on a specific domain en
dc.type.driver info:eu-repo/semantics/conferenceObject -
dc.type.full 04 Contributo in convegno::04.01 Contributo in Atti di convegno it
dc.type.miur 273 -
dc.type.referee Sì, ma tipo non specificato -
dc.ugov.descaux1 172865 -
iris.orcid.lastModifiedDate 2024/03/01 16:36:50 *
iris.orcid.lastModifiedMillisecond 1709307410610 *
iris.scopus.extIssued 2010 -
iris.scopus.extTitle SemEval-2010 task 17: All-words word sense disambiguation on a specific domain -
iris.sitodocente.maxattempts 1 -
Appare nelle tipologie: 04.01 Contributo in Atti di convegno
File in questo prodotto:
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/20.500.14243/151404
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus ND
  • ???jsp.display-item.citation.isi??? ND
social impact