Over the course of the last few years, lexicography has witnessed the burgeoning of increasingly reliable automaticapproaches supporting the creation of lexicographic resources such as dictionaries, lexical knowledge bases andannotated datasets. In fact, recent achievements in the field of Natural Language Processing and particularly inWord Sense Disambiguation have widely demonstrated their effectiveness not only for the creation of lexicographicresources, but also for enabling a deeper analysis of lexical-semantic data both within and across languages.Nevertheless, we argue that the potential derived from the connections between the two fields is far from exhausted.In this work, we address a serious limitation affecting both lexicography and Word Sense Disambiguation, i.e. thelack of high-quality sense-annotated data and describe our efforts aimed at constructing a novel entirely manuallyannotated parallel dataset in 10 European languages. For the purposes of the present paper, we concentrate on theannotation of morpho-syntactic features. Finally, unlike many of the currently available sense-annotated datasets,we will annotate semantically by using senses derived from high-quality lexicographic repositories.
Designing the ELEXIS Parallel Sense-Annotated Dataset in 10 European Languages
Quochi;Valeria;Monachini;Monica;Frontini;Francesca;
2021
Abstract
Over the course of the last few years, lexicography has witnessed the burgeoning of increasingly reliable automaticapproaches supporting the creation of lexicographic resources such as dictionaries, lexical knowledge bases andannotated datasets. In fact, recent achievements in the field of Natural Language Processing and particularly inWord Sense Disambiguation have widely demonstrated their effectiveness not only for the creation of lexicographicresources, but also for enabling a deeper analysis of lexical-semantic data both within and across languages.Nevertheless, we argue that the potential derived from the connections between the two fields is far from exhausted.In this work, we address a serious limitation affecting both lexicography and Word Sense Disambiguation, i.e. thelack of high-quality sense-annotated data and describe our efforts aimed at constructing a novel entirely manuallyannotated parallel dataset in 10 European languages. For the purposes of the present paper, we concentrate on theannotation of morpho-syntactic features. Finally, unlike many of the currently available sense-annotated datasets,we will annotate semantically by using senses derived from high-quality lexicographic repositories.| Campo DC | Valore | Lingua |
|---|---|---|
| dc.authority.anceserie | ELECTRONIC LEXICOGRAPHY IN THE 21ST CENTURY. PROCEEDINGS OF ELEX ... CONFERENCE | en |
| dc.authority.orgunit | Istituto di linguistica computazionale "Antonio Zampolli" - ILC | en |
| dc.authority.people | Martelli | en |
| dc.authority.people | Federico | en |
| dc.authority.people | Navigli | en |
| dc.authority.people | Roberto | en |
| dc.authority.people | Krek | en |
| dc.authority.people | Simon | en |
| dc.authority.people | Tiberius | en |
| dc.authority.people | Carole | en |
| dc.authority.people | Kallas | en |
| dc.authority.people | Jelena | en |
| dc.authority.people | Gantar | en |
| dc.authority.people | Polona | en |
| dc.authority.people | Koeva | en |
| dc.authority.people | Svetla | en |
| dc.authority.people | Nimb | en |
| dc.authority.people | Sanni | en |
| dc.authority.people | Pedersen | en |
| dc.authority.people | Bolette Sandford | en |
| dc.authority.people | Olsen | en |
| dc.authority.people | Sussi | en |
| dc.authority.people | Langements | en |
| dc.authority.people | Margit | en |
| dc.authority.people | Koppel | en |
| dc.authority.people | Kristina | en |
| dc.authority.people | ksik | en |
| dc.authority.people | Tiiu | en |
| dc.authority.people | Dobrovolijc | en |
| dc.authority.people | Kaja | en |
| dc.authority.people | UreaRuiz | en |
| dc.authority.people | RafaelJ | en |
| dc.authority.people | SanchoSnchez | en |
| dc.authority.people | JosLuis | en |
| dc.authority.people | Lipp | en |
| dc.authority.people | Veronika | en |
| dc.authority.people | Varadi | en |
| dc.authority.people | Tamas | en |
| dc.authority.people | Gyrffy | en |
| dc.authority.people | Andrs | en |
| dc.authority.people | Lszl | en |
| dc.authority.people | Simon | en |
| dc.authority.people | Quochi | en |
| dc.authority.people | Valeria | en |
| dc.authority.people | Monachini | en |
| dc.authority.people | Monica | en |
| dc.authority.people | Frontini | en |
| dc.authority.people | Francesca | en |
| dc.authority.people | Tempelaars | en |
| dc.authority.people | Rob | en |
| dc.authority.people | Costa | en |
| dc.authority.people | Rute | en |
| dc.authority.people | Salgado | en |
| dc.authority.people | Ana | en |
| dc.authority.people | ibej | en |
| dc.authority.people | Jaka | en |
| dc.authority.people | Munda | en |
| dc.authority.people | Tina | en |
| dc.authority.project | European Lexicographic Infrastructure | en |
| dc.collection.id.s | 71c7200a-7c5f-4e83-8d57-d3d2ba88f40d | * |
| dc.collection.name | 04.01 Contributo in Atti di convegno | * |
| dc.contributor.appartenenza | Istituto di linguistica computazionale "Antonio Zampolli" - ILC | * |
| dc.contributor.appartenenza.mi | 918 | * |
| dc.contributor.area | Non assegn | * |
| dc.contributor.area | Non assegn | * |
| dc.contributor.area | Non assegn | * |
| dc.contributor.area | Non assegn | * |
| dc.contributor.area | Non assegn | * |
| dc.contributor.area | Non assegn | * |
| dc.date.accessioned | 2024/02/21 04:33:16 | - |
| dc.date.available | 2024/02/21 04:33:16 | - |
| dc.date.firstsubmission | 2025/01/31 17:43:50 | * |
| dc.date.issued | 2021 | - |
| dc.date.submission | 2025/03/03 12:18:54 | * |
| dc.description.abstracteng | Over the course of the last few years, lexicography has witnessed the burgeoning of increasingly reliable automaticapproaches supporting the creation of lexicographic resources such as dictionaries, lexical knowledge bases andannotated datasets. In fact, recent achievements in the field of Natural Language Processing and particularly inWord Sense Disambiguation have widely demonstrated their effectiveness not only for the creation of lexicographicresources, but also for enabling a deeper analysis of lexical-semantic data both within and across languages.Nevertheless, we argue that the potential derived from the connections between the two fields is far from exhausted.In this work, we address a serious limitation affecting both lexicography and Word Sense Disambiguation, i.e. thelack of high-quality sense-annotated data and describe our efforts aimed at constructing a novel entirely manuallyannotated parallel dataset in 10 European languages. For the purposes of the present paper, we concentrate on theannotation of morpho-syntactic features. Finally, unlike many of the currently available sense-annotated datasets,we will annotate semantically by using senses derived from high-quality lexicographic repositories. | - |
| dc.description.affiliations | Sapienza NLP Group, Department of Computer Science, Sapienza University of Rome, Italy Artificial Intelligence Laboratory, Jo?ef Stefan Institute, Slovenia Institute of the Estonian Language, Estonia Faculty of Arts, University of Ljubljana, Slovenia Institute for Bulgarian Language, Bulgarian Academy of Sciences, Bulgaria NOVA CLUNL, Centro de Linguística da Universidade NOVA de Lisboa, Portugal Academia das Ciências de Lisboa, Portugal University of Copenhagen, Denmark Centro de Estudios de la Real Academia Española, Spain Society for Danish Language and Literature, Copenhagen, Denmark Hungarian Research Centre for Linguistics, Institute for Lexicology, Hungary Hungarian Research Centre for Linguistics, Institute for Language Technologies and Applied Linguistics, Hungary Instituut voor de Nederlandse Taal, The Netherlands Istituto di Linguistica Computazionale "A.Zampolli", Centro Nazionale delle Ricerche, Italy | - |
| dc.description.allpeople | Martelli, ; Federico, ; Navigli, ; Roberto, ; Krek, ; Simon, ; Tiberius, ; Carole, ; Kallas, ; Jelena, ; Gantar, ; Polona, ; Koeva, ; Svetla, ; Nimb, ; Sanni, ; Pedersen, ; Bolette, Sandford; Olsen, ; Sussi, ; Langements, ; Margit, ; Koppel, ; Kristina, ; Ksik, ; Tiiu, ; Dobrovolijc, ; Kaja, ; Urearuiz, ; Rafaelj, ; Sanchosnchez, ; Josluis, ; Lipp, ; Veronika, ; Varadi, ; Tamas, ; Gyrffy, ; Andrs, ; Lszl, ; Simon, ; Quochi, Valeria; Quochi, Valeria; Monachini, Monica; Monachini, Monica; Frontini, Francesca; Frontini, Francesca; Tempelaars, ; Rob, ; Costa, ; Rute, ; Salgado, ; Ana, ; Ibej, ; Jaka, ; Munda, ; Tina, | - |
| dc.description.allpeopleoriginal | Martelli, Federico and Navigli, Roberto and Krek, Simon and Tiberius, Carole and Kallas, Jelena and Gantar, Polona and Koeva, Svetla and Nimb, Sanni and Pedersen, Bolette Sandford and Olsen, Sussi and Langements, Margit and Koppel, Kristina and ?ksik, Tiiu and Dobrovolijc, Kaja and Ure?a-Ruiz, Rafael-J. and Sancho-S?nchez, Jos?-Luis and Lipp, Veronika and Varadi, Tamas and Gy?rffy, Andr?s and L?szl?, Simon and Quochi, Valeria and Monachini, Monica and Frontini, Francesca and Tempelaars, Rob and Costa, Rute and Salgado, Ana and ?ibej, Jaka and Munda, Tina | en |
| dc.description.fulltext | open | en |
| dc.description.international | si | en |
| dc.description.numberofauthors | 56 | - |
| dc.identifier.scopus | 2-s2.0-85137076090 | en |
| dc.identifier.uri | https://hdl.handle.net/20.500.14243/443238 | - |
| dc.identifier.url | https://static-curis.ku.dk/portal/files/279888836/eLex_2021_22_pp377_395.pdf | en |
| dc.language.iso | eng | en |
| dc.miur.last.status.update | 2025-02-14T14:20:27Z | * |
| dc.publisher.country | CZE | en |
| dc.publisher.name | Lexical Computing | en |
| dc.publisher.place | Brno | en |
| dc.relation.allauthors | Kosem, I., Cukr, M., Jakubíček, M., Kallas, J., Krek, S., and Tiberius, C. | en |
| dc.relation.conferencedate | 05/-7/2021-07/07/2021 | en |
| dc.relation.conferencename | eLex 2021 | en |
| dc.relation.conferenceplace | Virtuale | en |
| dc.relation.firstpage | 377 | en |
| dc.relation.ispartofbook | Electronic lexicography in the 21st century (eLex 2021): Post-editing lexicography | en |
| dc.relation.lastpage | 395 | en |
| dc.relation.medium | ELETTRONICO | en |
| dc.relation.numberofpages | 19 | en |
| dc.relation.projectAcronym | ELEXIS | en |
| dc.relation.projectAwardNumber | 731015 | en |
| dc.relation.projectAwardTitle | European Lexicographic Infrastructure | en |
| dc.relation.projectFunderName | European Commission | en |
| dc.relation.projectFundingStream | H2020 | en |
| dc.relation.volume | 2021 | en |
| dc.subject.keywordseng | Digital lexicography | - |
| dc.subject.keywordseng | Word Sense Disambiguation | - |
| dc.subject.keywordseng | Computational Linguistics | - |
| dc.subject.keywordseng | Corpus Linguistics | - |
| dc.subject.keywordseng | Natural Language Processing | - |
| dc.subject.singlekeyword | Digital lexicography | * |
| dc.subject.singlekeyword | Word Sense Disambiguation | * |
| dc.subject.singlekeyword | Computational Linguistics | * |
| dc.subject.singlekeyword | Corpus Linguistics | * |
| dc.subject.singlekeyword | Natural Language Processing | * |
| dc.title | Designing the ELEXIS Parallel Sense-Annotated Dataset in 10 European Languages | en |
| dc.type.circulation | Internazionale | en |
| dc.type.driver | info:eu-repo/semantics/conferenceObject | - |
| dc.type.full | 04 Contributo in convegno::04.01 Contributo in Atti di convegno | it |
| dc.type.impactfactor | si | en |
| dc.type.invited | contributo | en |
| dc.type.miur | 273 | - |
| dc.type.referee | Esperti anonimi | en |
| dc.ugov.descaux1 | 461705 | - |
| dc.ugov.descaux2 | CC BY-SA | - |
| iris.mediafilter.data | 2025/04/04 04:37:26 | * |
| iris.orcid.lastModifiedDate | 2025/03/05 11:39:00 | * |
| iris.orcid.lastModifiedMillisecond | 1741171140042 | * |
| iris.scopus.extIssued | 2021 | - |
| iris.scopus.extTitle | Designing the ELEXIS Parallel Sense-Annotated Dataset in 10 European Languages | - |
| iris.sitodocente.maxattempts | 1 | - |
| scopus.authority.anceserie | ELECTRONIC LEXICOGRAPHY IN THE 21ST CENTURY. PROCEEDINGS OF ELEX ... CONFERENCE###2533-5626 | * |
| scopus.category | 1203 | * |
| scopus.category | 3310 | * |
| scopus.contributor.affiliation | Sapienza University of Rome | - |
| scopus.contributor.affiliation | Sapienza University of Rome | - |
| scopus.contributor.affiliation | Jožef Stefan Institute | - |
| scopus.contributor.affiliation | Institute of the Estonian Language | - |
| scopus.contributor.affiliation | University of Ljubljana | - |
| scopus.contributor.affiliation | Bulgarian Academy of Sciences | - |
| scopus.contributor.affiliation | Society for Danish Language and Literature | - |
| scopus.contributor.affiliation | University of Copenhagen | - |
| scopus.contributor.affiliation | University of Copenhagen | - |
| scopus.contributor.affiliation | Institute of the Estonian Language | - |
| scopus.contributor.affiliation | Institute of the Estonian Language | - |
| scopus.contributor.affiliation | Institute of the Estonian Language | - |
| scopus.contributor.affiliation | Jožef Stefan Institute | - |
| scopus.contributor.affiliation | Centro de Estudios de la Real Academia Española | - |
| scopus.contributor.affiliation | Centro de Estudios de la Real Academia Española | - |
| scopus.contributor.affiliation | Institute for Lexicology | - |
| scopus.contributor.affiliation | Institute for Language Technologies and Applied Linguistics | - |
| scopus.contributor.affiliation | Institute for Lexicology | - |
| scopus.contributor.affiliation | Institute for Lexicology | - |
| scopus.contributor.affiliation | Centro Nazionale delle Ricerche | - |
| scopus.contributor.affiliation | Centro Nazionale delle Ricerche | - |
| scopus.contributor.affiliation | Centro Nazionale delle Ricerche | - |
| scopus.contributor.affiliation | Instituut voor de Nederlandse Taal | - |
| scopus.contributor.affiliation | Instituut voor de Nederlandse Taal | - |
| scopus.contributor.affiliation | Universidade NOVA de Lisboa | - |
| scopus.contributor.affiliation | Academia das Ciências de Lisboa | - |
| scopus.contributor.affiliation | Jožef Stefan Institute | - |
| scopus.contributor.affiliation | Jožef Stefan Institute | - |
| scopus.contributor.afid | 60032350 | - |
| scopus.contributor.afid | 60032350 | - |
| scopus.contributor.afid | 60023955 | - |
| scopus.contributor.afid | 60104696 | - |
| scopus.contributor.afid | 60031106 | - |
| scopus.contributor.afid | 60024147 | - |
| scopus.contributor.afid | 120383809 | - |
| scopus.contributor.afid | 60030840 | - |
| scopus.contributor.afid | 60030840 | - |
| scopus.contributor.afid | 60104696 | - |
| scopus.contributor.afid | 60104696 | - |
| scopus.contributor.afid | 60104696 | - |
| scopus.contributor.afid | 60023955 | - |
| scopus.contributor.afid | 127035857 | - |
| scopus.contributor.afid | 127035857 | - |
| scopus.contributor.afid | 60020907 | - |
| scopus.contributor.afid | 60020907 | - |
| scopus.contributor.afid | 60020907 | - |
| scopus.contributor.afid | 60020907 | - |
| scopus.contributor.afid | 60008941 | - |
| scopus.contributor.afid | 60008941 | - |
| scopus.contributor.afid | 60008941 | - |
| scopus.contributor.afid | 127316562 | - |
| scopus.contributor.afid | 127316562 | - |
| scopus.contributor.afid | 60031875 | - |
| scopus.contributor.afid | 113155708 | - |
| scopus.contributor.afid | 60023955 | - |
| scopus.contributor.afid | 60023955 | - |
| scopus.contributor.auid | 57210165202 | - |
| scopus.contributor.auid | 6507102454 | - |
| scopus.contributor.auid | 55581031400 | - |
| scopus.contributor.auid | 53871611800 | - |
| scopus.contributor.auid | 55699375300 | - |
| scopus.contributor.auid | 23090522300 | - |
| scopus.contributor.auid | 31967847600 | - |
| scopus.contributor.auid | 7201713480 | - |
| scopus.contributor.auid | 57188766062 | - |
| scopus.contributor.auid | 36061015300 | - |
| scopus.contributor.auid | 57192269211 | - |
| scopus.contributor.auid | 57357398800 | - |
| scopus.contributor.auid | 56888719100 | - |
| scopus.contributor.auid | 56150844000 | - |
| scopus.contributor.auid | 57870318200 | - |
| scopus.contributor.auid | 57220028718 | - |
| scopus.contributor.auid | 55368420100 | - |
| scopus.contributor.auid | 57220032553 | - |
| scopus.contributor.auid | 57220027306 | - |
| scopus.contributor.auid | 34977412400 | - |
| scopus.contributor.auid | 23397766600 | - |
| scopus.contributor.auid | 55162070400 | - |
| scopus.contributor.auid | 26632410700 | - |
| scopus.contributor.auid | 57869723600 | - |
| scopus.contributor.auid | 55958196000 | - |
| scopus.contributor.auid | 57198203815 | - |
| scopus.contributor.auid | 57003329500 | - |
| scopus.contributor.auid | 57869566500 | - |
| scopus.contributor.country | Italy | - |
| scopus.contributor.country | Italy | - |
| scopus.contributor.country | Slovenia | - |
| scopus.contributor.country | Estonia | - |
| scopus.contributor.country | Slovenia | - |
| scopus.contributor.country | Bulgaria | - |
| scopus.contributor.country | Denmark | - |
| scopus.contributor.country | Denmark | - |
| scopus.contributor.country | Denmark | - |
| scopus.contributor.country | Estonia | - |
| scopus.contributor.country | Estonia | - |
| scopus.contributor.country | Estonia | - |
| scopus.contributor.country | Slovenia | - |
| scopus.contributor.country | Spain | - |
| scopus.contributor.country | Spain | - |
| scopus.contributor.country | Hungary | - |
| scopus.contributor.country | Hungary | - |
| scopus.contributor.country | Hungary | - |
| scopus.contributor.country | Hungary | - |
| scopus.contributor.country | Italy | - |
| scopus.contributor.country | Italy | - |
| scopus.contributor.country | Italy | - |
| scopus.contributor.country | Netherlands | - |
| scopus.contributor.country | Netherlands | - |
| scopus.contributor.country | Portugal | - |
| scopus.contributor.country | Portugal | - |
| scopus.contributor.country | Slovenia | - |
| scopus.contributor.country | Slovenia | - |
| scopus.contributor.dptid | 113210625 | - |
| scopus.contributor.dptid | 113210625 | - |
| scopus.contributor.dptid | 105775097 | - |
| scopus.contributor.dptid | - | |
| scopus.contributor.dptid | 116279878 | - |
| scopus.contributor.dptid | - | |
| scopus.contributor.dptid | - | |
| scopus.contributor.dptid | - | |
| scopus.contributor.dptid | - | |
| scopus.contributor.dptid | - | |
| scopus.contributor.dptid | - | |
| scopus.contributor.dptid | - | |
| scopus.contributor.dptid | 105775097 | - |
| scopus.contributor.dptid | - | |
| scopus.contributor.dptid | - | |
| scopus.contributor.dptid | - | |
| scopus.contributor.dptid | - | |
| scopus.contributor.dptid | - | |
| scopus.contributor.dptid | - | |
| scopus.contributor.dptid | - | |
| scopus.contributor.dptid | - | |
| scopus.contributor.dptid | - | |
| scopus.contributor.dptid | - | |
| scopus.contributor.dptid | - | |
| scopus.contributor.dptid | 107060993 | - |
| scopus.contributor.dptid | - | |
| scopus.contributor.dptid | 105775097 | - |
| scopus.contributor.dptid | 105775097 | - |
| scopus.contributor.name | Federico | - |
| scopus.contributor.name | Roberto | - |
| scopus.contributor.name | Simon | - |
| scopus.contributor.name | Jelena | - |
| scopus.contributor.name | Polona | - |
| scopus.contributor.name | Svetla | - |
| scopus.contributor.name | Sanni | - |
| scopus.contributor.name | Bolette Sandford | - |
| scopus.contributor.name | Sussi | - |
| scopus.contributor.name | Margit | - |
| scopus.contributor.name | Kristina | - |
| scopus.contributor.name | Tiiu | - |
| scopus.contributor.name | Kaja | - |
| scopus.contributor.name | Rafael J. | - |
| scopus.contributor.name | José-Luis | - |
| scopus.contributor.name | Veronika | - |
| scopus.contributor.name | Tamás | - |
| scopus.contributor.name | András | - |
| scopus.contributor.name | Simon | - |
| scopus.contributor.name | Valeria | - |
| scopus.contributor.name | Monica | - |
| scopus.contributor.name | Francesca | - |
| scopus.contributor.name | Carole | - |
| scopus.contributor.name | Rob | - |
| scopus.contributor.name | Rute | - |
| scopus.contributor.name | Ana | - |
| scopus.contributor.name | Jaka | - |
| scopus.contributor.name | Tina | - |
| scopus.contributor.subaffiliation | Sapienza NLP Group;Department of Computer Science; | - |
| scopus.contributor.subaffiliation | Sapienza NLP Group;Department of Computer Science; | - |
| scopus.contributor.subaffiliation | Artificial Intelligence Laboratory; | - |
| scopus.contributor.subaffiliation | - | |
| scopus.contributor.subaffiliation | Faculty of Arts; | - |
| scopus.contributor.subaffiliation | Institute for Bulgarian Language; | - |
| scopus.contributor.subaffiliation | - | |
| scopus.contributor.subaffiliation | - | |
| scopus.contributor.subaffiliation | - | |
| scopus.contributor.subaffiliation | - | |
| scopus.contributor.subaffiliation | - | |
| scopus.contributor.subaffiliation | - | |
| scopus.contributor.subaffiliation | Artificial Intelligence Laboratory; | - |
| scopus.contributor.subaffiliation | - | |
| scopus.contributor.subaffiliation | - | |
| scopus.contributor.subaffiliation | Hungarian Research Centre for Linguistics; | - |
| scopus.contributor.subaffiliation | Hungarian Research Centre for Linguistics; | - |
| scopus.contributor.subaffiliation | Hungarian Research Centre for Linguistics; | - |
| scopus.contributor.subaffiliation | Hungarian Research Centre for Linguistics; | - |
| scopus.contributor.subaffiliation | Istituto di Linguistica Computazionale "A. Zampolli"; | - |
| scopus.contributor.subaffiliation | Istituto di Linguistica Computazionale "A. Zampolli"; | - |
| scopus.contributor.subaffiliation | Istituto di Linguistica Computazionale "A. Zampolli"; | - |
| scopus.contributor.subaffiliation | - | |
| scopus.contributor.subaffiliation | - | |
| scopus.contributor.subaffiliation | NOVA CLUNL;Centro de Linguística; | - |
| scopus.contributor.subaffiliation | - | |
| scopus.contributor.subaffiliation | Artificial Intelligence Laboratory; | - |
| scopus.contributor.subaffiliation | Artificial Intelligence Laboratory; | - |
| scopus.contributor.surname | Martelli | - |
| scopus.contributor.surname | Navigli | - |
| scopus.contributor.surname | Krek | - |
| scopus.contributor.surname | Kallas | - |
| scopus.contributor.surname | Gantar | - |
| scopus.contributor.surname | Koeva | - |
| scopus.contributor.surname | Nimb | - |
| scopus.contributor.surname | Pedersen | - |
| scopus.contributor.surname | Olsen | - |
| scopus.contributor.surname | Langemets | - |
| scopus.contributor.surname | Koppel | - |
| scopus.contributor.surname | Üksik | - |
| scopus.contributor.surname | Dobrovoljc | - |
| scopus.contributor.surname | Ureña-Ruiz | - |
| scopus.contributor.surname | Sancho-Sánchez | - |
| scopus.contributor.surname | Lipp | - |
| scopus.contributor.surname | Váradi | - |
| scopus.contributor.surname | Győrffy | - |
| scopus.contributor.surname | László | - |
| scopus.contributor.surname | Quochi | - |
| scopus.contributor.surname | Monachini | - |
| scopus.contributor.surname | Frontini | - |
| scopus.contributor.surname | Tiberius | - |
| scopus.contributor.surname | Tempelaars | - |
| scopus.contributor.surname | Costa | - |
| scopus.contributor.surname | Salgado | - |
| scopus.contributor.surname | Čibej | - |
| scopus.contributor.surname | Munda | - |
| scopus.date.issued | 2021 | * |
| scopus.description.abstracteng | Over the course of the last few years, lexicography has witnessed the burgeoning of increasingly reliable automatic approaches supporting the creation of lexicographic resources such as dictionaries, lexical knowledge bases and annotated datasets. In fact, recent achievements in the field of Natural Language Processing and particularly in Word Sense Disambiguation have widely demonstrated their effectiveness not only for the creation of lexicographic resources, but also for enabling a deeper analysis of lexical-semantic data both within and across languages. Nevertheless, we argue that the potential derived from the connections between the two fields is far from exhausted. In this work, we address a serious limitation affecting both lexicography and Word Sense Disambiguation, i.e. the lack of high-quality sense-annotated data and describe our efforts aimed at constructing a novel entirely manually annotated parallel dataset in 10 European languages. For the purposes of the present paper, we concentrate on the annotation of morpho-syntactic features. Finally, unlike many of the currently available sense-annotated datasets, we will annotate semantically by using senses derived from high-quality lexicographic repositories. | * |
| scopus.description.allpeopleoriginal | Martelli F.; Navigli R.; Krek S.; Kallas J.; Gantar P.; Koeva S.; Nimb S.; Pedersen B.S.; Olsen S.; Langemets M.; Koppel K.; Uksik T.; Dobrovoljc K.; Urena-Ruiz R.J.; Sancho-Sanchez J.-L.; Lipp V.; Varadi T.; Gyorffy A.; Laszlo S.; Quochi V.; Monachini M.; Frontini F.; Tiberius C.; Tempelaars R.; Costa R.; Salgado A.; Cibej J.; Munda T. | * |
| scopus.differences | scopus.relation.conferencename | * |
| scopus.differences | scopus.publisher.name | * |
| scopus.differences | scopus.subject.keywords | * |
| scopus.differences | scopus.relation.conferencedate | * |
| scopus.differences | scopus.description.allpeopleoriginal | * |
| scopus.differences | scopus.description.abstracteng | * |
| scopus.differences | scopus.relation.volume | * |
| scopus.document.type | cp | * |
| scopus.document.types | cp | * |
| scopus.funding.funders | 100010661 - Horizon 2020 Framework Programme; 501100001871 - Fundação para a Ciência e a Tecnologia; 501100007601 - Horizon 2020; | * |
| scopus.identifier.eissn | 2533-5626 | * |
| scopus.identifier.pui | 638912671 | * |
| scopus.identifier.scopus | 2-s2.0-85137076090 | * |
| scopus.journal.sourceid | 21100936369 | * |
| scopus.language.iso | eng | * |
| scopus.publisher.name | Lexical Computing CZ s.r.o. | * |
| scopus.relation.conferencedate | 2021 | * |
| scopus.relation.conferencename | 7th Biennial Conference on Electronic Lexicography, eLex 2021 | * |
| scopus.relation.firstpage | 377 | * |
| scopus.relation.lastpage | 395 | * |
| scopus.relation.volume | 2021- | * |
| scopus.subject.keywords | Digital lexicography; Natural Language Processing, Computational Linguistics, Corpus Linguistics; Word Sense Disambiguation; | * |
| scopus.title | Designing the ELEXIS Parallel Sense-Annotated Dataset in 10 European Languages | * |
| scopus.titleeng | Designing the ELEXIS Parallel Sense-Annotated Dataset in 10 European Languages | * |
| Appare nelle tipologie: | 04.01 Contributo in Atti di convegno | |
| File | Dimensione | Formato | |
|---|---|---|---|
|
prod_461705-doc_180174.pdf
accesso aperto
Descrizione: Designing the ELEXIS Parallel Sense-Annotated Dataset in 10 European Languages
Tipologia:
Versione Editoriale (PDF)
Licenza:
Creative commons
Dimensione
587.64 kB
Formato
Adobe PDF
|
587.64 kB | Adobe PDF | Visualizza/Apri |
I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.


