The ongoing phenomenon of digitisation is changing social and work life, with tangible effects on the socio-economic context. Understanding the impact, opportunities, and threats of digital transformation requires the identication of viewpoints from a large diversity of stakeholders, from policy makers to domain experts, and from engineers to common citizens. The DESIRA (Digitisation: Economic and Social Impacts in Rural Areas) EU H2020 project1 considers rural areas, with a strong focus on agricultural and forestry activities, and aims at assessing the impact of digital technologies in those domains by involving a large number of stakeholders, all across Europe, around 20 focal questions. Given the involvement of stakeholders with diverse background and skills, a primary goal of the project is to develop domain-specic and interactive reference taxonomies (i.e., structured classications of terms) to facilitate common understanding of technologies in use in each domain at today. The taxonomies, which aims at easing the learning of the meaning of technical and domain-specic terms, are going to be exploited by the stakeholders in 20 Living Labs built around the focal questions. This report paper focuses on the semi-automatic development of the taxonomies through natural language processing (NLP) techniques based on context-specic term extraction. Furthermore, we crawl Wikipedia to enrich the taxonomies with additional categories and denitions. We plan to validate the taxonomies through fieeld studies within the Living Labs.
Using NLP to support terminology extraction and domain scoping: report on the H2020 DESIRA project
Bacco FM;Dell'Orletta F;Ferrari A
2020
Abstract
The ongoing phenomenon of digitisation is changing social and work life, with tangible effects on the socio-economic context. Understanding the impact, opportunities, and threats of digital transformation requires the identication of viewpoints from a large diversity of stakeholders, from policy makers to domain experts, and from engineers to common citizens. The DESIRA (Digitisation: Economic and Social Impacts in Rural Areas) EU H2020 project1 considers rural areas, with a strong focus on agricultural and forestry activities, and aims at assessing the impact of digital technologies in those domains by involving a large number of stakeholders, all across Europe, around 20 focal questions. Given the involvement of stakeholders with diverse background and skills, a primary goal of the project is to develop domain-specic and interactive reference taxonomies (i.e., structured classications of terms) to facilitate common understanding of technologies in use in each domain at today. The taxonomies, which aims at easing the learning of the meaning of technical and domain-specic terms, are going to be exploited by the stakeholders in 20 Living Labs built around the focal questions. This report paper focuses on the semi-automatic development of the taxonomies through natural language processing (NLP) techniques based on context-specic term extraction. Furthermore, we crawl Wikipedia to enrich the taxonomies with additional categories and denitions. We plan to validate the taxonomies through fieeld studies within the Living Labs.| Campo DC | Valore | Lingua |
|---|---|---|
| dc.authority.orgunit | Istituto di linguistica computazionale "Antonio Zampolli" - ILC | - |
| dc.authority.orgunit | Istituto di Scienza e Tecnologie dell'Informazione "Alessandro Faedo" - ISTI | - |
| dc.authority.people | Bacco FM | it |
| dc.authority.people | Brunori G | it |
| dc.authority.people | Dell'Orletta F | it |
| dc.authority.people | Ferrari A | it |
| dc.authority.project | Digitisation: Economic and Social Impacts in Rural Areas | - |
| dc.collection.id.s | 71c7200a-7c5f-4e83-8d57-d3d2ba88f40d | * |
| dc.collection.name | 04.01 Contributo in Atti di convegno | * |
| dc.contributor.appartenenza | Istituto di Scienza e Tecnologie dell'Informazione "Alessandro Faedo" - ISTI | * |
| dc.contributor.appartenenza | Istituto di linguistica computazionale "Antonio Zampolli" - ILC | * |
| dc.contributor.appartenenza.mi | 918 | * |
| dc.contributor.appartenenza.mi | 973 | * |
| dc.date.accessioned | 2024/02/17 19:42:49 | - |
| dc.date.available | 2024/02/17 19:42:49 | - |
| dc.date.issued | 2020 | - |
| dc.description.abstracteng | The ongoing phenomenon of digitisation is changing social and work life, with tangible effects on the socio-economic context. Understanding the impact, opportunities, and threats of digital transformation requires the identication of viewpoints from a large diversity of stakeholders, from policy makers to domain experts, and from engineers to common citizens. The DESIRA (Digitisation: Economic and Social Impacts in Rural Areas) EU H2020 project1 considers rural areas, with a strong focus on agricultural and forestry activities, and aims at assessing the impact of digital technologies in those domains by involving a large number of stakeholders, all across Europe, around 20 focal questions. Given the involvement of stakeholders with diverse background and skills, a primary goal of the project is to develop domain-specic and interactive reference taxonomies (i.e., structured classications of terms) to facilitate common understanding of technologies in use in each domain at today. The taxonomies, which aims at easing the learning of the meaning of technical and domain-specic terms, are going to be exploited by the stakeholders in 20 Living Labs built around the focal questions. This report paper focuses on the semi-automatic development of the taxonomies through natural language processing (NLP) techniques based on context-specic term extraction. Furthermore, we crawl Wikipedia to enrich the taxonomies with additional categories and denitions. We plan to validate the taxonomies through fieeld studies within the Living Labs. | - |
| dc.description.affiliations | CNR-ISTI, Pisa, Italy; University of Pisa, Pisa, Italy; CNR-ILC, Pisa, Italy; CNR-ISTI, Pisa, Italy | - |
| dc.description.allpeople | Bacco, Fm; Brunori, G; Dell'Orletta, F; Ferrari, A | - |
| dc.description.allpeopleoriginal | Bacco F.M.; Brunori G.; Dell'Orletta F.; Ferrari A. | - |
| dc.description.fulltext | open | en |
| dc.description.numberofauthors | 4 | - |
| dc.identifier.scopus | 2-s2.0-85082691650 | - |
| dc.identifier.uri | https://hdl.handle.net/20.500.14243/370407 | - |
| dc.identifier.url | http://ceur-ws.org/Vol-2584/ | - |
| dc.language.iso | eng | - |
| dc.publisher.country | DEU | - |
| dc.publisher.name | CEUR-WS.org | - |
| dc.publisher.place | Aachen | - |
| dc.relation.conferencedate | 24 March 2020 | - |
| dc.relation.conferencename | Third Workshop on Natural Language Processing for Requirements Engineering | - |
| dc.relation.conferenceplace | Pisa, Italy | - |
| dc.relation.firstpage | 1 | - |
| dc.relation.lastpage | 5 | - |
| dc.relation.numberofpages | 5 | - |
| dc.relation.projectAcronym | DESIRA | - |
| dc.relation.projectAwardNumber | 818194 | - |
| dc.relation.projectAwardTitle | Digitisation: Economic and Social Impacts in Rural Areas | - |
| dc.relation.projectFunderName | - | en |
| dc.relation.projectFundingStream | H2020 | - |
| dc.subject.keywords | NLP | - |
| dc.subject.keywords | WIkipedia | - |
| dc.subject.keywords | Socio-economic impact | - |
| dc.subject.keywords | Taxonomy | - |
| dc.subject.keywords | Knowledge graph | - |
| dc.subject.keywords | Terminology extraction | - |
| dc.subject.keywords | Domain scoping | - |
| dc.subject.singlekeyword | NLP | * |
| dc.subject.singlekeyword | WIkipedia | * |
| dc.subject.singlekeyword | Socio-economic impact | * |
| dc.subject.singlekeyword | Taxonomy | * |
| dc.subject.singlekeyword | Knowledge graph | * |
| dc.subject.singlekeyword | Terminology extraction | * |
| dc.subject.singlekeyword | Domain scoping | * |
| dc.title | Using NLP to support terminology extraction and domain scoping: report on the H2020 DESIRA project | en |
| dc.type.driver | info:eu-repo/semantics/conferenceObject | - |
| dc.type.full | 04 Contributo in convegno::04.01 Contributo in Atti di convegno | it |
| dc.type.miur | 273 | - |
| dc.type.referee | Sì, ma tipo non specificato | - |
| dc.ugov.descaux1 | 417821 | - |
| iris.mediafilter.data | 2025/04/23 04:10:44 | * |
| iris.orcid.lastModifiedDate | 2024/04/04 19:42:00 | * |
| iris.orcid.lastModifiedMillisecond | 1712252520987 | * |
| iris.scopus.extIssued | 2020 | - |
| iris.scopus.extTitle | Using NLP to support terminology extraction and domain scoping: Report on the H2020 DESIRA Project | - |
| iris.sitodocente.maxattempts | 2 | - |
| scopus.authority.anceserie | CEUR WORKSHOP PROCEEDINGS###1613-0073 | * |
| scopus.category | 1700 | * |
| scopus.contributor.affiliation | CNR-ISTI | - |
| scopus.contributor.affiliation | University of Pisa | - |
| scopus.contributor.affiliation | CNR-ILC | - |
| scopus.contributor.affiliation | CNR-ISTI | - |
| scopus.contributor.afid | 60085207 | - |
| scopus.contributor.afid | 60028868 | - |
| scopus.contributor.afid | 60021199 | - |
| scopus.contributor.afid | 60085207 | - |
| scopus.contributor.auid | 57189197221 | - |
| scopus.contributor.auid | 57950785200 | - |
| scopus.contributor.auid | 57540567000 | - |
| scopus.contributor.auid | 55765001561 | - |
| scopus.contributor.country | Italy | - |
| scopus.contributor.country | Italy | - |
| scopus.contributor.country | Italy | - |
| scopus.contributor.country | Italy | - |
| scopus.contributor.dptid | 123211339 | - |
| scopus.contributor.dptid | - | |
| scopus.contributor.dptid | 121833164 | - |
| scopus.contributor.dptid | 124176773 | - |
| scopus.contributor.name | Manlio | - |
| scopus.contributor.name | Gianluca | - |
| scopus.contributor.name | Felice | - |
| scopus.contributor.name | Alessio | - |
| scopus.contributor.subaffiliation | Wn Lab; | - |
| scopus.contributor.subaffiliation | Page;Disaaa; | - |
| scopus.contributor.subaffiliation | ItaliaNLP Lab; | - |
| scopus.contributor.subaffiliation | Fmt Lab; | - |
| scopus.contributor.surname | Bacco | - |
| scopus.contributor.surname | Brunori | - |
| scopus.contributor.surname | Dell'Orletta | - |
| scopus.contributor.surname | Ferrari | - |
| scopus.date.issued | 2020 | * |
| scopus.description.abstracteng | The ongoing phenomenon of digitisation is changing social and work life, with tangible effects on the socio-economic context. Understanding the impact, opportunities, and threats of digital transformation re- quires the identification of viewpoints from a large diversity of stake- holders, from policy makers to domain experts, and from engineers to common citizens. The DESIRA (Digitisation: Economic and Social Impacts in Rural Areas) EU H2020 project1 considers rural areas, with a strong focus on agricultural and forestry activities, and aims at asses- sing the impact of digital technologies in those domains by involving a large number of stakeholders, all across Europe, around 20 focal ques- tions. Given the involvement of stakeholders with diverse background and skills, a primary goal of the project is to develop domain-specific and interactive reference taxonomies (i.e., structured classifications of terms) to facilitate common understanding of technologies in use in each domain at today. The taxonomies, which aims at easing the learn- ing of the meaning of technical and domain-specific terms, are going to be exploited by the stakeholders in 20 Living Labs built around the focal questions. This report paper focuses on the semi-automatic development of the taxonomies through natural language processing (NLP) techniques based on context-specific term extraction. Further- more, we crawl Wikipedia to enrich the taxonomies with additional categories and definitions. We plan to validate the taxonomies through field studies within the Living Labs. | * |
| scopus.description.allpeopleoriginal | Bacco M.; Brunori G.; Dell'Orletta F.; Ferrari A. | * |
| scopus.differences | scopus.relation.conferencename | * |
| scopus.differences | scopus.authority.anceserie | * |
| scopus.differences | scopus.publisher.name | * |
| scopus.differences | scopus.relation.conferencedate | * |
| scopus.differences | scopus.description.allpeopleoriginal | * |
| scopus.differences | scopus.description.abstracteng | * |
| scopus.differences | scopus.relation.conferenceplace | * |
| scopus.differences | scopus.relation.volume | * |
| scopus.document.type | cp | * |
| scopus.document.types | cp | * |
| scopus.funding.funders | 100010661 - Horizon 2020 Framework Programme; | * |
| scopus.funding.ids | 818194; | * |
| scopus.identifier.pui | 631394304 | * |
| scopus.identifier.scopus | 2-s2.0-85082691650 | * |
| scopus.journal.sourceid | 21100218356 | * |
| scopus.language.iso | eng | * |
| scopus.publisher.name | CEUR-WS | * |
| scopus.relation.conferencedate | 2020 | * |
| scopus.relation.conferencename | Joint 26th International Conference on Requirements Engineering: Foundation for Software Quality Workshops, Doctoral Symposium, Live Studies Track, and Poster Track, REFSQ-JP 2020 | * |
| scopus.relation.conferenceplace | ita | * |
| scopus.relation.volume | 2584 | * |
| scopus.title | Using NLP to support terminology extraction and domain scoping: Report on the H2020 DESIRA Project | * |
| scopus.titleeng | Using NLP to support terminology extraction and domain scoping: Report on the H2020 DESIRA Project | * |
| Appare nelle tipologie: | 04.01 Contributo in Atti di convegno | |
| File | Dimensione | Formato | |
|---|---|---|---|
|
prod_417821-doc_147993.pdf
accesso aperto
Descrizione: Using NLP to support terminology extraction and domain scoping: report on the H2020 DESIRA project
Tipologia:
Versione Editoriale (PDF)
Dimensione
501.39 kB
Formato
Adobe PDF
|
501.39 kB | Adobe PDF | Visualizza/Apri |
I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.


