The ongoing phenomenon of digitisation is changing social and work life, with tangible effects on the socio-economic context. Understanding the impact, opportunities, and threats of digital transformation requires the identication of viewpoints from a large diversity of stakeholders, from policy makers to domain experts, and from engineers to common citizens. The DESIRA (Digitisation: Economic and Social Impacts in Rural Areas) EU H2020 project1 considers rural areas, with a strong focus on agricultural and forestry activities, and aims at assessing the impact of digital technologies in those domains by involving a large number of stakeholders, all across Europe, around 20 focal questions. Given the involvement of stakeholders with diverse background and skills, a primary goal of the project is to develop domain-specic and interactive reference taxonomies (i.e., structured classications of terms) to facilitate common understanding of technologies in use in each domain at today. The taxonomies, which aims at easing the learning of the meaning of technical and domain-specic terms, are going to be exploited by the stakeholders in 20 Living Labs built around the focal questions. This report paper focuses on the semi-automatic development of the taxonomies through natural language processing (NLP) techniques based on context-specic term extraction. Furthermore, we crawl Wikipedia to enrich the taxonomies with additional categories and denitions. We plan to validate the taxonomies through fieeld studies within the Living Labs.

Using NLP to support terminology extraction and domain scoping: report on the H2020 DESIRA project

Bacco FM;Dell'Orletta F;Ferrari A
2020

Abstract

The ongoing phenomenon of digitisation is changing social and work life, with tangible effects on the socio-economic context. Understanding the impact, opportunities, and threats of digital transformation requires the identication of viewpoints from a large diversity of stakeholders, from policy makers to domain experts, and from engineers to common citizens. The DESIRA (Digitisation: Economic and Social Impacts in Rural Areas) EU H2020 project1 considers rural areas, with a strong focus on agricultural and forestry activities, and aims at assessing the impact of digital technologies in those domains by involving a large number of stakeholders, all across Europe, around 20 focal questions. Given the involvement of stakeholders with diverse background and skills, a primary goal of the project is to develop domain-specic and interactive reference taxonomies (i.e., structured classications of terms) to facilitate common understanding of technologies in use in each domain at today. The taxonomies, which aims at easing the learning of the meaning of technical and domain-specic terms, are going to be exploited by the stakeholders in 20 Living Labs built around the focal questions. This report paper focuses on the semi-automatic development of the taxonomies through natural language processing (NLP) techniques based on context-specic term extraction. Furthermore, we crawl Wikipedia to enrich the taxonomies with additional categories and denitions. We plan to validate the taxonomies through fieeld studies within the Living Labs.
Campo DC Valore Lingua
dc.authority.orgunit Istituto di linguistica computazionale "Antonio Zampolli" - ILC -
dc.authority.orgunit Istituto di Scienza e Tecnologie dell'Informazione "Alessandro Faedo" - ISTI -
dc.authority.people Bacco FM it
dc.authority.people Brunori G it
dc.authority.people Dell'Orletta F it
dc.authority.people Ferrari A it
dc.authority.project Digitisation: Economic and Social Impacts in Rural Areas -
dc.collection.id.s 71c7200a-7c5f-4e83-8d57-d3d2ba88f40d *
dc.collection.name 04.01 Contributo in Atti di convegno *
dc.contributor.appartenenza Istituto di Scienza e Tecnologie dell'Informazione "Alessandro Faedo" - ISTI *
dc.contributor.appartenenza Istituto di linguistica computazionale "Antonio Zampolli" - ILC *
dc.contributor.appartenenza.mi 918 *
dc.contributor.appartenenza.mi 973 *
dc.date.accessioned 2024/02/17 19:42:49 -
dc.date.available 2024/02/17 19:42:49 -
dc.date.issued 2020 -
dc.description.abstracteng The ongoing phenomenon of digitisation is changing social and work life, with tangible effects on the socio-economic context. Understanding the impact, opportunities, and threats of digital transformation requires the identication of viewpoints from a large diversity of stakeholders, from policy makers to domain experts, and from engineers to common citizens. The DESIRA (Digitisation: Economic and Social Impacts in Rural Areas) EU H2020 project1 considers rural areas, with a strong focus on agricultural and forestry activities, and aims at assessing the impact of digital technologies in those domains by involving a large number of stakeholders, all across Europe, around 20 focal questions. Given the involvement of stakeholders with diverse background and skills, a primary goal of the project is to develop domain-specic and interactive reference taxonomies (i.e., structured classications of terms) to facilitate common understanding of technologies in use in each domain at today. The taxonomies, which aims at easing the learning of the meaning of technical and domain-specic terms, are going to be exploited by the stakeholders in 20 Living Labs built around the focal questions. This report paper focuses on the semi-automatic development of the taxonomies through natural language processing (NLP) techniques based on context-specic term extraction. Furthermore, we crawl Wikipedia to enrich the taxonomies with additional categories and denitions. We plan to validate the taxonomies through fieeld studies within the Living Labs. -
dc.description.affiliations CNR-ISTI, Pisa, Italy; University of Pisa, Pisa, Italy; CNR-ILC, Pisa, Italy; CNR-ISTI, Pisa, Italy -
dc.description.allpeople Bacco, Fm; Brunori, G; Dell'Orletta, F; Ferrari, A -
dc.description.allpeopleoriginal Bacco F.M.; Brunori G.; Dell'Orletta F.; Ferrari A. -
dc.description.fulltext open en
dc.description.numberofauthors 4 -
dc.identifier.scopus 2-s2.0-85082691650 -
dc.identifier.uri https://hdl.handle.net/20.500.14243/370407 -
dc.identifier.url http://ceur-ws.org/Vol-2584/ -
dc.language.iso eng -
dc.publisher.country DEU -
dc.publisher.name CEUR-WS.org -
dc.publisher.place Aachen -
dc.relation.conferencedate 24 March 2020 -
dc.relation.conferencename Third Workshop on Natural Language Processing for Requirements Engineering -
dc.relation.conferenceplace Pisa, Italy -
dc.relation.firstpage 1 -
dc.relation.lastpage 5 -
dc.relation.numberofpages 5 -
dc.relation.projectAcronym DESIRA -
dc.relation.projectAwardNumber 818194 -
dc.relation.projectAwardTitle Digitisation: Economic and Social Impacts in Rural Areas -
dc.relation.projectFunderName - en
dc.relation.projectFundingStream H2020 -
dc.subject.keywords NLP -
dc.subject.keywords WIkipedia -
dc.subject.keywords Socio-economic impact -
dc.subject.keywords Taxonomy -
dc.subject.keywords Knowledge graph -
dc.subject.keywords Terminology extraction -
dc.subject.keywords Domain scoping -
dc.subject.singlekeyword NLP *
dc.subject.singlekeyword WIkipedia *
dc.subject.singlekeyword Socio-economic impact *
dc.subject.singlekeyword Taxonomy *
dc.subject.singlekeyword Knowledge graph *
dc.subject.singlekeyword Terminology extraction *
dc.subject.singlekeyword Domain scoping *
dc.title Using NLP to support terminology extraction and domain scoping: report on the H2020 DESIRA project en
dc.type.driver info:eu-repo/semantics/conferenceObject -
dc.type.full 04 Contributo in convegno::04.01 Contributo in Atti di convegno it
dc.type.miur 273 -
dc.type.referee Sì, ma tipo non specificato -
dc.ugov.descaux1 417821 -
iris.mediafilter.data 2025/04/23 04:10:44 *
iris.orcid.lastModifiedDate 2024/04/04 19:42:00 *
iris.orcid.lastModifiedMillisecond 1712252520987 *
iris.scopus.extIssued 2020 -
iris.scopus.extTitle Using NLP to support terminology extraction and domain scoping: Report on the H2020 DESIRA Project -
iris.sitodocente.maxattempts 2 -
scopus.authority.anceserie CEUR WORKSHOP PROCEEDINGS###1613-0073 *
scopus.category 1700 *
scopus.contributor.affiliation CNR-ISTI -
scopus.contributor.affiliation University of Pisa -
scopus.contributor.affiliation CNR-ILC -
scopus.contributor.affiliation CNR-ISTI -
scopus.contributor.afid 60085207 -
scopus.contributor.afid 60028868 -
scopus.contributor.afid 60021199 -
scopus.contributor.afid 60085207 -
scopus.contributor.auid 57189197221 -
scopus.contributor.auid 57950785200 -
scopus.contributor.auid 57540567000 -
scopus.contributor.auid 55765001561 -
scopus.contributor.country Italy -
scopus.contributor.country Italy -
scopus.contributor.country Italy -
scopus.contributor.country Italy -
scopus.contributor.dptid 123211339 -
scopus.contributor.dptid -
scopus.contributor.dptid 121833164 -
scopus.contributor.dptid 124176773 -
scopus.contributor.name Manlio -
scopus.contributor.name Gianluca -
scopus.contributor.name Felice -
scopus.contributor.name Alessio -
scopus.contributor.subaffiliation Wn Lab; -
scopus.contributor.subaffiliation Page;Disaaa; -
scopus.contributor.subaffiliation ItaliaNLP Lab; -
scopus.contributor.subaffiliation Fmt Lab; -
scopus.contributor.surname Bacco -
scopus.contributor.surname Brunori -
scopus.contributor.surname Dell'Orletta -
scopus.contributor.surname Ferrari -
scopus.date.issued 2020 *
scopus.description.abstracteng The ongoing phenomenon of digitisation is changing social and work life, with tangible effects on the socio-economic context. Understanding the impact, opportunities, and threats of digital transformation re- quires the identification of viewpoints from a large diversity of stake- holders, from policy makers to domain experts, and from engineers to common citizens. The DESIRA (Digitisation: Economic and Social Impacts in Rural Areas) EU H2020 project1 considers rural areas, with a strong focus on agricultural and forestry activities, and aims at asses- sing the impact of digital technologies in those domains by involving a large number of stakeholders, all across Europe, around 20 focal ques- tions. Given the involvement of stakeholders with diverse background and skills, a primary goal of the project is to develop domain-specific and interactive reference taxonomies (i.e., structured classifications of terms) to facilitate common understanding of technologies in use in each domain at today. The taxonomies, which aims at easing the learn- ing of the meaning of technical and domain-specific terms, are going to be exploited by the stakeholders in 20 Living Labs built around the focal questions. This report paper focuses on the semi-automatic development of the taxonomies through natural language processing (NLP) techniques based on context-specific term extraction. Further- more, we crawl Wikipedia to enrich the taxonomies with additional categories and definitions. We plan to validate the taxonomies through field studies within the Living Labs. *
scopus.description.allpeopleoriginal Bacco M.; Brunori G.; Dell'Orletta F.; Ferrari A. *
scopus.differences scopus.relation.conferencename *
scopus.differences scopus.authority.anceserie *
scopus.differences scopus.publisher.name *
scopus.differences scopus.relation.conferencedate *
scopus.differences scopus.description.allpeopleoriginal *
scopus.differences scopus.description.abstracteng *
scopus.differences scopus.relation.conferenceplace *
scopus.differences scopus.relation.volume *
scopus.document.type cp *
scopus.document.types cp *
scopus.funding.funders 100010661 - Horizon 2020 Framework Programme; *
scopus.funding.ids 818194; *
scopus.identifier.pui 631394304 *
scopus.identifier.scopus 2-s2.0-85082691650 *
scopus.journal.sourceid 21100218356 *
scopus.language.iso eng *
scopus.publisher.name CEUR-WS *
scopus.relation.conferencedate 2020 *
scopus.relation.conferencename Joint 26th International Conference on Requirements Engineering: Foundation for Software Quality Workshops, Doctoral Symposium, Live Studies Track, and Poster Track, REFSQ-JP 2020 *
scopus.relation.conferenceplace ita *
scopus.relation.volume 2584 *
scopus.title Using NLP to support terminology extraction and domain scoping: Report on the H2020 DESIRA Project *
scopus.titleeng Using NLP to support terminology extraction and domain scoping: Report on the H2020 DESIRA Project *
Appare nelle tipologie: 04.01 Contributo in Atti di convegno
File in questo prodotto:
File Dimensione Formato  
prod_417821-doc_147993.pdf

accesso aperto

Descrizione: Using NLP to support terminology extraction and domain scoping: report on the H2020 DESIRA project
Tipologia: Versione Editoriale (PDF)
Dimensione 501.39 kB
Formato Adobe PDF
501.39 kB Adobe PDF Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/20.500.14243/370407
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 1
  • ???jsp.display-item.citation.isi??? ND
social impact