This paper presents the results of a study on textual resources in the field of Human Language Technology (HLT). A statistical representation of the most significant terms in HLT and other interrelated disciplines associates old and new words, highlighting the terminological changes that have taken place in the course of time. Aim of our study is to contribute to the creation of language resources for the extraction of documentation coming from the Web in order to help preventing the disappearance of documents containing HLT words that have undergone rapid development over the last decades. This paper is organised as follows: after a general introduction to our work, section 2 provides a historical overview of HLT; sections 3 and 4 offer an account of the most relevant terms used by specialists in different periods, and those indicative of the changes that have taken place; section 5 describes the methodology we have used and also contains information on our database and a graphical representation of the data. Finally, the conclusions stress the need to integrate pre-existing or obsolete words and expressions, creating HLT synonym relations.

Terminology Extraction from the web

Sassi M;Pardelli G;Goggi S
2009

Abstract

This paper presents the results of a study on textual resources in the field of Human Language Technology (HLT). A statistical representation of the most significant terms in HLT and other interrelated disciplines associates old and new words, highlighting the terminological changes that have taken place in the course of time. Aim of our study is to contribute to the creation of language resources for the extraction of documentation coming from the Web in order to help preventing the disappearance of documents containing HLT words that have undergone rapid development over the last decades. This paper is organised as follows: after a general introduction to our work, section 2 provides a historical overview of HLT; sections 3 and 4 offer an account of the most relevant terms used by specialists in different periods, and those indicative of the changes that have taken place; section 5 describes the methodology we have used and also contains information on our database and a graphical representation of the data. Finally, the conclusions stress the need to integrate pre-existing or obsolete words and expressions, creating HLT synonym relations.
Campo DC Valore Lingua
dc.authority.orgunit Istituto di linguistica computazionale "Antonio Zampolli" - ILC -
dc.authority.people Sassi M it
dc.authority.people Pardelli G it
dc.authority.people Goggi S it
dc.collection.id.s 71c7200a-7c5f-4e83-8d57-d3d2ba88f40d *
dc.collection.name 04.01 Contributo in Atti di convegno *
dc.contributor.appartenenza Istituto di linguistica computazionale "Antonio Zampolli" - ILC *
dc.contributor.appartenenza.mi 918 *
dc.date.accessioned 2024/02/19 19:56:09 -
dc.date.available 2024/02/19 19:56:09 -
dc.date.issued 2009 -
dc.description.abstracteng This paper presents the results of a study on textual resources in the field of Human Language Technology (HLT). A statistical representation of the most significant terms in HLT and other interrelated disciplines associates old and new words, highlighting the terminological changes that have taken place in the course of time. Aim of our study is to contribute to the creation of language resources for the extraction of documentation coming from the Web in order to help preventing the disappearance of documents containing HLT words that have undergone rapid development over the last decades. This paper is organised as follows: after a general introduction to our work, section 2 provides a historical overview of HLT; sections 3 and 4 offer an account of the most relevant terms used by specialists in different periods, and those indicative of the changes that have taken place; section 5 describes the methodology we have used and also contains information on our database and a graphical representation of the data. Finally, the conclusions stress the need to integrate pre-existing or obsolete words and expressions, creating HLT synonym relations. -
dc.description.affiliations CNR-ILC Pisa -
dc.description.allpeople Sassi, M; Pardelli, G; Goggi, S -
dc.description.allpeopleoriginal Sassi M.; Pardelli G.; Goggi S. -
dc.description.fulltext none en
dc.description.numberofauthors 3 -
dc.identifier.isbn 978-83-7177-746-2 -
dc.identifier.uri https://hdl.handle.net/20.500.14243/65130 -
dc.language.iso eng -
dc.relation.alleditors Zygmunt Vetulani (ed.) -
dc.relation.conferencedate November 6-8, 2009 -
dc.relation.conferencename 4th Language Technology Conference: Human Language Technology as a challenge for Computer Science and Linguistics -
dc.relation.conferenceplace Poznan, PL -
dc.relation.firstpage 417 -
dc.relation.lastpage 420 -
dc.relation.numberofpages 4 -
dc.subject.keywords Terminology -
dc.subject.keywords Computational Linguistics -
dc.subject.keywords Web-based information -
dc.subject.singlekeyword Terminology *
dc.subject.singlekeyword Computational Linguistics *
dc.subject.singlekeyword Web-based information *
dc.title Terminology Extraction from the web en
dc.type.driver info:eu-repo/semantics/conferenceObject -
dc.type.full 04 Contributo in convegno::04.01 Contributo in Atti di convegno it
dc.type.miur 273 -
dc.type.referee Sì, ma tipo non specificato -
dc.ugov.descaux1 84757 -
iris.orcid.lastModifiedDate 2024/04/04 11:05:14 *
iris.orcid.lastModifiedMillisecond 1712221514338 *
iris.sitodocente.maxattempts 1 -
Appare nelle tipologie: 04.01 Contributo in Atti di convegno
File in questo prodotto:
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/20.500.14243/65130
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus ND
  • ???jsp.display-item.citation.isi??? ND
social impact