Corpus-based approaches and statistical approaches have been the main stream of natural language processing research for the past two decades. Language resources play a key role in such approaches, but there is an insufficient amount of language resources in many Asian languages. In this situation, standardisation of language resources would be of great help in developing resources in new languages. This paper presents the latest development efforts of our project which aims at creating a common standard for Asian language resources that is compatible with an international standard. In particular, the paper focuses on i) lexical specification and data categories relevant for building multilingual lexical resources for Asian languages; ii) a core upper-layer ontology needed for ensuring multilingual interoperability and iii) the evaluation platform used to test the entire architectural framework.

Adapting International Standard for Asian Language Technologies

Monachini M;Soria C;
2008

Abstract

Corpus-based approaches and statistical approaches have been the main stream of natural language processing research for the past two decades. Language resources play a key role in such approaches, but there is an insufficient amount of language resources in many Asian languages. In this situation, standardisation of language resources would be of great help in developing resources in new languages. This paper presents the latest development efforts of our project which aims at creating a common standard for Asian language resources that is compatible with an international standard. In particular, the paper focuses on i) lexical specification and data categories relevant for building multilingual lexical resources for Asian languages; ii) a core upper-layer ontology needed for ensuring multilingual interoperability and iii) the evaluation platform used to test the entire architectural framework.
Campo DC Valore Lingua
dc.authority.orgunit Istituto di linguistica computazionale "Antonio Zampolli" - ILC -
dc.authority.people Takenobu T it
dc.authority.people Kaplan D it
dc.authority.people Huang C it
dc.authority.people Hsieh S it
dc.authority.people Calzolari N it
dc.authority.people Monachini M it
dc.authority.people Soria C it
dc.authority.people Shirai K it
dc.authority.people Sornlertlamvanich V it
dc.authority.people Charoenporn T it
dc.authority.people Yingju X it
dc.collection.id.s 71c7200a-7c5f-4e83-8d57-d3d2ba88f40d *
dc.collection.name 04.01 Contributo in Atti di convegno *
dc.contributor.appartenenza Istituto di linguistica computazionale "Antonio Zampolli" - ILC *
dc.contributor.appartenenza.mi 918 *
dc.date.accessioned 2024/02/19 19:38:33 -
dc.date.available 2024/02/19 19:38:33 -
dc.date.issued 2008 -
dc.description.abstracteng Corpus-based approaches and statistical approaches have been the main stream of natural language processing research for the past two decades. Language resources play a key role in such approaches, but there is an insufficient amount of language resources in many Asian languages. In this situation, standardisation of language resources would be of great help in developing resources in new languages. This paper presents the latest development efforts of our project which aims at creating a common standard for Asian language resources that is compatible with an international standard. In particular, the paper focuses on i) lexical specification and data categories relevant for building multilingual lexical resources for Asian languages; ii) a core upper-layer ontology needed for ensuring multilingual interoperability and iii) the evaluation platform used to test the entire architectural framework. -
dc.description.affiliations Takenobu Tokunaga, Kaplan Dain: Tokyo Institute of Technology, Tokyo, Japan; Huang Chu-Ren, Hsieh Shu-Kai: Academia Sinica, Taipei, Taiwan; Shirai Kiyoaki: JAIST, Ishikawa, Japan; Sornlertlamvanich Virach, Charoenporn Thatsanee: TCL/NICT, Bangkok, Thailand; YingJu Xia: Fujitsu R&D Center LTD, Beijing, China. -
dc.description.allpeople Takenobu, T; Kaplan, D; Huang, C; Hsieh, S; Calzolari, N; Monachini, M; Soria, C; Shirai, K; Sornlertlamvanich, V; Charoenporn, T; Yingju, X -
dc.description.allpeopleoriginal Takenobu T.; Kaplan D.; Huang C.; Hsieh S.; Calzolari N.; Monachini M.; Soria C.; Shirai K.; Sornlertlamvanich V.; Charoenporn T.; Yingju X. -
dc.description.fulltext none en
dc.description.numberofauthors 11 -
dc.identifier.isbn 2-9517408-4-0 -
dc.identifier.isi WOS:000324028901126 -
dc.identifier.uri https://hdl.handle.net/20.500.14243/65077 -
dc.identifier.url http://www.lrec-conf.org/proceedings/lrec2008/pdf/422_paper.pdf -
dc.publisher.country FRA -
dc.publisher.name European Language Resources Association ELRA -
dc.publisher.place Paris -
dc.relation.conferencedate 26-05/1-06-2008 -
dc.relation.conferencename LREC 2008, Sixth International Conference on Language Resources and Evaluation -
dc.relation.conferenceplace Marrakech, Morocco -
dc.relation.lastpage 1658 -
dc.relation.numberofpages 1663 -
dc.subject.keywords LR national/international projects -
dc.subject.keywords Organizational/policy issues -
dc.subject.keywords LR Infrastructures and Architectures -
dc.subject.keywords Lexicon -
dc.subject.keywords Lexical database -
dc.subject.singlekeyword LR national/international projects *
dc.subject.singlekeyword Organizational/policy issues *
dc.subject.singlekeyword LR Infrastructures and Architectures *
dc.subject.singlekeyword Lexicon *
dc.subject.singlekeyword Lexical database *
dc.title Adapting International Standard for Asian Language Technologies en
dc.type.driver info:eu-repo/semantics/conferenceObject -
dc.type.full 04 Contributo in convegno::04.01 Contributo in Atti di convegno it
dc.type.miur 273 -
dc.type.referee Sì, ma tipo non specificato -
dc.ugov.descaux1 84701 -
iris.isi.metadataErrorDescription 0 -
iris.isi.metadataErrorType ERROR_NO_MATCH -
iris.isi.metadataStatus ERROR -
iris.orcid.lastModifiedDate 2025/03/02 08:01:37 *
iris.orcid.lastModifiedMillisecond 1740898897194 *
iris.scopus.extIssued 2008 -
iris.scopus.extTitle Adapting international standard for Asian language technologies -
iris.sitodocente.maxattempts 4 -
Appare nelle tipologie: 04.01 Contributo in Atti di convegno
File in questo prodotto:
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/20.500.14243/65077
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus ND
  • ???jsp.display-item.citation.isi??? 0
social impact