This paper describes the development of a web-service tool for the automatic extraction of Multi-word expressions lexicons, which has been integrated in a distributed platform for the automatic creation of linguistic resources. The main purpose of the work described is thus to provide a (computationally "light") tool that produces a full lexical resource: multi-word terms/items with relevant and useful attached information that can be used for more complex processing tasks and applications (e.g. parsing, MT, IE, query expansion, etc.). The output of our tool is a MW lexicon formatted and encoded in XML according to the Lexical Mark-up Framework. The tool is already functional and available as a service. Evaluation experiments show that the tool precision is of about 80%.

A MWE Acquisition and Lexicon Builder Web Service

Quochi Valeria;Frontini Francesca;
2012

Abstract

This paper describes the development of a web-service tool for the automatic extraction of Multi-word expressions lexicons, which has been integrated in a distributed platform for the automatic creation of linguistic resources. The main purpose of the work described is thus to provide a (computationally "light") tool that produces a full lexical resource: multi-word terms/items with relevant and useful attached information that can be used for more complex processing tasks and applications (e.g. parsing, MT, IE, query expansion, etc.). The output of our tool is a MW lexicon formatted and encoded in XML according to the Lexical Mark-up Framework. The tool is already functional and available as a service. Evaluation experiments show that the tool precision is of about 80%.
2012
Istituto di linguistica computazionale "Antonio Zampolli" - ILC
Inglese
Martin Kay and Christian Boitet
Proceedings of COLING 2012: Technical Papers
International Conference on Computational Linguistics (COLING)
2291
2306
16
9781627483896
http://aclweb.org/anthology/C/C12/C12-1140.pdf
Curran Associates
Red Hook, NY 12571
STATI UNITI D'AMERICA
Sì, ma tipo non specificato
December 2012
Mumbai, India
Multiword extraction
lexical resources
LMF
web services.
ID_PUMA: /cnr.ilc/2012-A3-007 Il volume degli atti reso disponibile da The COLING 2012 Organizing Committee, Indian Institute of Technology Bombay, a https://aclanthology.org/volumes/C12-1/ (Creative Commons Attribution 4.0 International License)
2
none
Quochi, Valeria; Frontini, Francesca; Rubino, Francesco
273
info:eu-repo/semantics/conferenceObject
04 Contributo in convegno::04.01 Contributo in Atti di convegno
   Platform for Automatic, Normalized Annotation and Cost-Effective Acquisition of Language Resources for Human Language Technologies
   PANACEA
   FP7
   248064
File in questo prodotto:
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/20.500.14243/128266
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus ND
  • ???jsp.display-item.citation.isi??? ND
social impact