This paper presents a case study concerning the challenges and requirements posed by next generation language resources, realized as an overall model of open, distributed and collaborative language infrastructure. If a sort of "new paradigm" for language resource sharing is required, we think that the emerging and still evolving technology connected to Grid computing is a very interesting and suitable one for a concrete realization of this vision. Given the current limitations of Grid computing, it is very important to test the new environment on basic language analysis tools, in order to get the feeling of what are the potentialities and possible limitations connected to its use in NLP. For this reason, we have done some experiments on a module of the Linguistic Miner, i.e. the extraction of linguistic patterns from restricted domain corpora. The Grid environment has produced the expected results (reduction of the processing time, huge storage capacity, data redundancy) without any additional cost for the final user.

Next Generation Language Resources using Grid

Sassolini E;Sassi M;Cucurullo S;Picchi E;Enea A;Monachini M;Soria C;
2006

Abstract

This paper presents a case study concerning the challenges and requirements posed by next generation language resources, realized as an overall model of open, distributed and collaborative language infrastructure. If a sort of "new paradigm" for language resource sharing is required, we think that the emerging and still evolving technology connected to Grid computing is a very interesting and suitable one for a concrete realization of this vision. Given the current limitations of Grid computing, it is very important to test the new environment on basic language analysis tools, in order to get the feeling of what are the potentialities and possible limitations connected to its use in NLP. For this reason, we have done some experiments on a module of the Linguistic Miner, i.e. the extraction of linguistic patterns from restricted domain corpora. The Grid environment has produced the expected results (reduction of the processing time, huge storage capacity, data redundancy) without any additional cost for the final user.
Campo DC Valore Lingua
dc.authority.orgunit Istituto di linguistica computazionale "Antonio Zampolli" - ILC -
dc.authority.people Calzolari F it
dc.authority.people Sassolini E it
dc.authority.people Sassi M it
dc.authority.people Cucurullo S it
dc.authority.people Picchi E it
dc.authority.people Bertagna F it
dc.authority.people Enea A it
dc.authority.people Monachini M it
dc.authority.people Soria C it
dc.authority.people Calzolari N it
dc.collection.id.s 71c7200a-7c5f-4e83-8d57-d3d2ba88f40d *
dc.collection.name 04.01 Contributo in Atti di convegno *
dc.contributor.appartenenza Istituto di linguistica computazionale "Antonio Zampolli" - ILC *
dc.contributor.appartenenza.mi 918 *
dc.date.accessioned 2024/02/19 17:53:10 -
dc.date.available 2024/02/19 17:53:10 -
dc.date.issued 2006 -
dc.description.abstract This paper presents a case study concerning the challenges and requirements posed by next generation language resources, realized as an overall model of open, distributed and collaborative language infrastructure. If a sort of "new paradigm" for language resource sharing is required, we think that the emerging and still evolving technology connected to Grid computing is a very interesting and suitable one for a concrete realization of this vision. Given the current limitations of Grid computing, it is very important to test the new environment on basic language analysis tools, in order to get the feeling of what are the potentialities and possible limitations connected to its use in NLP. For this reason, we have done some experiments on a module of the Linguistic Miner, i.e. the extraction of linguistic patterns from restricted domain corpora. The Grid environment has produced the expected results (reduction of the processing time, huge storage capacity, data redundancy) without any additional cost for the final user. -
dc.description.affiliations Istituto di Linguistica Computazionale "A. Zampolli", CNR - Pisa Calzolari F. (Scuola Normale Superiore, Pisa). -
dc.description.allpeople Calzolari, F; Sassolini, E; Sassi, M; Cucurullo, S; Picchi, E; Bertagna, F; Enea, A; Monachini, M; Soria, C; Calzolari, N -
dc.description.allpeopleoriginal Calzolari F., Sassolini E., Sassi M., Cucurullo S., Picchi E., Bertagna F., Enea A., Monachini M., Soria C., Calzolari N. -
dc.description.fulltext none en
dc.description.numberofauthors 10 -
dc.identifier.isbn 2-9517408-2-4 -
dc.identifier.uri https://hdl.handle.net/20.500.14243/64246 -
dc.language.iso eng -
dc.relation.conferencedate 24-26 Maggio 2006 -
dc.relation.conferencename LREC 2006: 5th International Conference on Language Resources and Evaluation -
dc.relation.conferenceplace Genova -
dc.relation.firstpage 1858 -
dc.relation.lastpage 1861 -
dc.subject.keywords grid -
dc.subject.keywords acquisition -
dc.subject.keywords topic classification -
dc.subject.singlekeyword grid *
dc.subject.singlekeyword acquisition *
dc.subject.singlekeyword topic classification *
dc.title Next Generation Language Resources using Grid en
dc.type.driver info:eu-repo/semantics/conferenceObject -
dc.type.full 04 Contributo in convegno::04.01 Contributo in Atti di convegno it
dc.type.miur 273 -
dc.type.referee Sì, ma tipo non specificato -
dc.ugov.descaux1 84625 -
iris.orcid.lastModifiedDate 2024/04/04 15:28:10 *
iris.orcid.lastModifiedMillisecond 1712237290688 *
iris.scopus.extIssued 2006 -
iris.scopus.extTitle Next generation language resources using grid -
iris.sitodocente.maxattempts 3 -
Appare nelle tipologie: 04.01 Contributo in Atti di convegno
File in questo prodotto:
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/20.500.14243/64246
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus ND
  • ???jsp.display-item.citation.isi??? ND
social impact