Relations among phenomena at different linguistic levels are at the essence of language properties but today we focus mostly on one specific linguistic layer at a time, without (having the possibility of) paying attention to the relations among the different layers. At the same time our efforts are too much scattered without much possibility of exploiting other people's achievements. To address the complexities hidden in multilayer interrelations even small amounts of processed data can be useful, improving the performance of complex systems. Exploiting the current trend towards sharing we want to initiate a collective movement that works towards creating synergies and harmonisation among different annotation efforts that are now dispersed. In this paper we present the general architecture of the Language Library, an initiative which is conceived as a facility for gathering and making available through simple functionalities the linguistic knowledge the field is able to produce, putting in place new ways of collaboration within the LRT community. In order to reach this goal, a first population round of the Language Library has started around a core of parallel/comparable texts that have been annotated by several contributors submitting a paper for LREC2012. The Language Library has also an ancillary aim related to language documentation and archiving and it is conceived as a theory-neutral space which allows for several language processing philosophies to coexist.

The Language Library: supporting community effort for collective resource production

Del Gratta Riccardo;Frontini Francesca;Russo Irene;
2012

Abstract

Relations among phenomena at different linguistic levels are at the essence of language properties but today we focus mostly on one specific linguistic layer at a time, without (having the possibility of) paying attention to the relations among the different layers. At the same time our efforts are too much scattered without much possibility of exploiting other people's achievements. To address the complexities hidden in multilayer interrelations even small amounts of processed data can be useful, improving the performance of complex systems. Exploiting the current trend towards sharing we want to initiate a collective movement that works towards creating synergies and harmonisation among different annotation efforts that are now dispersed. In this paper we present the general architecture of the Language Library, an initiative which is conceived as a facility for gathering and making available through simple functionalities the linguistic knowledge the field is able to produce, putting in place new ways of collaboration within the LRT community. In order to reach this goal, a first population round of the Language Library has started around a core of parallel/comparable texts that have been annotated by several contributors submitting a paper for LREC2012. The Language Library has also an ancillary aim related to language documentation and archiving and it is conceived as a theory-neutral space which allows for several language processing philosophies to coexist.
Campo DC Valore Lingua
dc.authority.orgunit Istituto di linguistica computazionale "Antonio Zampolli" - ILC en
dc.authority.people Del Gratta Riccardo en
dc.authority.people Frontini Francesca en
dc.authority.people Rubino Francesco en
dc.authority.people Russo Irene en
dc.authority.people Calzolari Nicoletta en
dc.collection.id.s 71c7200a-7c5f-4e83-8d57-d3d2ba88f40d *
dc.collection.name 04.01 Contributo in Atti di convegno *
dc.contributor.appartenenza Istituto di linguistica computazionale "Antonio Zampolli" - ILC *
dc.contributor.appartenenza.mi 918 *
dc.date.accessioned 2024/02/15 22:59:57 -
dc.date.available 2024/02/15 22:59:57 -
dc.date.firstsubmission 2024/10/02 15:07:54 *
dc.date.issued 2012 -
dc.date.submission 2024/10/03 10:13:25 *
dc.description.abstracteng Relations among phenomena at different linguistic levels are at the essence of language properties but today we focus mostly on one specific linguistic layer at a time, without (having the possibility of) paying attention to the relations among the different layers. At the same time our efforts are too much scattered without much possibility of exploiting other people's achievements. To address the complexities hidden in multilayer interrelations even small amounts of processed data can be useful, improving the performance of complex systems. Exploiting the current trend towards sharing we want to initiate a collective movement that works towards creating synergies and harmonisation among different annotation efforts that are now dispersed. In this paper we present the general architecture of the Language Library, an initiative which is conceived as a facility for gathering and making available through simple functionalities the linguistic knowledge the field is able to produce, putting in place new ways of collaboration within the LRT community. In order to reach this goal, a first population round of the Language Library has started around a core of parallel/comparable texts that have been annotated by several contributors submitting a paper for LREC2012. The Language Library has also an ancillary aim related to language documentation and archiving and it is conceived as a theory-neutral space which allows for several language processing philosophies to coexist. -
dc.description.affiliations CNR-ILC, Pisa -
dc.description.allpeople DEL GRATTA, Riccardo; Frontini, Francesca; Rubino, Francesco; Russo, Irene; Calzolari, Nicoletta -
dc.description.allpeopleoriginal Del Gratta, Riccardo; Frontini, Francesca; Rubino, Francesco; Russo, Irene; Calzolari, Nicoletta en
dc.description.fulltext none en
dc.description.note ID_PUMA: /cnr.ilc/2012-A2-016 en
dc.description.numberofauthors 5 -
dc.identifier.isi WOS:000355611004120 en
dc.identifier.uri https://hdl.handle.net/20.500.14243/119634 -
dc.language.iso eng en
dc.miur.last.status.update 2024-10-02T13:05:12Z *
dc.relation.conferencedate 23-25 may 2012 en
dc.relation.conferencename The Eight International Conference on Language Resources and Evaluation (LREC'12) en
dc.relation.conferenceplace Istanbul, Turkey en
dc.relation.firstpage 43 en
dc.relation.ispartofbook The Eight International Conference on Language Resources and Evaluation (LREC'12) en
dc.relation.lastpage 49 en
dc.relation.numberofpages 7 en
dc.subject.keywords annotation -
dc.subject.keywords metadata -
dc.subject.keywords scientific crowdsourcing -
dc.subject.singlekeyword annotation *
dc.subject.singlekeyword metadata *
dc.subject.singlekeyword scientific crowdsourcing *
dc.title The Language Library: supporting community effort for collective resource production en
dc.type.driver info:eu-repo/semantics/conferenceObject -
dc.type.full 04 Contributo in convegno::04.01 Contributo in Atti di convegno it
dc.type.miur 273 -
dc.ugov.descaux1 220182 -
iris.isi.extIssued 2014 -
iris.isi.extTitle From Synsets to Videos: Enriching ItalWordNet Multimodally -
iris.isi.metadataErrorDescription 0 -
iris.isi.metadataErrorType ERROR_NO_MATCH -
iris.isi.metadataStatus ERROR -
iris.orcid.lastModifiedDate 2024/12/03 14:59:14 *
iris.orcid.lastModifiedMillisecond 1733234354501 *
iris.scopus.extIssued 2012 -
iris.scopus.extTitle The language library: Supporting community effort for collective resource production -
iris.sitodocente.maxattempts 1 -
isi.category OT -
isi.category OY -
isi.contributor.affiliation Consiglio Nazionale delle Ricerche (CNR) -
isi.contributor.affiliation Consiglio Nazionale delle Ricerche (CNR) -
isi.contributor.affiliation Consiglio Nazionale delle Ricerche (CNR) -
isi.contributor.affiliation Consiglio Nazionale delle Ricerche (CNR) -
isi.contributor.affiliation Consiglio Nazionale delle Ricerche (CNR) -
isi.contributor.country Italy -
isi.contributor.country Italy -
isi.contributor.country Italy -
isi.contributor.country Italy -
isi.contributor.country Italy -
isi.contributor.name Roberto -
isi.contributor.name Valeria -
isi.contributor.name Irene -
isi.contributor.name Irene -
isi.contributor.name Monica -
isi.contributor.researcherId -
isi.contributor.researcherId -
isi.contributor.researcherId -
isi.contributor.researcherId -
isi.contributor.researcherId -
isi.contributor.subaffiliation Ist Linguist Computaz Zampolli -
isi.contributor.subaffiliation Ist Linguist Computaz Zampolli -
isi.contributor.subaffiliation Ist Linguist Computaz Zampolli -
isi.contributor.subaffiliation Ist Linguist Computaz Zampolli -
isi.contributor.subaffiliation Ist Linguist Computaz Zampolli -
isi.contributor.surname Bartolini -
isi.contributor.surname Quochi -
isi.contributor.surname De Felice -
isi.contributor.surname Russo -
isi.contributor.surname Monachini -
isi.date.issued 2014 -
isi.description.abstract The paper describes the multimodal enrichment of ItalWordNet action verbs' entries by means of an automatic mapping with a conceptual ontology of action types instantiated by video scenes (ImagAct). The two resources present significative differences as well as interesting complementary features, such that a mapping of these two resources can lead to a an enrichment of IWN, through the connection between synsets and videos apt to illustrate the meaning described by glosses. Here, we describe an approach inspired by ontology matching methods for the automatic mapping of ImagAct video scenes onto ItalWordNet. The experiments described in the paper are conducted on Italian, but the same methodology can be extended to other languages for which WordNets have been created, since ImagAct is available also for English, Chinese and Spanish. This source of multimodal information can be exploited to design second language learning tools, as well as for language grounding in action recognition in video sources and potentially for robotics. -
isi.description.allpeopleoriginal Bartolini, R; Quochi, V; De Felice, I; Russo, I; Monachini, M; -
isi.document.sourcetype WOS.ISSHP -
isi.document.type Meeting -
isi.document.types Meeting -
isi.identifier.isi WOS:000355611004120 -
isi.journal.journaltitle LREC 2014 - NINTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION -
isi.language.original English -
isi.publisher.place 55-57, RUE BRILLAT-SAVARIN, PARIS, 75013, FRANCE -
isi.relation.firstpage 3110 -
isi.relation.lastpage 3117 -
isi.title From Synsets to Videos: Enriching ItalWordNet Multimodally -
Appare nelle tipologie: 04.01 Contributo in Atti di convegno
File in questo prodotto:
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/20.500.14243/119634
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus ND
  • ???jsp.display-item.citation.isi??? 1
social impact