Linguistic Miner is a project carried out at ILC whose objective is the development of an integrated system to build, organise and manage a corpus of Italian texts (of various origins and formats), and to design and constantly add new tools for the automatic extraction of tiered linguistic knowledge to be made available for many teaching, publishing, and other cultural purposes. The project is based on a notion that is preliminary to all the systems for corpus-based linguistic analysis: a language represented by the largest possible collection of heterogeneous texts is the best source of linguistic information at any level of analysis considered. The first goals of such a system are the semi-automated construction of an Italian data mine for the extraction of linguistic information, the validation of linguistic patterns, the installation of useful tools and resources for a range of different categories of Italian language users. The main feature of the project is its purpose of building large language reference corpora allowing for the creation and use of effective tools for the handling and processing, as well as the automatic linguistic synthesis, of such corpora.

Linguistic Miner. An Italian Linguistic Knowledge System

Picchi E;Cucurullo S;Sassi M;Sassolini E
2004

Abstract

Linguistic Miner is a project carried out at ILC whose objective is the development of an integrated system to build, organise and manage a corpus of Italian texts (of various origins and formats), and to design and constantly add new tools for the automatic extraction of tiered linguistic knowledge to be made available for many teaching, publishing, and other cultural purposes. The project is based on a notion that is preliminary to all the systems for corpus-based linguistic analysis: a language represented by the largest possible collection of heterogeneous texts is the best source of linguistic information at any level of analysis considered. The first goals of such a system are the semi-automated construction of an Italian data mine for the extraction of linguistic information, the validation of linguistic patterns, the installation of useful tools and resources for a range of different categories of Italian language users. The main feature of the project is its purpose of building large language reference corpora allowing for the creation and use of effective tools for the handling and processing, as well as the automatic linguistic synthesis, of such corpora.
Campo DC Valore Lingua
dc.authority.orgunit Istituto di linguistica computazionale "Antonio Zampolli" - ILC -
dc.authority.people Picchi E it
dc.authority.people Ceccotti ML it
dc.authority.people Cucurullo S it
dc.authority.people Sassi M it
dc.authority.people Sassolini E it
dc.collection.id.s 71c7200a-7c5f-4e83-8d57-d3d2ba88f40d *
dc.collection.name 04.01 Contributo in Atti di convegno *
dc.contributor.appartenenza Istituto di linguistica computazionale "Antonio Zampolli" - ILC *
dc.contributor.appartenenza.mi 918 *
dc.date.accessioned 2024/02/19 17:51:40 -
dc.date.available 2024/02/19 17:51:40 -
dc.date.issued 2004 -
dc.description.abstracteng Linguistic Miner is a project carried out at ILC whose objective is the development of an integrated system to build, organise and manage a corpus of Italian texts (of various origins and formats), and to design and constantly add new tools for the automatic extraction of tiered linguistic knowledge to be made available for many teaching, publishing, and other cultural purposes. The project is based on a notion that is preliminary to all the systems for corpus-based linguistic analysis: a language represented by the largest possible collection of heterogeneous texts is the best source of linguistic information at any level of analysis considered. The first goals of such a system are the semi-automated construction of an Italian data mine for the extraction of linguistic information, the validation of linguistic patterns, the installation of useful tools and resources for a range of different categories of Italian language users. The main feature of the project is its purpose of building large language reference corpora allowing for the creation and use of effective tools for the handling and processing, as well as the automatic linguistic synthesis, of such corpora. -
dc.description.affiliations ILC-CNR -
dc.description.allpeople Picchi E.; Ceccotti M.L.; Cucurullo S.; Sassi M.; Sassolini E. -
dc.description.allpeopleoriginal Picchi E., Ceccotti M.L., Cucurullo S., Sassi M., Sassolini E. -
dc.description.fulltext none en
dc.description.numberofauthors 4 -
dc.identifier.isbn 2-9517408-1-6 -
dc.identifier.uri https://hdl.handle.net/20.500.14243/64236 -
dc.identifier.url http://www.lrec-conf.org/lrec2004/ -
dc.language.iso eng -
dc.relation.conferencedate 26-27-28 Maggio 2004 -
dc.relation.conferencename LREC 2004: Fourth International Conference on Language Resources and Evaluation -
dc.relation.conferenceplace Lisbona -
dc.relation.firstpage 1811 -
dc.relation.ispartofbook Proceedings of the 4th International Conference on Language Resources and Evaluation -
dc.relation.lastpage 1814 -
dc.relation.numberofpages 4 -
dc.relation.volume V -
dc.subject.keywords linguistic analysis -
dc.subject.keywords information extraction -
dc.subject.singlekeyword linguistic analysis *
dc.subject.singlekeyword information extraction *
dc.title Linguistic Miner. An Italian Linguistic Knowledge System en
dc.type.driver info:eu-repo/semantics/conferenceObject -
dc.type.full 04 Contributo in convegno::04.01 Contributo in Atti di convegno it
dc.type.miur 273 -
dc.type.referee Sì, ma tipo non specificato -
dc.ugov.descaux1 84615 -
iris.orcid.lastModifiedDate 2024/03/02 02:32:44 *
iris.orcid.lastModifiedMillisecond 1709343164048 *
iris.sitodocente.maxattempts 1 -
Appare nelle tipologie: 04.01 Contributo in Atti di convegno
File in questo prodotto:
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/20.500.14243/64236
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus ND
  • ???jsp.display-item.citation.isi??? ND
social impact