Abstract - This paper describes the main characteristics of the ItalWordNet semantic database, built in the context of the SI-TAL Italian National Project, within which a set of integrated resources and tools for the automatic treatment of the Italian language was realized. The database was created by extending the Italian wordnet developed within the EuroWordNet project, by adding: i) adjectives, adverbs and proper nouns (not dealt with in EuroWordNet); ii) a terminological subset related to the economic-financial domain. The relevant changes involved by these extensions both in the linguistic model and in the data structure are also illustrated. In particular, we discuss: i) the overall architecture of the database; ii) the semantic relations used to encode information on synsets; iii) the changes made to the EuroWordNet Top Ontology structure; iv) the specific characteristics of the terminological subset and the solutions adopted to link it to the generic wordnet. Keywords - synset, semantic database, wordnet, semantic
ItalWordNet: building a large semantic database for the automatic treatment of Italian
Roventini A;Marinelli R;
2003
Abstract
Abstract - This paper describes the main characteristics of the ItalWordNet semantic database, built in the context of the SI-TAL Italian National Project, within which a set of integrated resources and tools for the automatic treatment of the Italian language was realized. The database was created by extending the Italian wordnet developed within the EuroWordNet project, by adding: i) adjectives, adverbs and proper nouns (not dealt with in EuroWordNet); ii) a terminological subset related to the economic-financial domain. The relevant changes involved by these extensions both in the linguistic model and in the data structure are also illustrated. In particular, we discuss: i) the overall architecture of the database; ii) the semantic relations used to encode information on synsets; iii) the changes made to the EuroWordNet Top Ontology structure; iv) the specific characteristics of the terminological subset and the solutions adopted to link it to the generic wordnet. Keywords - synset, semantic database, wordnet, semantic| Campo DC | Valore | Lingua |
|---|---|---|
| dc.authority.orgunit | Istituto di linguistica computazionale "Antonio Zampolli" - ILC | - |
| dc.authority.people | Roventini A | it |
| dc.authority.people | Alonge A | it |
| dc.authority.people | Bertagna F | it |
| dc.authority.people | Calzolari N | it |
| dc.authority.people | Cancila J | it |
| dc.authority.people | Girardi C | it |
| dc.authority.people | Magnini B | it |
| dc.authority.people | Marinelli R | it |
| dc.authority.people | Speranza M | it |
| dc.authority.people | Zampolli A | it |
| dc.collection.id.s | b3f88f24-048a-4e43-8ab1-6697b90e068e | * |
| dc.collection.name | 01.01 Articolo in rivista | * |
| dc.contributor.appartenenza | Istituto di linguistica computazionale "Antonio Zampolli" - ILC | * |
| dc.contributor.appartenenza.mi | 918 | * |
| dc.date.accessioned | 2024/02/19 00:46:04 | - |
| dc.date.available | 2024/02/19 00:46:04 | - |
| dc.date.issued | 2003 | - |
| dc.description.abstract | Abstract - This paper describes the main characteristics of the ItalWordNet semantic database, built in the context of the SI-TAL Italian National Project, within which a set of integrated resources and tools for the automatic treatment of the Italian language was realized. The database was created by extending the Italian wordnet developed within the EuroWordNet project, by adding: i) adjectives, adverbs and proper nouns (not dealt with in EuroWordNet); ii) a terminological subset related to the economic-financial domain. The relevant changes involved by these extensions both in the linguistic model and in the data structure are also illustrated. In particular, we discuss: i) the overall architecture of the database; ii) the semantic relations used to encode information on synsets; iii) the changes made to the EuroWordNet Top Ontology structure; iv) the specific characteristics of the terminological subset and the solutions adopted to link it to the generic wordnet. Keywords - synset, semantic database, wordnet, semantic | - |
| dc.description.affiliations | Alonge A: Università di Perugia; MagniniB.: IRST-Trento; Speranza M.: IRST-Trento; Cancila: IRST-Trento; Zampolli A. Direttore ILC Uni Pisa; Cancila J.: Girardi C.: | - |
| dc.description.allpeople | Roventini, A; Alonge, A; Bertagna, F; Calzolari, N; Cancila, J; Girardi, C; Magnini, B; Marinelli, R; Speranza, M; Zampolli, A | - |
| dc.description.allpeopleoriginal | Roventini A. , Alonge A. , Bertagna F. , Calzolari N. , Cancila J. , Girardi C. , Magnini B. , Marinelli R. 8, Speranza M. 9, Zampolli A. | - |
| dc.description.fulltext | none | en |
| dc.description.note | La risorsa IWN viene distribuita attraverso ELDA (ne sono state vendute diverse copie). IWN è compatibile con gli standard WordNet ed EWN ed è disponibile in formato XML. IWN è stata usata come risorsa lessicale di riferimento per la codifica semantica della ISST (Italian Syntactic Semantic Treebank) nel progetto TAL, per la seconda e terza edizione della competizione internazionale di sistemi di disambiguazione SENSEVAL e come base di conoscenza per un sistema di Question Answering per litaliano sviluppato presso l'ILC. L'attività di ricerca legata allo sviluppo di IWN è accompagnata da una ricca produzione di pubblicazioni e presentata in numerosi congressi internazionali. Per la rilevanza del congresso nel panorama della disciplina e per la severità della selezione, ricordiamo l'articolo Alonge A., Bertagna F., Calzolari N., Roventini A., Zampolli A., Encoding information on adjectives in a lexical-semantic net for computational applications, in Proceedings NAACL 2001. | - |
| dc.description.numberofauthors | 10 | - |
| dc.identifier.uri | https://hdl.handle.net/20.500.14243/37667 | - |
| dc.relation.firstpage | 745 | - |
| dc.relation.lastpage | 791 | - |
| dc.relation.volume | 18-19 | - |
| dc.subject.keywords | Database lessicale | - |
| dc.subject.keywords | Rete semantica | - |
| dc.subject.keywords | Relazioni semantiche | - |
| dc.subject.keywords | Risorse linguistiche | - |
| dc.subject.singlekeyword | Database lessicale | * |
| dc.subject.singlekeyword | Rete semantica | * |
| dc.subject.singlekeyword | Relazioni semantiche | * |
| dc.subject.singlekeyword | Risorse linguistiche | * |
| dc.title | ItalWordNet: building a large semantic database for the automatic treatment of Italian | en |
| dc.type.driver | info:eu-repo/semantics/article | - |
| dc.type.full | 01 Contributo su Rivista::01.01 Articolo in rivista | it |
| dc.type.miur | 262 | - |
| dc.type.referee | Sì, ma tipo non specificato | - |
| dc.ugov.descaux1 | 64479 | - |
| iris.orcid.lastModifiedDate | 2024/04/04 10:48:30 | * |
| iris.orcid.lastModifiedMillisecond | 1712220510551 | * |
| iris.sitodocente.maxattempts | 3 | - |
| Appare nelle tipologie: | 01.01 Articolo in rivista | |
I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.


