Abstract - This paper describes the main characteristics of the ItalWordNet semantic database, built in the context of the SI-TAL Italian National Project, within which a set of integrated resources and tools for the automatic treatment of the Italian language was realized. The database was created by extending the Italian wordnet developed within the EuroWordNet project, by adding: i) adjectives, adverbs and proper nouns (not dealt with in EuroWordNet); ii) a terminological subset related to the economic-financial domain. The relevant changes involved by these extensions both in the linguistic model and in the data structure are also illustrated. In particular, we discuss: i) the overall architecture of the database; ii) the semantic relations used to encode information on synsets; iii) the changes made to the EuroWordNet Top Ontology structure; iv) the specific characteristics of the terminological subset and the solutions adopted to link it to the generic wordnet. Keywords - synset, semantic database, wordnet, semantic

ItalWordNet: building a large semantic database for the automatic treatment of Italian

Roventini A;Marinelli R;
2003

Abstract

Abstract - This paper describes the main characteristics of the ItalWordNet semantic database, built in the context of the SI-TAL Italian National Project, within which a set of integrated resources and tools for the automatic treatment of the Italian language was realized. The database was created by extending the Italian wordnet developed within the EuroWordNet project, by adding: i) adjectives, adverbs and proper nouns (not dealt with in EuroWordNet); ii) a terminological subset related to the economic-financial domain. The relevant changes involved by these extensions both in the linguistic model and in the data structure are also illustrated. In particular, we discuss: i) the overall architecture of the database; ii) the semantic relations used to encode information on synsets; iii) the changes made to the EuroWordNet Top Ontology structure; iv) the specific characteristics of the terminological subset and the solutions adopted to link it to the generic wordnet. Keywords - synset, semantic database, wordnet, semantic
2003
Istituto di linguistica computazionale "Antonio Zampolli" - ILC
18-19
745
791
Sì, ma tipo non specificato
Database lessicale
Rete semantica
Relazioni semantiche
Risorse linguistiche
La risorsa IWN viene distribuita attraverso ELDA (ne sono state vendute diverse copie). IWN è compatibile con gli standard WordNet ed EWN ed è disponibile in formato XML. IWN è stata usata come risorsa lessicale di riferimento per la codifica semantica della ISST (Italian Syntactic Semantic Treebank) nel progetto TAL, per la seconda e terza edizione della competizione internazionale di sistemi di disambiguazione SENSEVAL e come base di conoscenza per un sistema di Question Answering per l’italiano sviluppato presso l'ILC. L'attività di ricerca legata allo sviluppo di IWN è accompagnata da una ricca produzione di pubblicazioni e presentata in numerosi congressi internazionali. Per la rilevanza del congresso nel panorama della disciplina e per la severità della selezione, ricordiamo l'articolo Alonge A., Bertagna F., Calzolari N., Roventini A., Zampolli A., Encoding information on adjectives in a lexical-semantic net for computational applications, in Proceedings NAACL 2001.
10
info:eu-repo/semantics/article
262
Roventini, A; Alonge, A; Bertagna, F; Calzolari, N; Cancila, J; Girardi, C; Magnini, B; Marinelli, R; Speranza, M; Zampolli, A
01 Contributo su Rivista::01.01 Articolo in rivista
none
File in questo prodotto:
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/20.500.14243/37667
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus ND
  • ???jsp.display-item.citation.isi??? ND
social impact