The paper presents two experiments of unsupervised classification of Italian noun phrases. The goal of the experiments is to identify the most prominent contextual properties that allow for a functional classification of noun phrases. For this purpose, we used a Self Organizing Map is trained with syntactically-annotated contexts containing noun phrases. The contexts are defined by means of a set of features representing morpho-syntactic properties of both nouns and their wider contexts. Two types of experiments have been run: one based on noun types and the other based on noun tokens. The results of the type simulation show that when frequency is the most prominent classification factor, the network isolates idiomatic or fixed phrases. The results of the token simulation experiment, instead, show that, of the 3 6 attributes represented in the original input matrix, only a few of them are prominent in the re-organization of the map. In particular, key features in the emergent macro-classification are the type of determiner and the grammatical number of the noun. An additional but not less interesting result is an organization into semantic/pragmatic micro-classes. In conclusions, our result confirm the relative prominence of determiner type and grammatical number in the task of noun (phrase) categorization.
Learning properties of Noun Phrases: from data to functions
Quochi Valeria;
2008
Abstract
The paper presents two experiments of unsupervised classification of Italian noun phrases. The goal of the experiments is to identify the most prominent contextual properties that allow for a functional classification of noun phrases. For this purpose, we used a Self Organizing Map is trained with syntactically-annotated contexts containing noun phrases. The contexts are defined by means of a set of features representing morpho-syntactic properties of both nouns and their wider contexts. Two types of experiments have been run: one based on noun types and the other based on noun tokens. The results of the type simulation show that when frequency is the most prominent classification factor, the network isolates idiomatic or fixed phrases. The results of the token simulation experiment, instead, show that, of the 3 6 attributes represented in the original input matrix, only a few of them are prominent in the re-organization of the map. In particular, key features in the emergent macro-classification are the type of determiner and the grammatical number of the noun. An additional but not less interesting result is an organization into semantic/pragmatic micro-classes. In conclusions, our result confirm the relative prominence of determiner type and grammatical number in the task of noun (phrase) categorization.| Campo DC | Valore | Lingua |
|---|---|---|
| dc.authority.orgunit | Istituto di linguistica computazionale "Antonio Zampolli" - ILC | - |
| dc.authority.people | Quochi Valeria | it |
| dc.authority.people | Calderone Basilio | it |
| dc.collection.id.s | 71c7200a-7c5f-4e83-8d57-d3d2ba88f40d | * |
| dc.collection.name | 04.01 Contributo in Atti di convegno | * |
| dc.contributor.appartenenza | Istituto di linguistica computazionale "Antonio Zampolli" - ILC | * |
| dc.contributor.appartenenza.mi | 918 | * |
| dc.date.accessioned | 2024/02/19 17:30:14 | - |
| dc.date.available | 2024/02/19 17:30:14 | - |
| dc.date.issued | 2008 | - |
| dc.description.abstracteng | The paper presents two experiments of unsupervised classification of Italian noun phrases. The goal of the experiments is to identify the most prominent contextual properties that allow for a functional classification of noun phrases. For this purpose, we used a Self Organizing Map is trained with syntactically-annotated contexts containing noun phrases. The contexts are defined by means of a set of features representing morpho-syntactic properties of both nouns and their wider contexts. Two types of experiments have been run: one based on noun types and the other based on noun tokens. The results of the type simulation show that when frequency is the most prominent classification factor, the network isolates idiomatic or fixed phrases. The results of the token simulation experiment, instead, show that, of the 3 6 attributes represented in the original input matrix, only a few of them are prominent in the re-organization of the map. In particular, key features in the emergent macro-classification are the type of determiner and the grammatical number of the noun. An additional but not less interesting result is an organization into semantic/pragmatic micro-classes. In conclusions, our result confirm the relative prominence of determiner type and grammatical number in the task of noun (phrase) categorization. | - |
| dc.description.affiliations | Consiglio Nazionale delle Ricerche (CNR), Scuola Normale Superiore | - |
| dc.description.allpeople | Quochi, Valeria; Calderone, Basilio | - |
| dc.description.allpeopleoriginal | Quochi, Valeria; Calderone, Basilio | - |
| dc.description.fulltext | none | en |
| dc.description.numberofauthors | 1 | - |
| dc.identifier.isbn | 2-9517408-4-0 | - |
| dc.identifier.isi | WOS:000324028902114 | - |
| dc.identifier.scopus | 2-s2.0-85037530722 | - |
| dc.identifier.uri | https://hdl.handle.net/20.500.14243/227370 | - |
| dc.identifier.url | http://www.lrec-conf.org/proceedings/lrec2008/summaries/644.html | - |
| dc.language.iso | eng | - |
| dc.relation.conferencedate | 28-30 Maggio | - |
| dc.relation.conferencename | Sixth International Conference on Language Resources and Evaluation (LREC'08) | - |
| dc.relation.conferenceplace | Marrakech, Morocco | - |
| dc.relation.firstpage | 2596 | - |
| dc.relation.lastpage | 2602 | - |
| dc.relation.numberofpages | 7 | - |
| dc.subject.keywords | cognitive linguistics | - |
| dc.subject.keywords | noun phrase | - |
| dc.subject.singlekeyword | cognitive linguistics | * |
| dc.subject.singlekeyword | noun phrase | * |
| dc.title | Learning properties of Noun Phrases: from data to functions | en |
| dc.type.driver | info:eu-repo/semantics/conferenceObject | - |
| dc.type.full | 04 Contributo in convegno::04.01 Contributo in Atti di convegno | it |
| dc.type.miur | 273 | - |
| dc.type.referee | Sì, ma tipo non specificato | - |
| dc.ugov.descaux1 | 288714 | - |
| iris.isi.extIssued | 2008 | - |
| iris.isi.extTitle | Learning properties of Noun Phrases: from data to functions | - |
| iris.orcid.lastModifiedDate | 2025/04/23 01:12:23 | * |
| iris.orcid.lastModifiedMillisecond | 1745363543967 | * |
| iris.scopus.extIssued | 2008 | - |
| iris.scopus.extTitle | Learning properties of noun phrases: From data to functions | - |
| iris.scopus.ideLinkStatusDate | 2024/04/10 09:22:08 | * |
| iris.scopus.ideLinkStatusMillisecond | 1712733728913 | * |
| iris.sitodocente.maxattempts | 3 | - |
| isi.category | OT | * |
| isi.contributor.affiliation | Scuola Normale Superiore di Pisa | - |
| isi.contributor.affiliation | Scuola Normale Superiore di Pisa | - |
| isi.contributor.country | Italy | - |
| isi.contributor.country | Italy | - |
| isi.contributor.name | Valeria | - |
| isi.contributor.name | Basilio | - |
| isi.contributor.researcherId | E-7468-2011 | - |
| isi.contributor.researcherId | EMG-3709-2022 | - |
| isi.contributor.subaffiliation | CNR | - |
| isi.contributor.subaffiliation | CNR | - |
| isi.contributor.surname | Quochi | - |
| isi.contributor.surname | Calderone | - |
| isi.date.issued | 2008 | * |
| isi.description.abstracteng | The paper presents two experiments of unsupervised classification of Italian noun phrases. The goal of the experiments is to identify the most prominent contextual properties that allow for a functional classification of noun phrases. For this purpose, we used a Self Organizing Map is trained with syntactically-annotated contexts containing noun phrases. The contexts are defined by means of a set of features representing morpho-syntactic properties of both nouns and their wider contexts. Two types of experiments have been run: one based on noun types and the other based on noun tokens. The results of the type simulation show that when frequency is the most prominent classification factor, the network isolates idiomatic or fixed phrases. The results of the token simulation experiment, instead, show that, of the 3 6 attributes represented in the original input matrix, only a few of them are prominent in the re-organization of the map. In particular, key features in the emergent macro-classification are the type of determiner and the grammatical number of the noun. An additional but not less interesting result is an organization into semantic/pragmatic micro-classes. In conclusions, our result confirm the relative prominence of determiner type and grammatical number in the task of noun (phrase) categorization. | * |
| isi.description.allpeopleoriginal | Quochi, V; Calderone, B; | * |
| isi.document.sourcetype | WOS.ISSHP | * |
| isi.document.type | Proceedings Paper | * |
| isi.document.types | Proceedings Paper | * |
| isi.identifier.isi | WOS:000324028902114 | * |
| isi.journal.journaltitle | SIXTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, LREC 2008 | * |
| isi.language.original | English | * |
| isi.publisher.place | 55-57, RUE BRILLAT-SAVARIN, PARIS, 75013, FRANCE | * |
| isi.relation.firstpage | 2596 | * |
| isi.relation.lastpage | 2602 | * |
| isi.title | Learning properties of Noun Phrases: from data to functions | * |
| scopus.category | 1203 | * |
| scopus.category | 3304 | * |
| scopus.category | 3310 | * |
| scopus.category | 3309 | * |
| scopus.contributor.affiliation | Scuola Normale Superiore | - |
| scopus.contributor.affiliation | Scuola Normale Superiore | - |
| scopus.contributor.afid | 60008941 | - |
| scopus.contributor.afid | 60008941 | - |
| scopus.contributor.auid | 34977412400 | - |
| scopus.contributor.auid | 35291233300 | - |
| scopus.contributor.country | Italy | - |
| scopus.contributor.country | Italy | - |
| scopus.contributor.dptid | - | |
| scopus.contributor.dptid | - | |
| scopus.contributor.name | Valeria | - |
| scopus.contributor.name | Basilio | - |
| scopus.contributor.subaffiliation | Istituto di Linguistica Computazionale CNR; | - |
| scopus.contributor.subaffiliation | Istituto di Linguistica Computazionale CNR; | - |
| scopus.contributor.surname | Quochi | - |
| scopus.contributor.surname | Calderone | - |
| scopus.date.issued | 2008 | * |
| scopus.description.abstracteng | The paper presents two experiments of unsupervised classification of Italian noun phrases. The goal of the experiments is to identify the most prominent contextual properties that allow for a functional classification of noun phrases. For this purpose, we used a Self Organizing Map is trained with syntactically-annotated contexts containing noun phrases. The contexts are defined by means of a set of features representing morpho-syntactic properties of both nouns and their wider contexts. Two types of experiments have been run: one based on noun types and the other based on noun tokens. The results of the type simulation show that when frequency is the most prominent classification factor, the network isolates idiomatic or fixed phrases. The results of the token simulation experiment, instead, show that, of the 36 attributes represented in the original input matrix, only a few of them are prominent in the re-organization of the map. In particular, key features in the emergent macro-classification are the type of determiner and the grammatical number of the noun. An additional but not less interesting result is an organization into semantic/pragmatic micro-classes. In conclusions, our result confirm the relative prominence of determiner type and grammatical number in the task of noun (phrase) categorization. | * |
| scopus.description.allpeopleoriginal | Quochi V.; Calderone B. | * |
| scopus.differences | scopus.relation.conferencename | * |
| scopus.differences | scopus.publisher.name | * |
| scopus.differences | scopus.relation.conferencedate | * |
| scopus.differences | scopus.identifier.isbn | * |
| scopus.differences | scopus.description.allpeopleoriginal | * |
| scopus.differences | scopus.description.abstracteng | * |
| scopus.differences | scopus.relation.conferenceplace | * |
| scopus.document.type | cp | * |
| scopus.document.types | cp | * |
| scopus.identifier.isbn | 9782951740846 | * |
| scopus.identifier.pui | 619617224 | * |
| scopus.identifier.scopus | 2-s2.0-85037530722 | * |
| scopus.journal.sourceid | 21100842264 | * |
| scopus.language.iso | eng | * |
| scopus.publisher.name | European Language Resources Association (ELRA) | * |
| scopus.relation.conferencedate | 2008 | * |
| scopus.relation.conferencename | 6th International Conference on Language Resources and Evaluation, LREC 2008 | * |
| scopus.relation.conferenceplace | Palais des Congres Mansour Eddahbi, mar | * |
| scopus.relation.firstpage | 2596 | * |
| scopus.relation.lastpage | 2602 | * |
| scopus.title | Learning properties of noun phrases: From data to functions | * |
| scopus.titleeng | Learning properties of noun phrases: From data to functions | * |
| Appare nelle tipologie: | 04.01 Contributo in Atti di convegno | |
I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.


