The paper presents two experiments of unsupervised classification of Italian noun phrases. The goal of the experiments is to identify the most prominent contextual properties that allow for a functional classification of noun phrases. For this purpose, we used a Self Organizing Map is trained with syntactically-annotated contexts containing noun phrases. The contexts are defined by means of a set of features representing morpho-syntactic properties of both nouns and their wider contexts. Two types of experiments have been run: one based on noun types and the other based on noun tokens. The results of the type simulation show that when frequency is the most prominent classification factor, the network isolates idiomatic or fixed phrases. The results of the token simulation experiment, instead, show that, of the 3 6 attributes represented in the original input matrix, only a few of them are prominent in the re-organization of the map. In particular, key features in the emergent macro-classification are the type of determiner and the grammatical number of the noun. An additional but not less interesting result is an organization into semantic/pragmatic micro-classes. In conclusions, our result confirm the relative prominence of determiner type and grammatical number in the task of noun (phrase) categorization.

Learning properties of Noun Phrases: from data to functions

Quochi Valeria;
2008

Abstract

The paper presents two experiments of unsupervised classification of Italian noun phrases. The goal of the experiments is to identify the most prominent contextual properties that allow for a functional classification of noun phrases. For this purpose, we used a Self Organizing Map is trained with syntactically-annotated contexts containing noun phrases. The contexts are defined by means of a set of features representing morpho-syntactic properties of both nouns and their wider contexts. Two types of experiments have been run: one based on noun types and the other based on noun tokens. The results of the type simulation show that when frequency is the most prominent classification factor, the network isolates idiomatic or fixed phrases. The results of the token simulation experiment, instead, show that, of the 3 6 attributes represented in the original input matrix, only a few of them are prominent in the re-organization of the map. In particular, key features in the emergent macro-classification are the type of determiner and the grammatical number of the noun. An additional but not less interesting result is an organization into semantic/pragmatic micro-classes. In conclusions, our result confirm the relative prominence of determiner type and grammatical number in the task of noun (phrase) categorization.
Campo DC Valore Lingua
dc.authority.orgunit Istituto di linguistica computazionale "Antonio Zampolli" - ILC -
dc.authority.people Quochi Valeria it
dc.authority.people Calderone Basilio it
dc.collection.id.s 71c7200a-7c5f-4e83-8d57-d3d2ba88f40d *
dc.collection.name 04.01 Contributo in Atti di convegno *
dc.contributor.appartenenza Istituto di linguistica computazionale "Antonio Zampolli" - ILC *
dc.contributor.appartenenza.mi 918 *
dc.date.accessioned 2024/02/19 17:30:14 -
dc.date.available 2024/02/19 17:30:14 -
dc.date.issued 2008 -
dc.description.abstracteng The paper presents two experiments of unsupervised classification of Italian noun phrases. The goal of the experiments is to identify the most prominent contextual properties that allow for a functional classification of noun phrases. For this purpose, we used a Self Organizing Map is trained with syntactically-annotated contexts containing noun phrases. The contexts are defined by means of a set of features representing morpho-syntactic properties of both nouns and their wider contexts. Two types of experiments have been run: one based on noun types and the other based on noun tokens. The results of the type simulation show that when frequency is the most prominent classification factor, the network isolates idiomatic or fixed phrases. The results of the token simulation experiment, instead, show that, of the 3 6 attributes represented in the original input matrix, only a few of them are prominent in the re-organization of the map. In particular, key features in the emergent macro-classification are the type of determiner and the grammatical number of the noun. An additional but not less interesting result is an organization into semantic/pragmatic micro-classes. In conclusions, our result confirm the relative prominence of determiner type and grammatical number in the task of noun (phrase) categorization. -
dc.description.affiliations Consiglio Nazionale delle Ricerche (CNR), Scuola Normale Superiore -
dc.description.allpeople Quochi, Valeria; Calderone, Basilio -
dc.description.allpeopleoriginal Quochi, Valeria; Calderone, Basilio -
dc.description.fulltext none en
dc.description.numberofauthors 1 -
dc.identifier.isbn 2-9517408-4-0 -
dc.identifier.isi WOS:000324028902114 -
dc.identifier.scopus 2-s2.0-85037530722 -
dc.identifier.uri https://hdl.handle.net/20.500.14243/227370 -
dc.identifier.url http://www.lrec-conf.org/proceedings/lrec2008/summaries/644.html -
dc.language.iso eng -
dc.relation.conferencedate 28-30 Maggio -
dc.relation.conferencename Sixth International Conference on Language Resources and Evaluation (LREC'08) -
dc.relation.conferenceplace Marrakech, Morocco -
dc.relation.firstpage 2596 -
dc.relation.lastpage 2602 -
dc.relation.numberofpages 7 -
dc.subject.keywords cognitive linguistics -
dc.subject.keywords noun phrase -
dc.subject.singlekeyword cognitive linguistics *
dc.subject.singlekeyword noun phrase *
dc.title Learning properties of Noun Phrases: from data to functions en
dc.type.driver info:eu-repo/semantics/conferenceObject -
dc.type.full 04 Contributo in convegno::04.01 Contributo in Atti di convegno it
dc.type.miur 273 -
dc.type.referee Sì, ma tipo non specificato -
dc.ugov.descaux1 288714 -
iris.isi.extIssued 2008 -
iris.isi.extTitle Learning properties of Noun Phrases: from data to functions -
iris.orcid.lastModifiedDate 2025/04/23 01:12:23 *
iris.orcid.lastModifiedMillisecond 1745363543967 *
iris.scopus.extIssued 2008 -
iris.scopus.extTitle Learning properties of noun phrases: From data to functions -
iris.scopus.ideLinkStatusDate 2024/04/10 09:22:08 *
iris.scopus.ideLinkStatusMillisecond 1712733728913 *
iris.sitodocente.maxattempts 3 -
isi.category OT *
isi.contributor.affiliation Scuola Normale Superiore di Pisa -
isi.contributor.affiliation Scuola Normale Superiore di Pisa -
isi.contributor.country Italy -
isi.contributor.country Italy -
isi.contributor.name Valeria -
isi.contributor.name Basilio -
isi.contributor.researcherId E-7468-2011 -
isi.contributor.researcherId EMG-3709-2022 -
isi.contributor.subaffiliation CNR -
isi.contributor.subaffiliation CNR -
isi.contributor.surname Quochi -
isi.contributor.surname Calderone -
isi.date.issued 2008 *
isi.description.abstracteng The paper presents two experiments of unsupervised classification of Italian noun phrases. The goal of the experiments is to identify the most prominent contextual properties that allow for a functional classification of noun phrases. For this purpose, we used a Self Organizing Map is trained with syntactically-annotated contexts containing noun phrases. The contexts are defined by means of a set of features representing morpho-syntactic properties of both nouns and their wider contexts. Two types of experiments have been run: one based on noun types and the other based on noun tokens. The results of the type simulation show that when frequency is the most prominent classification factor, the network isolates idiomatic or fixed phrases. The results of the token simulation experiment, instead, show that, of the 3 6 attributes represented in the original input matrix, only a few of them are prominent in the re-organization of the map. In particular, key features in the emergent macro-classification are the type of determiner and the grammatical number of the noun. An additional but not less interesting result is an organization into semantic/pragmatic micro-classes. In conclusions, our result confirm the relative prominence of determiner type and grammatical number in the task of noun (phrase) categorization. *
isi.description.allpeopleoriginal Quochi, V; Calderone, B; *
isi.document.sourcetype WOS.ISSHP *
isi.document.type Proceedings Paper *
isi.document.types Proceedings Paper *
isi.identifier.isi WOS:000324028902114 *
isi.journal.journaltitle SIXTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, LREC 2008 *
isi.language.original English *
isi.publisher.place 55-57, RUE BRILLAT-SAVARIN, PARIS, 75013, FRANCE *
isi.relation.firstpage 2596 *
isi.relation.lastpage 2602 *
isi.title Learning properties of Noun Phrases: from data to functions *
scopus.category 1203 *
scopus.category 3304 *
scopus.category 3310 *
scopus.category 3309 *
scopus.contributor.affiliation Scuola Normale Superiore -
scopus.contributor.affiliation Scuola Normale Superiore -
scopus.contributor.afid 60008941 -
scopus.contributor.afid 60008941 -
scopus.contributor.auid 34977412400 -
scopus.contributor.auid 35291233300 -
scopus.contributor.country Italy -
scopus.contributor.country Italy -
scopus.contributor.dptid -
scopus.contributor.dptid -
scopus.contributor.name Valeria -
scopus.contributor.name Basilio -
scopus.contributor.subaffiliation Istituto di Linguistica Computazionale CNR; -
scopus.contributor.subaffiliation Istituto di Linguistica Computazionale CNR; -
scopus.contributor.surname Quochi -
scopus.contributor.surname Calderone -
scopus.date.issued 2008 *
scopus.description.abstracteng The paper presents two experiments of unsupervised classification of Italian noun phrases. The goal of the experiments is to identify the most prominent contextual properties that allow for a functional classification of noun phrases. For this purpose, we used a Self Organizing Map is trained with syntactically-annotated contexts containing noun phrases. The contexts are defined by means of a set of features representing morpho-syntactic properties of both nouns and their wider contexts. Two types of experiments have been run: one based on noun types and the other based on noun tokens. The results of the type simulation show that when frequency is the most prominent classification factor, the network isolates idiomatic or fixed phrases. The results of the token simulation experiment, instead, show that, of the 36 attributes represented in the original input matrix, only a few of them are prominent in the re-organization of the map. In particular, key features in the emergent macro-classification are the type of determiner and the grammatical number of the noun. An additional but not less interesting result is an organization into semantic/pragmatic micro-classes. In conclusions, our result confirm the relative prominence of determiner type and grammatical number in the task of noun (phrase) categorization. *
scopus.description.allpeopleoriginal Quochi V.; Calderone B. *
scopus.differences scopus.relation.conferencename *
scopus.differences scopus.publisher.name *
scopus.differences scopus.relation.conferencedate *
scopus.differences scopus.identifier.isbn *
scopus.differences scopus.description.allpeopleoriginal *
scopus.differences scopus.description.abstracteng *
scopus.differences scopus.relation.conferenceplace *
scopus.document.type cp *
scopus.document.types cp *
scopus.identifier.isbn 9782951740846 *
scopus.identifier.pui 619617224 *
scopus.identifier.scopus 2-s2.0-85037530722 *
scopus.journal.sourceid 21100842264 *
scopus.language.iso eng *
scopus.publisher.name European Language Resources Association (ELRA) *
scopus.relation.conferencedate 2008 *
scopus.relation.conferencename 6th International Conference on Language Resources and Evaluation, LREC 2008 *
scopus.relation.conferenceplace Palais des Congres Mansour Eddahbi, mar *
scopus.relation.firstpage 2596 *
scopus.relation.lastpage 2602 *
scopus.title Learning properties of noun phrases: From data to functions *
scopus.titleeng Learning properties of noun phrases: From data to functions *
Appare nelle tipologie: 04.01 Contributo in Atti di convegno
File in questo prodotto:
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/20.500.14243/227370
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 0
  • ???jsp.display-item.citation.isi??? 0
social impact