Due to the typological diversity of their inflectional processes, some languages are intuitively more difficult than other languages. Yet, finding a single measure to quantitatively assess the comparative complexity of an inflectional system proves an exceedingly difficult endeavor. In this paper we propose to investigate the issue from a processing-oriented standpoint, using data processed by a type of recurrent neural network to quantitatively model the dynamic of word processing and learning in different input conditions. We evaluate the relative complexity of a set of typologically different inflectional systems (Greek, Italian, Spanish, German, English and Standard Modern Arabic) by training a Temporal Self-Organizing Map (TSOM), a recurrent variant of Kohonen's Self-Organizing Maps, on a fixed set of verb forms from top-frequency verb paradigms, with no information about the morphosemantic and morphosyntactic content conveyed by the forms. After training, the behavior of each language-specific TSOM is assessed on different tasks, looking at self-organizing patterns of temporal connectivity and functional responses. Our simulations show that word processing is facilitated by maximally contrastive inflectional systems, where verb forms exhibit the earliest possible point of lexical discrimination. Conversely, word learning is favored by a maximally generalizable system, where forms are inferred from the smallest possible number of their paradigm companions. Based on evidence from the literature and our own data, we conjecture that the resulting balance is the outcome of the interaction between form frequency and morphological regularity. Big families of stem-sharing, regularly inflected forms are the productive core of an inflectional system. Such a core is easier to learn but slower to discriminate. In contrast, less predictable verb forms, based on alternating and possibly suppletive stems, are easier to process but are learned by rote. Inflection systems thus strike a balance between these conflicting processing and communicative requirements, while staying within tight learnability bounds, in line with Ackermann and Malouf's Low Conditional Entropy Conjecture. Our quantitative investigation supports a discriminative view of morphological inflection as a collective, emergent system, whose global self-organization rests on a surprisingly small handful of language-independent principles of word coactivation and competition.

A processing-oriented investigation of inflectional complexity

Marzi Claudia
Primo
;
Ferro Marcello
Secondo
;
Pirrelli Vito
Ultimo
2019

Abstract

Due to the typological diversity of their inflectional processes, some languages are intuitively more difficult than other languages. Yet, finding a single measure to quantitatively assess the comparative complexity of an inflectional system proves an exceedingly difficult endeavor. In this paper we propose to investigate the issue from a processing-oriented standpoint, using data processed by a type of recurrent neural network to quantitatively model the dynamic of word processing and learning in different input conditions. We evaluate the relative complexity of a set of typologically different inflectional systems (Greek, Italian, Spanish, German, English and Standard Modern Arabic) by training a Temporal Self-Organizing Map (TSOM), a recurrent variant of Kohonen's Self-Organizing Maps, on a fixed set of verb forms from top-frequency verb paradigms, with no information about the morphosemantic and morphosyntactic content conveyed by the forms. After training, the behavior of each language-specific TSOM is assessed on different tasks, looking at self-organizing patterns of temporal connectivity and functional responses. Our simulations show that word processing is facilitated by maximally contrastive inflectional systems, where verb forms exhibit the earliest possible point of lexical discrimination. Conversely, word learning is favored by a maximally generalizable system, where forms are inferred from the smallest possible number of their paradigm companions. Based on evidence from the literature and our own data, we conjecture that the resulting balance is the outcome of the interaction between form frequency and morphological regularity. Big families of stem-sharing, regularly inflected forms are the productive core of an inflectional system. Such a core is easier to learn but slower to discriminate. In contrast, less predictable verb forms, based on alternating and possibly suppletive stems, are easier to process but are learned by rote. Inflection systems thus strike a balance between these conflicting processing and communicative requirements, while staying within tight learnability bounds, in line with Ackermann and Malouf's Low Conditional Entropy Conjecture. Our quantitative investigation supports a discriminative view of morphological inflection as a collective, emergent system, whose global self-organization rests on a surprisingly small handful of language-independent principles of word coactivation and competition.
Campo DC Valore Lingua
dc.authority.ancejournal FRONTIERS IN COMMUNICATION en
dc.authority.orgunit Istituto di linguistica computazionale "Antonio Zampolli" - ILC en
dc.authority.people Marzi Claudia en
dc.authority.people Ferro Marcello en
dc.authority.people Pirrelli Vito en
dc.collection.id.s b3f88f24-048a-4e43-8ab1-6697b90e068e *
dc.collection.name 01.01 Articolo in rivista *
dc.contributor.appartenenza Istituto di linguistica computazionale "Antonio Zampolli" - ILC *
dc.contributor.appartenenza.mi 918 *
dc.contributor.area Non assegn *
dc.contributor.area Non assegn *
dc.contributor.area Non assegn *
dc.date.accessioned 2024/02/20 21:18:29 -
dc.date.available 2024/02/20 21:18:29 -
dc.date.firstsubmission 2024/09/25 16:20:32 *
dc.date.issued 2019 -
dc.date.submission 2025/02/04 14:44:04 *
dc.description.abstracteng Due to the typological diversity of their inflectional processes, some languages are intuitively more difficult than other languages. Yet, finding a single measure to quantitatively assess the comparative complexity of an inflectional system proves an exceedingly difficult endeavor. In this paper we propose to investigate the issue from a processing-oriented standpoint, using data processed by a type of recurrent neural network to quantitatively model the dynamic of word processing and learning in different input conditions. We evaluate the relative complexity of a set of typologically different inflectional systems (Greek, Italian, Spanish, German, English and Standard Modern Arabic) by training a Temporal Self-Organizing Map (TSOM), a recurrent variant of Kohonen's Self-Organizing Maps, on a fixed set of verb forms from top-frequency verb paradigms, with no information about the morphosemantic and morphosyntactic content conveyed by the forms. After training, the behavior of each language-specific TSOM is assessed on different tasks, looking at self-organizing patterns of temporal connectivity and functional responses. Our simulations show that word processing is facilitated by maximally contrastive inflectional systems, where verb forms exhibit the earliest possible point of lexical discrimination. Conversely, word learning is favored by a maximally generalizable system, where forms are inferred from the smallest possible number of their paradigm companions. Based on evidence from the literature and our own data, we conjecture that the resulting balance is the outcome of the interaction between form frequency and morphological regularity. Big families of stem-sharing, regularly inflected forms are the productive core of an inflectional system. Such a core is easier to learn but slower to discriminate. In contrast, less predictable verb forms, based on alternating and possibly suppletive stems, are easier to process but are learned by rote. Inflection systems thus strike a balance between these conflicting processing and communicative requirements, while staying within tight learnability bounds, in line with Ackermann and Malouf's Low Conditional Entropy Conjecture. Our quantitative investigation supports a discriminative view of morphological inflection as a collective, emergent system, whose global self-organization rests on a surprisingly small handful of language-independent principles of word coactivation and competition. -
dc.description.affiliations ILC-CNR -
dc.description.allpeople Marzi, Claudia; Ferro, Marcello; Pirrelli, Vito -
dc.description.allpeopleoriginal Marzi, Claudia; Ferro, Marcello; Pirrelli, Vito en
dc.description.fulltext none en
dc.description.numberofauthors 3 -
dc.identifier.doi 10.3389/fcomm.2019.00048 en
dc.identifier.isi WOS:000677606300001 en
dc.identifier.scopus 2-s2.0-85087505822 en
dc.identifier.uri https://hdl.handle.net/20.500.14243/392957 -
dc.identifier.url https://www.frontiersin.org/articles/10.3389/fcomm.2019.00048/full en
dc.language.iso eng en
dc.miur.last.status.update 2024-09-25T14:20:49Z *
dc.relation.firstpage 1 en
dc.relation.issue 48 en
dc.relation.lastpage 23 en
dc.relation.medium ELETTRONICO en
dc.relation.numberofpages 23 en
dc.relation.volume 4 en
dc.subject.keywordseng Morphological complexity -
dc.subject.keywordseng Discriminative learning -
dc.subject.keywordseng Recurrent neural networks (RNNs) -
dc.subject.keywordseng self-organization -
dc.subject.keywordseng emergence -
dc.subject.keywordseng processing uncertainty -
dc.subject.keywordseng stem-family size -
dc.subject.singlekeyword Morphological complexity *
dc.subject.singlekeyword Discriminative learning *
dc.subject.singlekeyword Recurrent neural networks (RNNs) *
dc.subject.singlekeyword self-organization *
dc.subject.singlekeyword emergence *
dc.subject.singlekeyword processing uncertainty *
dc.subject.singlekeyword stem-family size *
dc.title A processing-oriented investigation of inflectional complexity en
dc.type.circulation Internazionale en
dc.type.driver info:eu-repo/semantics/article -
dc.type.full 01 Contributo su Rivista::01.01 Articolo in rivista it
dc.type.miur 262 -
dc.type.referee Esperti non anonimi en
dc.ugov.descaux1 406277 -
iris.isi.extIssued 2019 -
iris.isi.extTitle A Processing-Oriented Investigation of Inflectional Complexity -
iris.isi.ideLinkStatusDate 2025/01/13 19:03:38 *
iris.isi.ideLinkStatusMillisecond 1736791418882 *
iris.orcid.lastModifiedDate 2025/02/28 18:07:41 *
iris.orcid.lastModifiedMillisecond 1740762461886 *
iris.scopus.extIssued 2019 -
iris.scopus.extTitle A Processing-Oriented Investigation of Inflectional Complexity -
iris.scopus.ideLinkStatusDate 2024/09/25 16:22:18 *
iris.scopus.ideLinkStatusMillisecond 1727274138742 *
iris.sitodocente.maxattempts 1 -
iris.unpaywall.bestoahost publisher *
iris.unpaywall.bestoaversion publishedVersion *
iris.unpaywall.doi 10.3389/fcomm.2019.00048 *
iris.unpaywall.hosttype publisher *
iris.unpaywall.isoa true *
iris.unpaywall.journalisindoaj true *
iris.unpaywall.landingpage https://doi.org/10.3389/fcomm.2019.00048 *
iris.unpaywall.license cc-by *
iris.unpaywall.metadataCallLastModified 13/06/2025 04:17:26 -
iris.unpaywall.metadataCallLastModifiedMillisecond 1749781046392 -
iris.unpaywall.oastatus gold *
iris.unpaywall.pdfurl https://www.frontiersin.org/articles/10.3389/fcomm.2019.00048/pdf *
isi.category EU *
isi.contributor.affiliation Consiglio Nazionale delle Ricerche (CNR) -
isi.contributor.affiliation Consiglio Nazionale delle Ricerche (CNR) -
isi.contributor.affiliation Consiglio Nazionale delle Ricerche (CNR) -
isi.contributor.country Italy -
isi.contributor.country Italy -
isi.contributor.country Italy -
isi.contributor.name Claudia -
isi.contributor.name Marcello -
isi.contributor.name Vito -
isi.contributor.researcherId C-8034-2012 -
isi.contributor.researcherId D-6260-2016 -
isi.contributor.researcherId DXW-8155-2022 -
isi.contributor.subaffiliation Inst Computat Linguist -
isi.contributor.subaffiliation Inst Computat Linguist -
isi.contributor.subaffiliation Inst Computat Linguist -
isi.contributor.surname Marzi -
isi.contributor.surname Ferro -
isi.contributor.surname Pirrelli -
isi.date.issued 2019 *
isi.description.abstracteng Due to the typological diversity of their inflectional processes, some languages are intuitively more difficult than other languages. Yet, finding a single measure to quantitatively assess the comparative complexity of an inflectional system proves an exceedingly difficult endeavor. In this paper we propose to investigate the issue from a processing-oriented standpoint, using data processed by a type of recurrent neural network to quantitatively model the dynamic of word processing and learning in different input conditions. We evaluate the relative complexity of a set of typologically different inflectional systems (Greek, Italian, Spanish, German, English and Standard Modern Arabic) by training a Temporal Self-Organizing Map (TSOM), a recurrent variant of Kohonen's Self-Organizing Maps, on a fixed set of verb forms from top-frequency verb paradigms, with no information about the morphosemantic and morphosyntactic content conveyed by the forms. After training, the behavior of each language-specific TSOM is assessed on different tasks, looking at self-organizing patterns of temporal connectivity and functional responses. Our simulations show that word processing is facilitated by maximally contrastive inflectional systems, where verb forms exhibit the earliest possible point of lexical discrimination. Conversely, word learning is favored by a maximally generalizable system, where forms are inferred from the smallest possible number of their paradigm companions. Based on evidence from the literature and our own data, we conjecture that the resulting balance is the outcome of the interaction between form frequency and morphological regularity. Big families of stem-sharing, regularly inflected forms are the productive core of an inflectional system. Such a core is easier to learn but slower to discriminate. In contrast, less predictable verb forms, based on alternating and possibly suppletive stems, are easier to process but are learned by rote. Inflection systems thus strike a balance between these conflicting processing and communicative requirements, while staying within tight learnability bounds, in line with Ackermann and Malouf's Low Conditional Entropy Conjecture. Our quantitative investigation supports a discriminative view of morphological inflection as a collective, emergent system, whose global self-organization rests on a surprisingly small handful of language-independent principles of word coactivation and competition. *
isi.description.allpeopleoriginal Marzi, C; Ferro, M; Pirrelli, V; *
isi.document.sourcetype WOS.ESCI *
isi.document.type Article *
isi.document.types Article *
isi.identifier.doi 10.3389/fcomm.2019.00048 *
isi.identifier.eissn 2297-900X *
isi.identifier.isi WOS:000677606300001 *
isi.journal.journaltitle FRONTIERS IN COMMUNICATION *
isi.journal.journaltitleabbrev FRONT COMMUN *
isi.language.original English *
isi.publisher.place AVENUE DU TRIBUNAL FEDERAL 34, LAUSANNE, CH-1015, SWITZERLAND *
isi.relation.volume 4 *
isi.title A Processing-Oriented Investigation of Inflectional Complexity *
scopus.authority.ancejournal FRONTIERS IN COMMUNICATION###2297-900X *
scopus.category 3315 *
scopus.category 3301 *
scopus.contributor.affiliation National Research Council -
scopus.contributor.affiliation National Research Council -
scopus.contributor.affiliation National Research Council -
scopus.contributor.afid 60021199 -
scopus.contributor.afid 60021199 -
scopus.contributor.afid 60021199 -
scopus.contributor.auid 36621334800 -
scopus.contributor.auid 15759406100 -
scopus.contributor.auid 14833305800 -
scopus.contributor.country Italy -
scopus.contributor.country Italy -
scopus.contributor.country Italy -
scopus.contributor.dptid 104078586 -
scopus.contributor.dptid 104078586 -
scopus.contributor.dptid 104078586 -
scopus.contributor.name Claudia -
scopus.contributor.name Marcello -
scopus.contributor.name Vito -
scopus.contributor.subaffiliation Institute for Computational Linguistics; -
scopus.contributor.subaffiliation Institute for Computational Linguistics; -
scopus.contributor.subaffiliation Institute for Computational Linguistics; -
scopus.contributor.surname Marzi -
scopus.contributor.surname Ferro -
scopus.contributor.surname Pirrelli -
scopus.date.issued 2019 *
scopus.description.abstracteng Due to the typological diversity of their inflectional processes, some languages are intuitively more difficult than other languages. Yet, finding a single measure to quantitatively assess the comparative complexity of an inflectional system proves an exceedingly difficult endeavor. In this paper we propose to investigate the issue from a processing-oriented standpoint, using data processed by a type of recurrent neural network to quantitatively model the dynamic of word processing and learning in different input conditions. We evaluate the relative complexity of a set of typologically different inflectional systems (Greek, Italian, Spanish, German, English and Standard Modern Arabic) by training a Temporal Self-Organizing Map (TSOM), a recurrent variant of Kohonen's Self-Organizing Maps, on a fixed set of verb forms from top-frequency verb paradigms, with no information about the morphosemantic and morphosyntactic content conveyed by the forms. After training, the behavior of each language-specific TSOM is assessed on different tasks, looking at self-organizing patterns of temporal connectivity and functional responses. Our simulations show that word processing is facilitated by maximally contrastive inflectional systems, where verb forms exhibit the earliest possible point of lexical discrimination. Conversely, word learning is favored by a maximally generalizable system, where forms are inferred from the smallest possible number of their paradigm companions. Based on evidence from the literature and our own data, we conjecture that the resulting balance is the outcome of the interaction between form frequency and morphological regularity. Big families of stem-sharing, regularly inflected forms are the productive core of an inflectional system. Such a core is easier to learn but slower to discriminate. In contrast, less predictable verb forms, based on alternating and possibly suppletive stems, are easier to process but are learned by rote. Inflection systems thus strike a balance between these conflicting processing and communicative requirements, while staying within tight learnability bounds, in line with Ackermann and Malouf's Low Conditional Entropy Conjecture. Our quantitative investigation supports a discriminative view of morphological inflection as a collective, emergent system, whose global self-organization rests on a surprisingly small handful of language-independent principles of word coactivation and competition. *
scopus.description.allpeopleoriginal Marzi C.; Ferro M.; Pirrelli V. *
scopus.differences scopus.subject.keywords *
scopus.differences scopus.description.allpeopleoriginal *
scopus.document.type ar *
scopus.document.types ar *
scopus.identifier.doi 10.3389/fcomm.2019.00048 *
scopus.identifier.eissn 2297-900X *
scopus.identifier.pui 636266382 *
scopus.identifier.scopus 2-s2.0-85087505822 *
scopus.journal.sourceid 21101061926 *
scopus.language.iso eng *
scopus.publisher.name Frontiers Media S.A. *
scopus.relation.article 00048 *
scopus.relation.volume 4 *
scopus.subject.keywords discriminative learning; emergence; morphological complexity; processing uncertainty; recurrent neural networks; self-organization; stem-family size; *
scopus.title A Processing-Oriented Investigation of Inflectional Complexity *
scopus.titleeng A Processing-Oriented Investigation of Inflectional Complexity *
Appare nelle tipologie: 01.01 Articolo in rivista
File in questo prodotto:
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/20.500.14243/392957
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 13
  • ???jsp.display-item.citation.isi??? 13
social impact