Machine learning offers two basic strategies for morphology induction: lexical segmentation and surface word relation. The first approach assumes that words can be segmented into morphemes. Inferring a novel inflected form requires identification of morphemic constituents and a strategy for their recombination. The second approach dispenses with segmentation: lexical representations form part of a network of associatively related inflected forms. Production of a novel form consists in filling in one empty node in the network. Here, we present the results of a task of word inflection by a recurrent LSTM network that learns to fill in paradigm cells of incomplete verb paradigms. Although the task does not require morpheme segmentation, we show that accuracy in carrying out the inflection task is a function of the model's sensitivity to paradigm distribution and morphological structure.
Deep Learning of Inflection and the Cell-Filling Problem
Cardillo FAPrimo
;Ferro MSecondo
;Marzi CPenultimo
;Pirrelli VUltimo
2018
Abstract
Machine learning offers two basic strategies for morphology induction: lexical segmentation and surface word relation. The first approach assumes that words can be segmented into morphemes. Inferring a novel inflected form requires identification of morphemic constituents and a strategy for their recombination. The second approach dispenses with segmentation: lexical representations form part of a network of associatively related inflected forms. Production of a novel form consists in filling in one empty node in the network. Here, we present the results of a task of word inflection by a recurrent LSTM network that learns to fill in paradigm cells of incomplete verb paradigms. Although the task does not require morpheme segmentation, we show that accuracy in carrying out the inflection task is a function of the model's sensitivity to paradigm distribution and morphological structure.| Campo DC | Valore | Lingua |
|---|---|---|
| dc.authority.ancejournal | IJCOL | en |
| dc.authority.orgunit | Istituto di linguistica computazionale "Antonio Zampolli" - ILC | en |
| dc.authority.people | Cardillo FA | en |
| dc.authority.people | Ferro M | en |
| dc.authority.people | Marzi C | en |
| dc.authority.people | Pirrelli V | en |
| dc.collection.id.s | b3f88f24-048a-4e43-8ab1-6697b90e068e | * |
| dc.collection.name | 01.01 Articolo in rivista | * |
| dc.contributor.appartenenza | Istituto di linguistica computazionale "Antonio Zampolli" - ILC | * |
| dc.contributor.appartenenza.mi | 918 | * |
| dc.date.accessioned | 2024/02/16 06:53:04 | - |
| dc.date.available | 2024/02/16 06:53:04 | - |
| dc.date.firstsubmission | 2024/09/26 15:41:06 | * |
| dc.date.issued | 2018 | - |
| dc.date.submission | 2024/09/26 15:41:06 | * |
| dc.description.abstracteng | Machine learning offers two basic strategies for morphology induction: lexical segmentation and surface word relation. The first approach assumes that words can be segmented into morphemes. Inferring a novel inflected form requires identification of morphemic constituents and a strategy for their recombination. The second approach dispenses with segmentation: lexical representations form part of a network of associatively related inflected forms. Production of a novel form consists in filling in one empty node in the network. Here, we present the results of a task of word inflection by a recurrent LSTM network that learns to fill in paradigm cells of incomplete verb paradigms. Although the task does not require morpheme segmentation, we show that accuracy in carrying out the inflection task is a function of the model's sensitivity to paradigm distribution and morphological structure. | - |
| dc.description.affiliations | ILC-CNR; ILC-CNR; ILC-CNR; ILC-CNR; | - |
| dc.description.allpeople | Cardillo, Fa; Ferro, M; Marzi, C; Pirrelli, V | - |
| dc.description.allpeopleoriginal | Cardillo, F.A.; Ferro, M.; Marzi, C.; Pirrelli, V. | en |
| dc.description.fulltext | none | en |
| dc.description.numberofauthors | 4 | - |
| dc.identifier.uri | https://hdl.handle.net/20.500.14243/355603 | - |
| dc.identifier.url | https://publications.cnr.it/doc/396348 | en |
| dc.language.iso | eng | en |
| dc.miur.last.status.update | 2024-09-26T13:41:19Z | * |
| dc.relation.firstpage | 57 | en |
| dc.relation.issue | 1 | en |
| dc.relation.lastpage | 75 | en |
| dc.relation.medium | ELETTRONICO | en |
| dc.relation.numberofpages | 21 | en |
| dc.relation.volume | 4 | en |
| dc.subject.keywordseng | Deep Learning | - |
| dc.subject.keywordseng | LSTM | - |
| dc.subject.keywordseng | Cell-Filling Problem | - |
| dc.subject.singlekeyword | Deep Learning | * |
| dc.subject.singlekeyword | LSTM | * |
| dc.subject.singlekeyword | Cell-Filling Problem | * |
| dc.title | Deep Learning of Inflection and the Cell-Filling Problem | en |
| dc.type.circulation | Internazionale | en |
| dc.type.driver | info:eu-repo/semantics/article | - |
| dc.type.full | 01 Contributo su Rivista::01.01 Articolo in rivista | it |
| dc.type.miur | 262 | - |
| dc.type.referee | Comitato scientifico | en |
| dc.ugov.descaux1 | 396348 | - |
| iris.orcid.lastModifiedDate | 2024/11/29 18:14:37 | * |
| iris.orcid.lastModifiedMillisecond | 1732900477088 | * |
| iris.sitodocente.maxattempts | 1 | - |
| Appare nelle tipologie: | 01.01 Articolo in rivista | |
I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.


