CNR Institutional Research Information System

Machine learning offers two basic strategies for morphology induction: lexical segmentation and surface word relation. The first approach assumes that words can be segmented into morphemes. Inferring a novel inflected form requires identification of morphemic constituents and a strategy for their recombination. The second approach dispenses with segmentation: lexical representations form part of a network of associatively related inflected forms. Production of a novel form consists in filling in one empty node in the network. Here, we present the results of a task of word inflection by a recurrent LSTM network that learns to fill in paradigm cells of incomplete verb paradigms. Although the task does not require morpheme segmentation, we show that accuracy in carrying out the inflection task is a function of the model's sensitivity to paradigm distribution and morphological structure.

Deep Learning of Inflection and the Cell-Filling Problem

Cardillo FA^Primo;Ferro M^Secondo;Marzi C^Penultimo;Pirrelli V^Ultimo

2018

Abstract

Machine learning offers two basic strategies for morphology induction: lexical segmentation and surface word relation. The first approach assumes that words can be segmented into morphemes. Inferring a novel inflected form requires identification of morphemic constituents and a strategy for their recombination. The second approach dispenses with segmentation: lexical representations form part of a network of associatively related inflected forms. Production of a novel form consists in filling in one empty node in the network. Here, we present the results of a task of word inflection by a recurrent LSTM network that learns to fill in paradigm cells of incomplete verb paradigms. Although the task does not require morpheme segmentation, we show that accuracy in carrying out the inflection task is a function of the model's sensitivity to paradigm distribution and morphological structure.

Scheda breve

Scheda completa

Scheda completa (DC)

Campo DC	Valore	Lingua
dc.authority.ancejournal	IJCOL	en
dc.authority.orgunit	Istituto di linguistica computazionale "Antonio Zampolli" - ILC	en
dc.authority.people	Cardillo FA	en
dc.authority.people	Ferro M	en
dc.authority.people	Marzi C	en
dc.authority.people	Pirrelli V	en
dc.collection.id.s	b3f88f24-048a-4e43-8ab1-6697b90e068e	*
dc.collection.name	01.01 Articolo in rivista	*
dc.contributor.appartenenza	Istituto di linguistica computazionale "Antonio Zampolli" - ILC	*
dc.contributor.appartenenza.mi	918	*
dc.date.accessioned	2024/02/16 06:53:04	-
dc.date.available	2024/02/16 06:53:04	-
dc.date.firstsubmission	2024/09/26 15:41:06	*
dc.date.issued	2018	-
dc.date.submission	2024/09/26 15:41:06	*
dc.description.abstracteng	Machine learning offers two basic strategies for morphology induction: lexical segmentation and surface word relation. The first approach assumes that words can be segmented into morphemes. Inferring a novel inflected form requires identification of morphemic constituents and a strategy for their recombination. The second approach dispenses with segmentation: lexical representations form part of a network of associatively related inflected forms. Production of a novel form consists in filling in one empty node in the network. Here, we present the results of a task of word inflection by a recurrent LSTM network that learns to fill in paradigm cells of incomplete verb paradigms. Although the task does not require morpheme segmentation, we show that accuracy in carrying out the inflection task is a function of the model's sensitivity to paradigm distribution and morphological structure.	-
dc.description.affiliations	ILC-CNR; ILC-CNR; ILC-CNR; ILC-CNR;	-
dc.description.allpeople	Cardillo, Fa; Ferro, M; Marzi, C; Pirrelli, V	-
dc.description.allpeopleoriginal	Cardillo, F.A.; Ferro, M.; Marzi, C.; Pirrelli, V.	en
dc.description.fulltext	none	en
dc.description.numberofauthors	4	-
dc.identifier.uri	https://hdl.handle.net/20.500.14243/355603	-
dc.identifier.url	https://publications.cnr.it/doc/396348	en
dc.language.iso	eng	en
dc.miur.last.status.update	2024-09-26T13:41:19Z	*
dc.relation.firstpage	57	en
dc.relation.issue	1	en
dc.relation.lastpage	75	en
dc.relation.medium	ELETTRONICO	en
dc.relation.numberofpages	21	en
dc.relation.volume	4	en
dc.subject.keywordseng	Deep Learning	-
dc.subject.keywordseng	LSTM	-
dc.subject.keywordseng	Cell-Filling Problem	-
dc.subject.singlekeyword	Deep Learning	*
dc.subject.singlekeyword	LSTM	*
dc.subject.singlekeyword	Cell-Filling Problem	*
dc.title	Deep Learning of Inflection and the Cell-Filling Problem	en
dc.type.circulation	Internazionale	en
dc.type.driver	info:eu-repo/semantics/article	-
dc.type.full	01 Contributo su Rivista::01.01 Articolo in rivista	it
dc.type.miur	262	-
dc.type.referee	Comitato scientifico	en
dc.ugov.descaux1	396348	-
iris.orcid.lastModifiedDate	2024/11/29 18:14:37	*
iris.orcid.lastModifiedMillisecond	1732900477088	*
iris.sitodocente.maxattempts	1	-
Appare nelle tipologie:	01.01 Articolo in rivista

File in questo prodotto:

Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/20.500.14243/355603

Citazioni

ND

ND

ND

social impact