CNR Institutional Research Information System

We study the influence of context on how humans evaluate the complexity of a sentence in English. We collect a new dataset of sentences, where each sentence is rated for perceived complexity within different contextual windows. We carry out an in-depth analysis to detect which linguistic features correlate more with complexity judgments and with the degree of agreement among annotators. We train several regression models, using either explicit linguistic features or contextualized word embeddings, to predict the mean complexity values assigned to sentences in the different contextual windows, as well as their standard deviation. Results show that models leveraging explicit features capturing morphosyntactic and syntactic phenomena perform always better, especially when they have access to features extracted from all contextual sentences.

Sentence Complexity in Context

Iavarone B;Brunato D;Dell'orletta F

2021

Abstract

We study the influence of context on how humans evaluate the complexity of a sentence in English. We collect a new dataset of sentences, where each sentence is rated for perceived complexity within different contextual windows. We carry out an in-depth analysis to detect which linguistic features correlate more with complexity judgments and with the degree of agreement among annotators. We train several regression models, using either explicit linguistic features or contextualized word embeddings, to predict the mean complexity values assigned to sentences in the different contextual windows, as well as their standard deviation. Results show that models leveraging explicit features capturing morphosyntactic and syntactic phenomena perform always better, especially when they have access to features extracted from all contextual sentences.

Scheda breve

Scheda completa

Scheda completa (DC)

Campo DC	Valore	Lingua
dc.authority.people	Iavarone B	it
dc.authority.people	Brunato D	it
dc.authority.people	Dell'orletta F	it
dc.collection.id.s	71c7200a-7c5f-4e83-8d57-d3d2ba88f40d	*
dc.collection.name	04.01 Contributo in Atti di convegno	*
dc.contributor.appartenenza	Istituto di linguistica computazionale "Antonio Zampolli" - ILC	*
dc.contributor.appartenenza.mi	918	*
dc.date.accessioned	2024/02/20 13:13:45	-
dc.date.available	2024/02/20 13:13:45	-
dc.date.issued	2021	-
dc.description.abstracteng	We study the influence of context on how humans evaluate the complexity of a sentence in English. We collect a new dataset of sentences, where each sentence is rated for perceived complexity within different contextual windows. We carry out an in-depth analysis to detect which linguistic features correlate more with complexity judgments and with the degree of agreement among annotators. We train several regression models, using either explicit linguistic features or contextualized word embeddings, to predict the mean complexity values assigned to sentences in the different contextual windows, as well as their standard deviation. Results show that models leveraging explicit features capturing morphosyntactic and syntactic phenomena perform always better, especially when they have access to features extracted from all contextual sentences.	-
dc.description.affiliations	Scuola Normale Superiore, Scuola Normale Superiore, Italy; Istituto Di Linguistica Computazionale Antonio Zampolli, Pisa, Istituto di Linguistica Computazionale "Antonio Zampolli", Pisa, , Italy	-
dc.description.allpeople	Iavarone, B; Brunato, D; Dell'Orletta, F	-
dc.description.allpeopleoriginal	Iavarone B ; Brunato D ; Dell'orletta F	-
dc.description.fulltext	none	en
dc.description.numberofauthors	3	-
dc.identifier.doi	10.18653/v1/2021.cmcl-1.23	-
dc.identifier.isbn	978-1-954085-35-0	-
dc.identifier.scopus	2-s2.0-85123172973	-
dc.identifier.uri	https://hdl.handle.net/20.500.14243/440176	-
dc.identifier.url	http://www.scopus.com/record/display.url?eid=2-s2.0-85123172973&origin=inward	-
dc.language.iso	eng	-
dc.relation.conferencedate	10/06/2021	-
dc.relation.conferencename	Proceedings of Workshop on Cognitive Modeling and Computational Linguistics (CMCL 2021)	-
dc.relation.firstpage	186	-
dc.relation.lastpage	199	-
dc.subject.keywords	linguistic complexity	-
dc.subject.keywords	crowdsourcing	-
dc.subject.keywords	human perception	-
dc.subject.singlekeyword	linguistic complexity	*
dc.subject.singlekeyword	crowdsourcing	*
dc.subject.singlekeyword	human perception	*
dc.title	Sentence Complexity in Context	en
dc.type.driver	info:eu-repo/semantics/conferenceObject	-
dc.type.full	04 Contributo in convegno::04.01 Contributo in Atti di convegno	it
dc.type.miur	273	-
dc.type.referee	Sì, ma tipo non specificato	-
dc.ugov.descaux1	464975	-
iris.orcid.lastModifiedDate	2024/04/05 00:24:57	*
iris.orcid.lastModifiedMillisecond	1712269497991	*
iris.scopus.extIssued	2021	-
iris.scopus.extTitle	Sentence Complexity in Context	-
iris.sitodocente.maxattempts	2	-
iris.unpaywall.bestoaversion	publishedVersion	*
iris.unpaywall.doi	10.18653/v1/2021.cmcl-1.23	*
iris.unpaywall.isoa	true	*
iris.unpaywall.journalisindoaj	false	*
iris.unpaywall.landingpage	https://doi.org/10.18653/v1/2021.cmcl-1.23	*
iris.unpaywall.license	cc-by	*
iris.unpaywall.metadataCallLastModified	16/06/2026 06:01:11	-
iris.unpaywall.metadataCallLastModifiedMillisecond	1781582471957	-
iris.unpaywall.oastatus	gold	*
iris.unpaywall.pdfurl	https://aclanthology.org/2021.cmcl-1.23.pdf	*
scopus.category	1203	*
scopus.category	3616	*
scopus.contributor.affiliation	Istituto Di Linguistica Computazionale Antonio Zampolli	-
scopus.contributor.affiliation	Istituto Di Linguistica Computazionale Antonio Zampolli	-
scopus.contributor.affiliation	Istituto Di Linguistica Computazionale Antonio Zampolli	-
scopus.contributor.afid	60008941	-
scopus.contributor.afid	60008941	-
scopus.contributor.afid	60008941	-
scopus.contributor.auid	57204819416	-
scopus.contributor.auid	55237740200	-
scopus.contributor.auid	57540567000	-
scopus.contributor.country	Italy	-
scopus.contributor.country	Italy	-
scopus.contributor.country	Italy	-
scopus.contributor.dptid		-
scopus.contributor.dptid		-
scopus.contributor.dptid		-
scopus.contributor.name	Benedetta	-
scopus.contributor.name	Dominique	-
scopus.contributor.name	Felice	-
scopus.contributor.subaffiliation		-
scopus.contributor.subaffiliation		-
scopus.contributor.subaffiliation		-
scopus.contributor.surname	Iavarone	-
scopus.contributor.surname	Brunato	-
scopus.contributor.surname	Dell'orletta	-
scopus.date.issued	2021	*
scopus.description.abstracteng	We study the influence of context on how humans evaluate the complexity of a sentence in English. We collect a new dataset of sentences, where each sentence is rated for perceived complexity within different contextual windows. We carry out an in-depth analysis to detect which linguistic features correlate more with complexity judgments and with the degree of agreement among annotators. We train several regression models, using either explicit linguistic features or contextualized word embeddings, to predict the mean complexity values assigned to sentences in the different contextual windows, as well as their standard deviation. Results show that models leveraging explicit features capturing morphosyntactic and syntactic phenomena perform always better, especially when they have access to features extracted from all contextual sentences.	*
scopus.description.allpeopleoriginal	Iavarone B.; Brunato D.; Dell'orletta F.	*
scopus.differences	scopus.relation.conferencename	*
scopus.differences	scopus.publisher.name	*
scopus.differences	scopus.relation.conferencedate	*
scopus.differences	scopus.identifier.isbn	*
scopus.differences	scopus.description.allpeopleoriginal	*
scopus.document.type	cp	*
scopus.document.types	cp	*
scopus.identifier.doi	10.18653/v1/2021.cmcl-1.23	*
scopus.identifier.isbn	9781954085350	*
scopus.identifier.pui	636990649	*
scopus.identifier.scopus	2-s2.0-85123172973	*
scopus.journal.sourceid	21101073272	*
scopus.language.iso	eng	*
scopus.publisher.name	Association for Computational Linguistics (ACL)	*
scopus.relation.conferencedate	2021	*
scopus.relation.conferencename	11th Workshop on Cognitive Modeling and Computational Linguistics, CMCL 2021	*
scopus.relation.firstpage	186	*
scopus.relation.lastpage	199	*
scopus.title	Sentence Complexity in Context	*
scopus.titleeng	Sentence Complexity in Context	*
Appare nelle tipologie:	04.01 Contributo in Atti di convegno

File in questo prodotto:

Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/20.500.14243/440176

Citazioni

ND

18

ND

social impact