CNR Institutional Research Information System

In this paper we present a comparison between the linguistic knowledge encoded in the internal representations of a contextual Language Model (BERT) and a contextual-independent one (Word2vec). We use a wide set of probing tasks, each of which corresponds to a distinct sentence-level feature extracted from different levels of linguistic annotation. We show that, although BERT is capable of understanding the full context of each word in an input sequence, the implicit knowledge encoded in its aggregated sentence representations is still comparable to that of a contextual-independent model. We also find that BERT is able to encode sentence-level properties even within single-word embeddings, obtaining comparable or even superior results than those obtained with sentence representations.

Contextual and Non-Contextual Word Embeddings: an in-depth Linguistic Investigation

Miaschi;Alessio;Dell'Orletta;Felice

2020

Abstract

In this paper we present a comparison between the linguistic knowledge encoded in the internal representations of a contextual Language Model (BERT) and a contextual-independent one (Word2vec). We use a wide set of probing tasks, each of which corresponds to a distinct sentence-level feature extracted from different levels of linguistic annotation. We show that, although BERT is capable of understanding the full context of each word in an input sequence, the implicit knowledge encoded in its aggregated sentence representations is still comparable to that of a contextual-independent model. We also find that BERT is able to encode sentence-level properties even within single-word embeddings, obtaining comparable or even superior results than those obtained with sentence representations.

Scheda breve

Scheda completa

Scheda completa (DC)

Campo DC	Valore	Lingua
dc.authority.orgunit	Istituto di linguistica computazionale "Antonio Zampolli" - ILC	en
dc.authority.people	Miaschi	en
dc.authority.people	Alessio	en
dc.authority.people	Dell'Orletta	en
dc.authority.people	Felice	en
dc.collection.id.s	71c7200a-7c5f-4e83-8d57-d3d2ba88f40d	*
dc.collection.name	04.01 Contributo in Atti di convegno	*
dc.contributor.appartenenza	Istituto di linguistica computazionale "Antonio Zampolli" - ILC	*
dc.contributor.appartenenza.mi	918	*
dc.date.accessioned	2024/02/21 06:02:12	-
dc.date.available	2024/02/21 06:02:12	-
dc.date.firstsubmission	2024/12/20 10:16:38	*
dc.date.issued	2020	-
dc.date.submission	2024/12/20 10:16:38	*
dc.description.abstracteng	In this paper we present a comparison between the linguistic knowledge encoded in the internal representations of a contextual Language Model (BERT) and a contextual-independent one (Word2vec). We use a wide set of probing tasks, each of which corresponds to a distinct sentence-level feature extracted from different levels of linguistic annotation. We show that, although BERT is capable of understanding the full context of each word in an input sequence, the implicit knowledge encoded in its aggregated sentence representations is still comparable to that of a contextual-independent model. We also find that BERT is able to encode sentence-level properties even within single-word embeddings, obtaining comparable or even superior results than those obtained with sentence representations.	-
dc.description.affiliations	Università di Pisa; Istituto di Linguistica Computazionale (ILC-CNR)	-
dc.description.allpeople	Miaschi, Alessio; Miaschi, Alessio; Dell'Orletta, Felice; Dell'Orletta, Felice	-
dc.description.allpeopleoriginal	Miaschi, Alessio and Dell'Orletta, Felice	en
dc.description.fulltext	open	en
dc.description.numberofauthors	4	-
dc.identifier.doi	10.18653/v1/2020.repl4nlp-1.15	en
dc.identifier.isbn	978-1-952148-15-6	en
dc.identifier.uri	https://hdl.handle.net/20.500.14243/421763	-
dc.identifier.url	https://www.aclweb.org/anthology/2020.repl4nlp-1.15	en
dc.language.iso	eng	en
dc.miur.last.status.update	2024-12-20T09:05:47Z	*
dc.relation.conferencedate	09/07/2020	en
dc.relation.conferencename	5th Workshop on Representation Learning for NLP	en
dc.relation.firstpage	110	en
dc.relation.ispartofbook	Proceedings of the 5th Workshop on Representation Learning for NLP	en
dc.relation.lastpage	119	en
dc.relation.numberofpages	10	en
dc.subject.keywords	nlp	-
dc.subject.keywords	interpretability	-
dc.subject.keywords	representation learning	-
dc.subject.singlekeyword	nlp	*
dc.subject.singlekeyword	interpretability	*
dc.subject.singlekeyword	representation learning	*
dc.title	Contextual and Non-Contextual Word Embeddings: an in-depth Linguistic Investigation	en
dc.type.driver	info:eu-repo/semantics/conferenceObject	-
dc.type.full	04 Contributo in convegno::04.01 Contributo in Atti di convegno	it
dc.type.miur	273	-
dc.ugov.descaux1	442036	-
iris.mediafilter.data	2025/04/15 04:24:51	*
iris.orcid.lastModifiedDate	2024/12/20 12:07:24	*
iris.orcid.lastModifiedMillisecond	1734692844990	*
iris.scopus.extIssued	2020	-
iris.scopus.extTitle	Contextual and non-contextual word embeddings: An in-depth linguistic investigation	-
iris.sitodocente.maxattempts	1	-
iris.unpaywall.bestoaversion	publishedVersion	*
iris.unpaywall.doi	10.18653/v1/2020.repl4nlp-1.15	*
iris.unpaywall.isoa	true	*
iris.unpaywall.landingpage	https://doi.org/10.18653/v1/2020.repl4nlp-1.15	*
iris.unpaywall.license	cc-by	*
iris.unpaywall.metadataCallLastModified	01/01/2026 02:45:11	-
iris.unpaywall.metadataCallLastModifiedMillisecond	1767231911785	-
iris.unpaywall.oastatus	gold	*
iris.unpaywall.pdfurl	https://www.aclweb.org/anthology/2020.repl4nlp-1.15.pdf	*
Appare nelle tipologie:	04.01 Contributo in Atti di convegno

File in questo prodotto:

File	Dimensione	Formato
2020.repl4nlp-1.15.pdf accesso aperto Licenza: Creative commons Dimensione 1.56 MB Formato Adobe PDF Visualizza/Apri	1.56 MB	Adobe PDF	Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/20.500.14243/421763

Citazioni

ND

ND

ND

social impact