In this paper we present a comparison between the linguistic knowledge encoded in the internal representations of a contextual Language Model (BERT) and a contextual-independent one (Word2vec). We use a wide set of probing tasks, each of which corresponds to a distinct sentence-level feature extracted from different levels of linguistic annotation. We show that, although BERT is capable of understanding the full context of each word in an input sequence, the implicit knowledge encoded in its aggregated sentence representations is still comparable to that of a contextual-independent model. We also find that BERT is able to encode sentence-level properties even within single-word embeddings, obtaining comparable or even superior results than those obtained with sentence representations.

Contextual and Non-Contextual Word Embeddings: an in-depth Linguistic Investigation

Miaschi;Alessio;Dell'Orletta;Felice
2020

Abstract

In this paper we present a comparison between the linguistic knowledge encoded in the internal representations of a contextual Language Model (BERT) and a contextual-independent one (Word2vec). We use a wide set of probing tasks, each of which corresponds to a distinct sentence-level feature extracted from different levels of linguistic annotation. We show that, although BERT is capable of understanding the full context of each word in an input sequence, the implicit knowledge encoded in its aggregated sentence representations is still comparable to that of a contextual-independent model. We also find that BERT is able to encode sentence-level properties even within single-word embeddings, obtaining comparable or even superior results than those obtained with sentence representations.
Campo DC Valore Lingua
dc.authority.orgunit Istituto di linguistica computazionale "Antonio Zampolli" - ILC en
dc.authority.people Miaschi en
dc.authority.people Alessio en
dc.authority.people Dell'Orletta en
dc.authority.people Felice en
dc.collection.id.s 71c7200a-7c5f-4e83-8d57-d3d2ba88f40d *
dc.collection.name 04.01 Contributo in Atti di convegno *
dc.contributor.appartenenza Istituto di linguistica computazionale "Antonio Zampolli" - ILC *
dc.contributor.appartenenza.mi 918 *
dc.date.accessioned 2024/02/21 06:02:12 -
dc.date.available 2024/02/21 06:02:12 -
dc.date.firstsubmission 2024/12/20 10:16:38 *
dc.date.issued 2020 -
dc.date.submission 2024/12/20 10:16:38 *
dc.description.abstracteng In this paper we present a comparison between the linguistic knowledge encoded in the internal representations of a contextual Language Model (BERT) and a contextual-independent one (Word2vec). We use a wide set of probing tasks, each of which corresponds to a distinct sentence-level feature extracted from different levels of linguistic annotation. We show that, although BERT is capable of understanding the full context of each word in an input sequence, the implicit knowledge encoded in its aggregated sentence representations is still comparable to that of a contextual-independent model. We also find that BERT is able to encode sentence-level properties even within single-word embeddings, obtaining comparable or even superior results than those obtained with sentence representations. -
dc.description.affiliations Università di Pisa; Istituto di Linguistica Computazionale (ILC-CNR) -
dc.description.allpeople Miaschi, Alessio; Miaschi, Alessio; Dell'Orletta, Felice; Dell'Orletta, Felice -
dc.description.allpeopleoriginal Miaschi, Alessio and Dell'Orletta, Felice en
dc.description.fulltext open en
dc.description.numberofauthors 4 -
dc.identifier.doi 10.18653/v1/2020.repl4nlp-1.15 en
dc.identifier.isbn 978-1-952148-15-6 en
dc.identifier.uri https://hdl.handle.net/20.500.14243/421763 -
dc.identifier.url https://www.aclweb.org/anthology/2020.repl4nlp-1.15 en
dc.language.iso eng en
dc.miur.last.status.update 2024-12-20T09:05:47Z *
dc.relation.conferencedate 09/07/2020 en
dc.relation.conferencename 5th Workshop on Representation Learning for NLP en
dc.relation.firstpage 110 en
dc.relation.ispartofbook Proceedings of the 5th Workshop on Representation Learning for NLP en
dc.relation.lastpage 119 en
dc.relation.numberofpages 10 en
dc.subject.keywords nlp -
dc.subject.keywords interpretability -
dc.subject.keywords representation learning -
dc.subject.singlekeyword nlp *
dc.subject.singlekeyword interpretability *
dc.subject.singlekeyword representation learning *
dc.title Contextual and Non-Contextual Word Embeddings: an in-depth Linguistic Investigation en
dc.type.driver info:eu-repo/semantics/conferenceObject -
dc.type.full 04 Contributo in convegno::04.01 Contributo in Atti di convegno it
dc.type.miur 273 -
dc.ugov.descaux1 442036 -
iris.mediafilter.data 2025/04/15 04:24:51 *
iris.orcid.lastModifiedDate 2024/12/20 12:07:24 *
iris.orcid.lastModifiedMillisecond 1734692844990 *
iris.scopus.extIssued 2020 -
iris.scopus.extTitle Contextual and non-contextual word embeddings: An in-depth linguistic investigation -
iris.sitodocente.maxattempts 1 -
iris.unpaywall.bestoaversion publishedVersion *
iris.unpaywall.doi 10.18653/v1/2020.repl4nlp-1.15 *
iris.unpaywall.isoa true *
iris.unpaywall.landingpage https://doi.org/10.18653/v1/2020.repl4nlp-1.15 *
iris.unpaywall.license cc-by *
iris.unpaywall.metadataCallLastModified 01/01/2026 02:45:11 -
iris.unpaywall.metadataCallLastModifiedMillisecond 1767231911785 -
iris.unpaywall.oastatus gold *
iris.unpaywall.pdfurl https://www.aclweb.org/anthology/2020.repl4nlp-1.15.pdf *
Appare nelle tipologie: 04.01 Contributo in Atti di convegno
File in questo prodotto:
File Dimensione Formato  
2020.repl4nlp-1.15.pdf

accesso aperto

Licenza: Creative commons
Dimensione 1.56 MB
Formato Adobe PDF
1.56 MB Adobe PDF Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/20.500.14243/421763
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus ND
  • ???jsp.display-item.citation.isi??? ND
social impact