This paper presents an investigation aimed at studying how the linguistic structure of a sentence affects the perplexity of two of the most popular Neural Language Models (NLMs), BERT and GPT-2. We first compare the sentence-level likelihood computed with BERT and the GPT-2's perplexity showing that the two metrics are correlated. In addition, we exploit linguistic features capturing a wide set of morpho-syntactic and syntactic phenomena showing how they contribute to predict the perplexity of the two NLMs.

What Makes My Model Perplexed? A Linguistic Investigation on Neural Language Models Perplexity

Miaschi;Alessio;Brunato;Dominique;Dell'Orletta;Felice;Venturi;Giulia
2021

Abstract

This paper presents an investigation aimed at studying how the linguistic structure of a sentence affects the perplexity of two of the most popular Neural Language Models (NLMs), BERT and GPT-2. We first compare the sentence-level likelihood computed with BERT and the GPT-2's perplexity showing that the two metrics are correlated. In addition, we exploit linguistic features capturing a wide set of morpho-syntactic and syntactic phenomena showing how they contribute to predict the perplexity of the two NLMs.
Campo DC Valore Lingua
dc.authority.orgunit Istituto di linguistica computazionale "Antonio Zampolli" - ILC en
dc.authority.people Miaschi en
dc.authority.people Alessio en
dc.authority.people Brunato en
dc.authority.people Dominique en
dc.authority.people Dell'Orletta en
dc.authority.people Felice en
dc.authority.people Venturi en
dc.authority.people Giulia en
dc.collection.id.s 71c7200a-7c5f-4e83-8d57-d3d2ba88f40d *
dc.collection.name 04.01 Contributo in Atti di convegno *
dc.contributor.appartenenza Istituto di linguistica computazionale "Antonio Zampolli" - ILC *
dc.contributor.appartenenza.mi 918 *
dc.date.accessioned 2024/02/20 15:08:20 -
dc.date.available 2024/02/20 15:08:20 -
dc.date.firstsubmission 2024/12/18 16:58:48 *
dc.date.issued 2021 -
dc.date.submission 2024/12/18 16:58:48 *
dc.description.abstracteng This paper presents an investigation aimed at studying how the linguistic structure of a sentence affects the perplexity of two of the most popular Neural Language Models (NLMs), BERT and GPT-2. We first compare the sentence-level likelihood computed with BERT and the GPT-2's perplexity showing that the two metrics are correlated. In addition, we exploit linguistic features capturing a wide set of morpho-syntactic and syntactic phenomena showing how they contribute to predict the perplexity of the two NLMs. -
dc.description.affiliations Università di Pisa; Istituto di Linguistica Computazionale (ILC-CNR) -
dc.description.allpeople Miaschi, Alessio; Miaschi, Alessio; Brunato, DOMINIQUE PIERINA; Brunato, DOMINIQUE PIERINA; Dell'Orletta, Felice; Dell'Orletta, Felice; Venturi, Giulia; Venturi, Giulia -
dc.description.allpeopleoriginal Miaschi, Alessio and Brunato, Dominique and Dell'Orletta, Felice and Venturi, Giulia en
dc.description.fulltext open en
dc.description.numberofauthors 8 -
dc.identifier.isbn 978-1-954085-30-5 en
dc.identifier.uri https://hdl.handle.net/20.500.14243/400474 -
dc.identifier.url https://www.aclweb.org/anthology/2021.deelio-1.5 en
dc.language.iso eng en
dc.miur.last.status.update 2024-12-18T16:10:15Z *
dc.relation.conferencedate 10/06/2021 en
dc.relation.conferencename 2nd Workshop on Knowledge Extraction and Integrationfor Deep Learning Architectures en
dc.relation.firstpage 40 en
dc.relation.ispartofbook Proceedings of the 2nd Workshop on Knowledge Extraction and Integrationfor Deep Learning Architectures en
dc.relation.lastpage 47 en
dc.relation.numberofpages 8 en
dc.subject.keywords nlp -
dc.subject.keywords interpretability -
dc.subject.keywords deep learning -
dc.subject.singlekeyword nlp *
dc.subject.singlekeyword interpretability *
dc.subject.singlekeyword deep learning *
dc.title What Makes My Model Perplexed? A Linguistic Investigation on Neural Language Models Perplexity en
dc.type.driver info:eu-repo/semantics/conferenceObject -
dc.type.full 04 Contributo in convegno::04.01 Contributo in Atti di convegno it
dc.type.miur 273 -
dc.ugov.descaux1 454441 -
iris.mediafilter.data 2025/04/15 04:23:28 *
iris.orcid.lastModifiedDate 2024/12/19 16:47:25 *
iris.orcid.lastModifiedMillisecond 1734623245281 *
iris.scopus.extIssued 2021 -
iris.scopus.extTitle What Makes My Model Perplexed? A Linguistic Investigation on Neural Language Models Perplexity -
iris.sitodocente.maxattempts 1 -
Appare nelle tipologie: 04.01 Contributo in Atti di convegno
File in questo prodotto:
File Dimensione Formato  
2021.deelio-1.5.pdf

accesso aperto

Licenza: Creative commons
Dimensione 619.04 kB
Formato Adobe PDF
619.04 kB Adobe PDF Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/20.500.14243/400474
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus ND
  • ???jsp.display-item.citation.isi??? ND
social impact