In this paper we investigate the linguistic knowledge learned by a Neural Language Model (NLM) before and after a fine-tuning process and how this knowledge affects its predictions during several classification problems. We use a wide set of probing tasks, each of which corresponds to a distinct sentence-level feature extracted from different levels of linguistic annotation. We show that BERT is able to encode a wide range of linguistic characteristics, but it tends to lose this information when trained on specific downstream tasks. We also find that BERT's capacity to encode different kind of linguistic properties has a positive influence on its predictions: the more it stores readable linguistic information of a sentence, the higher will be its capacity of predicting the expected label assigned to that sentence.

Linguistic Profiling of a Neural Language Model

Miaschi A;Brunato D;Dell'Orletta F;Venturi G
2020

Abstract

In this paper we investigate the linguistic knowledge learned by a Neural Language Model (NLM) before and after a fine-tuning process and how this knowledge affects its predictions during several classification problems. We use a wide set of probing tasks, each of which corresponds to a distinct sentence-level feature extracted from different levels of linguistic annotation. We show that BERT is able to encode a wide range of linguistic characteristics, but it tends to lose this information when trained on specific downstream tasks. We also find that BERT's capacity to encode different kind of linguistic properties has a positive influence on its predictions: the more it stores readable linguistic information of a sentence, the higher will be its capacity of predicting the expected label assigned to that sentence.
Campo DC Valore Lingua
dc.authority.orgunit Istituto di linguistica computazionale "Antonio Zampolli" - ILC en
dc.authority.people Miaschi A en
dc.authority.people Brunato D en
dc.authority.people Dell'Orletta F en
dc.authority.people Venturi G en
dc.collection.id.s 71c7200a-7c5f-4e83-8d57-d3d2ba88f40d *
dc.collection.name 04.01 Contributo in Atti di convegno *
dc.contributor.appartenenza Istituto di linguistica computazionale "Antonio Zampolli" - ILC *
dc.contributor.appartenenza.mi 918 *
dc.contributor.area Non assegn *
dc.contributor.area Non assegn *
dc.contributor.area Non assegn *
dc.contributor.area Non assegn *
dc.date.accessioned 2024/02/18 14:24:16 -
dc.date.available 2024/02/18 14:24:16 -
dc.date.firstsubmission 2024/12/16 13:14:32 *
dc.date.issued 2020 -
dc.date.submission 2025/03/03 15:15:30 *
dc.description.abstracteng In this paper we investigate the linguistic knowledge learned by a Neural Language Model (NLM) before and after a fine-tuning process and how this knowledge affects its predictions during several classification problems. We use a wide set of probing tasks, each of which corresponds to a distinct sentence-level feature extracted from different levels of linguistic annotation. We show that BERT is able to encode a wide range of linguistic characteristics, but it tends to lose this information when trained on specific downstream tasks. We also find that BERT's capacity to encode different kind of linguistic properties has a positive influence on its predictions: the more it stores readable linguistic information of a sentence, the higher will be its capacity of predicting the expected label assigned to that sentence. -
dc.description.affiliations Università di Pisa; Istituto di Linguistica Computazionale (ILC-CNR) -
dc.description.allpeople Miaschi, A; Brunato, D; Dell'Orletta, F; Venturi, G -
dc.description.allpeopleoriginal Miaschi A., Brunato D., Dell'Orletta F., Venturi G. en
dc.description.fulltext open en
dc.description.numberofauthors 4 -
dc.identifier.doi 10.18653/v1/2020.coling-main.65 en
dc.identifier.isbn 978-1-952148-27-9 en
dc.identifier.scopus 2-s2.0-85108066043 en
dc.identifier.uri https://hdl.handle.net/20.500.14243/379646 -
dc.identifier.url https://www.aclweb.org/anthology/2020.coling-main.65/ en
dc.language.iso eng en
dc.miur.last.status.update 2024-12-20T09:06:10Z *
dc.relation.conferencedate 8-13/12/2020 en
dc.relation.conferencename International Conference on Computational Linguistics (COLING) en
dc.relation.conferenceplace Online en
dc.relation.firstpage 745 en
dc.relation.ispartofbook International Conference on Computational Linguistics (COLING) en
dc.relation.lastpage 756 en
dc.relation.numberofpages 11 en
dc.subject.keywords Linguistic Profiling -
dc.subject.keywords Neural Language Model -
dc.subject.keywords Interpretability -
dc.subject.singlekeyword Linguistic Profiling *
dc.subject.singlekeyword Neural Language Model *
dc.subject.singlekeyword Interpretability *
dc.title Linguistic Profiling of a Neural Language Model en
dc.type.driver info:eu-repo/semantics/conferenceObject -
dc.type.full 04 Contributo in convegno::04.01 Contributo in Atti di convegno it
dc.type.miur 273 -
dc.type.referee Sì, ma tipo non specificato en
dc.ugov.descaux1 438491 -
iris.mediafilter.data 2025/03/25 03:38:35 *
iris.orcid.lastModifiedDate 2025/03/03 18:27:56 *
iris.orcid.lastModifiedMillisecond 1741022876810 *
iris.scopus.extIssued 2020 -
iris.scopus.extTitle Linguistic Profiling of a Neural Language Model -
iris.sitodocente.maxattempts 1 -
iris.unpaywall.bestoaversion publishedVersion *
iris.unpaywall.doi 10.18653/v1/2020.coling-main.65 *
iris.unpaywall.isoa true *
iris.unpaywall.journalisindoaj false *
iris.unpaywall.landingpage https://doi.org/10.18653/v1/2020.coling-main.65 *
iris.unpaywall.license cc-by *
iris.unpaywall.metadataCallLastModified 28/04/2026 04:33:56 -
iris.unpaywall.metadataCallLastModifiedMillisecond 1777343636392 -
iris.unpaywall.oastatus gold *
iris.unpaywall.pdfurl https://www.aclweb.org/anthology/2020.coling-main.65.pdf *
scopus.category 2614 *
scopus.category 1706 *
scopus.category 1703 *
scopus.contributor.affiliation University of Pisa -
scopus.contributor.affiliation Pisa ItaliaNLP Lab. -
scopus.contributor.affiliation Pisa ItaliaNLP Lab. -
scopus.contributor.affiliation Pisa ItaliaNLP Lab. -
scopus.contributor.afid 60028868 -
scopus.contributor.afid 60008941 -
scopus.contributor.afid 60008941 -
scopus.contributor.afid 60008941 -
scopus.contributor.auid 57211678681 -
scopus.contributor.auid 55237740200 -
scopus.contributor.auid 57540567000 -
scopus.contributor.auid 27568199800 -
scopus.contributor.country Italy -
scopus.contributor.country Italy -
scopus.contributor.country Italy -
scopus.contributor.country Italy -
scopus.contributor.dptid 109696702 -
scopus.contributor.dptid -
scopus.contributor.dptid -
scopus.contributor.dptid -
scopus.contributor.name Alessio -
scopus.contributor.name Dominique -
scopus.contributor.name Felice -
scopus.contributor.name Giulia -
scopus.contributor.subaffiliation Department of Computer Science; -
scopus.contributor.subaffiliation Istituto di Linguistica Computazionale “Antonio Zampolli”; -
scopus.contributor.subaffiliation Istituto di Linguistica Computazionale “Antonio Zampolli”; -
scopus.contributor.subaffiliation Istituto di Linguistica Computazionale “Antonio Zampolli”; -
scopus.contributor.surname Miaschi -
scopus.contributor.surname Brunato -
scopus.contributor.surname Dell’Orletta -
scopus.contributor.surname Venturi -
scopus.date.issued 2020 *
scopus.description.abstracteng In this paper we investigate the linguistic knowledge learned by a Neural Language Model (NLM) before and after a fine-tuning process and how this knowledge affects its predictions during several classification problems. We use a wide set of probing tasks, each of which corresponds to a distinct sentence-level feature extracted from different levels of linguistic annotation. We show that BERT is able to encode a wide range of linguistic characteristics, but it tends to lose this information when trained on specific downstream tasks. We also find that BERT’s capacity to encode different kind of linguistic properties has a positive influence on its predictions: the more it stores readable linguistic information of a sentence, the higher will be its capacity of predicting the expected label assigned to that sentence. *
scopus.description.allpeopleoriginal Miaschi A.; Brunato D.; Dell'Orletta F.; Venturi G. *
scopus.differences scopus.relation.conferencename *
scopus.differences scopus.publisher.name *
scopus.differences scopus.relation.conferencedate *
scopus.differences scopus.identifier.isbn *
scopus.differences scopus.description.allpeopleoriginal *
scopus.differences scopus.description.abstracteng *
scopus.differences scopus.relation.conferenceplace *
scopus.document.type cp *
scopus.document.types cp *
scopus.identifier.doi 10.18653/v1/2020.coling-main.65 *
scopus.identifier.isbn 9781952148279 *
scopus.identifier.pui 640522807 *
scopus.identifier.scopus 2-s2.0-85108066043 *
scopus.journal.sourceid 21101140120 *
scopus.language.iso eng *
scopus.publisher.name Association for Computational Linguistics (ACL) *
scopus.relation.conferencedate 2020 *
scopus.relation.conferencename 28th International Conference on Computational Linguistics, COLING 2020 *
scopus.relation.conferenceplace esp *
scopus.relation.firstpage 745 *
scopus.relation.lastpage 756 *
scopus.title Linguistic Profiling of a Neural Language Model *
scopus.titleeng Linguistic Profiling of a Neural Language Model *
Appare nelle tipologie: 04.01 Contributo in Atti di convegno
File in questo prodotto:
File Dimensione Formato  
2020.coling-main.65.pdf

accesso aperto

Licenza: Creative commons
Dimensione 1.77 MB
Formato Adobe PDF
1.77 MB Adobe PDF Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/20.500.14243/379646
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 46
  • ???jsp.display-item.citation.isi??? ND
social impact