CNR Institutional Research Information System

In this paper we investigate the linguistic knowledge learned by a Neural Language Model (NLM) before and after a fine-tuning process and how this knowledge affects its predictions during several classification problems. We use a wide set of probing tasks, each of which corresponds to a distinct sentence-level feature extracted from different levels of linguistic annotation. We show that BERT is able to encode a wide range of linguistic characteristics, but it tends to lose this information when trained on specific downstream tasks. We also find that BERT's capacity to encode different kind of linguistic properties has a positive influence on its predictions: the more it stores readable linguistic information of a sentence, the higher will be its capacity of predicting the expected label assigned to that sentence.

Linguistic Profiling of a Neural Language Model

Miaschi A;Brunato D;Dell'Orletta F;Venturi G

2020

Abstract

In this paper we investigate the linguistic knowledge learned by a Neural Language Model (NLM) before and after a fine-tuning process and how this knowledge affects its predictions during several classification problems. We use a wide set of probing tasks, each of which corresponds to a distinct sentence-level feature extracted from different levels of linguistic annotation. We show that BERT is able to encode a wide range of linguistic characteristics, but it tends to lose this information when trained on specific downstream tasks. We also find that BERT's capacity to encode different kind of linguistic properties has a positive influence on its predictions: the more it stores readable linguistic information of a sentence, the higher will be its capacity of predicting the expected label assigned to that sentence.

Scheda breve

Scheda completa

Scheda completa (DC)

Campo DC	Valore	Lingua
dc.authority.orgunit	Istituto di linguistica computazionale "Antonio Zampolli" - ILC	en
dc.authority.people	Miaschi A	en
dc.authority.people	Brunato D	en
dc.authority.people	Dell'Orletta F	en
dc.authority.people	Venturi G	en
dc.collection.id.s	71c7200a-7c5f-4e83-8d57-d3d2ba88f40d	*
dc.collection.name	04.01 Contributo in Atti di convegno	*
dc.contributor.appartenenza	Istituto di linguistica computazionale "Antonio Zampolli" - ILC	*
dc.contributor.appartenenza.mi	918	*
dc.contributor.area	Non assegn	*
dc.contributor.area	Non assegn	*
dc.contributor.area	Non assegn	*
dc.contributor.area	Non assegn	*
dc.date.accessioned	2024/02/18 14:24:16	-
dc.date.available	2024/02/18 14:24:16	-
dc.date.firstsubmission	2024/12/16 13:14:32	*
dc.date.issued	2020	-
dc.date.submission	2025/03/03 15:15:30	*
dc.description.abstracteng	In this paper we investigate the linguistic knowledge learned by a Neural Language Model (NLM) before and after a fine-tuning process and how this knowledge affects its predictions during several classification problems. We use a wide set of probing tasks, each of which corresponds to a distinct sentence-level feature extracted from different levels of linguistic annotation. We show that BERT is able to encode a wide range of linguistic characteristics, but it tends to lose this information when trained on specific downstream tasks. We also find that BERT's capacity to encode different kind of linguistic properties has a positive influence on its predictions: the more it stores readable linguistic information of a sentence, the higher will be its capacity of predicting the expected label assigned to that sentence.	-
dc.description.affiliations	Università di Pisa; Istituto di Linguistica Computazionale (ILC-CNR)	-
dc.description.allpeople	Miaschi, A; Brunato, D; Dell'Orletta, F; Venturi, G	-
dc.description.allpeopleoriginal	Miaschi A., Brunato D., Dell'Orletta F., Venturi G.	en
dc.description.fulltext	open	en
dc.description.numberofauthors	4	-
dc.identifier.doi	10.18653/v1/2020.coling-main.65	en
dc.identifier.isbn	978-1-952148-27-9	en
dc.identifier.scopus	2-s2.0-85108066043	en
dc.identifier.uri	https://hdl.handle.net/20.500.14243/379646	-
dc.identifier.url	https://www.aclweb.org/anthology/2020.coling-main.65/	en
dc.language.iso	eng	en
dc.miur.last.status.update	2024-12-20T09:06:10Z	*
dc.relation.conferencedate	8-13/12/2020	en
dc.relation.conferencename	International Conference on Computational Linguistics (COLING)	en
dc.relation.conferenceplace	Online	en
dc.relation.firstpage	745	en
dc.relation.ispartofbook	International Conference on Computational Linguistics (COLING)	en
dc.relation.lastpage	756	en
dc.relation.numberofpages	11	en
dc.subject.keywords	Linguistic Profiling	-
dc.subject.keywords	Neural Language Model	-
dc.subject.keywords	Interpretability	-
dc.subject.singlekeyword	Linguistic Profiling	*
dc.subject.singlekeyword	Neural Language Model	*
dc.subject.singlekeyword	Interpretability	*
dc.title	Linguistic Profiling of a Neural Language Model	en
dc.type.driver	info:eu-repo/semantics/conferenceObject	-
dc.type.full	04 Contributo in convegno::04.01 Contributo in Atti di convegno	it
dc.type.miur	273	-
dc.type.referee	Sì, ma tipo non specificato	en
dc.ugov.descaux1	438491	-
iris.mediafilter.data	2025/03/25 03:38:35	*
iris.orcid.lastModifiedDate	2025/03/03 18:27:56	*
iris.orcid.lastModifiedMillisecond	1741022876810	*
iris.scopus.extIssued	2020	-
iris.scopus.extTitle	Linguistic Profiling of a Neural Language Model	-
iris.sitodocente.maxattempts	1	-
iris.unpaywall.bestoaversion	publishedVersion	*
iris.unpaywall.doi	10.18653/v1/2020.coling-main.65	*
iris.unpaywall.isoa	true	*
iris.unpaywall.journalisindoaj	false	*
iris.unpaywall.landingpage	https://doi.org/10.18653/v1/2020.coling-main.65	*
iris.unpaywall.license	cc-by	*
iris.unpaywall.metadataCallLastModified	28/04/2026 04:33:56	-
iris.unpaywall.metadataCallLastModifiedMillisecond	1777343636392	-
iris.unpaywall.oastatus	gold	*
iris.unpaywall.pdfurl	https://www.aclweb.org/anthology/2020.coling-main.65.pdf	*
scopus.category	2614	*
scopus.category	1706	*
scopus.category	1703	*
scopus.contributor.affiliation	University of Pisa	-
scopus.contributor.affiliation	Pisa ItaliaNLP Lab.	-
scopus.contributor.affiliation	Pisa ItaliaNLP Lab.	-
scopus.contributor.affiliation	Pisa ItaliaNLP Lab.	-
scopus.contributor.afid	60028868	-
scopus.contributor.afid	60008941	-
scopus.contributor.afid	60008941	-
scopus.contributor.afid	60008941	-
scopus.contributor.auid	57211678681	-
scopus.contributor.auid	55237740200	-
scopus.contributor.auid	57540567000	-
scopus.contributor.auid	27568199800	-
scopus.contributor.country	Italy	-
scopus.contributor.country	Italy	-
scopus.contributor.country	Italy	-
scopus.contributor.country	Italy	-
scopus.contributor.dptid	109696702	-
scopus.contributor.dptid		-
scopus.contributor.dptid		-
scopus.contributor.dptid		-
scopus.contributor.name	Alessio	-
scopus.contributor.name	Dominique	-
scopus.contributor.name	Felice	-
scopus.contributor.name	Giulia	-
scopus.contributor.subaffiliation	Department of Computer Science;	-
scopus.contributor.subaffiliation	Istituto di Linguistica Computazionale “Antonio Zampolli”;	-
scopus.contributor.subaffiliation	Istituto di Linguistica Computazionale “Antonio Zampolli”;	-
scopus.contributor.subaffiliation	Istituto di Linguistica Computazionale “Antonio Zampolli”;	-
scopus.contributor.surname	Miaschi	-
scopus.contributor.surname	Brunato	-
scopus.contributor.surname	Dell’Orletta	-
scopus.contributor.surname	Venturi	-
scopus.date.issued	2020	*
scopus.description.abstracteng	In this paper we investigate the linguistic knowledge learned by a Neural Language Model (NLM) before and after a fine-tuning process and how this knowledge affects its predictions during several classification problems. We use a wide set of probing tasks, each of which corresponds to a distinct sentence-level feature extracted from different levels of linguistic annotation. We show that BERT is able to encode a wide range of linguistic characteristics, but it tends to lose this information when trained on specific downstream tasks. We also find that BERT’s capacity to encode different kind of linguistic properties has a positive influence on its predictions: the more it stores readable linguistic information of a sentence, the higher will be its capacity of predicting the expected label assigned to that sentence.	*
scopus.description.allpeopleoriginal	Miaschi A.; Brunato D.; Dell'Orletta F.; Venturi G.	*
scopus.differences	scopus.relation.conferencename	*
scopus.differences	scopus.publisher.name	*
scopus.differences	scopus.relation.conferencedate	*
scopus.differences	scopus.identifier.isbn	*
scopus.differences	scopus.description.allpeopleoriginal	*
scopus.differences	scopus.description.abstracteng	*
scopus.differences	scopus.relation.conferenceplace	*
scopus.document.type	cp	*
scopus.document.types	cp	*
scopus.identifier.doi	10.18653/v1/2020.coling-main.65	*
scopus.identifier.isbn	9781952148279	*
scopus.identifier.pui	640522807	*
scopus.identifier.scopus	2-s2.0-85108066043	*
scopus.journal.sourceid	21101140120	*
scopus.language.iso	eng	*
scopus.publisher.name	Association for Computational Linguistics (ACL)	*
scopus.relation.conferencedate	2020	*
scopus.relation.conferencename	28th International Conference on Computational Linguistics, COLING 2020	*
scopus.relation.conferenceplace	esp	*
scopus.relation.firstpage	745	*
scopus.relation.lastpage	756	*
scopus.title	Linguistic Profiling of a Neural Language Model	*
scopus.titleeng	Linguistic Profiling of a Neural Language Model	*
Appare nelle tipologie:	04.01 Contributo in Atti di convegno

File in questo prodotto:

File	Dimensione	Formato
2020.coling-main.65.pdf accesso aperto Licenza: Creative commons Dimensione 1.77 MB Formato Adobe PDF Visualizza/Apri	1.77 MB	Adobe PDF	Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/20.500.14243/379646

Citazioni

ND

46

ND

social impact