In this paper we investigate the linguistic knowledge learned by a Neural Language Model (NLM) before and after a fine-tuning process and how this knowledge affects its predictions during several classification problems. We use a wide set of probing tasks, each of which corresponds to a distinct sentence-level feature extracted from different levels of linguistic annotation. We show that BERT is able to encode a wide range of linguistic characteristics, but it tends to lose this information when trained on specific downstream tasks. We also find that BERT's capacity to encode different kind of linguistic properties has a positive influence on its predictions: the more it stores readable linguistic information of a sentence, the higher will be its capacity of predicting the expected label assigned to that sentence.
Linguistic Profiling of a Neural Language Model
Miaschi A;Brunato D;Dell'Orletta F;Venturi G
2020
Abstract
In this paper we investigate the linguistic knowledge learned by a Neural Language Model (NLM) before and after a fine-tuning process and how this knowledge affects its predictions during several classification problems. We use a wide set of probing tasks, each of which corresponds to a distinct sentence-level feature extracted from different levels of linguistic annotation. We show that BERT is able to encode a wide range of linguistic characteristics, but it tends to lose this information when trained on specific downstream tasks. We also find that BERT's capacity to encode different kind of linguistic properties has a positive influence on its predictions: the more it stores readable linguistic information of a sentence, the higher will be its capacity of predicting the expected label assigned to that sentence.| Campo DC | Valore | Lingua |
|---|---|---|
| dc.authority.orgunit | Istituto di linguistica computazionale "Antonio Zampolli" - ILC | en |
| dc.authority.people | Miaschi A | en |
| dc.authority.people | Brunato D | en |
| dc.authority.people | Dell'Orletta F | en |
| dc.authority.people | Venturi G | en |
| dc.collection.id.s | 71c7200a-7c5f-4e83-8d57-d3d2ba88f40d | * |
| dc.collection.name | 04.01 Contributo in Atti di convegno | * |
| dc.contributor.appartenenza | Istituto di linguistica computazionale "Antonio Zampolli" - ILC | * |
| dc.contributor.appartenenza.mi | 918 | * |
| dc.contributor.area | Non assegn | * |
| dc.contributor.area | Non assegn | * |
| dc.contributor.area | Non assegn | * |
| dc.contributor.area | Non assegn | * |
| dc.date.accessioned | 2024/02/18 14:24:16 | - |
| dc.date.available | 2024/02/18 14:24:16 | - |
| dc.date.firstsubmission | 2024/12/16 13:14:32 | * |
| dc.date.issued | 2020 | - |
| dc.date.submission | 2025/03/03 15:15:30 | * |
| dc.description.abstracteng | In this paper we investigate the linguistic knowledge learned by a Neural Language Model (NLM) before and after a fine-tuning process and how this knowledge affects its predictions during several classification problems. We use a wide set of probing tasks, each of which corresponds to a distinct sentence-level feature extracted from different levels of linguistic annotation. We show that BERT is able to encode a wide range of linguistic characteristics, but it tends to lose this information when trained on specific downstream tasks. We also find that BERT's capacity to encode different kind of linguistic properties has a positive influence on its predictions: the more it stores readable linguistic information of a sentence, the higher will be its capacity of predicting the expected label assigned to that sentence. | - |
| dc.description.affiliations | Università di Pisa; Istituto di Linguistica Computazionale (ILC-CNR) | - |
| dc.description.allpeople | Miaschi, A; Brunato, D; Dell'Orletta, F; Venturi, G | - |
| dc.description.allpeopleoriginal | Miaschi A., Brunato D., Dell'Orletta F., Venturi G. | en |
| dc.description.fulltext | open | en |
| dc.description.numberofauthors | 4 | - |
| dc.identifier.doi | 10.18653/v1/2020.coling-main.65 | en |
| dc.identifier.isbn | 978-1-952148-27-9 | en |
| dc.identifier.scopus | 2-s2.0-85108066043 | en |
| dc.identifier.uri | https://hdl.handle.net/20.500.14243/379646 | - |
| dc.identifier.url | https://www.aclweb.org/anthology/2020.coling-main.65/ | en |
| dc.language.iso | eng | en |
| dc.miur.last.status.update | 2024-12-20T09:06:10Z | * |
| dc.relation.conferencedate | 8-13/12/2020 | en |
| dc.relation.conferencename | International Conference on Computational Linguistics (COLING) | en |
| dc.relation.conferenceplace | Online | en |
| dc.relation.firstpage | 745 | en |
| dc.relation.ispartofbook | International Conference on Computational Linguistics (COLING) | en |
| dc.relation.lastpage | 756 | en |
| dc.relation.numberofpages | 11 | en |
| dc.subject.keywords | Linguistic Profiling | - |
| dc.subject.keywords | Neural Language Model | - |
| dc.subject.keywords | Interpretability | - |
| dc.subject.singlekeyword | Linguistic Profiling | * |
| dc.subject.singlekeyword | Neural Language Model | * |
| dc.subject.singlekeyword | Interpretability | * |
| dc.title | Linguistic Profiling of a Neural Language Model | en |
| dc.type.driver | info:eu-repo/semantics/conferenceObject | - |
| dc.type.full | 04 Contributo in convegno::04.01 Contributo in Atti di convegno | it |
| dc.type.miur | 273 | - |
| dc.type.referee | Sì, ma tipo non specificato | en |
| dc.ugov.descaux1 | 438491 | - |
| iris.mediafilter.data | 2025/03/25 03:38:35 | * |
| iris.orcid.lastModifiedDate | 2025/03/03 18:27:56 | * |
| iris.orcid.lastModifiedMillisecond | 1741022876810 | * |
| iris.scopus.extIssued | 2020 | - |
| iris.scopus.extTitle | Linguistic Profiling of a Neural Language Model | - |
| iris.sitodocente.maxattempts | 1 | - |
| iris.unpaywall.bestoaversion | publishedVersion | * |
| iris.unpaywall.doi | 10.18653/v1/2020.coling-main.65 | * |
| iris.unpaywall.isoa | true | * |
| iris.unpaywall.journalisindoaj | false | * |
| iris.unpaywall.landingpage | https://doi.org/10.18653/v1/2020.coling-main.65 | * |
| iris.unpaywall.license | cc-by | * |
| iris.unpaywall.metadataCallLastModified | 28/04/2026 04:33:56 | - |
| iris.unpaywall.metadataCallLastModifiedMillisecond | 1777343636392 | - |
| iris.unpaywall.oastatus | gold | * |
| iris.unpaywall.pdfurl | https://www.aclweb.org/anthology/2020.coling-main.65.pdf | * |
| scopus.category | 2614 | * |
| scopus.category | 1706 | * |
| scopus.category | 1703 | * |
| scopus.contributor.affiliation | University of Pisa | - |
| scopus.contributor.affiliation | Pisa ItaliaNLP Lab. | - |
| scopus.contributor.affiliation | Pisa ItaliaNLP Lab. | - |
| scopus.contributor.affiliation | Pisa ItaliaNLP Lab. | - |
| scopus.contributor.afid | 60028868 | - |
| scopus.contributor.afid | 60008941 | - |
| scopus.contributor.afid | 60008941 | - |
| scopus.contributor.afid | 60008941 | - |
| scopus.contributor.auid | 57211678681 | - |
| scopus.contributor.auid | 55237740200 | - |
| scopus.contributor.auid | 57540567000 | - |
| scopus.contributor.auid | 27568199800 | - |
| scopus.contributor.country | Italy | - |
| scopus.contributor.country | Italy | - |
| scopus.contributor.country | Italy | - |
| scopus.contributor.country | Italy | - |
| scopus.contributor.dptid | 109696702 | - |
| scopus.contributor.dptid | - | |
| scopus.contributor.dptid | - | |
| scopus.contributor.dptid | - | |
| scopus.contributor.name | Alessio | - |
| scopus.contributor.name | Dominique | - |
| scopus.contributor.name | Felice | - |
| scopus.contributor.name | Giulia | - |
| scopus.contributor.subaffiliation | Department of Computer Science; | - |
| scopus.contributor.subaffiliation | Istituto di Linguistica Computazionale “Antonio Zampolli”; | - |
| scopus.contributor.subaffiliation | Istituto di Linguistica Computazionale “Antonio Zampolli”; | - |
| scopus.contributor.subaffiliation | Istituto di Linguistica Computazionale “Antonio Zampolli”; | - |
| scopus.contributor.surname | Miaschi | - |
| scopus.contributor.surname | Brunato | - |
| scopus.contributor.surname | Dell’Orletta | - |
| scopus.contributor.surname | Venturi | - |
| scopus.date.issued | 2020 | * |
| scopus.description.abstracteng | In this paper we investigate the linguistic knowledge learned by a Neural Language Model (NLM) before and after a fine-tuning process and how this knowledge affects its predictions during several classification problems. We use a wide set of probing tasks, each of which corresponds to a distinct sentence-level feature extracted from different levels of linguistic annotation. We show that BERT is able to encode a wide range of linguistic characteristics, but it tends to lose this information when trained on specific downstream tasks. We also find that BERT’s capacity to encode different kind of linguistic properties has a positive influence on its predictions: the more it stores readable linguistic information of a sentence, the higher will be its capacity of predicting the expected label assigned to that sentence. | * |
| scopus.description.allpeopleoriginal | Miaschi A.; Brunato D.; Dell'Orletta F.; Venturi G. | * |
| scopus.differences | scopus.relation.conferencename | * |
| scopus.differences | scopus.publisher.name | * |
| scopus.differences | scopus.relation.conferencedate | * |
| scopus.differences | scopus.identifier.isbn | * |
| scopus.differences | scopus.description.allpeopleoriginal | * |
| scopus.differences | scopus.description.abstracteng | * |
| scopus.differences | scopus.relation.conferenceplace | * |
| scopus.document.type | cp | * |
| scopus.document.types | cp | * |
| scopus.identifier.doi | 10.18653/v1/2020.coling-main.65 | * |
| scopus.identifier.isbn | 9781952148279 | * |
| scopus.identifier.pui | 640522807 | * |
| scopus.identifier.scopus | 2-s2.0-85108066043 | * |
| scopus.journal.sourceid | 21101140120 | * |
| scopus.language.iso | eng | * |
| scopus.publisher.name | Association for Computational Linguistics (ACL) | * |
| scopus.relation.conferencedate | 2020 | * |
| scopus.relation.conferencename | 28th International Conference on Computational Linguistics, COLING 2020 | * |
| scopus.relation.conferenceplace | esp | * |
| scopus.relation.firstpage | 745 | * |
| scopus.relation.lastpage | 756 | * |
| scopus.title | Linguistic Profiling of a Neural Language Model | * |
| scopus.titleeng | Linguistic Profiling of a Neural Language Model | * |
| Appare nelle tipologie: | 04.01 Contributo in Atti di convegno | |
| File | Dimensione | Formato | |
|---|---|---|---|
|
2020.coling-main.65.pdf
accesso aperto
Licenza:
Creative commons
Dimensione
1.77 MB
Formato
Adobe PDF
|
1.77 MB | Adobe PDF | Visualizza/Apri |
I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.


