In this paper, we present an in-depth investigation of the linguistic knowledge encoded by the transformer models currently available for the Italian language. In particular, we investigate how the complexity of two different architectures of probing models affects the performance of the Transformers in encoding a wide spectrum of linguistic features. Moreover, we explore how this implicit knowledge varies according to different textual genres and language varieties.

Probing Linguistic Knowledge in Italian Neural Language Models across Language Varieties

Miaschi;Alessio;Brunato;Dominique;Dell'Orletta;Felice;Venturi;Giulia
2022

Abstract

In this paper, we present an in-depth investigation of the linguistic knowledge encoded by the transformer models currently available for the Italian language. In particular, we investigate how the complexity of two different architectures of probing models affects the performance of the Transformers in encoding a wide spectrum of linguistic features. Moreover, we explore how this implicit knowledge varies according to different textual genres and language varieties.
Campo DC Valore Lingua
dc.authority.ancejournal IJCOL en
dc.authority.orgunit Istituto di linguistica computazionale "Antonio Zampolli" - ILC en
dc.authority.people Miaschi en
dc.authority.people Alessio en
dc.authority.people Sarti en
dc.authority.people Gabriele en
dc.authority.people Brunato en
dc.authority.people Dominique en
dc.authority.people Dell'Orletta en
dc.authority.people Felice en
dc.authority.people Venturi en
dc.authority.people Giulia en
dc.collection.id.s b3f88f24-048a-4e43-8ab1-6697b90e068e *
dc.collection.name 01.01 Articolo in rivista *
dc.contributor.appartenenza Istituto di linguistica computazionale "Antonio Zampolli" - ILC *
dc.contributor.appartenenza.mi 918 *
dc.contributor.area Non assegn *
dc.contributor.area Non assegn *
dc.contributor.area Non assegn *
dc.contributor.area Non assegn *
dc.contributor.area Non assegn *
dc.contributor.area Non assegn *
dc.contributor.area Non assegn *
dc.contributor.area Non assegn *
dc.date.accessioned 2024/02/21 03:23:35 -
dc.date.available 2024/02/21 03:23:35 -
dc.date.firstsubmission 2025/01/24 14:53:46 *
dc.date.issued 2022 -
dc.date.submission 2025/01/24 17:35:57 *
dc.description.abstracteng In this paper, we present an in-depth investigation of the linguistic knowledge encoded by the transformer models currently available for the Italian language. In particular, we investigate how the complexity of two different architectures of probing models affects the performance of the Transformers in encoding a wide spectrum of linguistic features. Moreover, we explore how this implicit knowledge varies according to different textual genres and language varieties. -
dc.description.affiliations Department of Computer Science, Università di Pisa; Center for Language and Cognition, University of Groningen; Istituto di Linguistica Computazionale "Antonio Zampolli", CNR, Pisa - ItaliaNLP Lab -
dc.description.allpeople Miaschi, Alessio; Miaschi, Alessio; Sarti, ; Gabriele, ; Brunato, DOMINIQUE PIERINA; Brunato, DOMINIQUE PIERINA; Dell'Orletta, Felice; Dell'Orletta, Felice; Venturi, Giulia; Venturi, Giulia -
dc.description.allpeopleoriginal Miaschi, Alessio and Sarti, Gabriele and Brunato, Dominique and Dell'Orletta, Felice and Venturi, Giulia en
dc.description.fulltext open en
dc.description.numberofauthors 10 -
dc.identifier.doi 10.4000/ijcol.965 en
dc.identifier.scopus 2-s2.0-85205875190 -
dc.identifier.uri https://hdl.handle.net/20.500.14243/443057 -
dc.identifier.url http://www.aaccademia.it/ita/scheda-libro?aaref=1518 en
dc.language.iso eng en
dc.miur.last.status.update 2025-01-24T13:54:36Z *
dc.relation.firstpage 25 en
dc.relation.lastpage 44 en
dc.relation.numberofpages 20 en
dc.subject.keywords Neural Language Models -
dc.subject.keywords Interpretability -
dc.subject.keywords Language Varieties -
dc.subject.singlekeyword Neural Language Models *
dc.subject.singlekeyword Interpretability *
dc.subject.singlekeyword Language Varieties *
dc.title Probing Linguistic Knowledge in Italian Neural Language Models across Language Varieties en
dc.type.driver info:eu-repo/semantics/article -
dc.type.full 01 Contributo su Rivista::01.01 Articolo in rivista it
dc.type.miur 262 -
dc.ugov.descaux1 469733 -
iris.mediafilter.data 2025/04/12 03:23:51 *
iris.orcid.lastModifiedDate 2025/02/05 10:19:13 *
iris.orcid.lastModifiedMillisecond 1738747153020 *
iris.scopus.extIssued 2022 -
iris.scopus.extTitle Probing Linguistic Knowledge in Italian Neural Language Models across Language Varieties -
iris.sitodocente.maxattempts 1 -
iris.unpaywall.bestoahost publisher *
iris.unpaywall.bestoaversion publishedVersion *
iris.unpaywall.doi 10.4000/ijcol.965 *
iris.unpaywall.hosttype publisher *
iris.unpaywall.isoa true *
iris.unpaywall.journalisindoaj true *
iris.unpaywall.landingpage https://doi.org/10.4000/ijcol.965 *
iris.unpaywall.license cc-by-nc-nd *
iris.unpaywall.metadataCallLastModified 13/02/2026 03:30:51 -
iris.unpaywall.metadataCallLastModifiedMillisecond 1770949851137 -
iris.unpaywall.oastatus gold *
iris.unpaywall.pdfurl https://journals.openedition.org/ijcol/pdf/965 *
scopus.authority.ancejournal IJCOL###2499-4553 *
scopus.category 1709 *
scopus.category 3310 *
scopus.category 1703 *
scopus.category 1702 *
scopus.contributor.affiliation ItaliaNLP Lab -
scopus.contributor.affiliation ItaliaNLP Lab -
scopus.contributor.affiliation ItaliaNLP Lab -
scopus.contributor.affiliation University of Groningen -
scopus.contributor.affiliation ItaliaNLP Lab -
scopus.contributor.afid 60028868 -
scopus.contributor.afid 60008941 -
scopus.contributor.afid 60008941 -
scopus.contributor.afid 60010023 -
scopus.contributor.afid 60008941 -
scopus.contributor.auid 57211678681 -
scopus.contributor.auid 55237740200 -
scopus.contributor.auid 27568199800 -
scopus.contributor.auid 57220744180 -
scopus.contributor.auid 57540567000 -
scopus.contributor.country Italy -
scopus.contributor.country Italy -
scopus.contributor.country Italy -
scopus.contributor.country Netherlands -
scopus.contributor.country Italy -
scopus.contributor.dptid 109696702 -
scopus.contributor.dptid 114087935 -
scopus.contributor.dptid 114087935 -
scopus.contributor.dptid 103548396 -
scopus.contributor.dptid 114087935 -
scopus.contributor.name Alessio -
scopus.contributor.name Dominique -
scopus.contributor.name Giulia -
scopus.contributor.name Gabriele -
scopus.contributor.name Felice -
scopus.contributor.subaffiliation Department of Computer Science;Università di Pisa;Istituto di Linguistica Computazionale “Antonio Zampolli”;CNR; -
scopus.contributor.subaffiliation Istituto di Linguistica Computazionale “Antonio Zampolli”;CNR; -
scopus.contributor.subaffiliation Istituto di Linguistica Computazionale “Antonio Zampolli”;CNR; -
scopus.contributor.subaffiliation Center for Language and Cognition; -
scopus.contributor.subaffiliation Istituto di Linguistica Computazionale “Antonio Zampolli”;CNR; -
scopus.contributor.surname Miaschi -
scopus.contributor.surname Brunato -
scopus.contributor.surname Venturi -
scopus.contributor.surname Sarti -
scopus.contributor.surname Dell’orletta -
scopus.date.issued 2022 *
scopus.description.abstracteng In this paper, we present an in-depth investigation of the linguistic knowledge encoded by the transformer models currently available for the Italian language. In particular, we investigate how the complexity of two different architectures of probing models affects the performance of the Transformers in encoding a wide spectrum of linguistic features. Moreover, we explore how this implicit knowledge varies according to different textual genres and language varieties. *
scopus.description.allpeopleoriginal Miaschi A.; Brunato D.; Venturi G.; Sarti G.; Dell'orletta F. *
scopus.differences scopus.description.allpeopleoriginal *
scopus.differences scopus.relation.issue *
scopus.differences scopus.relation.volume *
scopus.document.type ar *
scopus.document.types ar *
scopus.identifier.doi 10.4000/ijcol.965 *
scopus.identifier.eissn 2499-4553 *
scopus.identifier.pui 2031364966 *
scopus.identifier.scopus 2-s2.0-85205875190 *
scopus.journal.sourceid 21101252471 *
scopus.language.iso eng *
scopus.publisher.name Accademia University Press *
scopus.relation.firstpage 25 *
scopus.relation.issue 1 *
scopus.relation.lastpage 44 *
scopus.relation.volume 8 *
scopus.title Probing Linguistic Knowledge in Italian Neural Language Models across Language Varieties *
scopus.titleeng Probing Linguistic Knowledge in Italian Neural Language Models across Language Varieties *
Appare nelle tipologie: 01.01 Articolo in rivista
File in questo prodotto:
File Dimensione Formato  
prod_469733-doc_190324.pdf

accesso aperto

Descrizione: Probing_Linguistic_Knowledge_in_Italian_Neural_Language_Models_across_Language_Varieties
Tipologia: Versione Editoriale (PDF)
Licenza: Creative commons
Dimensione 1.74 MB
Formato Adobe PDF
1.74 MB Adobe PDF Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/20.500.14243/443057
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 4
  • ???jsp.display-item.citation.isi??? ND
social impact