In this paper, we propose an extensive evaluation of the first text-to-text Italian Neural Language Model (NLM), IT5 [1], on a classification scenario. In particular, we test the performance of IT5 on several tasks involving both the classification of the topic and the style of a set of Italian posts. We assess the model in two different configurations, single- and multi-task classification, and we compare it with a more traditional NLM based on the Transformer architecture (i.e. BERT). Moreover, we test its performance in a few-shot learning scenario. We also perform a qualitative investigation on the impact of label representations in modeling the classification of the IT5 model. Results show that IT5 could achieve good results, although generally lower than the BERT model. Nevertheless, we observe a significant performance improvement of the Text-to-text model in a multi-task classification scenario. Finally, we found that altering the representation of the labels mainly impacts the classification of the topic.

Evaluating Text-To-Text Framework for Topic and Style Classification of Italian texts

Miaschi Alessio;Dell'Orletta Felice
2022

Abstract

In this paper, we propose an extensive evaluation of the first text-to-text Italian Neural Language Model (NLM), IT5 [1], on a classification scenario. In particular, we test the performance of IT5 on several tasks involving both the classification of the topic and the style of a set of Italian posts. We assess the model in two different configurations, single- and multi-task classification, and we compare it with a more traditional NLM based on the Transformer architecture (i.e. BERT). Moreover, we test its performance in a few-shot learning scenario. We also perform a qualitative investigation on the impact of label representations in modeling the classification of the IT5 model. Results show that IT5 could achieve good results, although generally lower than the BERT model. Nevertheless, we observe a significant performance improvement of the Text-to-text model in a multi-task classification scenario. Finally, we found that altering the representation of the labels mainly impacts the classification of the topic.
Campo DC Valore Lingua
dc.authority.anceserie CEUR WORKSHOP PROCEEDINGS -
dc.authority.anceserie CEUR Workshop Proceedings -
dc.authority.orgunit Istituto di linguistica computazionale "Antonio Zampolli" - ILC -
dc.authority.people Papucci Michele it
dc.authority.people De Nigris Chiara it
dc.authority.people Miaschi Alessio it
dc.authority.people Dell'Orletta Felice it
dc.collection.id.s 71c7200a-7c5f-4e83-8d57-d3d2ba88f40d *
dc.collection.name 04.01 Contributo in Atti di convegno *
dc.contributor.appartenenza Istituto di linguistica computazionale "Antonio Zampolli" - ILC *
dc.contributor.appartenenza.mi 918 *
dc.date.accessioned 2024/02/21 07:43:50 -
dc.date.available 2024/02/21 07:43:50 -
dc.date.issued 2022 -
dc.description.abstracteng In this paper, we propose an extensive evaluation of the first text-to-text Italian Neural Language Model (NLM), IT5 [1], on a classification scenario. In particular, we test the performance of IT5 on several tasks involving both the classification of the topic and the style of a set of Italian posts. We assess the model in two different configurations, single- and multi-task classification, and we compare it with a more traditional NLM based on the Transformer architecture (i.e. BERT). Moreover, we test its performance in a few-shot learning scenario. We also perform a qualitative investigation on the impact of label representations in modeling the classification of the IT5 model. Results show that IT5 could achieve good results, although generally lower than the BERT model. Nevertheless, we observe a significant performance improvement of the Text-to-text model in a multi-task classification scenario. Finally, we found that altering the representation of the labels mainly impacts the classification of the topic. -
dc.description.affiliations Università di Pisa, Pisa; Istituto Di Linguistica Computazionale "A. Zampolli" ((ILC-CNR), ItaliaNLP Lab, Pisa; TALIA S.r.l. -
dc.description.allpeople Papucci, Michele; De Nigris, Chiara; Miaschi, Alessio; Dell'Orletta, Felice -
dc.description.allpeopleoriginal Papucci, Michele; De Nigris, Chiara; Miaschi, Alessio; Dell'Orletta, Felice -
dc.description.fulltext none en
dc.description.numberofauthors 4 -
dc.identifier.scopus 2-s2.0-85143252156 -
dc.identifier.uri https://hdl.handle.net/20.500.14243/415084 -
dc.identifier.url http://www.scopus.com/record/display.url?eid=2-s2.0-85143252156&origin=inward -
dc.language.iso eng -
dc.miur.last.status.update 2024-12-20T09:02:37Z *
dc.relation.conferencedate 30/11/2022 -
dc.relation.conferencename Sixth Workshop on Natural Language for Artificial Intelligence, NL4AI 2022 -
dc.relation.firstpage 56 -
dc.relation.lastpage 70 -
dc.relation.volume 3287 -
dc.subject.keywords bert -
dc.subject.keywords style classification -
dc.subject.keywords t5 -
dc.subject.keywords text-to-text -
dc.subject.keywords topic classification -
dc.subject.keywords transformers -
dc.subject.singlekeyword bert *
dc.subject.singlekeyword style classification *
dc.subject.singlekeyword t5 *
dc.subject.singlekeyword text-to-text *
dc.subject.singlekeyword topic classification *
dc.subject.singlekeyword transformers *
dc.title Evaluating Text-To-Text Framework for Topic and Style Classification of Italian texts en
dc.type.driver info:eu-repo/semantics/conferenceObject -
dc.type.full 04 Contributo in convegno::04.01 Contributo in Atti di convegno it
dc.type.miur 273 -
dc.ugov.descaux1 474890 -
iris.orcid.lastModifiedDate 2024/04/04 16:17:39 *
iris.orcid.lastModifiedMillisecond 1712240259730 *
iris.scopus.extIssued 2022 -
iris.scopus.extTitle Evaluating Text-To-Text Framework for Topic and Style Classification of Italian texts -
iris.sitodocente.maxattempts 1 -
scopus.authority.anceserie CEUR WORKSHOP PROCEEDINGS###1613-0073 *
scopus.category 1700 *
scopus.contributor.affiliation TALIA S.r.l. -
scopus.contributor.affiliation Università di Pisa -
scopus.contributor.affiliation NLP Lab -
scopus.contributor.affiliation NLP Lab -
scopus.contributor.afid 128940566 -
scopus.contributor.afid 60028868 -
scopus.contributor.afid 60008941 -
scopus.contributor.afid 60008941 -
scopus.contributor.auid 57991631200 -
scopus.contributor.auid 57991570000 -
scopus.contributor.auid 57211678681 -
scopus.contributor.auid 57540567000 -
scopus.contributor.country Italy -
scopus.contributor.country Italy -
scopus.contributor.country Italy -
scopus.contributor.country Italy -
scopus.contributor.dptid -
scopus.contributor.dptid -
scopus.contributor.dptid 114087935 -
scopus.contributor.dptid 114087935 -
scopus.contributor.name Michele -
scopus.contributor.name Chiara -
scopus.contributor.name Alessio -
scopus.contributor.name Felice -
scopus.contributor.subaffiliation -
scopus.contributor.subaffiliation -
scopus.contributor.subaffiliation Istituto di Linguistica Computazionale "A. Zampolli" (ILC-CNR); -
scopus.contributor.subaffiliation Istituto di Linguistica Computazionale "A. Zampolli" (ILC-CNR); -
scopus.contributor.surname Papucci -
scopus.contributor.surname De Nigris -
scopus.contributor.surname Miaschi -
scopus.contributor.surname Dell'Orletta -
scopus.date.issued 2022 *
scopus.description.abstracteng In this paper, we propose an extensive evaluation of the first text-to-text Italian Neural Language Model (NLM), IT5 [1], on a classification scenario. In particular, we test the performance of IT5 on several tasks involving both the classification of the topic and the style of a set of Italian posts. We assess the model in two different configurations, single- and multi-task classification, and we compare it with a more traditional NLM based on the Transformer architecture (i.e. BERT). Moreover, we test its performance in a few-shot learning scenario. We also perform a qualitative investigation on the impact of label representations in modeling the classification of the IT5 model. Results show that IT5 could achieve good results, although generally lower than the BERT model. Nevertheless, we observe a significant performance improvement of the Text-to-text model in a multi-task classification scenario. Finally, we found that altering the representation of the labels mainly impacts the classification of the topic. *
scopus.description.allpeopleoriginal Papucci M.; De Nigris C.; Miaschi A.; Dell'Orletta F. *
scopus.differences scopus.relation.conferencename *
scopus.differences scopus.authority.anceserie *
scopus.differences scopus.publisher.name *
scopus.differences scopus.subject.keywords *
scopus.differences scopus.relation.conferencedate *
scopus.differences scopus.description.allpeopleoriginal *
scopus.differences scopus.relation.conferenceplace *
scopus.document.type cp *
scopus.document.types cp *
scopus.identifier.pui 639690263 *
scopus.identifier.scopus 2-s2.0-85143252156 *
scopus.journal.sourceid 21100218356 *
scopus.language.iso eng *
scopus.publisher.name CEUR-WS *
scopus.relation.conferencedate 2022 *
scopus.relation.conferencename 6th Workshop on Natural Language for Artificial Intelligence, NL4AI 2022 *
scopus.relation.conferenceplace ita *
scopus.relation.firstpage 56 *
scopus.relation.lastpage 70 *
scopus.relation.volume 3287 *
scopus.subject.keywords bert; style classification; t5; text-to-text; topic classification; transformers; *
scopus.title Evaluating Text-To-Text Framework for Topic and Style Classification of Italian texts *
scopus.titleeng Evaluating Text-To-Text Framework for Topic and Style Classification of Italian texts *
Appare nelle tipologie: 04.01 Contributo in Atti di convegno
File in questo prodotto:
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/20.500.14243/415084
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 1
  • ???jsp.display-item.citation.isi??? ND
social impact