CNR Institutional Research Information System

In this paper, we propose an extensive evaluation of the first text-to-text Italian Neural Language Model (NLM), IT5 [1], on a classification scenario. In particular, we test the performance of IT5 on several tasks involving both the classification of the topic and the style of a set of Italian posts. We assess the model in two different configurations, single- and multi-task classification, and we compare it with a more traditional NLM based on the Transformer architecture (i.e. BERT). Moreover, we test its performance in a few-shot learning scenario. We also perform a qualitative investigation on the impact of label representations in modeling the classification of the IT5 model. Results show that IT5 could achieve good results, although generally lower than the BERT model. Nevertheless, we observe a significant performance improvement of the Text-to-text model in a multi-task classification scenario. Finally, we found that altering the representation of the labels mainly impacts the classification of the topic.

Evaluating Text-To-Text Framework for Topic and Style Classification of Italian texts

Papucci Michele;De Nigris Chiara;Miaschi Alessio;Dell'Orletta Felice

2022

Abstract

In this paper, we propose an extensive evaluation of the first text-to-text Italian Neural Language Model (NLM), IT5 [1], on a classification scenario. In particular, we test the performance of IT5 on several tasks involving both the classification of the topic and the style of a set of Italian posts. We assess the model in two different configurations, single- and multi-task classification, and we compare it with a more traditional NLM based on the Transformer architecture (i.e. BERT). Moreover, we test its performance in a few-shot learning scenario. We also perform a qualitative investigation on the impact of label representations in modeling the classification of the IT5 model. Results show that IT5 could achieve good results, although generally lower than the BERT model. Nevertheless, we observe a significant performance improvement of the Text-to-text model in a multi-task classification scenario. Finally, we found that altering the representation of the labels mainly impacts the classification of the topic.

Scheda breve

Scheda completa

Scheda completa (DC)

Campo DC	Valore	Lingua
dc.authority.anceserie	CEUR WORKSHOP PROCEEDINGS	-
dc.authority.anceserie	CEUR Workshop Proceedings	-
dc.authority.orgunit	Istituto di linguistica computazionale "Antonio Zampolli" - ILC	-
dc.authority.people	Papucci Michele	it
dc.authority.people	De Nigris Chiara	it
dc.authority.people	Miaschi Alessio	it
dc.authority.people	Dell'Orletta Felice	it
dc.collection.id.s	71c7200a-7c5f-4e83-8d57-d3d2ba88f40d	*
dc.collection.name	04.01 Contributo in Atti di convegno	*
dc.contributor.appartenenza	Istituto di linguistica computazionale "Antonio Zampolli" - ILC	*
dc.contributor.appartenenza.mi	918	*
dc.date.accessioned	2024/02/21 07:43:50	-
dc.date.available	2024/02/21 07:43:50	-
dc.date.issued	2022	-
dc.description.abstracteng	In this paper, we propose an extensive evaluation of the first text-to-text Italian Neural Language Model (NLM), IT5 [1], on a classification scenario. In particular, we test the performance of IT5 on several tasks involving both the classification of the topic and the style of a set of Italian posts. We assess the model in two different configurations, single- and multi-task classification, and we compare it with a more traditional NLM based on the Transformer architecture (i.e. BERT). Moreover, we test its performance in a few-shot learning scenario. We also perform a qualitative investigation on the impact of label representations in modeling the classification of the IT5 model. Results show that IT5 could achieve good results, although generally lower than the BERT model. Nevertheless, we observe a significant performance improvement of the Text-to-text model in a multi-task classification scenario. Finally, we found that altering the representation of the labels mainly impacts the classification of the topic.	-
dc.description.affiliations	Università di Pisa, Pisa; Istituto Di Linguistica Computazionale "A. Zampolli" ((ILC-CNR), ItaliaNLP Lab, Pisa; TALIA S.r.l.	-
dc.description.allpeople	Papucci, Michele; De Nigris, Chiara; Miaschi, Alessio; Dell'Orletta, Felice	-
dc.description.allpeopleoriginal	Papucci, Michele; De Nigris, Chiara; Miaschi, Alessio; Dell'Orletta, Felice	-
dc.description.fulltext	none	en
dc.description.numberofauthors	4	-
dc.identifier.scopus	2-s2.0-85143252156	-
dc.identifier.uri	https://hdl.handle.net/20.500.14243/415084	-
dc.identifier.url	http://www.scopus.com/record/display.url?eid=2-s2.0-85143252156&origin=inward	-
dc.language.iso	eng	-
dc.miur.last.status.update	2024-12-20T09:02:37Z	*
dc.relation.conferencedate	30/11/2022	-
dc.relation.conferencename	Sixth Workshop on Natural Language for Artificial Intelligence, NL4AI 2022	-
dc.relation.firstpage	56	-
dc.relation.lastpage	70	-
dc.relation.volume	3287	-
dc.subject.keywords	bert	-
dc.subject.keywords	style classification	-
dc.subject.keywords	t5	-
dc.subject.keywords	text-to-text	-
dc.subject.keywords	topic classification	-
dc.subject.keywords	transformers	-
dc.subject.singlekeyword	bert	*
dc.subject.singlekeyword	style classification	*
dc.subject.singlekeyword	t5	*
dc.subject.singlekeyword	text-to-text	*
dc.subject.singlekeyword	topic classification	*
dc.subject.singlekeyword	transformers	*
dc.title	Evaluating Text-To-Text Framework for Topic and Style Classification of Italian texts	en
dc.type.driver	info:eu-repo/semantics/conferenceObject	-
dc.type.full	04 Contributo in convegno::04.01 Contributo in Atti di convegno	it
dc.type.miur	273	-
dc.ugov.descaux1	474890	-
iris.orcid.lastModifiedDate	2024/04/04 16:17:39	*
iris.orcid.lastModifiedMillisecond	1712240259730	*
iris.scopus.extIssued	2022	-
iris.scopus.extTitle	Evaluating Text-To-Text Framework for Topic and Style Classification of Italian texts	-
iris.sitodocente.maxattempts	1	-
scopus.authority.anceserie	CEUR WORKSHOP PROCEEDINGS###1613-0073	*
scopus.category	1700	*
scopus.contributor.affiliation	TALIA S.r.l.	-
scopus.contributor.affiliation	Università di Pisa	-
scopus.contributor.affiliation	NLP Lab	-
scopus.contributor.affiliation	NLP Lab	-
scopus.contributor.afid	128940566	-
scopus.contributor.afid	60028868	-
scopus.contributor.afid	60008941	-
scopus.contributor.afid	60008941	-
scopus.contributor.auid	57991631200	-
scopus.contributor.auid	57991570000	-
scopus.contributor.auid	57211678681	-
scopus.contributor.auid	57540567000	-
scopus.contributor.country	Italy	-
scopus.contributor.country	Italy	-
scopus.contributor.country	Italy	-
scopus.contributor.country	Italy	-
scopus.contributor.dptid		-
scopus.contributor.dptid		-
scopus.contributor.dptid	114087935	-
scopus.contributor.dptid	114087935	-
scopus.contributor.name	Michele	-
scopus.contributor.name	Chiara	-
scopus.contributor.name	Alessio	-
scopus.contributor.name	Felice	-
scopus.contributor.subaffiliation		-
scopus.contributor.subaffiliation		-
scopus.contributor.subaffiliation	Istituto di Linguistica Computazionale "A. Zampolli" (ILC-CNR);	-
scopus.contributor.subaffiliation	Istituto di Linguistica Computazionale "A. Zampolli" (ILC-CNR);	-
scopus.contributor.surname	Papucci	-
scopus.contributor.surname	De Nigris	-
scopus.contributor.surname	Miaschi	-
scopus.contributor.surname	Dell'Orletta	-
scopus.date.issued	2022	*
scopus.description.abstracteng	In this paper, we propose an extensive evaluation of the first text-to-text Italian Neural Language Model (NLM), IT5 [1], on a classification scenario. In particular, we test the performance of IT5 on several tasks involving both the classification of the topic and the style of a set of Italian posts. We assess the model in two different configurations, single- and multi-task classification, and we compare it with a more traditional NLM based on the Transformer architecture (i.e. BERT). Moreover, we test its performance in a few-shot learning scenario. We also perform a qualitative investigation on the impact of label representations in modeling the classification of the IT5 model. Results show that IT5 could achieve good results, although generally lower than the BERT model. Nevertheless, we observe a significant performance improvement of the Text-to-text model in a multi-task classification scenario. Finally, we found that altering the representation of the labels mainly impacts the classification of the topic.	*
scopus.description.allpeopleoriginal	Papucci M.; De Nigris C.; Miaschi A.; Dell'Orletta F.	*
scopus.differences	scopus.relation.conferencename	*
scopus.differences	scopus.authority.anceserie	*
scopus.differences	scopus.publisher.name	*
scopus.differences	scopus.subject.keywords	*
scopus.differences	scopus.relation.conferencedate	*
scopus.differences	scopus.description.allpeopleoriginal	*
scopus.differences	scopus.relation.conferenceplace	*
scopus.document.type	cp	*
scopus.document.types	cp	*
scopus.identifier.pui	639690263	*
scopus.identifier.scopus	2-s2.0-85143252156	*
scopus.journal.sourceid	21100218356	*
scopus.language.iso	eng	*
scopus.publisher.name	CEUR-WS	*
scopus.relation.conferencedate	2022	*
scopus.relation.conferencename	6th Workshop on Natural Language for Artificial Intelligence, NL4AI 2022	*
scopus.relation.conferenceplace	ita	*
scopus.relation.firstpage	56	*
scopus.relation.lastpage	70	*
scopus.relation.volume	3287	*
scopus.subject.keywords	bert; style classification; t5; text-to-text; topic classification; transformers;	*
scopus.title	Evaluating Text-To-Text Framework for Topic and Style Classification of Italian texts	*
scopus.titleeng	Evaluating Text-To-Text Framework for Topic and Style Classification of Italian texts	*
Appare nelle tipologie:	04.01 Contributo in Atti di convegno

File in questo prodotto:

Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/20.500.14243/415084

Citazioni

ND

1

ND

social impact