Format differences present a significant challenge to the interoperability of Text Analysis tools. It is essential to consider format conversions within a robust theoretical framework that can effectively manage these conversions while ensuring that they adhere to specific properties. This paper presents an approach based on “functors” to address format conversion for electronic textual documents. This method ensures that the properties of text and tools are preserved during the process. Functors are key concepts in Category Theory as they enable us to reformulate problems from a category where they are complicated to solve to another category where solutions are more easily attainable. The main concept of this paper is to model a specific scenario. Within the category of documents that conform to a particular format f, there arises a need to parse a document D using a Text Analysis (TA) tool t that cannot interpret the format f. The challenge can be solved with the help of format conversion from f to f′, where f′ fits with t. However, we propose and discuss a method that uses functors to “transform” D and t so that the transformed t can read D with f′. .

Using Functors as Format Converters

Riccardo Del Gratta
Co-primo
Writing – Original Draft Preparation
;
Angelo Mario Del Grosso
Co-primo
Writing – Original Draft Preparation
2025

Abstract

Format differences present a significant challenge to the interoperability of Text Analysis tools. It is essential to consider format conversions within a robust theoretical framework that can effectively manage these conversions while ensuring that they adhere to specific properties. This paper presents an approach based on “functors” to address format conversion for electronic textual documents. This method ensures that the properties of text and tools are preserved during the process. Functors are key concepts in Category Theory as they enable us to reformulate problems from a category where they are complicated to solve to another category where solutions are more easily attainable. The main concept of this paper is to model a specific scenario. Within the category of documents that conform to a particular format f, there arises a need to parse a document D using a Text Analysis (TA) tool t that cannot interpret the format f. The challenge can be solved with the help of format conversion from f to f′, where f′ fits with t. However, we propose and discuss a method that uses functors to “transform” D and t so that the transformed t can read D with f′. .
Campo DC Valore Lingua
dc.authority.orgunit Istituto di linguistica computazionale "Antonio Zampolli" - ILC en
dc.authority.people Riccardo Del Gratta en
dc.authority.people Angelo Mario Del Grosso en
dc.collection.id.s 71c7200a-7c5f-4e83-8d57-d3d2ba88f40d *
dc.collection.name 04.01 Contributo in Atti di convegno *
dc.contributor.appartenenza Istituto di linguistica computazionale "Antonio Zampolli" - ILC *
dc.contributor.appartenenza.mi 918 *
dc.contributor.area Non assegn *
dc.contributor.area Non assegn *
dc.date.accessioned 2026/03/03 17:33:52 -
dc.date.available 2026/03/03 17:33:52 -
dc.date.firstsubmission 2025/12/17 10:26:10 *
dc.date.issued 2025 -
dc.date.submission 2025/12/17 10:26:10 *
dc.description.abstracteng Format differences present a significant challenge to the interoperability of Text Analysis tools. It is essential to consider format conversions within a robust theoretical framework that can effectively manage these conversions while ensuring that they adhere to specific properties. This paper presents an approach based on “functors” to address format conversion for electronic textual documents. This method ensures that the properties of text and tools are preserved during the process. Functors are key concepts in Category Theory as they enable us to reformulate problems from a category where they are complicated to solve to another category where solutions are more easily attainable. The main concept of this paper is to model a specific scenario. Within the category of documents that conform to a particular format f, there arises a need to parse a document D using a Text Analysis (TA) tool t that cannot interpret the format f. The challenge can be solved with the help of format conversion from f to f′, where f′ fits with t. However, we propose and discuss a method that uses functors to “transform” D and t so that the transformed t can read D with f′. . -
dc.description.allpeople Del Gratta, Riccardo; Del Grosso, Angelo Mario -
dc.description.allpeopleoriginal Riccardo Del Gratta and Angelo Mario Del Grosso en
dc.description.fulltext restricted en
dc.description.numberofauthors 2 -
dc.identifier.doi 10.1109/cist65886.2025.11224237 en
dc.identifier.isbn 979-8-3315-4384-6 en
dc.identifier.source manual *
dc.identifier.uri https://hdl.handle.net/20.500.14243/560722 -
dc.identifier.url https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=11224237 en
dc.language.iso eng en
dc.relation.conferencedate 4-10 Ottobre 2025 en
dc.relation.conferenceplace Marrakech en
dc.relation.firstpage 488 en
dc.relation.ispartofbook 8th IEEE Congress on Information Science and Technology (CiSt) en
dc.relation.lastpage 493 en
dc.relation.numberofpages 6 en
dc.relation.volume 6 en
dc.subject.keywordseng Format Conversion -
dc.subject.keywordseng Interoperability -
dc.subject.keywordseng Functors -
dc.subject.keywordseng Category Theory -
dc.subject.singlekeyword Format Conversion *
dc.subject.singlekeyword Interoperability *
dc.subject.singlekeyword Functors *
dc.subject.singlekeyword Category Theory *
dc.title Using Functors as Format Converters en
dc.type.driver info:eu-repo/semantics/conferenceObject -
dc.type.full 04 Contributo in convegno::04.01 Contributo in Atti di convegno it
dc.type.miur 273 -
iris.mediafilter.data 2026/03/04 02:52:17 *
iris.orcid.lastModifiedDate 2026/03/03 17:33:52 *
iris.orcid.lastModifiedMillisecond 1772555632194 *
iris.scopus.extIssued 2025 -
iris.scopus.extTitle Using Functors as Format Converters -
iris.sitodocente.maxattempts 1 -
iris.unpaywall.doi 10.1109/cist65886.2025.11224237 *
iris.unpaywall.isoa false *
iris.unpaywall.metadataCallLastModified 04/03/2026 04:33:18 -
iris.unpaywall.metadataCallLastModifiedMillisecond 1772595198978 -
iris.unpaywall.oastatus closed *
Appare nelle tipologie: 04.01 Contributo in Atti di convegno
File in questo prodotto:
File Dimensione Formato  
Using_Functors_as_Format_Converters.pdf

solo utenti autorizzati

Tipologia: Versione Editoriale (PDF)
Licenza: NON PUBBLICO - Accesso privato/ristretto
Dimensione 1.1 MB
Formato Adobe PDF
1.1 MB Adobe PDF   Visualizza/Apri   Richiedi una copia

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/20.500.14243/560722
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus ND
  • ???jsp.display-item.citation.isi??? ND
social impact