The development of computational models and tools for philologically curated digital editions poses dual challenges: defining functional specifications for the reference community and ensuring sustainability and adherence to open science principles. Requirements analysis benefits from user stories describing application scenarios, while issues in process management and technologies require solutions for resource accessibility and longevity. The CHROMA model (http://chroma.cnr.it/) offers an integrated approach rooted in projects like "Bellini Digital Correspondence" and "Pirandello Nazionale". It treats text as a complex multidimensional object through an editorial process involving: Creating digital surrogates of primary sources via IIIF protocol; Segmentation and text recognition using HTR/OCR environments by using eScriptorium tool; Separating textual and paratextual planes; Representing structural and semantic phenomena through XML/TEI, RDF, and Domain-Specific Languages; Assisted encoding with software tools for realigning different versions of editions e.g., Bertalign and NormaTEI; Integrating linguistic and lexicographic analyses by using NLP tools; Extracting information and generating knowledge graphs by means of semantic Web technologies and Ontologies; Connecting to authority records via LOD. The edition's fruition involves interactive visualization tools like TEIPublisher or EVT. Long-term technological standards ensure academic sustainability and synergy with the H2IOSC project. This workflow is part of pilot projects of the H2IOSC infrastructure such as the "Text Transcription Environment" and will be included in the initiative's marketplace as possible tool in docker deployment flavor. The CLARIN bazaar has discussed the model's conceptual choices and explored the proposed process in depth.
CHROMA model for H2IOSC
Sichera, Pietro
Primo
Methodology
;Cristofaro, Salvatore
Methodology
;SPAMPINATO, DARIA
Methodology
;Mazzagufo, Laura
Methodology
;DEL GROSSO, ANGELO MARIO
Methodology
2024
Abstract
The development of computational models and tools for philologically curated digital editions poses dual challenges: defining functional specifications for the reference community and ensuring sustainability and adherence to open science principles. Requirements analysis benefits from user stories describing application scenarios, while issues in process management and technologies require solutions for resource accessibility and longevity. The CHROMA model (http://chroma.cnr.it/) offers an integrated approach rooted in projects like "Bellini Digital Correspondence" and "Pirandello Nazionale". It treats text as a complex multidimensional object through an editorial process involving: Creating digital surrogates of primary sources via IIIF protocol; Segmentation and text recognition using HTR/OCR environments by using eScriptorium tool; Separating textual and paratextual planes; Representing structural and semantic phenomena through XML/TEI, RDF, and Domain-Specific Languages; Assisted encoding with software tools for realigning different versions of editions e.g., Bertalign and NormaTEI; Integrating linguistic and lexicographic analyses by using NLP tools; Extracting information and generating knowledge graphs by means of semantic Web technologies and Ontologies; Connecting to authority records via LOD. The edition's fruition involves interactive visualization tools like TEIPublisher or EVT. Long-term technological standards ensure academic sustainability and synergy with the H2IOSC project. This workflow is part of pilot projects of the H2IOSC infrastructure such as the "Text Transcription Environment" and will be included in the initiative's marketplace as possible tool in docker deployment flavor. The CLARIN bazaar has discussed the model's conceptual choices and explored the proposed process in depth.File | Dimensione | Formato | |
---|---|---|---|
CHROMA-model-for-H2IOSC-CLARIN-Conference-2024.pdf
accesso aperto
Tipologia:
Versione Editoriale (PDF)
Licenza:
Creative commons
Dimensione
1.84 MB
Formato
Adobe PDF
|
1.84 MB | Adobe PDF | Visualizza/Apri |
I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.