We produced a novel dataset of 4,533 medieval Latin regesta (summaries) paired with full texts, extracted through a meticulous pipeline involving manual annotation, custom model training, text extraction, and post-processing to ensure high-quality, structured data for AI-driven summarization tasks.
Automatic extraction of regesta for medieval latin text summarization
Puccetti G.;Esuli A.
2025
Abstract
We produced a novel dataset of 4,533 medieval Latin regesta (summaries) paired with full texts, extracted through a meticulous pipeline involving manual annotation, custom model training, text extraction, and post-processing to ensure high-quality, structured data for AI-driven summarization tasks.File in questo prodotto:
| File | Dimensione | Formato | |
|---|---|---|---|
|
Puccetti_EN 2025_Regesta.pdf
accesso aperto
Descrizione: https://ercim-news.ercim.eu/en141/special/automatic-extraction-of-regesta-for-medieval-latin-text-summarization#google_vignette
Tipologia:
Versione Editoriale (PDF)
Licenza:
Creative commons
Dimensione
3.04 MB
Formato
Adobe PDF
|
3.04 MB | Adobe PDF | Visualizza/Apri |
I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.


