Many text documents show a reduced legibility due to some specific kinds of physical degradation. In these cases, recovering a clean text pattern may be not the only purpose of digital document restoration, since some of the degradation artifacts may contain significant information.This is the case, for instance, of underwritings in palimpsests. In this paper, we propose a novel approach to this problem, by reformulating it as a blind source separation problem and solving it by independent component analysis techniques. Under appropriate hypotheses, the spectral components of the document, taken at different bands both in the visible and in the non-visible ranges, can be used to extract the individual contributions of, say, the text and the bleed-through and background patterns. Examples of bleed-through cancellation and recovery of underwriting from palimpsests are provided.
Digital analysis of damaged documents by ICA techniques
Tonazzini A;Salerno E
2003
Abstract
Many text documents show a reduced legibility due to some specific kinds of physical degradation. In these cases, recovering a clean text pattern may be not the only purpose of digital document restoration, since some of the degradation artifacts may contain significant information.This is the case, for instance, of underwritings in palimpsests. In this paper, we propose a novel approach to this problem, by reformulating it as a blind source separation problem and solving it by independent component analysis techniques. Under appropriate hypotheses, the spectral components of the document, taken at different bands both in the visible and in the non-visible ranges, can be used to extract the individual contributions of, say, the text and the bleed-through and background patterns. Examples of bleed-through cancellation and recovery of underwriting from palimpsests are provided.File | Dimensione | Formato | |
---|---|---|---|
prod_91149-doc_123368.pdf
solo utenti autorizzati
Descrizione: Digital analysis of damaged documents by ICA techniques
Tipologia:
Versione Editoriale (PDF)
Dimensione
424.84 kB
Formato
Adobe PDF
|
424.84 kB | Adobe PDF | Visualizza/Apri Richiedi una copia |
I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.