Many text documents show a reduced legibility due to some specific kinds of physical degradation. In these cases, recovering a clean text pattern may be not the only purpose of digital document restoration, since some of the degradation artifacts may contain significant information.This is the case, for instance, of underwritings in palimpsests. In this paper, we propose a novel approach to this problem, by reformulating it as a blind source separation problem and solving it by independent component analysis techniques. Under appropriate hypotheses, the spectral components of the document, taken at different bands both in the visible and in the non-visible ranges, can be used to extract the individual contributions of, say, the text and the bleed-through and background patterns. Examples of bleed-through cancellation and recovery of underwriting from palimpsests are provided.

Digital analysis of damaged documents by ICA techniques

Tonazzini A;Salerno E
2003

Abstract

Many text documents show a reduced legibility due to some specific kinds of physical degradation. In these cases, recovering a clean text pattern may be not the only purpose of digital document restoration, since some of the degradation artifacts may contain significant information.This is the case, for instance, of underwritings in palimpsests. In this paper, we propose a novel approach to this problem, by reformulating it as a blind source separation problem and solving it by independent component analysis techniques. Under appropriate hypotheses, the spectral components of the document, taken at different bands both in the visible and in the non-visible ranges, can be used to extract the individual contributions of, say, the text and the bleed-through and background patterns. Examples of bleed-through cancellation and recovery of underwriting from palimpsests are provided.
2003
Istituto di Scienza e Tecnologie dell'Informazione "Alessandro Faedo" - ISTI
Degraded Documents
Blind Source Separation
Component Analysis
File in questo prodotto:
File Dimensione Formato  
prod_91149-doc_123368.pdf

solo utenti autorizzati

Descrizione: Digital analysis of damaged documents by ICA techniques
Tipologia: Versione Editoriale (PDF)
Dimensione 424.84 kB
Formato Adobe PDF
424.84 kB Adobe PDF   Visualizza/Apri   Richiedi una copia

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/20.500.14243/57604
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus ND
  • ???jsp.display-item.citation.isi??? ND
social impact