CNR Institutional Research Information System

Bleed-through is a pervasive degradation in ancient documents, caused by the ink of the opposite side of the sheet that has seeped through the paper fiber, and appears as an extra, interfering text. Bleed-through severely impairs document readability and makes it difficult to decipher the contents. Digital image restoration techniques have been successfully employed to remove or significantly reduce this distortion. The main theme is to identify the bleedthrough pixels and estimate an appropriate replacement for them, in accordance to their surrounding. This paper proposes a two-step image restoration method, exploiting information from the recto and verso images. First, based on a non-stationary linear model of the two texts overlapped in the recto-verso pair, the bleed-through pixels are identified. In the second step, a sparse representation based image inpainting technique, with a non-negative sparsity constraint, is used to find an appropriate replacement for the bleedthough pixels. Thanks to the power of dictionary learning and sparse image reconstruction methods, the natural texture of the background is well reproduced in the bleed-through areas, and even a their possible overestimation is effectively corrected, so that the original appearance of the document is preserved. The experiments are conducted on the images of a popular database of ancient documents, and the results validate the performance of the proposed method compared to the state of the art.

Document bleed-through removal using sparse image inpainting

Hanif M;Tonazzini A;Savino P;Salerno E;Tsagkatakis G

2018

Abstract

Bleed-through is a pervasive degradation in ancient documents, caused by the ink of the opposite side of the sheet that has seeped through the paper fiber, and appears as an extra, interfering text. Bleed-through severely impairs document readability and makes it difficult to decipher the contents. Digital image restoration techniques have been successfully employed to remove or significantly reduce this distortion. The main theme is to identify the bleedthrough pixels and estimate an appropriate replacement for them, in accordance to their surrounding. This paper proposes a two-step image restoration method, exploiting information from the recto and verso images. First, based on a non-stationary linear model of the two texts overlapped in the recto-verso pair, the bleed-through pixels are identified. In the second step, a sparse representation based image inpainting technique, with a non-negative sparsity constraint, is used to find an appropriate replacement for the bleedthough pixels. Thanks to the power of dictionary learning and sparse image reconstruction methods, the natural texture of the background is well reproduced in the bleed-through areas, and even a their possible overestimation is effectively corrected, so that the original appearance of the document is preserved. The experiments are conducted on the images of a popular database of ancient documents, and the results validate the performance of the proposed method compared to the state of the art.

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno
	
				2018
			
	Strutture organizzative
	
				Istituto di Scienza e Tecnologie dell'Informazione "Alessandro Faedo" - ISTI
			
	Codice ISBN
	
				978-1-5386-3346-5
			
	Parole chiave
	
				Ancient document restoration
Image inpainting
Bleed-through removal
Sparse representation
			
	Appare nelle tipologie:
	
				04.01 Contributo in Atti di convegno

File in questo prodotto:

File	Dimensione	Formato
prod_388750-doc_133890.pdf solo utenti autorizzati Descrizione: Document Bleed-through Removal using Sparse Image Inpainting Tipologia: Versione Editoriale (PDF) Dimensione 417.65 kB Formato Adobe PDF Visualizza/Apri Richiedi una copia	417.65 kB	Adobe PDF	Visualizza/Apri Richiedi una copia
prod_388750-doc_133891.pdf accesso aperto Descrizione: Document Bleed-through Removal using Sparse Image Inpainting Tipologia: Versione Editoriale (PDF) Dimensione 1.11 MB Formato Adobe PDF Visualizza/Apri	1.11 MB	Adobe PDF	Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/20.500.14243/373327

Citazioni

ND

3

2

social impact