CNR Institutional Research Information System

This chapter deals with the problem of processing and analyzing digital images of ancient or degraded documents to increase the possibilities of inferring their structures. Classification and recognition are needed to infer structure but, when dealing with degraded documents, they are particularly difficult to apply directly to unprocessed images. This is why an intermediate step is needed that extracts automatically the "perceptual components" of the documents from their appearance. By "appearance" of a document, we mean the "raw" data set, containing the "sensorial components" of the object under study. Ancient documents of historical importance pose specific problems that are now being solved with the help of information technology. As much information as possible should be drawn from the physical documents and should be structured so as to permit specialized searches to be performed in large databases. The tools we use to treat unstructured, low-level information are both mathematical and descriptive. Under a mathematical point of view, we model our appearance as a function of all the perceptual components, or patterns we want to identify. Once the model has been established, its parameters can be learned from the data available and from reasonable assumptions on both the model itself and the patterns. Our descriptive tools form a specialized metadata schema that can help both the storage and the indexing of all the digital objects produced to represent the original document, and provides a complete description of all the processing performed. Suitable links fully interconnect the various descriptions in order to relate the different representations of the physical object and to trace the history of all the processing performed. Inferring structure is much easier by analyzing the patterns and their mutual relationships than by analyzing the appearance.

Low-level document image analysis and description: from appearance to structure

Salerno E;Savino P;Tonazzini A

2011

Abstract

This chapter deals with the problem of processing and analyzing digital images of ancient or degraded documents to increase the possibilities of inferring their structures. Classification and recognition are needed to infer structure but, when dealing with degraded documents, they are particularly difficult to apply directly to unprocessed images. This is why an intermediate step is needed that extracts automatically the "perceptual components" of the documents from their appearance. By "appearance" of a document, we mean the "raw" data set, containing the "sensorial components" of the object under study. Ancient documents of historical importance pose specific problems that are now being solved with the help of information technology. As much information as possible should be drawn from the physical documents and should be structured so as to permit specialized searches to be performed in large databases. The tools we use to treat unstructured, low-level information are both mathematical and descriptive. Under a mathematical point of view, we model our appearance as a function of all the perceptual components, or patterns we want to identify. Once the model has been established, its parameters can be learned from the data available and from reasonable assumptions on both the model itself and the patterns. Our descriptive tools form a specialized metadata schema that can help both the storage and the indexing of all the digital objects produced to represent the original document, and provides a complete description of all the processing performed. Suitable links fully interconnect the various descriptions in order to relate the different representations of the physical object and to trace the history of all the processing performed. Inferring structure is much easier by analyzing the patterns and their mutual relationships than by analyzing the appearance.

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno
	
				2011
			
	Strutture organizzative
	
				Istituto di Scienza e Tecnologie dell'Informazione "Alessandro Faedo" - ISTI
			
	Codice ISBN
	
				978-3-642-22912-1
			
	Parole chiave
	
				Document Image Processing ; Metadata Editor ; Digital Libraries
			
	Appare nelle tipologie:
	
				02.01 Contributo in volume (Capitolo o Saggio)

File in questo prodotto:

File	Dimensione	Formato
prod_202591-doc_46137.pdf solo utenti autorizzati Descrizione: Low-level document image analysis and description: from appearance to structure Tipologia: Versione Editoriale (PDF) Dimensione 1.21 MB Formato Adobe PDF Visualizza/Apri Richiedi una copia	1.21 MB	Adobe PDF	Visualizza/Apri Richiedi una copia

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/20.500.14243/9897

Citazioni

ND

0

0

social impact