In document image analysis the task of segmenting images of ancient printed documents in distinct elements is known to be a very complex problem. In general, these documents are of low quality and can present skew and degradations because of old printing or ink stains. To face these problems we will show and discuss the validity of the Mumford and Shah variational method, based on the ? convergence theory, along with its numerical handling. In particular, we segment and extract the interest regions, constituted by textual and non-textual blocks, from page images of ancient books, combining the variational approach with morphological operations. Study case is the first edition of 'Scienza Nuova' (1725) of Giambattista Vico.

Text Lines and Words Variational Extraction from Ancient Printed Documents

Rossella Cossu;Rosa Maria Spitaleri;Marco Veneziani
2014

Abstract

In document image analysis the task of segmenting images of ancient printed documents in distinct elements is known to be a very complex problem. In general, these documents are of low quality and can present skew and degradations because of old printing or ink stains. To face these problems we will show and discuss the validity of the Mumford and Shah variational method, based on the ? convergence theory, along with its numerical handling. In particular, we segment and extract the interest regions, constituted by textual and non-textual blocks, from page images of ancient books, combining the variational approach with morphological operations. Study case is the first edition of 'Scienza Nuova' (1725) of Giambattista Vico.
2014
Istituto Applicazioni del Calcolo ''Mauro Picone''
Ancient documenta
Variational segmentation
Text extraction
File in questo prodotto:
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/20.500.14243/263575
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus ND
  • ???jsp.display-item.citation.isi??? ND
social impact