Curating the records of an authority file is an activity as important as committing for many organizations, which have to rely on experts equipped with so-called authority control tools, capable of automatically supporting complex disambiguation workflows through user-friendly interfaces. This paper presents PACE, an open source authority control tool which offers user interfaces for (i) customizing the structure (ontology) of authority files, (ii) tune-up probabilistic disambiguation of authority files through a set of similarity functions for detecting record candidates for duplication and overload (iii) curate such authority files by applying record merges and splitting actions, and (iv) expose authority files to third-party consumers in several ways. PACE's back-end is based on Cassandra's "NOSQL" technology to offer (i) read-write performances that scale up linearly with the number of records and (ii) parallel and efficient (MapReduce-based) record sorting and matching algorithms.

PACE: a general-purpose tool for authority control

Manghi P;Mikulicic M
2011

Abstract

Curating the records of an authority file is an activity as important as committing for many organizations, which have to rely on experts equipped with so-called authority control tools, capable of automatically supporting complex disambiguation workflows through user-friendly interfaces. This paper presents PACE, an open source authority control tool which offers user interfaces for (i) customizing the structure (ontology) of authority files, (ii) tune-up probabilistic disambiguation of authority files through a set of similarity functions for detecting record candidates for duplication and overload (iii) curate such authority files by applying record merges and splitting actions, and (iv) expose authority files to third-party consumers in several ways. PACE's back-end is based on Cassandra's "NOSQL" technology to offer (i) read-write performances that scale up linearly with the number of records and (ii) parallel and efficient (MapReduce-based) record sorting and matching algorithms.
2011
Istituto di Scienza e Tecnologie dell'Informazione "Alessandro Faedo" - ISTI
Inglese
E. Garcìa-Barriocanal, Z. Cebeci, M.C. Okur, A. Òzturk
Metadata and Semantic Research
Metadata and Semantic Research. 5th International Conference, MTSR 2011
80
92
978-3-642-24730-9
http://www.springerlink.com/content/u628643038530q67/
Springer
Berlin
GERMANIA
Sì, ma tipo non specificato
12-14 October 2011
Izmir, Turkey
Authority files: control
PACE: authority control tool
Name disambiguation
Note: Co-finanziato da European Film Gateway (EFG)EU Project . - Area di valutazione 15a - Scienze e tecnologie per una società dell'informazione e della comunicazione
2
restricted
Manghi, P; Mikulicic, M
273
info:eu-repo/semantics/conferenceObject
04 Contributo in convegno::04.01 Contributo in Atti di convegno
   UNIVERsal open platform and reference Specification for Ambient Assisted Living
   UNIVERSAAL
   FP7
   247950
File in questo prodotto:
File Dimensione Formato  
prod_206294-doc_99757.pdf

solo utenti autorizzati

Descrizione: PACE: a general-purpose tool for authority control
Tipologia: Versione Editoriale (PDF)
Dimensione 1.03 MB
Formato Adobe PDF
1.03 MB Adobe PDF   Visualizza/Apri   Richiedi una copia

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/20.500.14243/174090
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 13
  • ???jsp.display-item.citation.isi??? 6
social impact