Automatic keyword extraction is the process of identifying key terms and key phrases from documents that can appropriately represent the subject of the documents. We present here a work-in-progress, an experimentation done on unsupervised keyword extraction, with the aim of automatically associating scored keyphrases to texts, using (standard or innovative) cluster based methods, and integrating word embedding to enhance semantic relatedness of keyphrases. In the paper we present the datasets used, the state-of-the-art for unsupervised automatic extraction algorithms, based on cluster methods, and we describe in details the algorithms implemented and preliminary results obtained. The results obtained are discussed, commented, and compared with those obtained, in previous experimentations, using TextRank, RAKE and Tf-idf.

Cluster-based unsupervised automatic keyphrases extraction algorithms: Experimentations on cultural heritage datasets

MT Artese;I Gagliardi
2019

Abstract

Automatic keyword extraction is the process of identifying key terms and key phrases from documents that can appropriately represent the subject of the documents. We present here a work-in-progress, an experimentation done on unsupervised keyword extraction, with the aim of automatically associating scored keyphrases to texts, using (standard or innovative) cluster based methods, and integrating word embedding to enhance semantic relatedness of keyphrases. In the paper we present the datasets used, the state-of-the-art for unsupervised automatic extraction algorithms, based on cluster methods, and we describe in details the algorithms implemented and preliminary results obtained. The results obtained are discussed, commented, and compared with those obtained, in previous experimentations, using TextRank, RAKE and Tf-idf.
2019
Istituto di Matematica Applicata e Tecnologie Informatiche - IMATI -
Inglese
ARCHIVING 2019: Digitization, Preservation, and Access - Final Program and Proceedings 2019
Archiving2019: Digitization, Preservation, and Access
2019
156
160
https://ist.publisher.ingentaconnect.com/contentone/ist/ac/2019/00002019/00000001/art00036
Society for Imaging Science and Technology
Springfield, VA 22151
STATI UNITI D'AMERICA
Sì, ma tipo non specificato
14-17/05/2019
Lisbona
N/A
2
none
M.T. Artese;I. Gagliardi
273
info:eu-repo/semantics/conferenceObject
04 Contributo in convegno::04.01 Contributo in Atti di convegno
File in questo prodotto:
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/20.500.14243/368581
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 0
  • ???jsp.display-item.citation.isi??? ND
social impact