Embeddings are fundamental resources often reused for building intelligent systems in thebiomedical context. As a result, evaluating the quality of previously trained embeddings andensuring they cover the desired information is critical for the success of applications. Thispaper proposes a new evaluation methodology to test the coverage of embeddings against atargetted domain of interest. It defines measures to assess the terminology, similarity, and analogycoverage, which are core aspects of the embeddings. Then, it discusses the experimentationcarried out on existing biomedical embeddings in the specific context of pulmonary diseases.The proposed methodology and measures are general and may be applied to any applicationdomain.

Quality of word and concept embeddings in targetted biomedical domains

Salvatore Giancani;Riccardo Albertoni;Chiara Eva Catalano
2023

Abstract

Embeddings are fundamental resources often reused for building intelligent systems in thebiomedical context. As a result, evaluating the quality of previously trained embeddings andensuring they cover the desired information is critical for the success of applications. Thispaper proposes a new evaluation methodology to test the coverage of embeddings against atargetted domain of interest. It defines measures to assess the terminology, similarity, and analogycoverage, which are core aspects of the embeddings. Then, it discusses the experimentationcarried out on existing biomedical embeddings in the specific context of pulmonary diseases.The proposed methodology and measures are general and may be applied to any applicationdomain.
2023
Istituto di Matematica Applicata e Tecnologie Informatiche - IMATI - Sede Secondaria Genova
Embedding
Quality
UMLS
Coverage
Chronic obstructive pulmonary disease
File in questo prodotto:
File Dimensione Formato  
prod_492085-doc_205275.pdf

accesso aperto

Descrizione: Quality of word and concept embeddings in targetted biomedical domains
Tipologia: Versione Editoriale (PDF)
Licenza: Creative commons
Dimensione 1.55 MB
Formato Adobe PDF
1.55 MB Adobe PDF Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/20.500.14243/454720
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 0
  • ???jsp.display-item.citation.isi??? 0
social impact