Machine learning and neural networks tools to address noisy data issues

MT Artese; I Gagliardi

doi:10.55630/dipp.2021.11.8

In this paper, we present tools for addressing noisy keyword issues in digital libraries. Two tasks, language detection and misspelling detection and correction, are addressed using both machine learning and deep learning techniques. To train and validate the models, different datasets were used/created/scraped. Encouraging preliminary results are presented and discussed.

Machine learning and neural networks tools to address noisy data issues

MT Artese;I Gagliardi

2021

Abstract

In this paper, we present tools for addressing noisy keyword issues in digital libraries. Two tasks, language detection and misspelling detection and correction, are addressed using both machine learning and deep learning techniques. To train and validate the models, different datasets were used/created/scraped. Encouraging preliminary results are presented and discussed.

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno
	
				2021
			
	Strutture organizzative
	
				Istituto di Matematica Applicata e Tecnologie Informatiche - IMATI -
			
	Parole chiave
	
				Content based retrieval
Digital library
Noisy data
Tags
Unsupervised tools

File in questo prodotto:

File	Dimensione	Formato
prod_473982-doc_194009.pdf solo utenti autorizzati Descrizione: Machine Learning and Neural Networks Tools to Address Noisy Data Issues Tipologia: Versione Editoriale (PDF) Dimensione 321.54 kB Formato Adobe PDF Visualizza/Apri Richiedi una copia	321.54 kB	Adobe PDF	Visualizza/Apri Richiedi una copia

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/20.500.14243/417755

Citazioni

ND

0

0

CNR Institutional Research Information System

Machine learning and neural networks tools to address noisy data issues

MT Artese;I Gagliardi

2021

Abstract

Scheda breve

Scheda completa

Scheda completa (DC)

Citazioni

social impact

CNR Institutional Research Information System

Machine learning and neural networks tools to address noisy data issues

MT Artese;I Gagliardi

2021

Abstract

Scheda breve Scheda completa Scheda completa (DC)

Informazioni

Citazioni

social impact

Conferma cancellazione

Scheda breve

Scheda completa

Scheda completa (DC)