In this paper, we present a new alignment-free technique for DNA barcode classification. Our method is based on the identification of distinctive words, extracted from the spectral representation of DNA sequences. In particular, we performed an unsupervised clustering using neural gas algorithm, for iteratively calculating those fingerprints that are characteristics of DNA sequences at different taxonomic levels. In order to demonstrate the efficacy of the proposed method, we tested it over 10 real barcode datasets belonging to different animalia species, provided by on-line resource Barcode of Life Database (BOLD).

A preliminary study on Spectral Representation Analysis for Classification of DNA Barcode Sequences

Antonino Fiannaca;Massimo La Rosa;Riccardo Rizzo;Alfonso Urso
2013

Abstract

In this paper, we present a new alignment-free technique for DNA barcode classification. Our method is based on the identification of distinctive words, extracted from the spectral representation of DNA sequences. In particular, we performed an unsupervised clustering using neural gas algorithm, for iteratively calculating those fingerprints that are characteristics of DNA sequences at different taxonomic levels. In order to demonstrate the efficacy of the proposed method, we tested it over 10 real barcode datasets belonging to different animalia species, provided by on-line resource Barcode of Life Database (BOLD).
2013
Istituto di Calcolo e Reti ad Alte Prestazioni - ICAR
Istituto di Calcolo e Reti ad Alte Prestazioni - ICAR
978-88-906437-3-6
Spectral Representation Barcode Sequences
File in questo prodotto:
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/20.500.14243/278038
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus ND
  • ???jsp.display-item.citation.isi??? ND
social impact