Over the last decades, the exuberant development of next-generation sequencing has revolutionized genediscovery. These technologies have boosted the mapping of single nucleotide polymorphisms (SNPs) acrossthe human genome, providing a complex universe of heterogeneity characterizing individuals worldwide.Fractal dimension (FD) measures the degree of geometric irregularity, quantifying how "complex" a selfsimilarnatural phenomenon is. We compared two FD algorithms, box-counting dimension (BCD) andHiguchi's fractal dimension (HFD), to characterize genome-wide patterns of SNPs extracted from theHapMap data set, which includes data from 1184 healthy subjects of eleven populations. In addition, wehave used cluster and classification analysis to relate the genetic distances within chromosomes basedon FD similarities to the geographical distances among the 11 global populations. We found that HFDoutperformed BCD at both grand average clusterization analysis by the cophenetic correlation coefficient,in which the closest value to 1 represents the most accurate clustering solution (0.981 for the HFD and0.956 for the BCD) and classification (79.0% accuracy, 61.7% sensitivity, and 96.4% specificity for theHFD with respect to 69.1% accuracy, 43.2% sensitivity, and 94.9% specificity for the BCD) of the 11 populations present in the HapMap data set. These results support the evidence that HFD is a reliablemeasure helpful in representing individual variations within all chromosomes and categorizing individualsand global populations.

Characterizing Fractal Genetic Variation in the Human Genome from the Hapmap Project

Borri A;Cerasa A;Citrigno L;Porcaro C
2022

Abstract

Over the last decades, the exuberant development of next-generation sequencing has revolutionized genediscovery. These technologies have boosted the mapping of single nucleotide polymorphisms (SNPs) acrossthe human genome, providing a complex universe of heterogeneity characterizing individuals worldwide.Fractal dimension (FD) measures the degree of geometric irregularity, quantifying how "complex" a selfsimilarnatural phenomenon is. We compared two FD algorithms, box-counting dimension (BCD) andHiguchi's fractal dimension (HFD), to characterize genome-wide patterns of SNPs extracted from theHapMap data set, which includes data from 1184 healthy subjects of eleven populations. In addition, wehave used cluster and classification analysis to relate the genetic distances within chromosomes basedon FD similarities to the geographical distances among the 11 global populations. We found that HFDoutperformed BCD at both grand average clusterization analysis by the cophenetic correlation coefficient,in which the closest value to 1 represents the most accurate clustering solution (0.981 for the HFD and0.956 for the BCD) and classification (79.0% accuracy, 61.7% sensitivity, and 96.4% specificity for theHFD with respect to 69.1% accuracy, 43.2% sensitivity, and 94.9% specificity for the BCD) of the 11 populations present in the HapMap data set. These results support the evidence that HFD is a reliablemeasure helpful in representing individual variations within all chromosomes and categorizing individualsand global populations.
2022
Istituto di Analisi dei Sistemi ed Informatica ''Antonio Ruberti'' - IASI
Istituto di Scienze e Tecnologie della Cognizione - ISTC
Istituto per la Ricerca e l'Innovazione Biomedica -IRIB
fractal dimension
human genome
HapMap project
machine learning
File in questo prodotto:
File Dimensione Formato  
[083J037] Characterizing Fractal Genetic Variation in the Human Genome from the Hapmap Project (IJNS 2022).pdf

accesso aperto

Tipologia: Versione Editoriale (PDF)
Licenza: Creative commons
Dimensione 2.91 MB
Formato Adobe PDF
2.91 MB Adobe PDF Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/20.500.14243/449343
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 11
  • ???jsp.display-item.citation.isi??? 8
social impact