The study of statistical fluctuations in DNA sequences can reveal important characteristics of their organization. Particularly, in the last two decades, several studies have focused on the detection of Long Range Correlations (LRC) in DNA sequences, and on the study of particular long range dependence properties of nucleotides. Since protein coding is carried out by codons (nucleotide triplets), we conduct LRC analysis to see whether there is a long range correlation among nucleotide triplets. Our interest is not limited to auto correlations of single codons, but also extends to cross correlations over all possible pairs of nucleotide triplets. LRCs in DNA sequences are studied and quantified using two measures: the mutual information function and the correlation function. The analysis on nucleotide triplets reveal LRCs for the human Chromosomes 20 and 21. Moreover, some triplets that contain only nucleotides Adenine and Thymine are seen to exhibit correlation significantly higher than others.

Long range correlations between nucleotide triplets in human chromosomes

Kuruoglu E E;
2010

Abstract

The study of statistical fluctuations in DNA sequences can reveal important characteristics of their organization. Particularly, in the last two decades, several studies have focused on the detection of Long Range Correlations (LRC) in DNA sequences, and on the study of particular long range dependence properties of nucleotides. Since protein coding is carried out by codons (nucleotide triplets), we conduct LRC analysis to see whether there is a long range correlation among nucleotide triplets. Our interest is not limited to auto correlations of single codons, but also extends to cross correlations over all possible pairs of nucleotide triplets. LRCs in DNA sequences are studied and quantified using two measures: the mutual information function and the correlation function. The analysis on nucleotide triplets reveal LRCs for the human Chromosomes 20 and 21. Moreover, some triplets that contain only nucleotides Adenine and Thymine are seen to exhibit correlation significantly higher than others.
2010
Istituto di Scienza e Tecnologie dell'Informazione "Alessandro Faedo" - ISTI
Life and Medical Sciences. Biology and genetics
Probability and Statistics. Correlation and regression analysis
Coding and Information Theory
92D10 Genetics
62M10 Time series
auto-correlation
regression
etc
File in questo prodotto:
File Dimensione Formato  
prod_161213-doc_132555.pdf

accesso aperto

Descrizione: Long range correlations between nucleotide triplets in human chromosomes
Dimensione 301.83 kB
Formato Adobe PDF
301.83 kB Adobe PDF Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/20.500.14243/167730
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus ND
  • ???jsp.display-item.citation.isi??? ND
social impact