It is a common assumption in various work on modelling mutation dynamics that mutations follow a Poisson dynamics; that is, in a given portion of genome the number of mutations follow a Poisson law. Equivalently, the distance between to mutations follows an exponential distribution. This can actually be verified when Human and Chimpanzee genomes are compared. It is of interest to see if this law generalizes also to somatic mutations which cause cancer. A recent survey published a catalogue of somatic mutations in cancer genome analysing 4,938,362 mutations from 7,042 cancers of 30 different cancer types. We have analysed this data to find the interoccurence time (space) distributions for different types of cancer. It has been found that specific cancer types show a power-law in interoccurrence distances, instead of the expected exponential distribution dictated with the Poisson assumption. Cancer genomes exhibiting power-law interoccurrence distances were enriched in cancer types where the main mutational process is described to be the activity of the APOBEC protein family, which produces a particular pattern of mutations called Kataegis. Therefore, the observation of a power-law in interoccurence distances could be used to identify cancer genomes with Kataegis. It is our objective to develop parametric models to differentiate between cancer and non-cancer mutations and between different cancer types.

Evidence of non-poissonian dynamics in cancer mutations

Kuruoglu E E;
2014

Abstract

It is a common assumption in various work on modelling mutation dynamics that mutations follow a Poisson dynamics; that is, in a given portion of genome the number of mutations follow a Poisson law. Equivalently, the distance between to mutations follows an exponential distribution. This can actually be verified when Human and Chimpanzee genomes are compared. It is of interest to see if this law generalizes also to somatic mutations which cause cancer. A recent survey published a catalogue of somatic mutations in cancer genome analysing 4,938,362 mutations from 7,042 cancers of 30 different cancer types. We have analysed this data to find the interoccurence time (space) distributions for different types of cancer. It has been found that specific cancer types show a power-law in interoccurrence distances, instead of the expected exponential distribution dictated with the Poisson assumption. Cancer genomes exhibiting power-law interoccurrence distances were enriched in cancer types where the main mutational process is described to be the activity of the APOBEC protein family, which produces a particular pattern of mutations called Kataegis. Therefore, the observation of a power-law in interoccurence distances could be used to identify cancer genomes with Kataegis. It is our objective to develop parametric models to differentiate between cancer and non-cancer mutations and between different cancer types.
2014
Istituto di Scienza e Tecnologie dell'Informazione "Alessandro Faedo" - ISTI
978-88-907318-6-0
Kataegis
Pareto law
Power law
Poisson law
Cancer genome
File in questo prodotto:
File Dimensione Formato  
prod_294357-doc_84495.pdf

accesso aperto

Descrizione: Evidence of non-poissonian dynamics in cancer mutations
Tipologia: Versione Editoriale (PDF)
Dimensione 412.68 kB
Formato Adobe PDF
412.68 kB Adobe PDF Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/20.500.14243/258569
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus ND
  • ???jsp.display-item.citation.isi??? ND
social impact