Copy Number Variants (CNVs) are structural rear- rangements contributing to phenotypic variation that have been proved to be associated with many dis- ease states. Over the last years, the identification of CNVs from whole-exome sequencing (WES) data has become a common practice for research and clinical purpose and, consequently, the demand for more and more efficient and accurate methods has increased. In this paper, we demonstrate that more than 30% of WES data map outside the targeted re- gions and that these reads, usually discarded, can be exploited to enhance the identification of CNVs from WES experiments. Here, we present EXCAVATOR2, the first read count based tool that exploits all the reads produced by WES experiments to detect CNVs with a genome-wide resolution. To evaluate the per- formance of our novel tool we use it for analysing two WES data sets, a population data set sequenced by the 1000 Genomes Project and a tumor data set made of bladder cancer samples. The results obtained from these analyses demonstrate that EXCAVATOR2 out- performs other four state-of-the-art methods and that our combined approach enlarge the spectrum of detectable CNVs from WES data with an unprece- dented resolution. EXCAVATOR2 is freely available at http://sourceforge.net/projects/excavator2tool/.

Enhanced copy number variants detection from whole-exome sequencing data using EXCAVATOR2

D'Aurizio R;Pellegrini M;
2016

Abstract

Copy Number Variants (CNVs) are structural rear- rangements contributing to phenotypic variation that have been proved to be associated with many dis- ease states. Over the last years, the identification of CNVs from whole-exome sequencing (WES) data has become a common practice for research and clinical purpose and, consequently, the demand for more and more efficient and accurate methods has increased. In this paper, we demonstrate that more than 30% of WES data map outside the targeted re- gions and that these reads, usually discarded, can be exploited to enhance the identification of CNVs from WES experiments. Here, we present EXCAVATOR2, the first read count based tool that exploits all the reads produced by WES experiments to detect CNVs with a genome-wide resolution. To evaluate the per- formance of our novel tool we use it for analysing two WES data sets, a population data set sequenced by the 1000 Genomes Project and a tumor data set made of bladder cancer samples. The results obtained from these analyses demonstrate that EXCAVATOR2 out- performs other four state-of-the-art methods and that our combined approach enlarge the spectrum of detectable CNVs from WES data with an unprece- dented resolution. EXCAVATOR2 is freely available at http://sourceforge.net/projects/excavator2tool/.
2016
Istituto di informatica e telematica - IIT
Algorithm/protocol design and analysis
Bioinformatics
File in questo prodotto:
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/20.500.14243/318642
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 82
  • ???jsp.display-item.citation.isi??? 80
social impact