When the reads obtained from high-throughput sequencing are mapped against a reference database, some of them - known as multireads - can map to more than one reference sequence. This event occurs because genomes contains many repeated portions and reads are generally shorter than reference sequences. Removing the multireads from the mapping results causes an underestimation of the read counts, while estimating the real read count can lead to false positives during the detection of differentially expressed sequences.
Managing NGS differential expression uncertainty with fuzzy sets
Arianna Consiglio;Giorgio Grillo;Liuni Sabino
2015
Abstract
When the reads obtained from high-throughput sequencing are mapped against a reference database, some of them - known as multireads - can map to more than one reference sequence. This event occurs because genomes contains many repeated portions and reads are generally shorter than reference sequences. Removing the multireads from the mapping results causes an underestimation of the read counts, while estimating the real read count can lead to false positives during the detection of differentially expressed sequences.File in questo prodotto:
Non ci sono file associati a questo prodotto.
I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.