CNR Institutional Research Information System

Expression profiles have been successfully determined by using hybridization- and tagbased technologies, even though such approaches suffer from limits and drawbacks and lack information about rare RNA species, emerging as contributors to pathological phenotypes in humans (1-8). The introduction of next generation sequencing (NGS) technologies, revealing mammalian transcriptomes' complexity, has shown that a small fraction of transcribed sequences (<2%) is represented by mRNA (9). However, the unprecedented level of sensitivity in the data produced by NGS platforms brings with it the power to make several biological observations, at the cost of a considerable effort in the development of new bioinformatics tools and computational strategies to deal with these massive data files. Indeed, for these large-scale analyses, data transferring, processing and handling may represent a computational bottleneck. Another issue is the availability of software required to perform one or more downstream analysis (1). To this purpose, in this paper we describe the computational strategies used to analyze different aspects of a wholetranscriptome. In particular, we illustrate the results of the analysis performed on a dataset obtained from a strand-specific RNA sequenicng of ribosomal-depleted samples, isolated from a cell type impaired in the Down syndrome

RNA-seq: from computational challenges to biological insights

Costa V;Angelini C;D'Apice L;Mutarelli M;Casamassimi A;Aprile M;Esposito R;Leone L;Donizetti A;Crispi S;De Berardinis P;Napoli;Baldini A;Ciccodicola A

2010

Abstract

Expression profiles have been successfully determined by using hybridization- and tagbased technologies, even though such approaches suffer from limits and drawbacks and lack information about rare RNA species, emerging as contributors to pathological phenotypes in humans (1-8). The introduction of next generation sequencing (NGS) technologies, revealing mammalian transcriptomes' complexity, has shown that a small fraction of transcribed sequences (<2%) is represented by mRNA (9). However, the unprecedented level of sensitivity in the data produced by NGS platforms brings with it the power to make several biological observations, at the cost of a considerable effort in the development of new bioinformatics tools and computational strategies to deal with these massive data files. Indeed, for these large-scale analyses, data transferring, processing and handling may represent a computational bottleneck. Another issue is the availability of software required to perform one or more downstream analysis (1). To this purpose, in this paper we describe the computational strategies used to analyze different aspects of a wholetranscriptome. In particular, we illustrate the results of the analysis performed on a dataset obtained from a strand-specific RNA sequenicng of ribosomal-depleted samples, isolated from a cell type impaired in the Down syndrome

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno
	
				2010
			
	Strutture organizzative
	
				Istituto Applicazioni del Calcolo ''Mauro Picone''
			
	Codice ISBN
	
				978-88-548-3658-7
			
	Parole chiave
	
				Bioinformatics
RNA-seq
Next Generation sequencing
			
	Appare nelle tipologie:
	
				04.01 Contributo in Atti di convegno

File in questo prodotto:

Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/20.500.14243/66159

Citazioni

ND

ND

ND

social impact