CNR Institutional Research Information System

For several years now, there has been an exponential growth of the amount of life science data (e.g., sequenced complete genomes, 3D structures, DNA chips, Mass spectroscopy data) generated by high throughput experiments. Carrying out analyses of complex, voluminous, and heterogeneous data and guiding the analysis of data using a statistical and mathematical sound methodology is thus of paramount importance. Here we make and justify the observation that experimental replicates and phylogenetic data may be combined to strength the evidences on identifying transcriptional motifs, which seems to be quite difficult using other currently used methods. We present a case study considering sequences and microarray data from fungi species. Although we show that our methodology can result of immediate practical utility to bioinformaticians and biologists for annotating new genomes, here the focus is also on discussing the dependent interesting mathematical problems that high throughput data integration poses.

Combining experimental evidences from replicates and nearby species data for annotating novel genomes.

C Angelini;L Cutillo;I De Feis;R van der Wath;P Lio

2008

Abstract

For several years now, there has been an exponential growth of the amount of life science data (e.g., sequenced complete genomes, 3D structures, DNA chips, Mass spectroscopy data) generated by high throughput experiments. Carrying out analyses of complex, voluminous, and heterogeneous data and guiding the analysis of data using a statistical and mathematical sound methodology is thus of paramount importance. Here we make and justify the observation that experimental replicates and phylogenetic data may be combined to strength the evidences on identifying transcriptional motifs, which seems to be quite difficult using other currently used methods. We present a case study considering sequences and microarray data from fungi species. Although we show that our methodology can result of immediate practical utility to bioinformaticians and biologists for annotating new genomes, here the focus is also on discussing the dependent interesting mathematical problems that high throughput data integration poses.

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno
	
				2008
			
	Strutture organizzative
	
				Istituto Applicazioni del Calcolo ''Mauro Picone''
			
	Lingua/e
	
				Inglese
			
	Supervisori e coordinatori esterni
	
				Luigi M. Ricciardi, Aniello Buonocore, Enrica Pirozzi
			
	Titolo del Volume
	
				AIP Conference Proceedings
			
	Da pagina
	
				277
			
	A pagina
	
				291
			
	Numero di pagine
	
				15
			
	Codice ISBN
	
				978-0-7354-0552-3
			
	Codice DOI
	
				https://dx.doi.org/10.1063/1.2965094
			
	URL
	
				http://proceedings.aip.org/resource/2/apcpcs/1028/1?isAuthorized=no
			
	Referee
	
				Sì, ma tipo non specificato
			
	Parole chiave
	
				Bayesian variable selection
MCMC algorithm
Microarray data analysis
			
	Altre informazioni
	
				COLLECTIVE DYNAMICS: TOPICS ON COMPETITION AND COOPERATION IN THE BIOSCIENCES: A Selection of Papers in the Proceedings of the BIOCOMP2007 International Conference
			
	Numero autori
	
				2
			
	Tipologia
	
				02 Contributo in Volume::02.01 Contributo in volume (Capitolo o Saggio)
			
	Tipologia Login Miur
	
				268
			
	Fulltext
	
				none
			
	Tutti gli autori
	
						C. Angelini; L. Cutillo; I. De Feis; R. van der Wath; P. Lio
					
	Tipologia
	
				info:eu-repo/semantics/bookPart
			
	Appare nelle tipologie:
	
				02.01 Contributo in volume (Capitolo o Saggio)

File in questo prodotto:

Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/20.500.14243/66114

Citazioni

ND

ND

ND

social impact