CNR Institutional Research Information System

The post-Genomic Era is characterized by the proliferation of high-throughput platforms that allow the parallel study of a complete body of molecules in one single run of experiments (omic approach). Analysis and integration of omic data represent one of the most challenging frontiers for all the disciplines related to Systems Biology. From the computational perspective this requires, among others, the massive use of automated approaches in several steps of the complex analysis pipeline, often consisting of cascades of statistical tests. In this frame, the identification of statistical significance has been one of the early challenges in the handling of omic data and remains a critical step due to the multiple hypotheses testing issue, given the large number of hypotheses examined at one time. Two main approaches are currently used: p-values based on random permutation approaches and the False Discovery Rate. Both give meaningful and important results, however they suffer respectively from being computationally heavy -due to the large number of data that has to be generated-, or extremely flexible with respect to the definition of the significance threshold, leading to difficulties in standardization. We present here a complementary/alternative approach to these current ones and discuss performances, properties and limitations.

MM-Correction: Meta-analysis-Based Multiple Hypotheses Correction in Omic Studies

Nardini Christine;Wang Lei;Peng Hesen;Benini Luca;Kuo Michael D

2008

Abstract

The post-Genomic Era is characterized by the proliferation of high-throughput platforms that allow the parallel study of a complete body of molecules in one single run of experiments (omic approach). Analysis and integration of omic data represent one of the most challenging frontiers for all the disciplines related to Systems Biology. From the computational perspective this requires, among others, the massive use of automated approaches in several steps of the complex analysis pipeline, often consisting of cascades of statistical tests. In this frame, the identification of statistical significance has been one of the early challenges in the handling of omic data and remains a critical step due to the multiple hypotheses testing issue, given the large number of hypotheses examined at one time. Two main approaches are currently used: p-values based on random permutation approaches and the False Discovery Rate. Both give meaningful and important results, however they suffer respectively from being computationally heavy -due to the large number of data that has to be generated-, or extremely flexible with respect to the definition of the significance threshold, leading to difficulties in standardization. We present here a complementary/alternative approach to these current ones and discuss performances, properties and limitations.

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno
	
				2008
			
	Codice ISBN
	
				978-3-540-92218-6
			
	Parole chiave
	
				Statistical testing
statistical significance
multiple hypothesis testing
false discovery rate
statistical resampling methods
statistical meta-analysis
omic data
			
	Appare nelle tipologie:
	
				02.01 Contributo in volume (Capitolo o Saggio)

File in questo prodotto:

Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/20.500.14243/387214

Citazioni

ND

2

1

social impact