CNR Institutional Research Information System

MiRNAs may be biomarkers associated with the progression of several diseases and great eff orts have been made to discover miRNAs, identify miRNA targets and infer miRNA functions. Diff erent bioinformatic tools and algorithms enabled to predict miRNAs binding regions together with several databases that provided information about miRNA sequences. WGS, WES and RNAseq datasets can represent an optimal support to correlate variants with miRNA target sites at no additional cost. Our goal is to link variant calling information with a potential gain or loss of miRNAs target sites. So far, there is no simply logical and sequential procedure to do this. Using already existing software, we developed a simple pipeline that allows us to identify the impact of variants on miRNA target sites with a biological role. We created a unique database of miRNA's sequences that collected all existing miRNA databases (miRBase, tarbase, mirRSNP, etc) and performed prediction analyses of target sites focused on the most performing existing algorithms (MiRanda, TargetScan). For example, starting from about 23,000 genomic variants that lie in 3'UTR region, a simple input command enabled us to select 4637 lost and 4302 gained target sites. The output is a simple tabular text fi le contains genomic variants, target site, variant impact, associated miRNAs, loss and gain sites and annotation data coming from diff erent databases. The output result was designed to be used in subsequent standard software to apply fi lters based on the desired mode of inheritance and on sample’s genotype quality and variant frequencies. This method could be particularly useful on WES data when the identifi cation of the causal variant has failed and the involvement of miRNAs is assumed, without additional costs. This new automated sequential pipeline allows to identify, in silico, the variants involved in the miRNA targets from VCF fi les and to generate results easily interpreted and usable by people without bioinformatic skills.

Identification of microRNA target sites: in-silico screening on NGS data

Rallo Vincenzo;Angius Andrea

2018

Abstract

MiRNAs may be biomarkers associated with the progression of several diseases and great eff orts have been made to discover miRNAs, identify miRNA targets and infer miRNA functions. Diff erent bioinformatic tools and algorithms enabled to predict miRNAs binding regions together with several databases that provided information about miRNA sequences. WGS, WES and RNAseq datasets can represent an optimal support to correlate variants with miRNA target sites at no additional cost. Our goal is to link variant calling information with a potential gain or loss of miRNAs target sites. So far, there is no simply logical and sequential procedure to do this. Using already existing software, we developed a simple pipeline that allows us to identify the impact of variants on miRNA target sites with a biological role. We created a unique database of miRNA's sequences that collected all existing miRNA databases (miRBase, tarbase, mirRSNP, etc) and performed prediction analyses of target sites focused on the most performing existing algorithms (MiRanda, TargetScan). For example, starting from about 23,000 genomic variants that lie in 3'UTR region, a simple input command enabled us to select 4637 lost and 4302 gained target sites. The output is a simple tabular text fi le contains genomic variants, target site, variant impact, associated miRNAs, loss and gain sites and annotation data coming from diff erent databases. The output result was designed to be used in subsequent standard software to apply fi lters based on the desired mode of inheritance and on sample’s genotype quality and variant frequencies. This method could be particularly useful on WES data when the identifi cation of the causal variant has failed and the involvement of miRNAs is assumed, without additional costs. This new automated sequential pipeline allows to identify, in silico, the variants involved in the miRNA targets from VCF fi les and to generate results easily interpreted and usable by people without bioinformatic skills.

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno
	
				2018
			
	Strutture organizzative
	
				Istituto di Ricerca Genetica e Biomedica - IRGB
			
	Parole chiave
	
				Bioinformatics computational biology
Next-Generation Sequencing NGS
			
	Parole chiave
	
				miRNA
			
	Appare nelle tipologie:
	
				04.02 Abstract in Atti di convegno

File in questo prodotto:

File	Dimensione	Formato
2018-poster-abstracts_compressed.pdf accesso aperto Tipologia: Versione Editoriale (PDF) Licenza: Altro tipo di licenza Dimensione 8.14 MB Formato Adobe PDF Visualizza/Apri	8.14 MB	Adobe PDF	Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/20.500.14243/538539

Citazioni

ND

ND

ND

social impact