MiRNAs may be biomarkers associated with the progression of several diseases and great eff orts have been made to discover miRNAs, identify miRNA targets and infer miRNA functions. Diff erent bioinformatic tools and algorithms enabled to predict miRNAs binding regions together with several databases that provided information about miRNA sequences. WGS, WES and RNAseq datasets can represent an optimal support to correlate variants with miRNA target sites at no additional cost. Our goal is to link variant calling information with a potential gain or loss of miRNAs target sites. So far, there is no simply logical and sequential procedure to do this. Using already existing software, we developed a simple pipeline that allows us to identify the impact of variants on miRNA target sites with a biological role. We created a unique database of miRNA's sequences that collected all existing miRNA databases (miRBase, tarbase, mirRSNP, etc) and performed prediction analyses of target sites focused on the most performing existing algorithms (MiRanda, TargetScan). For example, starting from about 23,000 genomic variants that lie in 3'UTR region, a simple input command enabled us to select 4637 lost and 4302 gained target sites. The output is a simple tabular text fi le contains genomic variants, target site, variant impact, associated miRNAs, loss and gain sites and annotation data coming from diff erent databases. The output result was designed to be used in subsequent standard software to apply fi lters based on the desired mode of inheritance and on sample’s genotype quality and variant frequencies. This method could be particularly useful on WES data when the identifi cation of the causal variant has failed and the involvement of miRNAs is assumed, without additional costs. This new automated sequential pipeline allows to identify, in silico, the variants involved in the miRNA targets from VCF fi les and to generate results easily interpreted and usable by people without bioinformatic skills.

Identification of microRNA target sites: in-silico screening on NGS data

Rallo Vincenzo;Angius Andrea
2018

Abstract

MiRNAs may be biomarkers associated with the progression of several diseases and great eff orts have been made to discover miRNAs, identify miRNA targets and infer miRNA functions. Diff erent bioinformatic tools and algorithms enabled to predict miRNAs binding regions together with several databases that provided information about miRNA sequences. WGS, WES and RNAseq datasets can represent an optimal support to correlate variants with miRNA target sites at no additional cost. Our goal is to link variant calling information with a potential gain or loss of miRNAs target sites. So far, there is no simply logical and sequential procedure to do this. Using already existing software, we developed a simple pipeline that allows us to identify the impact of variants on miRNA target sites with a biological role. We created a unique database of miRNA's sequences that collected all existing miRNA databases (miRBase, tarbase, mirRSNP, etc) and performed prediction analyses of target sites focused on the most performing existing algorithms (MiRanda, TargetScan). For example, starting from about 23,000 genomic variants that lie in 3'UTR region, a simple input command enabled us to select 4637 lost and 4302 gained target sites. The output is a simple tabular text fi le contains genomic variants, target site, variant impact, associated miRNAs, loss and gain sites and annotation data coming from diff erent databases. The output result was designed to be used in subsequent standard software to apply fi lters based on the desired mode of inheritance and on sample’s genotype quality and variant frequencies. This method could be particularly useful on WES data when the identifi cation of the causal variant has failed and the involvement of miRNAs is assumed, without additional costs. This new automated sequential pipeline allows to identify, in silico, the variants involved in the miRNA targets from VCF fi les and to generate results easily interpreted and usable by people without bioinformatic skills.
2018
Istituto di Ricerca Genetica e Biomedica - IRGB
Bioinformatics computational biology
Next-Generation Sequencing NGS
miRNA
File in questo prodotto:
File Dimensione Formato  
2018-poster-abstracts_compressed.pdf

accesso aperto

Tipologia: Versione Editoriale (PDF)
Licenza: Altro tipo di licenza
Dimensione 8.14 MB
Formato Adobe PDF
8.14 MB Adobe PDF Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/20.500.14243/538539
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus ND
  • ???jsp.display-item.citation.isi??? ND
social impact