Recently, Next-Generation Sequencing (NGS) has emerged as revolutionised technique in the elds of `-omics' research. The Cancer Research Atlas (TCGA) is a great example of it where massive amount of sequencing data is present for miRNA and mRNA. Analysing these data could bring out some potential biological insight. Moreover, developing a prognostic system on this newly available sequencing data will give a greater help to cancer diagnosis. Hence, in this article, we have made an humble attempt to analyse such sequencing data of miRNA for accurate prediction of Breast Cancer. Generally miRNAs are small non-coding RNAs which are shown to participate in several carcinogenic processes either by tumor suppressors or oncogenes. This is the reason clinical treatment of the breast cancer patient has changed nowadays. Thus it is quite interesting to see the role of miRNAs for the prediction of breast cancer. In this regard, we have developed a technique using Gravitation Search Algorithm, which optimizes the underlying classification performance of Support Vector Machine. This proposed technique can select the potential feature, in this case miRNA, in order to achieve better prediction accuracy. In this study, we have achieved the classification accuracy upto 92% as well as found potential miRNAs. Top eight miRNAs are reported, which are hhsa-miR-10b, hsa-miR-107, hsa-miR- 10a, hsa-let-7a-1, hsa-let-7e, hsa-miR-101-2, hsa-let-7c and hsa-miR-100. The performance of the proposed technique is compared with seven other state-of-the-art techniques. Finally, the results have been justied by means of statistical test along with biological signicance analysis of the selected miRNAs.
Analysis of Next-Generation Sequencing data of miRNA for the Prediction of Breast Cancer
Filippo Geraci;Marco Pellegrini
2015
Abstract
Recently, Next-Generation Sequencing (NGS) has emerged as revolutionised technique in the elds of `-omics' research. The Cancer Research Atlas (TCGA) is a great example of it where massive amount of sequencing data is present for miRNA and mRNA. Analysing these data could bring out some potential biological insight. Moreover, developing a prognostic system on this newly available sequencing data will give a greater help to cancer diagnosis. Hence, in this article, we have made an humble attempt to analyse such sequencing data of miRNA for accurate prediction of Breast Cancer. Generally miRNAs are small non-coding RNAs which are shown to participate in several carcinogenic processes either by tumor suppressors or oncogenes. This is the reason clinical treatment of the breast cancer patient has changed nowadays. Thus it is quite interesting to see the role of miRNAs for the prediction of breast cancer. In this regard, we have developed a technique using Gravitation Search Algorithm, which optimizes the underlying classification performance of Support Vector Machine. This proposed technique can select the potential feature, in this case miRNA, in order to achieve better prediction accuracy. In this study, we have achieved the classification accuracy upto 92% as well as found potential miRNAs. Top eight miRNAs are reported, which are hhsa-miR-10b, hsa-miR-107, hsa-miR- 10a, hsa-let-7a-1, hsa-let-7e, hsa-miR-101-2, hsa-let-7c and hsa-miR-100. The performance of the proposed technique is compared with seven other state-of-the-art techniques. Finally, the results have been justied by means of statistical test along with biological signicance analysis of the selected miRNAs.I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.