Evint Leovonzko, Callixta F. Cahyaningrum, Rachmania Ulwani
{"title":"深度学习算法与高斯模糊数据预处理在环状 RNA 分类与检测中的应用","authors":"Evint Leovonzko, Callixta F. Cahyaningrum, Rachmania Ulwani","doi":"10.26685/urncst.601","DOIUrl":null,"url":null,"abstract":"Introduction: Circular RNAs (circRNAs) are increasingly recognized as key regulators of gene expression due to their unique closed-loop structure and involvement in various cellular processes. This study investigates the utilization of machine learning algorithms in predicting circRNA-disease associations. Methods: This study proposes a novel deep learning approach leveraging artificial neural networks (ANN) for circRNA classification. The methodology involves data collection from circRNA databases, k-mers counting for feature extraction, Gaussian blur implementation for data smoothing, and ANN-based model training. Results: Evaluation of the trained models based on precision, recall, and f1-score metrics shows an overall accuracy of 0.7511, with an average precision score of 0.7982, recall of 0.7511, and f1-score of 0.7637. Discussion: The results indicate that our ANN-based algorithm effectively detects and classifies circRNA datasets with considerable accuracy. Compared to the algorithm from past research, our algorithm is also shown to have less computational power. Conclusion: Comparative analysis demonstrates improved performance compared to previous algorithms, suggesting its potential for widespread implementation due to reduced computational requirements and simpler implementation.","PeriodicalId":245521,"journal":{"name":"Undergraduate Research in Natural and Clinical Science and Technology (URNCST) Journal","volume":" 8","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-07-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"The Implementation of Deep Learning Algorithm with Gaussian Blur Data Preprocessing in Circular RNA Classification and Detection\",\"authors\":\"Evint Leovonzko, Callixta F. Cahyaningrum, Rachmania Ulwani\",\"doi\":\"10.26685/urncst.601\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Introduction: Circular RNAs (circRNAs) are increasingly recognized as key regulators of gene expression due to their unique closed-loop structure and involvement in various cellular processes. This study investigates the utilization of machine learning algorithms in predicting circRNA-disease associations. Methods: This study proposes a novel deep learning approach leveraging artificial neural networks (ANN) for circRNA classification. The methodology involves data collection from circRNA databases, k-mers counting for feature extraction, Gaussian blur implementation for data smoothing, and ANN-based model training. Results: Evaluation of the trained models based on precision, recall, and f1-score metrics shows an overall accuracy of 0.7511, with an average precision score of 0.7982, recall of 0.7511, and f1-score of 0.7637. Discussion: The results indicate that our ANN-based algorithm effectively detects and classifies circRNA datasets with considerable accuracy. Compared to the algorithm from past research, our algorithm is also shown to have less computational power. Conclusion: Comparative analysis demonstrates improved performance compared to previous algorithms, suggesting its potential for widespread implementation due to reduced computational requirements and simpler implementation.\",\"PeriodicalId\":245521,\"journal\":{\"name\":\"Undergraduate Research in Natural and Clinical Science and Technology (URNCST) Journal\",\"volume\":\" 8\",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2024-07-04\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Undergraduate Research in Natural and Clinical Science and Technology (URNCST) Journal\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.26685/urncst.601\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Undergraduate Research in Natural and Clinical Science and Technology (URNCST) Journal","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.26685/urncst.601","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
摘要
导言:环状 RNA(circRNA)因其独特的闭环结构和参与各种细胞过程,越来越被认为是基因表达的关键调控因子。本研究探讨了如何利用机器学习算法预测 circRNA 与疾病的关联。方法:本研究提出了一种利用人工神经网络(ANN)进行 circRNA 分类的新型深度学习方法。该方法涉及从 circRNA 数据库中收集数据、用于特征提取的 k-mers 计数、用于数据平滑的高斯模糊实现以及基于 ANN 的模型训练。结果:根据精确度、召回率和 f1 分数指标对训练的模型进行评估,结果显示总体精确度为 0.7511,平均精确度为 0.7982,召回率为 0.7511,f1 分数为 0.7637。讨论结果表明,我们基于 ANN 的算法能有效地对 circRNA 数据集进行检测和分类,且准确率相当高。与过去研究的算法相比,我们的算法的计算能力也更低。结论对比分析表明,与以前的算法相比,我们的算法性能有所提高,而且由于计算要求降低、实施更简单,因此有望得到广泛应用。
The Implementation of Deep Learning Algorithm with Gaussian Blur Data Preprocessing in Circular RNA Classification and Detection
Introduction: Circular RNAs (circRNAs) are increasingly recognized as key regulators of gene expression due to their unique closed-loop structure and involvement in various cellular processes. This study investigates the utilization of machine learning algorithms in predicting circRNA-disease associations. Methods: This study proposes a novel deep learning approach leveraging artificial neural networks (ANN) for circRNA classification. The methodology involves data collection from circRNA databases, k-mers counting for feature extraction, Gaussian blur implementation for data smoothing, and ANN-based model training. Results: Evaluation of the trained models based on precision, recall, and f1-score metrics shows an overall accuracy of 0.7511, with an average precision score of 0.7982, recall of 0.7511, and f1-score of 0.7637. Discussion: The results indicate that our ANN-based algorithm effectively detects and classifies circRNA datasets with considerable accuracy. Compared to the algorithm from past research, our algorithm is also shown to have less computational power. Conclusion: Comparative analysis demonstrates improved performance compared to previous algorithms, suggesting its potential for widespread implementation due to reduced computational requirements and simpler implementation.