利用生成式对抗网络整合多组学数据并进行急性髓细胞白血病癌症药物筛选

IF 4.2 3区生物学 Q1 BIOCHEMICAL RESEARCH METHODS Methods Pub Date : 2024-04-24 DOI:10.1016/j.ymeth.2024.04.017

Sabrin Afroz , Nadira Islam , Md Ahsan Habib , Md Selim Reza , Md Ashad Alam

{"title":"利用生成式对抗网络整合多组学数据并进行急性髓细胞白血病癌症药物筛选","authors":"Sabrin Afroz , Nadira Islam , Md Ahsan Habib , Md Selim Reza , Md Ashad Alam","doi":"10.1016/j.ymeth.2024.04.017","DOIUrl":null,"url":null,"abstract":"<div><p>In the era of precision medicine, accurate disease phenotype prediction for heterogeneous diseases, such as cancer, is emerging due to advanced technologies that link genotypes and phenotypes. However, it is difficult to integrate different types of biological data because they are so varied. In this study, we focused on predicting the traits of a blood cancer called Acute Myeloid Leukemia (AML) by combining different kinds of biological data. We used a recently developed method called Omics Generative Adversarial Network (GAN) to better classify cancer outcomes. The primary advantages of a GAN include its ability to create synthetic data that is nearly indistinguishable from real data, its high flexibility, and its wide range of applications, including multi-omics data analysis. In addition, the GAN was effective at combining two types of biological data. We created synthetic datasets for gene activity and DNA methylation. Our method was more accurate in predicting disease traits than using the original data alone. The experimental results provided evidence that the creation of synthetic data through interacting multi-omics data analysis using GANs improves the overall prediction quality. Furthermore, we identified the top-ranked significant genes through statistical methods and pinpointed potential candidate drug agents through in-silico studies. The proposed drugs, also supported by other independent studies, might play a crucial role in the treatment of AML cancer. The code is available on GitHub; <span>https://github.com/SabrinAfroz/omicsGAN_codes?fbclid=IwAR1-/stuffmlE0hyWgSu2wlXo6dYlKUei3faLdlvpxTOOUPVlmYCloXf4Uk9ejK4I</span><svg><path></path></svg></p></div>","PeriodicalId":390,"journal":{"name":"Methods","volume":"226 ","pages":"Pages 138-150"},"PeriodicalIF":4.2000,"publicationDate":"2024-04-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Multi-omics data integration and drug screening of AML cancer using Generative Adversarial Network\",\"authors\":\"Sabrin Afroz , Nadira Islam , Md Ahsan Habib , Md Selim Reza , Md Ashad Alam\",\"doi\":\"10.1016/j.ymeth.2024.04.017\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><p>In the era of precision medicine, accurate disease phenotype prediction for heterogeneous diseases, such as cancer, is emerging due to advanced technologies that link genotypes and phenotypes. However, it is difficult to integrate different types of biological data because they are so varied. In this study, we focused on predicting the traits of a blood cancer called Acute Myeloid Leukemia (AML) by combining different kinds of biological data. We used a recently developed method called Omics Generative Adversarial Network (GAN) to better classify cancer outcomes. The primary advantages of a GAN include its ability to create synthetic data that is nearly indistinguishable from real data, its high flexibility, and its wide range of applications, including multi-omics data analysis. In addition, the GAN was effective at combining two types of biological data. We created synthetic datasets for gene activity and DNA methylation. Our method was more accurate in predicting disease traits than using the original data alone. The experimental results provided evidence that the creation of synthetic data through interacting multi-omics data analysis using GANs improves the overall prediction quality. Furthermore, we identified the top-ranked significant genes through statistical methods and pinpointed potential candidate drug agents through in-silico studies. The proposed drugs, also supported by other independent studies, might play a crucial role in the treatment of AML cancer. The code is available on GitHub; <span>https://github.com/SabrinAfroz/omicsGAN_codes?fbclid=IwAR1-/stuffmlE0hyWgSu2wlXo6dYlKUei3faLdlvpxTOOUPVlmYCloXf4Uk9ejK4I</span><svg><path></path></svg></p></div>\",\"PeriodicalId\":390,\"journal\":{\"name\":\"Methods\",\"volume\":\"226 \",\"pages\":\"Pages 138-150\"},\"PeriodicalIF\":4.2000,\"publicationDate\":\"2024-04-24\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Methods\",\"FirstCategoryId\":\"99\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S1046202324001075\",\"RegionNum\":3,\"RegionCategory\":\"生物学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"BIOCHEMICAL RESEARCH METHODS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Methods","FirstCategoryId":"99","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1046202324001075","RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"BIOCHEMICAL RESEARCH METHODS","Score":null,"Total":0}

引用次数: 0

摘要

在精准医疗时代，由于先进技术将基因型和表型联系起来，针对癌症等异质性疾病的精确疾病表型预测正在兴起。然而，由于不同类型的生物数据千差万别，因此很难对其进行整合。在本研究中，我们主要通过结合不同类型的生物数据来预测一种名为急性髓性白血病（AML）的血癌的性状。我们使用了最近开发的一种名为 Omics 生成对抗网络（GAN）的方法来更好地对癌症结果进行分类。GAN 的主要优点包括：能够创建与真实数据几乎无异的合成数据；灵活性高；应用范围广，包括多组学数据分析。此外，GAN 还能有效结合两种类型的生物数据。我们创建了基因活性和 DNA 甲基化的合成数据集。与单独使用原始数据相比，我们的方法在预测疾病特征方面更为准确。实验结果证明，通过使用 GANs 进行交互式多组学数据分析来创建合成数据，可以提高整体预测质量。此外，我们还通过统计方法确定了排名靠前的重要基因，并通过内嵌研究确定了潜在的候选药物。这些候选药物也得到了其他独立研究的支持，可能会在急性髓细胞白血病癌症的治疗中发挥重要作用。代码可在 GitHub 上获取；https://github.com/SabrinAfroz/omicsGAN_codes?fbclid=IwAR1-/stuffmlE0hyWgSu2wlXo6dYlKUei3faLdlvpxTOOUPVlmYCloXf4Uk9ejK4I

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

Multi-omics data integration and drug screening of AML cancer using Generative Adversarial Network

In the era of precision medicine, accurate disease phenotype prediction for heterogeneous diseases, such as cancer, is emerging due to advanced technologies that link genotypes and phenotypes. However, it is difficult to integrate different types of biological data because they are so varied. In this study, we focused on predicting the traits of a blood cancer called Acute Myeloid Leukemia (AML) by combining different kinds of biological data. We used a recently developed method called Omics Generative Adversarial Network (GAN) to better classify cancer outcomes. The primary advantages of a GAN include its ability to create synthetic data that is nearly indistinguishable from real data, its high flexibility, and its wide range of applications, including multi-omics data analysis. In addition, the GAN was effective at combining two types of biological data. We created synthetic datasets for gene activity and DNA methylation. Our method was more accurate in predicting disease traits than using the original data alone. The experimental results provided evidence that the creation of synthetic data through interacting multi-omics data analysis using GANs improves the overall prediction quality. Furthermore, we identified the top-ranked significant genes through statistical methods and pinpointed potential candidate drug agents through in-silico studies. The proposed drugs, also supported by other independent studies, might play a crucial role in the treatment of AML cancer. The code is available on GitHub; https://github.com/SabrinAfroz/omicsGAN_codes?fbclid=IwAR1-/stuffmlE0hyWgSu2wlXo6dYlKUei3faLdlvpxTOOUPVlmYCloXf4Uk9ejK4I

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Methods 生物-生化研究方法

CiteScore

9.80

自引率

2.10%

发文量

222

审稿时长

11.3 weeks

期刊介绍： Methods focuses on rapidly developing techniques in the experimental biological and medical sciences. Each topical issue, organized by a guest editor who is an expert in the area covered, consists solely of invited quality articles by specialist authors, many of them reviews. Issues are devoted to specific technical approaches with emphasis on clear detailed descriptions of protocols that allow them to be reproduced easily. The background information provided enables researchers to understand the principles underlying the methods; other helpful sections include comparisons of alternative methods giving the advantages and disadvantages of particular methods, guidance on avoiding potential pitfalls, and suggestions for troubleshooting.