{"title":"通过网络模型整合异质生物学数据预测基因-药物-疾病相互作用","authors":"H. Hanaf, B. Hassani, M. Kbir","doi":"10.15849/ijasca.220328.03","DOIUrl":null,"url":null,"abstract":"Abstract Prediction of gene-drug-disease interactions have talented new insights in biology. Discovering unknown interactions will provide new therapeutic approaches to explore gene expressions. Recent improvements in machine learning techniques have gotten considerable interest due to higher efficiency, accurate results, and their lower cost. However, most of the studies were ignoring relevant associations, by representing only drug-disease interactions on a network while public available data offers a large variety of interactions. Additionally, some computational techniques used in this domain are faced with new challenges, related to the organization of heterogeneous data which suffer from a high imbalance rate since there are extensively more non-interacting gene-drug-disease triplets than interacting ones. In this paper we present integration of heterogeneous biological data about genes, drugs, and diseases to build a model, and building a new graph representation relating genedrug-disease interactions. Using extreme gradient boosting (XGBoost) algorithm, we have been able to extract a list of valid interactions about gene-drug-disease triplets, and a list of gene-drug pairs related to lung cancer. Keywords: Biological heterogeneous data, Data integration, Gene-DrugDisease interactions, Machine learning.","PeriodicalId":38638,"journal":{"name":"International Journal of Advances in Soft Computing and its Applications","volume":" ","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2022-03-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":"{\"title\":\"Predicting Gene-Drug-Disease Interactions by integrating Heterogeneous Biological Data Through a Network Model\",\"authors\":\"H. Hanaf, B. Hassani, M. Kbir\",\"doi\":\"10.15849/ijasca.220328.03\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Abstract Prediction of gene-drug-disease interactions have talented new insights in biology. Discovering unknown interactions will provide new therapeutic approaches to explore gene expressions. Recent improvements in machine learning techniques have gotten considerable interest due to higher efficiency, accurate results, and their lower cost. However, most of the studies were ignoring relevant associations, by representing only drug-disease interactions on a network while public available data offers a large variety of interactions. Additionally, some computational techniques used in this domain are faced with new challenges, related to the organization of heterogeneous data which suffer from a high imbalance rate since there are extensively more non-interacting gene-drug-disease triplets than interacting ones. In this paper we present integration of heterogeneous biological data about genes, drugs, and diseases to build a model, and building a new graph representation relating genedrug-disease interactions. Using extreme gradient boosting (XGBoost) algorithm, we have been able to extract a list of valid interactions about gene-drug-disease triplets, and a list of gene-drug pairs related to lung cancer. Keywords: Biological heterogeneous data, Data integration, Gene-DrugDisease interactions, Machine learning.\",\"PeriodicalId\":38638,\"journal\":{\"name\":\"International Journal of Advances in Soft Computing and its Applications\",\"volume\":\" \",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-03-28\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"2\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"International Journal of Advances in Soft Computing and its Applications\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.15849/ijasca.220328.03\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q3\",\"JCRName\":\"Computer Science\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Journal of Advances in Soft Computing and its Applications","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.15849/ijasca.220328.03","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"Computer Science","Score":null,"Total":0}
Predicting Gene-Drug-Disease Interactions by integrating Heterogeneous Biological Data Through a Network Model
Abstract Prediction of gene-drug-disease interactions have talented new insights in biology. Discovering unknown interactions will provide new therapeutic approaches to explore gene expressions. Recent improvements in machine learning techniques have gotten considerable interest due to higher efficiency, accurate results, and their lower cost. However, most of the studies were ignoring relevant associations, by representing only drug-disease interactions on a network while public available data offers a large variety of interactions. Additionally, some computational techniques used in this domain are faced with new challenges, related to the organization of heterogeneous data which suffer from a high imbalance rate since there are extensively more non-interacting gene-drug-disease triplets than interacting ones. In this paper we present integration of heterogeneous biological data about genes, drugs, and diseases to build a model, and building a new graph representation relating genedrug-disease interactions. Using extreme gradient boosting (XGBoost) algorithm, we have been able to extract a list of valid interactions about gene-drug-disease triplets, and a list of gene-drug pairs related to lung cancer. Keywords: Biological heterogeneous data, Data integration, Gene-DrugDisease interactions, Machine learning.
期刊介绍:
The aim of this journal is to provide a lively forum for the communication of original research papers and timely review articles on Advances in Soft Computing and Its Applications. IJASCA will publish only articles of the highest quality. Submissions will be evaluated on their originality and significance. IJASCA invites submissions in all areas of Soft Computing and Its Applications. The scope of the journal includes, but is not limited to: √ Soft Computing Fundamental and Optimization √ Soft Computing for Big Data Era √ GPU Computing for Machine Learning √ Soft Computing Modeling for Perception and Spiritual Intelligence √ Soft Computing and Agents Technology √ Soft Computing in Computer Graphics √ Soft Computing and Pattern Recognition √ Soft Computing in Biomimetic Pattern Recognition √ Data mining for Social Network Data √ Spatial Data Mining & Information Retrieval √ Intelligent Software Agent Systems and Architectures √ Advanced Soft Computing and Multi-Objective Evolutionary Computation √ Perception-Based Intelligent Decision Systems √ Spiritual-Based Intelligent Systems √ Soft Computing in Industry ApplicationsOther issues related to the Advances of Soft Computing in various applications.