Identification and molecular modelling of potential drugs targeting the genes involved in the progression of lung cancer in patients with idiopathic pulmonary fibrosis
{"title":"Identification and molecular modelling of potential drugs targeting the genes involved in the progression of lung cancer in patients with idiopathic pulmonary fibrosis","authors":"Sanjukta Dasgupta","doi":"10.1016/j.genrep.2024.102067","DOIUrl":null,"url":null,"abstract":"<div><h3>Background</h3><div>Idiopathic pulmonary fibrosis (IPF) is characterized by progressive fibrosis in the lung parenchyma. Given the fact that IPF patients are at significant risk of developing lung cancer (LC), the overlapping gene signatures between IPF and LC need to be explored.</div></div><div><h3>Methods</h3><div>Two datasets (GSE79544 and GSE103888) were procured from the Gene Expression Omnibus repository and used to determine the overlapping genes between IPF and LC. Next, the prediction ability of these genes in differentiating the diseased group from controls was explored using two machine learning (ML) models (random forest and k-nearest neighbor). Potential drugs targeting the candidate genes were identified, and advanced structural analysis was conducted to determine the binding affinity between the candidate drug and target receptor.</div></div><div><h3>Result</h3><div>A total of ten common genes (<em>CCL13</em>, <em>CXCL2</em>, <em>MALT1</em>, <em>MARCKS</em>, <em>PLA2G7</em>, <em>SEMA6B</em>, <em>SFTPB</em>, <em>SPARC</em>, <em>SPP1</em>, and <em>TLCD2</em>) are differentially expressed in IPF and LC as compared to the controls. <em>PLA2G7</em> demonstrated promising potential in differentiating between IPF, LC, and controls. The increased expression correlated with poor survival in patients with LC. The expression of <em>PLA2G7</em> indicated a similar trend in the validation dataset. Darapladib, a selective inhibitor that belongs to toxicity class 4 and lethal dose50 value of 800 mg/kg exhibited maximum potential in targeting <em>PLA2G7</em> with a binding affinity score of −9.2 kcal/mol (chain A) and −9.3 kcal/mol (chain B), respectively.</div></div><div><h3>Conclusion</h3><div>The present study is the first of its kind that combines in-silico and ML algorithms to identify the gene signatures and promising drugs for treating the progression of LC in patients with IPF.</div></div>","PeriodicalId":12673,"journal":{"name":"Gene Reports","volume":null,"pages":null},"PeriodicalIF":1.0000,"publicationDate":"2024-10-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Gene Reports","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2452014424001900","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"GENETICS & HEREDITY","Score":null,"Total":0}
引用次数: 0
Abstract
Background
Idiopathic pulmonary fibrosis (IPF) is characterized by progressive fibrosis in the lung parenchyma. Given the fact that IPF patients are at significant risk of developing lung cancer (LC), the overlapping gene signatures between IPF and LC need to be explored.
Methods
Two datasets (GSE79544 and GSE103888) were procured from the Gene Expression Omnibus repository and used to determine the overlapping genes between IPF and LC. Next, the prediction ability of these genes in differentiating the diseased group from controls was explored using two machine learning (ML) models (random forest and k-nearest neighbor). Potential drugs targeting the candidate genes were identified, and advanced structural analysis was conducted to determine the binding affinity between the candidate drug and target receptor.
Result
A total of ten common genes (CCL13, CXCL2, MALT1, MARCKS, PLA2G7, SEMA6B, SFTPB, SPARC, SPP1, and TLCD2) are differentially expressed in IPF and LC as compared to the controls. PLA2G7 demonstrated promising potential in differentiating between IPF, LC, and controls. The increased expression correlated with poor survival in patients with LC. The expression of PLA2G7 indicated a similar trend in the validation dataset. Darapladib, a selective inhibitor that belongs to toxicity class 4 and lethal dose50 value of 800 mg/kg exhibited maximum potential in targeting PLA2G7 with a binding affinity score of −9.2 kcal/mol (chain A) and −9.3 kcal/mol (chain B), respectively.
Conclusion
The present study is the first of its kind that combines in-silico and ML algorithms to identify the gene signatures and promising drugs for treating the progression of LC in patients with IPF.
Gene ReportsBiochemistry, Genetics and Molecular Biology-Genetics
CiteScore
3.30
自引率
7.70%
发文量
246
审稿时长
49 days
期刊介绍:
Gene Reports publishes papers that focus on the regulation, expression, function and evolution of genes in all biological contexts, including all prokaryotic and eukaryotic organisms, as well as viruses. Gene Reports strives to be a very diverse journal and topics in all fields will be considered for publication. Although not limited to the following, some general topics include: DNA Organization, Replication & Evolution -Focus on genomic DNA (chromosomal organization, comparative genomics, DNA replication, DNA repair, mobile DNA, mitochondrial DNA, chloroplast DNA). Expression & Function - Focus on functional RNAs (microRNAs, tRNAs, rRNAs, mRNA splicing, alternative polyadenylation) Regulation - Focus on processes that mediate gene-read out (epigenetics, chromatin, histone code, transcription, translation, protein degradation). Cell Signaling - Focus on mechanisms that control information flow into the nucleus to control gene expression (kinase and phosphatase pathways controlled by extra-cellular ligands, Wnt, Notch, TGFbeta/BMPs, FGFs, IGFs etc.) Profiling of gene expression and genetic variation - Focus on high throughput approaches (e.g., DeepSeq, ChIP-Seq, Affymetrix microarrays, proteomics) that define gene regulatory circuitry, molecular pathways and protein/protein networks. Genetics - Focus on development in model organisms (e.g., mouse, frog, fruit fly, worm), human genetic variation, population genetics, as well as agricultural and veterinary genetics. Molecular Pathology & Regenerative Medicine - Focus on the deregulation of molecular processes in human diseases and mechanisms supporting regeneration of tissues through pluripotent or multipotent stem cells.