Circular RNAs (circRNAs) are a class of noncoding RNA molecules featuring a closed circular structure. They have been proved to play a significant role in the reduction of many diseases. Besides, many researches in clinical diagnosis and treatment of disease have revealed that circRNA can be considered as a potential biomarker. Therefore, understanding the association of circRNA and diseases can help to forecast some disorders of life activities. However, traditional biological experimental methods are time-consuming. The most common method for circRNA-disease association prediction on the basis of machine learning can avoid this, which relies on diverse data. Nevertheless, topological information of circRNA and disease usually is not involved in these methods. Moreover, circRNAs can be associated with diseases through miRNAs. With these considerations, we proposed a novel method, named THGNCDA, to predict the association between circRNAs and diseases. Specifically, for a certain pair of circRNA and disease, we employ a graph neural network with attention to learn the importance of its each neighbor. In addition, we use a multilayer convolutional neural network to explore the relationship of a circRNA-disease pair based on their attributes. When calculating embeddings, we introduce the information of miRNAs. The results of experiments show that THGNCDA outperformed the SOTA methods. In addition, it can be observed that our method gives a better recall rate. To confirm the significance of attention, we conducted extensive ablation studies. Case studies on Urinary Bladder and Prostatic Neoplasms further show THGNCDA's ability in discovering known relationships between circRNA candidates and diseases.
{"title":"THGNCDA: circRNA-disease association prediction based on triple heterogeneous graph network.","authors":"Yuwei Guo, Ming Yi","doi":"10.1093/bfgp/elad042","DOIUrl":"10.1093/bfgp/elad042","url":null,"abstract":"<p><p>Circular RNAs (circRNAs) are a class of noncoding RNA molecules featuring a closed circular structure. They have been proved to play a significant role in the reduction of many diseases. Besides, many researches in clinical diagnosis and treatment of disease have revealed that circRNA can be considered as a potential biomarker. Therefore, understanding the association of circRNA and diseases can help to forecast some disorders of life activities. However, traditional biological experimental methods are time-consuming. The most common method for circRNA-disease association prediction on the basis of machine learning can avoid this, which relies on diverse data. Nevertheless, topological information of circRNA and disease usually is not involved in these methods. Moreover, circRNAs can be associated with diseases through miRNAs. With these considerations, we proposed a novel method, named THGNCDA, to predict the association between circRNAs and diseases. Specifically, for a certain pair of circRNA and disease, we employ a graph neural network with attention to learn the importance of its each neighbor. In addition, we use a multilayer convolutional neural network to explore the relationship of a circRNA-disease pair based on their attributes. When calculating embeddings, we introduce the information of miRNAs. The results of experiments show that THGNCDA outperformed the SOTA methods. In addition, it can be observed that our method gives a better recall rate. To confirm the significance of attention, we conducted extensive ablation studies. Case studies on Urinary Bladder and Prostatic Neoplasms further show THGNCDA's ability in discovering known relationships between circRNA candidates and diseases.</p>","PeriodicalId":55323,"journal":{"name":"Briefings in Functional Genomics","volume":" ","pages":"384-394"},"PeriodicalIF":2.5,"publicationDate":"2024-07-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"41152051","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Rania Velissari, Mirolyuba Ilieva, James Dao, Henry E Miller, Jens Hedelund Madsen, Jan Gorodkin, Masanori Aikawa, Hideshi Ishii, Shizuka Uchida
The cases of inflammatory bowel disease (IBD) are increasing rapidly around the world. Due to the multifactorial causes of IBD, there is an urgent need to understand the pathogenesis of IBD. As such, the usage of high-throughput techniques to profile genetic mutations, microbiome environments, transcriptome and proteome (e.g. lipidome) is increasing to understand the molecular changes associated with IBD, including two major etiologies of IBD: Crohn disease (CD) and ulcerative colitis (UC). In the case of transcriptome data, RNA sequencing (RNA-seq) technique is used frequently. However, only protein-coding genes are analyzed, leaving behind all other RNAs, including non-coding RNAs (ncRNAs) to be unexplored. Among these ncRNAs, long non-coding RNAs (lncRNAs) may hold keys to understand the pathogenesis of IBD as lncRNAs are expressed in a cell/tissue-specific manner and dysregulated in a disease, such as IBD. However, it is rare that RNA-seq data are analyzed for lncRNAs. To fill this gap in knowledge, we re-analyzed RNA-seq data of CD and UC patients compared with the healthy donors to dissect the expression profiles of lncRNA genes. As inflammation plays key roles in the pathogenesis of IBD, we conducted loss-of-function experiments to provide functional data of IBD-specific lncRNA, lung cancer associated transcript 1 (LUCAT1), in an in vitro model of macrophage polarization. To further facilitate the lncRNA research in IBD, we built a web database, IBDB (https://ibd-db.shinyapps.io/IBDB/), to provide a one-stop-shop for expression profiling of protein-coding and lncRNA genes in IBD patients compared with healthy donors.
{"title":"Systematic analysis and characterization of long non-coding RNA genes in inflammatory bowel disease.","authors":"Rania Velissari, Mirolyuba Ilieva, James Dao, Henry E Miller, Jens Hedelund Madsen, Jan Gorodkin, Masanori Aikawa, Hideshi Ishii, Shizuka Uchida","doi":"10.1093/bfgp/elad044","DOIUrl":"10.1093/bfgp/elad044","url":null,"abstract":"<p><p>The cases of inflammatory bowel disease (IBD) are increasing rapidly around the world. Due to the multifactorial causes of IBD, there is an urgent need to understand the pathogenesis of IBD. As such, the usage of high-throughput techniques to profile genetic mutations, microbiome environments, transcriptome and proteome (e.g. lipidome) is increasing to understand the molecular changes associated with IBD, including two major etiologies of IBD: Crohn disease (CD) and ulcerative colitis (UC). In the case of transcriptome data, RNA sequencing (RNA-seq) technique is used frequently. However, only protein-coding genes are analyzed, leaving behind all other RNAs, including non-coding RNAs (ncRNAs) to be unexplored. Among these ncRNAs, long non-coding RNAs (lncRNAs) may hold keys to understand the pathogenesis of IBD as lncRNAs are expressed in a cell/tissue-specific manner and dysregulated in a disease, such as IBD. However, it is rare that RNA-seq data are analyzed for lncRNAs. To fill this gap in knowledge, we re-analyzed RNA-seq data of CD and UC patients compared with the healthy donors to dissect the expression profiles of lncRNA genes. As inflammation plays key roles in the pathogenesis of IBD, we conducted loss-of-function experiments to provide functional data of IBD-specific lncRNA, lung cancer associated transcript 1 (LUCAT1), in an in vitro model of macrophage polarization. To further facilitate the lncRNA research in IBD, we built a web database, IBDB (https://ibd-db.shinyapps.io/IBDB/), to provide a one-stop-shop for expression profiling of protein-coding and lncRNA genes in IBD patients compared with healthy donors.</p>","PeriodicalId":55323,"journal":{"name":"Briefings in Functional Genomics","volume":" ","pages":"395-405"},"PeriodicalIF":2.5,"publicationDate":"2024-07-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11260263/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"41157627","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Ren Junjun, Zhang Zhengqian, Wu Ying, Wang Jialiang, Liu Yongzhuang
Genome sequencing data have become increasingly important in the field of personalized medicine and diagnosis. However, accurately detecting genomic variations remains a challenging task. Traditional variation detection methods rely on manual inspection or predefined rules, which can be time-consuming and prone to errors. Consequently, deep learning-based approaches for variation detection have gained attention due to their ability to automatically learn genomic features that distinguish between variants. In our review, we discuss the recent advancements in deep learning-based algorithms for detecting small variations and structural variations in genomic data, as well as their advantages and limitations.
{"title":"A comprehensive review of deep learning-based variant calling methods.","authors":"Ren Junjun, Zhang Zhengqian, Wu Ying, Wang Jialiang, Liu Yongzhuang","doi":"10.1093/bfgp/elae003","DOIUrl":"10.1093/bfgp/elae003","url":null,"abstract":"<p><p>Genome sequencing data have become increasingly important in the field of personalized medicine and diagnosis. However, accurately detecting genomic variations remains a challenging task. Traditional variation detection methods rely on manual inspection or predefined rules, which can be time-consuming and prone to errors. Consequently, deep learning-based approaches for variation detection have gained attention due to their ability to automatically learn genomic features that distinguish between variants. In our review, we discuss the recent advancements in deep learning-based algorithms for detecting small variations and structural variations in genomic data, as well as their advantages and limitations.</p>","PeriodicalId":55323,"journal":{"name":"Briefings in Functional Genomics","volume":" ","pages":"303-313"},"PeriodicalIF":2.5,"publicationDate":"2024-07-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139747852","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Combination therapy is a promising strategy for cancers, increasing therapeutic options and reducing drug resistance. Yet, systematic identification of efficacious drug combinations is limited by the combinatorial explosion caused by a large number of possible drug pairs and diseases. At present, machine learning techniques have been widely applied to predict drug combinations, but most studies rely on the response of drug combinations to specific cell lines and are not entirely satisfactory in terms of mechanism interpretability and model scalability. Here, we proposed a novel network propagation-based machine learning framework to predict synergistic drug combinations. Based on the topological information of a comprehensive drug-drug association network, we innovatively introduced an affinity score between drug pairs as one of the features to train machine learning models. We applied network-based strategy to evaluate their therapeutic potential to different cancer types. Finally, we identified 17 specific-, 21 general- and 40 broad-spectrum antitumor drug combinations, in which 69% drug combinations were validated by vitro cellular experiments, 83% drug combinations were validated by literature reports and 100% drug combinations were validated by biological function analyses. By quantifying the network relationships between drug targets and cancer-related driver genes in the human protein-protein interactome, we show the existence of four distinct patterns of drug-drug-disease relationships. We also revealed that 32 biological pathways were correlated with the synergistic mechanism of broad-spectrum antitumor drug combinations. Overall, our model offers a powerful scalable screening framework for cancer treatments.
{"title":"Predicting drug synergy using a network propagation inspired machine learning framework.","authors":"Qing Jin, Xianze Zhang, Diwei Huo, Hongbo Xie, Denan Zhang, Lei Liu, Yashuang Zhao, Xiujie Chen","doi":"10.1093/bfgp/elad056","DOIUrl":"10.1093/bfgp/elad056","url":null,"abstract":"<p><p>Combination therapy is a promising strategy for cancers, increasing therapeutic options and reducing drug resistance. Yet, systematic identification of efficacious drug combinations is limited by the combinatorial explosion caused by a large number of possible drug pairs and diseases. At present, machine learning techniques have been widely applied to predict drug combinations, but most studies rely on the response of drug combinations to specific cell lines and are not entirely satisfactory in terms of mechanism interpretability and model scalability. Here, we proposed a novel network propagation-based machine learning framework to predict synergistic drug combinations. Based on the topological information of a comprehensive drug-drug association network, we innovatively introduced an affinity score between drug pairs as one of the features to train machine learning models. We applied network-based strategy to evaluate their therapeutic potential to different cancer types. Finally, we identified 17 specific-, 21 general- and 40 broad-spectrum antitumor drug combinations, in which 69% drug combinations were validated by vitro cellular experiments, 83% drug combinations were validated by literature reports and 100% drug combinations were validated by biological function analyses. By quantifying the network relationships between drug targets and cancer-related driver genes in the human protein-protein interactome, we show the existence of four distinct patterns of drug-drug-disease relationships. We also revealed that 32 biological pathways were correlated with the synergistic mechanism of broad-spectrum antitumor drug combinations. Overall, our model offers a powerful scalable screening framework for cancer treatments.</p>","PeriodicalId":55323,"journal":{"name":"Briefings in Functional Genomics","volume":" ","pages":"429-440"},"PeriodicalIF":2.5,"publicationDate":"2024-07-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139106947","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Our understanding of RNA biology has evolved with recent advances in research from it being a non-functional product to molecules of the genome with specific regulatory functions. Competitive endogenous RNA (ceRNA), which has gained prominence over time as an essential part of post-transcriptional regulatory mechanism, is one such example. The ceRNA biology hypothesis states that coding RNA and non-coding RNA co-regulate each other using microRNA (miRNA) response elements. The ceRNA components include long non-coding RNAs, pseudogene and circular RNAs that exert their effect by interacting with miRNA and regulate the expression level of its target genes. Emerging evidence has revealed that the dysregulation of the ceRNA network is attributed to the pathogenesis of various cancers, including the head and neck squamous cell carcinoma (HNSCC). This is the most prevalent cancer developed from the mucosal epithelium in the lip, oral cavity, larynx and pharynx. Although many efforts have been made to comprehend the cause and subsequent treatment of HNSCC, the morbidity and mortality rate remains high. Hence, there is an urgent need to understand the holistic progression of HNSCC, mediated by ceRNA, that can have immense relevance in identifying novel biomarkers with a defined therapeutic intervention. In this review, we have made an effort to highlight the ceRNA biology hypothesis with a focus on its involvement in the progression of HNSCC. For the identification of such ceRNAs, we have additionally highlighted a number of databases and tools.
{"title":"Competing endogenous RNAs in head and neck squamous cell carcinoma: a review.","authors":"Avantika Agrawal, Vaibhav Vindal","doi":"10.1093/bfgp/elad049","DOIUrl":"10.1093/bfgp/elad049","url":null,"abstract":"<p><p>Our understanding of RNA biology has evolved with recent advances in research from it being a non-functional product to molecules of the genome with specific regulatory functions. Competitive endogenous RNA (ceRNA), which has gained prominence over time as an essential part of post-transcriptional regulatory mechanism, is one such example. The ceRNA biology hypothesis states that coding RNA and non-coding RNA co-regulate each other using microRNA (miRNA) response elements. The ceRNA components include long non-coding RNAs, pseudogene and circular RNAs that exert their effect by interacting with miRNA and regulate the expression level of its target genes. Emerging evidence has revealed that the dysregulation of the ceRNA network is attributed to the pathogenesis of various cancers, including the head and neck squamous cell carcinoma (HNSCC). This is the most prevalent cancer developed from the mucosal epithelium in the lip, oral cavity, larynx and pharynx. Although many efforts have been made to comprehend the cause and subsequent treatment of HNSCC, the morbidity and mortality rate remains high. Hence, there is an urgent need to understand the holistic progression of HNSCC, mediated by ceRNA, that can have immense relevance in identifying novel biomarkers with a defined therapeutic intervention. In this review, we have made an effort to highlight the ceRNA biology hypothesis with a focus on its involvement in the progression of HNSCC. For the identification of such ceRNAs, we have additionally highlighted a number of databases and tools.</p>","PeriodicalId":55323,"journal":{"name":"Briefings in Functional Genomics","volume":" ","pages":"335-348"},"PeriodicalIF":2.5,"publicationDate":"2024-07-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"71523465","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Tumor mutational burden (TMB) is a significant predictive biomarker for selecting patients that may benefit from immune checkpoint inhibitor therapy. Whole exome sequencing is a common method for measuring TMB; however, its clinical application is limited by the high cost and time-consuming wet-laboratory experiments and bioinformatics analysis. To address this challenge, we downloaded multimodal data of 326 gastric cancer patients from The Cancer Genome Atlas, including histopathological images, clinical data and various molecular data. Using these data, we conducted a comprehensive analysis to investigate the relationship between TMB, clinical factors, gene expression and image features extracted from hematoxylin and eosin images. We further explored the feasibility of predicting TMB levels, i.e. high and low TMB, by utilizing a residual network (Resnet)-based deep learning algorithm for histopathological image analysis. Moreover, we developed a multimodal fusion deep learning model that combines histopathological images with omics data to predict TMB levels. We evaluated the performance of our models against various state-of-the-art methods using different TMB thresholds and obtained promising results. Specifically, our histopathological image analysis model achieved an area under curve (AUC) of 0.749. Notably, the multimodal fusion model significantly outperformed the model that relied only on histopathological images, with the highest AUC of 0.971. Our findings suggest that histopathological images could be used with reasonable accuracy to predict TMB levels in gastric cancer patients, while multimodal deep learning could achieve even higher levels of accuracy. This study sheds new light on predicting TMB in gastric cancer patients.
{"title":"Predicting gastric cancer tumor mutational burden from histopathological images using multimodal deep learning.","authors":"Jing Li, Haiyan Liu, Wei Liu, Peijun Zong, Kaimei Huang, Zibo Li, Haigang Li, Ting Xiong, Geng Tian, Chun Li, Jialiang Yang","doi":"10.1093/bfgp/elad032","DOIUrl":"10.1093/bfgp/elad032","url":null,"abstract":"<p><p>Tumor mutational burden (TMB) is a significant predictive biomarker for selecting patients that may benefit from immune checkpoint inhibitor therapy. Whole exome sequencing is a common method for measuring TMB; however, its clinical application is limited by the high cost and time-consuming wet-laboratory experiments and bioinformatics analysis. To address this challenge, we downloaded multimodal data of 326 gastric cancer patients from The Cancer Genome Atlas, including histopathological images, clinical data and various molecular data. Using these data, we conducted a comprehensive analysis to investigate the relationship between TMB, clinical factors, gene expression and image features extracted from hematoxylin and eosin images. We further explored the feasibility of predicting TMB levels, i.e. high and low TMB, by utilizing a residual network (Resnet)-based deep learning algorithm for histopathological image analysis. Moreover, we developed a multimodal fusion deep learning model that combines histopathological images with omics data to predict TMB levels. We evaluated the performance of our models against various state-of-the-art methods using different TMB thresholds and obtained promising results. Specifically, our histopathological image analysis model achieved an area under curve (AUC) of 0.749. Notably, the multimodal fusion model significantly outperformed the model that relied only on histopathological images, with the highest AUC of 0.971. Our findings suggest that histopathological images could be used with reasonable accuracy to predict TMB levels in gastric cancer patients, while multimodal deep learning could achieve even higher levels of accuracy. This study sheds new light on predicting TMB in gastric cancer patients.</p>","PeriodicalId":55323,"journal":{"name":"Briefings in Functional Genomics","volume":" ","pages":"228-238"},"PeriodicalIF":4.0,"publicationDate":"2024-05-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"9965530","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Lung cancer has been the most common and the leading cause of cancer deaths globally. Besides clinicopathological observations and traditional molecular tests, the advent of robust and scalable techniques for nucleic acid analysis has revolutionized biological research and medicinal practice in lung cancer treatment. In response to the demands for minimally invasive procedures and technology development over the past decade, many types of multi-omics data at various genome levels have been generated. As omics data grow, artificial intelligence models, particularly deep learning, are prominent in developing more rapid and effective methods to potentially improve lung cancer patient diagnosis, prognosis and treatment strategy. This decade has seen genome-based deep learning models thriving in various lung cancer tasks, including cancer prediction, subtype classification, prognosis estimation, cancer molecular signatures identification, treatment response prediction and biomarker development. In this study, we summarized available data sources for deep-learning-based lung cancer mining and provided an update on recent deep learning models in lung cancer genomics. Subsequently, we reviewed the current issues and discussed future research directions of deep-learning-based lung cancer genomics research.
{"title":"Omics-based deep learning approaches for lung cancer decision-making and therapeutics development.","authors":"Thi-Oanh Tran, Thanh Hoa Vo, Nguyen Quoc Khanh Le","doi":"10.1093/bfgp/elad031","DOIUrl":"10.1093/bfgp/elad031","url":null,"abstract":"<p><p>Lung cancer has been the most common and the leading cause of cancer deaths globally. Besides clinicopathological observations and traditional molecular tests, the advent of robust and scalable techniques for nucleic acid analysis has revolutionized biological research and medicinal practice in lung cancer treatment. In response to the demands for minimally invasive procedures and technology development over the past decade, many types of multi-omics data at various genome levels have been generated. As omics data grow, artificial intelligence models, particularly deep learning, are prominent in developing more rapid and effective methods to potentially improve lung cancer patient diagnosis, prognosis and treatment strategy. This decade has seen genome-based deep learning models thriving in various lung cancer tasks, including cancer prediction, subtype classification, prognosis estimation, cancer molecular signatures identification, treatment response prediction and biomarker development. In this study, we summarized available data sources for deep-learning-based lung cancer mining and provided an update on recent deep learning models in lung cancer genomics. Subsequently, we reviewed the current issues and discussed future research directions of deep-learning-based lung cancer genomics research.</p>","PeriodicalId":55323,"journal":{"name":"Briefings in Functional Genomics","volume":" ","pages":"181-192"},"PeriodicalIF":2.5,"publicationDate":"2024-05-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"10281428","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Tao Pan, Yueying Gao, Gang Xu, Lei Yu, Qi Xu, Jinyang Yu, Meng Liu, Can Zhang, Yanlin Ma, Yongsheng Li
Ion channels, in particular transient-receptor potential (TRP) channels, are essential genes that play important roles in many physiological processes. Emerging evidence has demonstrated that TRP genes are involved in a number of diseases, including various cancer types. However, we still lack knowledge about the expression alterations landscape of TRP genes across cancer types. In this review, we comprehensively reviewed and summarised the transcriptomes from more than 10 000 samples in 33 cancer types. We found that TRP genes were widespreadly transcriptomic dysregulated in cancer, which was associated with clinical survival of cancer patients. Perturbations of TRP genes were associated with a number of cancer pathways across cancer types. Moreover, we reviewed the functions of TRP family gene alterations in a number of diseases reported in recent studies. Taken together, our study comprehensively reviewed TRP genes with extensive transcriptomic alterations and their functions will directly contribute to cancer therapy and precision medicine.
{"title":"Widespread transcriptomic alterations of transient receptor potential channel genes in cancer.","authors":"Tao Pan, Yueying Gao, Gang Xu, Lei Yu, Qi Xu, Jinyang Yu, Meng Liu, Can Zhang, Yanlin Ma, Yongsheng Li","doi":"10.1093/bfgp/elad023","DOIUrl":"10.1093/bfgp/elad023","url":null,"abstract":"<p><p>Ion channels, in particular transient-receptor potential (TRP) channels, are essential genes that play important roles in many physiological processes. Emerging evidence has demonstrated that TRP genes are involved in a number of diseases, including various cancer types. However, we still lack knowledge about the expression alterations landscape of TRP genes across cancer types. In this review, we comprehensively reviewed and summarised the transcriptomes from more than 10 000 samples in 33 cancer types. We found that TRP genes were widespreadly transcriptomic dysregulated in cancer, which was associated with clinical survival of cancer patients. Perturbations of TRP genes were associated with a number of cancer pathways across cancer types. Moreover, we reviewed the functions of TRP family gene alterations in a number of diseases reported in recent studies. Taken together, our study comprehensively reviewed TRP genes with extensive transcriptomic alterations and their functions will directly contribute to cancer therapy and precision medicine.</p>","PeriodicalId":55323,"journal":{"name":"Briefings in Functional Genomics","volume":" ","pages":"214-227"},"PeriodicalIF":4.0,"publicationDate":"2024-05-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"9948253","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Akhil Kumar, Rishika Kaushal, Himanshi Sharma, Khushboo Sharma, Manoj B Menon, Vivekanandan P
We identified 11 conserved stretches in over 6.3 million SARS-CoV-2 genomes including all the major variants of concerns. Each conserved stretch is ≥100 nucleotides in length with ≥99.9% conservation at each nucleotide position. Interestingly, six of the eight conserved stretches in ORF1ab overlapped significantly with well-folded experimentally verified RNA secondary structures. Furthermore, two of the conserved stretches were mapped to regions within the S2-subunit that undergo dynamic structural rearrangements during viral fusion. In addition, the conserved stretches were significantly depleted for zinc-finger antiviral protein (ZAP) binding sites, which facilitated the recognition and degradation of viral RNA. These highly conserved stretches in the SARS-CoV-2 genome were poorly conserved at the nucleotide level among closely related β-coronaviruses, thus representing ideal targets for highly specific and discriminatory diagnostic assays. Our findings highlight the role of structural constraints at both RNA and protein levels that contribute to the sequence conservation of specific genomic regions in SARS-CoV-2.
{"title":"Mapping of long stretches of highly conserved sequences in over 6 million SARS-CoV-2 genomes.","authors":"Akhil Kumar, Rishika Kaushal, Himanshi Sharma, Khushboo Sharma, Manoj B Menon, Vivekanandan P","doi":"10.1093/bfgp/elad027","DOIUrl":"10.1093/bfgp/elad027","url":null,"abstract":"<p><p>We identified 11 conserved stretches in over 6.3 million SARS-CoV-2 genomes including all the major variants of concerns. Each conserved stretch is ≥100 nucleotides in length with ≥99.9% conservation at each nucleotide position. Interestingly, six of the eight conserved stretches in ORF1ab overlapped significantly with well-folded experimentally verified RNA secondary structures. Furthermore, two of the conserved stretches were mapped to regions within the S2-subunit that undergo dynamic structural rearrangements during viral fusion. In addition, the conserved stretches were significantly depleted for zinc-finger antiviral protein (ZAP) binding sites, which facilitated the recognition and degradation of viral RNA. These highly conserved stretches in the SARS-CoV-2 genome were poorly conserved at the nucleotide level among closely related β-coronaviruses, thus representing ideal targets for highly specific and discriminatory diagnostic assays. Our findings highlight the role of structural constraints at both RNA and protein levels that contribute to the sequence conservation of specific genomic regions in SARS-CoV-2.</p>","PeriodicalId":55323,"journal":{"name":"Briefings in Functional Genomics","volume":" ","pages":"256-264"},"PeriodicalIF":4.0,"publicationDate":"2024-05-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"9824704","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Tao Tang, Yiping Liu, Binshuang Zheng, Rong Li, Xiaocai Zhang, Yuansheng Liu
Third-generation sequencing (TGS) technologies have revolutionized genome science in the past decade. However, the long-read data produced by TGS platforms suffer from a much higher error rate than that of the previous technologies, thus complicating the downstream analysis. Several error correction tools for long-read data have been developed; these tools can be categorized into hybrid and self-correction tools. So far, these two types of tools are separately investigated, and their interplay remains understudied. Here, we integrate hybrid and self-correction methods for high-quality error correction. Our procedure leverages the inter-similarity between long-read data and high-accuracy information from short reads. We compare the performance of our method and state-of-the-art error correction tools on Escherichia coli and Arabidopsis thaliana datasets. The result shows that the integration approach outperformed the existing error correction methods and holds promise for improving the quality of downstream analyses in genomic research.
{"title":"Integration of hybrid and self-correction method improves the quality of long-read sequencing data.","authors":"Tao Tang, Yiping Liu, Binshuang Zheng, Rong Li, Xiaocai Zhang, Yuansheng Liu","doi":"10.1093/bfgp/elad026","DOIUrl":"10.1093/bfgp/elad026","url":null,"abstract":"<p><p>Third-generation sequencing (TGS) technologies have revolutionized genome science in the past decade. However, the long-read data produced by TGS platforms suffer from a much higher error rate than that of the previous technologies, thus complicating the downstream analysis. Several error correction tools for long-read data have been developed; these tools can be categorized into hybrid and self-correction tools. So far, these two types of tools are separately investigated, and their interplay remains understudied. Here, we integrate hybrid and self-correction methods for high-quality error correction. Our procedure leverages the inter-similarity between long-read data and high-accuracy information from short reads. We compare the performance of our method and state-of-the-art error correction tools on Escherichia coli and Arabidopsis thaliana datasets. The result shows that the integration approach outperformed the existing error correction methods and holds promise for improving the quality of downstream analyses in genomic research.</p>","PeriodicalId":55323,"journal":{"name":"Briefings in Functional Genomics","volume":" ","pages":"249-255"},"PeriodicalIF":4.0,"publicationDate":"2024-05-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"9669190","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}