Lung cancer has been the most common and the leading cause of cancer deaths globally. Besides clinicopathological observations and traditional molecular tests, the advent of robust and scalable techniques for nucleic acid analysis has revolutionized biological research and medicinal practice in lung cancer treatment. In response to the demands for minimally invasive procedures and technology development over the past decade, many types of multi-omics data at various genome levels have been generated. As omics data grow, artificial intelligence models, particularly deep learning, are prominent in developing more rapid and effective methods to potentially improve lung cancer patient diagnosis, prognosis and treatment strategy. This decade has seen genome-based deep learning models thriving in various lung cancer tasks, including cancer prediction, subtype classification, prognosis estimation, cancer molecular signatures identification, treatment response prediction and biomarker development. In this study, we summarized available data sources for deep-learning-based lung cancer mining and provided an update on recent deep learning models in lung cancer genomics. Subsequently, we reviewed the current issues and discussed future research directions of deep-learning-based lung cancer genomics research.
{"title":"Omics-based deep learning approaches for lung cancer decision-making and therapeutics development.","authors":"Thi-Oanh Tran, Thanh Hoa Vo, Nguyen Quoc Khanh Le","doi":"10.1093/bfgp/elad031","DOIUrl":"10.1093/bfgp/elad031","url":null,"abstract":"<p><p>Lung cancer has been the most common and the leading cause of cancer deaths globally. Besides clinicopathological observations and traditional molecular tests, the advent of robust and scalable techniques for nucleic acid analysis has revolutionized biological research and medicinal practice in lung cancer treatment. In response to the demands for minimally invasive procedures and technology development over the past decade, many types of multi-omics data at various genome levels have been generated. As omics data grow, artificial intelligence models, particularly deep learning, are prominent in developing more rapid and effective methods to potentially improve lung cancer patient diagnosis, prognosis and treatment strategy. This decade has seen genome-based deep learning models thriving in various lung cancer tasks, including cancer prediction, subtype classification, prognosis estimation, cancer molecular signatures identification, treatment response prediction and biomarker development. In this study, we summarized available data sources for deep-learning-based lung cancer mining and provided an update on recent deep learning models in lung cancer genomics. Subsequently, we reviewed the current issues and discussed future research directions of deep-learning-based lung cancer genomics research.</p>","PeriodicalId":55323,"journal":{"name":"Briefings in Functional Genomics","volume":" ","pages":"181-192"},"PeriodicalIF":2.5,"publicationDate":"2024-05-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"10281428","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Tao Pan, Yueying Gao, Gang Xu, Lei Yu, Qi Xu, Jinyang Yu, Meng Liu, Can Zhang, Yanlin Ma, Yongsheng Li
Ion channels, in particular transient-receptor potential (TRP) channels, are essential genes that play important roles in many physiological processes. Emerging evidence has demonstrated that TRP genes are involved in a number of diseases, including various cancer types. However, we still lack knowledge about the expression alterations landscape of TRP genes across cancer types. In this review, we comprehensively reviewed and summarised the transcriptomes from more than 10 000 samples in 33 cancer types. We found that TRP genes were widespreadly transcriptomic dysregulated in cancer, which was associated with clinical survival of cancer patients. Perturbations of TRP genes were associated with a number of cancer pathways across cancer types. Moreover, we reviewed the functions of TRP family gene alterations in a number of diseases reported in recent studies. Taken together, our study comprehensively reviewed TRP genes with extensive transcriptomic alterations and their functions will directly contribute to cancer therapy and precision medicine.
{"title":"Widespread transcriptomic alterations of transient receptor potential channel genes in cancer.","authors":"Tao Pan, Yueying Gao, Gang Xu, Lei Yu, Qi Xu, Jinyang Yu, Meng Liu, Can Zhang, Yanlin Ma, Yongsheng Li","doi":"10.1093/bfgp/elad023","DOIUrl":"10.1093/bfgp/elad023","url":null,"abstract":"<p><p>Ion channels, in particular transient-receptor potential (TRP) channels, are essential genes that play important roles in many physiological processes. Emerging evidence has demonstrated that TRP genes are involved in a number of diseases, including various cancer types. However, we still lack knowledge about the expression alterations landscape of TRP genes across cancer types. In this review, we comprehensively reviewed and summarised the transcriptomes from more than 10 000 samples in 33 cancer types. We found that TRP genes were widespreadly transcriptomic dysregulated in cancer, which was associated with clinical survival of cancer patients. Perturbations of TRP genes were associated with a number of cancer pathways across cancer types. Moreover, we reviewed the functions of TRP family gene alterations in a number of diseases reported in recent studies. Taken together, our study comprehensively reviewed TRP genes with extensive transcriptomic alterations and their functions will directly contribute to cancer therapy and precision medicine.</p>","PeriodicalId":55323,"journal":{"name":"Briefings in Functional Genomics","volume":" ","pages":"214-227"},"PeriodicalIF":4.0,"publicationDate":"2024-05-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"9948253","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Akhil Kumar, Rishika Kaushal, Himanshi Sharma, Khushboo Sharma, Manoj B Menon, Vivekanandan P
We identified 11 conserved stretches in over 6.3 million SARS-CoV-2 genomes including all the major variants of concerns. Each conserved stretch is ≥100 nucleotides in length with ≥99.9% conservation at each nucleotide position. Interestingly, six of the eight conserved stretches in ORF1ab overlapped significantly with well-folded experimentally verified RNA secondary structures. Furthermore, two of the conserved stretches were mapped to regions within the S2-subunit that undergo dynamic structural rearrangements during viral fusion. In addition, the conserved stretches were significantly depleted for zinc-finger antiviral protein (ZAP) binding sites, which facilitated the recognition and degradation of viral RNA. These highly conserved stretches in the SARS-CoV-2 genome were poorly conserved at the nucleotide level among closely related β-coronaviruses, thus representing ideal targets for highly specific and discriminatory diagnostic assays. Our findings highlight the role of structural constraints at both RNA and protein levels that contribute to the sequence conservation of specific genomic regions in SARS-CoV-2.
{"title":"Mapping of long stretches of highly conserved sequences in over 6 million SARS-CoV-2 genomes.","authors":"Akhil Kumar, Rishika Kaushal, Himanshi Sharma, Khushboo Sharma, Manoj B Menon, Vivekanandan P","doi":"10.1093/bfgp/elad027","DOIUrl":"10.1093/bfgp/elad027","url":null,"abstract":"<p><p>We identified 11 conserved stretches in over 6.3 million SARS-CoV-2 genomes including all the major variants of concerns. Each conserved stretch is ≥100 nucleotides in length with ≥99.9% conservation at each nucleotide position. Interestingly, six of the eight conserved stretches in ORF1ab overlapped significantly with well-folded experimentally verified RNA secondary structures. Furthermore, two of the conserved stretches were mapped to regions within the S2-subunit that undergo dynamic structural rearrangements during viral fusion. In addition, the conserved stretches were significantly depleted for zinc-finger antiviral protein (ZAP) binding sites, which facilitated the recognition and degradation of viral RNA. These highly conserved stretches in the SARS-CoV-2 genome were poorly conserved at the nucleotide level among closely related β-coronaviruses, thus representing ideal targets for highly specific and discriminatory diagnostic assays. Our findings highlight the role of structural constraints at both RNA and protein levels that contribute to the sequence conservation of specific genomic regions in SARS-CoV-2.</p>","PeriodicalId":55323,"journal":{"name":"Briefings in Functional Genomics","volume":" ","pages":"256-264"},"PeriodicalIF":4.0,"publicationDate":"2024-05-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"9824704","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Tao Tang, Yiping Liu, Binshuang Zheng, Rong Li, Xiaocai Zhang, Yuansheng Liu
Third-generation sequencing (TGS) technologies have revolutionized genome science in the past decade. However, the long-read data produced by TGS platforms suffer from a much higher error rate than that of the previous technologies, thus complicating the downstream analysis. Several error correction tools for long-read data have been developed; these tools can be categorized into hybrid and self-correction tools. So far, these two types of tools are separately investigated, and their interplay remains understudied. Here, we integrate hybrid and self-correction methods for high-quality error correction. Our procedure leverages the inter-similarity between long-read data and high-accuracy information from short reads. We compare the performance of our method and state-of-the-art error correction tools on Escherichia coli and Arabidopsis thaliana datasets. The result shows that the integration approach outperformed the existing error correction methods and holds promise for improving the quality of downstream analyses in genomic research.
{"title":"Integration of hybrid and self-correction method improves the quality of long-read sequencing data.","authors":"Tao Tang, Yiping Liu, Binshuang Zheng, Rong Li, Xiaocai Zhang, Yuansheng Liu","doi":"10.1093/bfgp/elad026","DOIUrl":"10.1093/bfgp/elad026","url":null,"abstract":"<p><p>Third-generation sequencing (TGS) technologies have revolutionized genome science in the past decade. However, the long-read data produced by TGS platforms suffer from a much higher error rate than that of the previous technologies, thus complicating the downstream analysis. Several error correction tools for long-read data have been developed; these tools can be categorized into hybrid and self-correction tools. So far, these two types of tools are separately investigated, and their interplay remains understudied. Here, we integrate hybrid and self-correction methods for high-quality error correction. Our procedure leverages the inter-similarity between long-read data and high-accuracy information from short reads. We compare the performance of our method and state-of-the-art error correction tools on Escherichia coli and Arabidopsis thaliana datasets. The result shows that the integration approach outperformed the existing error correction methods and holds promise for improving the quality of downstream analyses in genomic research.</p>","PeriodicalId":55323,"journal":{"name":"Briefings in Functional Genomics","volume":" ","pages":"249-255"},"PeriodicalIF":4.0,"publicationDate":"2024-05-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"9669190","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Yue-Chao Li, Zhu-Hong You, Chang-Qing Yu, Lei Wang, Lun Hu, Peng-Wei Hu, Yan Qiao, Xin-Fei Wang, Yu-An Huang
Recently, the role of competing endogenous RNAs in regulating gene expression through the interaction of microRNAs has been closely associated with the expression of circular RNAs (circRNAs) in various biological processes such as reproduction and apoptosis. While the number of confirmed circRNA-miRNA interactions (CMIs) continues to increase, the conventional in vitro approaches for discovery are expensive, labor intensive, and time consuming. Therefore, there is an urgent need for effective prediction of potential CMIs through appropriate data modeling and prediction based on known information. In this study, we proposed a novel model, called DeepCMI, that utilizes multi-source information on circRNA/miRNA to predict potential CMIs. Comprehensive evaluations on the CMI-9905 and CMI-9589 datasets demonstrated that DeepCMI successfully infers potential CMIs. Specifically, DeepCMI achieved AUC values of 90.54% and 94.8% on the CMI-9905 and CMI-9589 datasets, respectively. These results suggest that DeepCMI is an effective model for predicting potential CMIs and has the potential to significantly reduce the need for downstream in vitro studies. To facilitate the use of our trained model and data, we have constructed a computational platform, which is available at http://120.77.11.78/DeepCMI/. The source code and datasets used in this work are available at https://github.com/LiYuechao1998/DeepCMI.
{"title":"DeepCMI: a graph-based model for accurate prediction of circRNA-miRNA interactions with multiple information.","authors":"Yue-Chao Li, Zhu-Hong You, Chang-Qing Yu, Lei Wang, Lun Hu, Peng-Wei Hu, Yan Qiao, Xin-Fei Wang, Yu-An Huang","doi":"10.1093/bfgp/elad030","DOIUrl":"10.1093/bfgp/elad030","url":null,"abstract":"<p><p>Recently, the role of competing endogenous RNAs in regulating gene expression through the interaction of microRNAs has been closely associated with the expression of circular RNAs (circRNAs) in various biological processes such as reproduction and apoptosis. While the number of confirmed circRNA-miRNA interactions (CMIs) continues to increase, the conventional in vitro approaches for discovery are expensive, labor intensive, and time consuming. Therefore, there is an urgent need for effective prediction of potential CMIs through appropriate data modeling and prediction based on known information. In this study, we proposed a novel model, called DeepCMI, that utilizes multi-source information on circRNA/miRNA to predict potential CMIs. Comprehensive evaluations on the CMI-9905 and CMI-9589 datasets demonstrated that DeepCMI successfully infers potential CMIs. Specifically, DeepCMI achieved AUC values of 90.54% and 94.8% on the CMI-9905 and CMI-9589 datasets, respectively. These results suggest that DeepCMI is an effective model for predicting potential CMIs and has the potential to significantly reduce the need for downstream in vitro studies. To facilitate the use of our trained model and data, we have constructed a computational platform, which is available at http://120.77.11.78/DeepCMI/. The source code and datasets used in this work are available at https://github.com/LiYuechao1998/DeepCMI.</p>","PeriodicalId":55323,"journal":{"name":"Briefings in Functional Genomics","volume":" ","pages":"276-285"},"PeriodicalIF":4.0,"publicationDate":"2024-05-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"10291543","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Yizhi Cui, Hongzhi Liu, Yutong Ming, Zheng Zhang, Li Liu, Ruijun Liu
G-quadruplex (G4), a non-classical deoxyribonucleic acid structure, is widely distributed in the genome and involved in various biological processes. In vivo, high-throughput sequencing has indicated that G4s are significantly enriched at functional regions in a cell-type-specific manner. Therefore, the prediction of G4s based on computational methods is necessary instead of the time-consuming and laborious experimental methods. Recently, G4 CUT&Tag has been developed to generate higher-resolution sequencing data than ChIP-seq, which provides more accurate training samples for model construction. In this paper, we present a new dataset construction method based on G4 CUT&Tag sequencing data and an XGBoost prediction model based on the machine learning boost method. The results show that our model performs well within and across cell types. Furthermore, sequence analysis indicates that the formation of G4 structure is greatly affected by the flanking sequences, and the GC content of the G4 flanking sequences is higher than non-G4. Moreover, we also identified G4 motifs in the high-resolution dataset, among which we found several motifs for known transcription factors (TFs), such as SP2 and BPC. These TFs may directly or indirectly affect the formation of the G4 structure.
{"title":"Prediction of strand-specific and cell-type-specific G-quadruplexes based on high-resolution CUT&Tag data.","authors":"Yizhi Cui, Hongzhi Liu, Yutong Ming, Zheng Zhang, Li Liu, Ruijun Liu","doi":"10.1093/bfgp/elad024","DOIUrl":"10.1093/bfgp/elad024","url":null,"abstract":"<p><p>G-quadruplex (G4), a non-classical deoxyribonucleic acid structure, is widely distributed in the genome and involved in various biological processes. In vivo, high-throughput sequencing has indicated that G4s are significantly enriched at functional regions in a cell-type-specific manner. Therefore, the prediction of G4s based on computational methods is necessary instead of the time-consuming and laborious experimental methods. Recently, G4 CUT&Tag has been developed to generate higher-resolution sequencing data than ChIP-seq, which provides more accurate training samples for model construction. In this paper, we present a new dataset construction method based on G4 CUT&Tag sequencing data and an XGBoost prediction model based on the machine learning boost method. The results show that our model performs well within and across cell types. Furthermore, sequence analysis indicates that the formation of G4 structure is greatly affected by the flanking sequences, and the GC content of the G4 flanking sequences is higher than non-G4. Moreover, we also identified G4 motifs in the high-resolution dataset, among which we found several motifs for known transcription factors (TFs), such as SP2 and BPC. These TFs may directly or indirectly affect the formation of the G4 structure.</p>","PeriodicalId":55323,"journal":{"name":"Briefings in Functional Genomics","volume":" ","pages":"265-275"},"PeriodicalIF":4.0,"publicationDate":"2024-05-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"9683854","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Ting He, Zhipeng Gao, Ling Lin, Xu Zhang, Quan Zou
Esophageal cancer (ESCA) has a bad prognosis. Long non-coding RNA (lncRNA) impacts on cell proliferation. However, the prognosis function of N6-methyladenosine (m6A)-associated lncRNAs (m6A-lncRNAs) in ESCA remains unknown. Univariate Cox analysis was applied to investigate prognosis related m6A-lncRNAs, based on which the samples were clustered. Wilcoxon rank and Chi-square tests were adopted to compare the clinical traits, survival, pathway activity and immune infiltration in different clusters where overall survival, clinical traits (N stage), tumor-invasive immune cells and pathway activity were found significantly different. Through least absolute shrinkage and selection operator and proportional hazard (Lasso-Cox) model, five m6A-lncRNAs were selected to construct the prognostic signature (m6A-lncSig) and risk score. To investigate the link between risk score and clinical traits or immunological microenvironments, Chi-square test and Spearman correlation analysis were utilized. Risk score was found connected with N stage, tumor stage, different clusters, macrophages M2, B cells naive and T cells CD4 memory resting. Risk score and tumor stage were found as independent prognostic variables. And the constructed nomogram model had high accuracy in predicting prognosis. The obtained m6A-lncSig could be taken as potential prognostic biomarker for ESCA patients. This study offers a theoretical foundation for clinical diagnosis and prognosis of ESCA.
{"title":"Prognostic signature analysis and survival prediction of esophageal cancer based on N6-methyladenosine associated lncRNAs.","authors":"Ting He, Zhipeng Gao, Ling Lin, Xu Zhang, Quan Zou","doi":"10.1093/bfgp/elad028","DOIUrl":"10.1093/bfgp/elad028","url":null,"abstract":"<p><p>Esophageal cancer (ESCA) has a bad prognosis. Long non-coding RNA (lncRNA) impacts on cell proliferation. However, the prognosis function of N6-methyladenosine (m6A)-associated lncRNAs (m6A-lncRNAs) in ESCA remains unknown. Univariate Cox analysis was applied to investigate prognosis related m6A-lncRNAs, based on which the samples were clustered. Wilcoxon rank and Chi-square tests were adopted to compare the clinical traits, survival, pathway activity and immune infiltration in different clusters where overall survival, clinical traits (N stage), tumor-invasive immune cells and pathway activity were found significantly different. Through least absolute shrinkage and selection operator and proportional hazard (Lasso-Cox) model, five m6A-lncRNAs were selected to construct the prognostic signature (m6A-lncSig) and risk score. To investigate the link between risk score and clinical traits or immunological microenvironments, Chi-square test and Spearman correlation analysis were utilized. Risk score was found connected with N stage, tumor stage, different clusters, macrophages M2, B cells naive and T cells CD4 memory resting. Risk score and tumor stage were found as independent prognostic variables. And the constructed nomogram model had high accuracy in predicting prognosis. The obtained m6A-lncSig could be taken as potential prognostic biomarker for ESCA patients. This study offers a theoretical foundation for clinical diagnosis and prognosis of ESCA.</p>","PeriodicalId":55323,"journal":{"name":"Briefings in Functional Genomics","volume":" ","pages":"239-248"},"PeriodicalIF":4.0,"publicationDate":"2024-05-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"9886829","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Dayu Tan, Haijun Jiang, Haitao Li, Ying Xie, Yansen Su
The precise identification of drug-protein inter action (DPI) can significantly speed up the drug discovery process. Bioassay methods are time-consuming and expensive to screen for each pair of drug proteins. Machine-learning-based methods cannot accurately predict a large number of DPIs. Compared with traditional computing methods, deep learning methods need less domain knowledge and have strong data learning ability. In this study, we construct a DPI prediction model based on dual channel neural networks with an efficient path attention mechanism, called DCA-DPI. The drug molecular graph and protein sequence are used as the data input of the model, and the residual graph neural network and the residual convolution network are used to learn the feature representation of the drug and protein, respectively, to obtain the feature vector of the drug and the hidden vector of protein. To get a more accurate protein feature vector, the weighted sum of the hidden vector of protein is applied using the neural attention mechanism. In the end, drug and protein vectors are concatenated and input into the full connection layer for classification. In order to evaluate the performance of DCA-DPI, three widely used public data, Human, C.elegans and DUD-E, are used in the experiment. The evaluation metrics values in the experiment are superior to other relevant methods. Experiments show that our model is efficient for DPI prediction.
{"title":"Prediction of drug-protein interaction based on dual channel neural networks with attention mechanism.","authors":"Dayu Tan, Haijun Jiang, Haitao Li, Ying Xie, Yansen Su","doi":"10.1093/bfgp/elad037","DOIUrl":"10.1093/bfgp/elad037","url":null,"abstract":"<p><p>The precise identification of drug-protein inter action (DPI) can significantly speed up the drug discovery process. Bioassay methods are time-consuming and expensive to screen for each pair of drug proteins. Machine-learning-based methods cannot accurately predict a large number of DPIs. Compared with traditional computing methods, deep learning methods need less domain knowledge and have strong data learning ability. In this study, we construct a DPI prediction model based on dual channel neural networks with an efficient path attention mechanism, called DCA-DPI. The drug molecular graph and protein sequence are used as the data input of the model, and the residual graph neural network and the residual convolution network are used to learn the feature representation of the drug and protein, respectively, to obtain the feature vector of the drug and the hidden vector of protein. To get a more accurate protein feature vector, the weighted sum of the hidden vector of protein is applied using the neural attention mechanism. In the end, drug and protein vectors are concatenated and input into the full connection layer for classification. In order to evaluate the performance of DCA-DPI, three widely used public data, Human, C.elegans and DUD-E, are used in the experiment. The evaluation metrics values in the experiment are superior to other relevant methods. Experiments show that our model is efficient for DPI prediction.</p>","PeriodicalId":55323,"journal":{"name":"Briefings in Functional Genomics","volume":" ","pages":"286-294"},"PeriodicalIF":4.0,"publicationDate":"2024-05-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"10112268","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Archana Mathur, Nikhilanand Arya, Kitsuchart Pasupa, Sriparna Saha, Sudeepa Roy Dey, Snehanshu Saha
We present a survey of the current state-of-the-art in breast cancer detection and prognosis. We analyze the evolution of Artificial Intelligence-based approaches from using just uni-modal information to multi-modality for detection and how such paradigm shift facilitates the efficacy of detection, consistent with clinical observations. We conclude that interpretable AI-based predictions and ability to handle class imbalance should be considered priority.
{"title":"Breast cancer prognosis through the use of multi-modal classifiers: current state of the art and the way forward","authors":"Archana Mathur, Nikhilanand Arya, Kitsuchart Pasupa, Sriparna Saha, Sudeepa Roy Dey, Snehanshu Saha","doi":"10.1093/bfgp/elae015","DOIUrl":"https://doi.org/10.1093/bfgp/elae015","url":null,"abstract":"We present a survey of the current state-of-the-art in breast cancer detection and prognosis. We analyze the evolution of Artificial Intelligence-based approaches from using just uni-modal information to multi-modality for detection and how such paradigm shift facilitates the efficacy of detection, consistent with clinical observations. We conclude that interpretable AI-based predictions and ability to handle class imbalance should be considered priority.","PeriodicalId":55323,"journal":{"name":"Briefings in Functional Genomics","volume":"124 1","pages":""},"PeriodicalIF":4.0,"publicationDate":"2024-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140828431","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Schizosaccharomyces pombe is a commonly utilized model organism for studying various aspects of eukaryotic cell physiology. One reason for its widespread use as an experimental system is the ease of genetic manipulations, leveraging the natural homology-targeted repair mechanism to accurately modify the genome. We conducted a study to assess the feasibility and efficiency of directly introducing exogenous genes into the fission yeast S. pombe using Polymerase Chain Reaction (PCR) with short-homology flanking sequences. Specifically, we amplified the NatMX6 gene (which provides resistance to nourseothricin) using PCR with oligonucleotides that had short flanking regions of 20 bp, 40 bp, 60 bp and 80 bp to the target gene. By using this purified PCR product, we successfully introduced the NatMX6 gene at position 171 385 on chromosome III in S. pombe. We have made a simple modification to the transformation procedure, resulting in a significant increase in transformation efficiency by at least 5-fold. The success rate of gene integration at the target position varied between 20% and 50% depending on the length of the flanking regions. Additionally, we discovered that the addition of dimethyl sulfoxide and boiled carrier DNA increased the number of transformants by ~60- and 3-fold, respectively. Furthermore, we found that the removal of the pku70+ gene improved the transformation efficiency to ~5% and reduced the formation of small background colonies. Overall, our results demonstrate that with this modified method, even very short stretches of homologous regions (as short as 20 bp) can be used to effectively target genes at a high frequency in S. pombe. This finding greatly facilitates the introduction of exogenous genes in this organism.
在研究真核细胞生理学的各方面问题时,鼠李糖核酶是一种常用的模式生物。它被广泛用作实验系统的原因之一是其易于进行基因操作,利用天然的同源性靶向修复机制来精确地修改基因组。我们进行了一项研究,评估利用聚合酶链式反应(PCR)和短同源侧翼序列将外源基因直接导入裂殖酵母 S. pombe 的可行性和效率。具体来说,我们使用与目标基因侧翼区分别为 20 bp、40 bp、60 bp 和 80 bp 的寡核苷酸进行 PCR 扩增 NatMX6 基因(该基因对诺索三嗪具有抗性)。通过使用这种纯化的 PCR 产物,我们成功地将 NatMX6 基因导入了 S. pombe 的 III 号染色体 171 385 位。我们对转化程序进行了简单修改,使转化效率显著提高了至少 5 倍。根据侧翼区域的长度,目标位置的基因整合成功率在 20% 到 50% 之间。此外,我们还发现,加入二甲基亚砜和煮沸的载体 DNA 可使转化子的数量分别增加约 60 倍和 3 倍。此外,我们还发现去除 pku70+ 基因可将转化效率提高到约 5%,并减少小背景菌落的形成。总之,我们的研究结果表明,使用这种改进的方法,即使是很短的同源区段(短至 20 bp)也能有效地高频率靶向 S. pombe 中的基因。这一发现极大地促进了外源基因在该生物体内的引入。
{"title":"Short-homology-mediated PCR-based method for gene introduction in the fission yeast Schizosaccharomyces pombe","authors":"Cai-Xia Zhang, Ying-Chun Hou","doi":"10.1093/bfgp/elae016","DOIUrl":"https://doi.org/10.1093/bfgp/elae016","url":null,"abstract":"Schizosaccharomyces pombe is a commonly utilized model organism for studying various aspects of eukaryotic cell physiology. One reason for its widespread use as an experimental system is the ease of genetic manipulations, leveraging the natural homology-targeted repair mechanism to accurately modify the genome. We conducted a study to assess the feasibility and efficiency of directly introducing exogenous genes into the fission yeast S. pombe using Polymerase Chain Reaction (PCR) with short-homology flanking sequences. Specifically, we amplified the NatMX6 gene (which provides resistance to nourseothricin) using PCR with oligonucleotides that had short flanking regions of 20 bp, 40 bp, 60 bp and 80 bp to the target gene. By using this purified PCR product, we successfully introduced the NatMX6 gene at position 171 385 on chromosome III in S. pombe. We have made a simple modification to the transformation procedure, resulting in a significant increase in transformation efficiency by at least 5-fold. The success rate of gene integration at the target position varied between 20% and 50% depending on the length of the flanking regions. Additionally, we discovered that the addition of dimethyl sulfoxide and boiled carrier DNA increased the number of transformants by ~60- and 3-fold, respectively. Furthermore, we found that the removal of the pku70+ gene improved the transformation efficiency to ~5% and reduced the formation of small background colonies. Overall, our results demonstrate that with this modified method, even very short stretches of homologous regions (as short as 20 bp) can be used to effectively target genes at a high frequency in S. pombe. This finding greatly facilitates the introduction of exogenous genes in this organism.","PeriodicalId":55323,"journal":{"name":"Briefings in Functional Genomics","volume":"58 1","pages":""},"PeriodicalIF":4.0,"publicationDate":"2024-04-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140798721","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}