Zhao-Yue Zhang, Yue-Er Fan, Cheng-Bing Huang, Meng-Ze Du
Essential genes are necessary to sustain the life of a species under adequate nutritional conditions. These genes have attracted significant attention for their potential as drug targets, especially in developing broad-spectrum antibacterial drugs. However, studying essential genes remains challenging due to their variability in specific environmental conditions. In this study, the authors aim to develop a powerful prediction model for identifying essential genes in humans. The authors first obtained the essential gene data from human cancer cell lines and characterised gene sequences using 7 feature encoding methods such as Kmer, the Composition of K-spaced Nucleic Acid Pairs, and Z-curve. Subsequently, feature fusion and feature optimisation strategies were employed to select the impactful features. Finally, machine learning algorithms were applied to construct the prediction models and evaluate their performance. The single-feature-based model achieved the highest area under the Receiver Operating Characteristic curve (AUC) of 0.830. After fusing and filtering these features, the classical machine learning models achieved the highest AUC at 0.823 while the deep learning model reached 0.860. Results obtained by the authors show that compared to using individual features, feature fusion and feature optimisation strategies significantly improved model performance. Moreover, the study provided an advantageous method for essential gene identification compared to other methods.
{"title":"Human essential gene identification based on feature fusion and feature screening","authors":"Zhao-Yue Zhang, Yue-Er Fan, Cheng-Bing Huang, Meng-Ze Du","doi":"10.1049/syb2.12105","DOIUrl":"10.1049/syb2.12105","url":null,"abstract":"<p>Essential genes are necessary to sustain the life of a species under adequate nutritional conditions. These genes have attracted significant attention for their potential as drug targets, especially in developing broad-spectrum antibacterial drugs. However, studying essential genes remains challenging due to their variability in specific environmental conditions. In this study, the authors aim to develop a powerful prediction model for identifying essential genes in humans. The authors first obtained the essential gene data from human cancer cell lines and characterised gene sequences using 7 feature encoding methods such as Kmer, the Composition of K-spaced Nucleic Acid Pairs, and Z-curve. Subsequently, feature fusion and feature optimisation strategies were employed to select the impactful features. Finally, machine learning algorithms were applied to construct the prediction models and evaluate their performance. The single-feature-based model achieved the highest area under the Receiver Operating Characteristic curve (AUC) of 0.830. After fusing and filtering these features, the classical machine learning models achieved the highest AUC at 0.823 while the deep learning model reached 0.860. Results obtained by the authors show that compared to using individual features, feature fusion and feature optimisation strategies significantly improved model performance. Moreover, the study provided an advantageous method for essential gene identification compared to other methods.</p>","PeriodicalId":50379,"journal":{"name":"IET Systems Biology","volume":"18 6","pages":"227-237"},"PeriodicalIF":1.9,"publicationDate":"2024-11-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11665845/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142693464","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Sepsis is a severe systemic inflammatory syndrome triggered by infection and is a leading cause of morbidity and mortality in intensive care units (ICUs). Immune dysfunction is a hallmark of sepsis. In this study, the authors investigated cell-cell communication among lymphoid-derived leucocytes using single-cell RNA sequencing (scRNA-seq) to gain a deeper understanding of the underlying mechanisms in late-stage sepsis. The authors’ findings revealed that both the number and strength of cellular interactions were elevated in septic patients compared to healthy individuals, with several pathways showing significant alterations, particularly in conventional dendritic cells (cDCs) and plasmacytoid dendritic cells (pDCs). Notably, pathways such as CD6-ALCAM were more activated in sepsis, potentially due to T cell suppression. This study offers new insights into the mechanisms of immunosuppression and provides potential avenues for clinical intervention in sepsis.
败血症是由感染引发的严重全身炎症综合征,是重症监护病房(ICU)发病率和死亡率的主要原因。免疫功能障碍是败血症的标志。在这项研究中,作者利用单细胞 RNA 测序(scRNA-seq)研究了淋巴源性白细胞之间的细胞-细胞通讯,以深入了解晚期败血症的潜在机制。作者的研究结果表明,与健康人相比,脓毒症患者细胞间相互作用的数量和强度都有所增加,其中有几种通路发生了显著变化,尤其是在传统树突状细胞(cDCs)和浆细胞树突状细胞(pDCs)中。值得注意的是,脓毒症患者的 CD6-ALCAM 等通路更为活化,这可能是由于 T 细胞抑制所致。这项研究为了解免疫抑制的机制提供了新的视角,并为脓毒症的临床干预提供了潜在的途径。
{"title":"Inference and analysis of cell-cell communication of non-myeloid circulating cells in late sepsis based on single-cell RNA-seq","authors":"Yanyan Tao, Miaomiao Li, Cheng Liu","doi":"10.1049/syb2.12109","DOIUrl":"10.1049/syb2.12109","url":null,"abstract":"<p>Sepsis is a severe systemic inflammatory syndrome triggered by infection and is a leading cause of morbidity and mortality in intensive care units (ICUs). Immune dysfunction is a hallmark of sepsis. In this study, the authors investigated cell-cell communication among lymphoid-derived leucocytes using single-cell RNA sequencing (scRNA-seq) to gain a deeper understanding of the underlying mechanisms in late-stage sepsis. The authors’ findings revealed that both the number and strength of cellular interactions were elevated in septic patients compared to healthy individuals, with several pathways showing significant alterations, particularly in conventional dendritic cells (cDCs) and plasmacytoid dendritic cells (pDCs). Notably, pathways such as CD6-ALCAM were more activated in sepsis, potentially due to T cell suppression. This study offers new insights into the mechanisms of immunosuppression and provides potential avenues for clinical intervention in sepsis.</p>","PeriodicalId":50379,"journal":{"name":"IET Systems Biology","volume":"18 6","pages":"218-226"},"PeriodicalIF":1.9,"publicationDate":"2024-11-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11665843/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142693570","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Yang Zhang, Ting Yang, Yu Yang, Dongsheng Xu, Yucheng Hu, Shuo Zhang, Nanchao Luo, Lin Ning, Liping Ren
Small interfering RNA (siRNA) has revolutionised biomedical research and drug development through precise post-transcriptional gene silencing technology. Despite its immense potential, siRNA therapy still faces technical challenges, such as delivery efficiency, targeting specificity, and molecular stability. To address these challenges and facilitate siRNA drug development, we have developed siRNAEfficacyDB, a comprehensive database that integrates experimentally validated siRNA efficacy data. This database contains 3544 siRNA records, covering 42 target genes and 5 cell lines. It provides detailed information on siRNA sequences, target genes, inhibition efficiencies, experimental techniques, cell lines, siRNA concentrations, and incubation times. siRNAEfficacyDB offers a user-friendly web interface that makes it easy to query, browse and analyse data, enabling efficient access to siRNA-related information. In summary, siRNAEfficacyDB provides a useful data foundation for siRNA drug design and optimisation, serving as a valuable resource for advancing computer-aided siRNA design research and nucleic acid drug development. siRNAEfficacyDB is freely available at https://cellknowledge.com.cn/siRNAEfficacy for non-commercial use.
{"title":"siRNAEfficacyDB: An experimentally supported small interfering RNA efficacy database","authors":"Yang Zhang, Ting Yang, Yu Yang, Dongsheng Xu, Yucheng Hu, Shuo Zhang, Nanchao Luo, Lin Ning, Liping Ren","doi":"10.1049/syb2.12102","DOIUrl":"10.1049/syb2.12102","url":null,"abstract":"<p>Small interfering RNA (siRNA) has revolutionised biomedical research and drug development through precise post-transcriptional gene silencing technology. Despite its immense potential, siRNA therapy still faces technical challenges, such as delivery efficiency, targeting specificity, and molecular stability. To address these challenges and facilitate siRNA drug development, we have developed siRNAEfficacyDB, a comprehensive database that integrates experimentally validated siRNA efficacy data. This database contains 3544 siRNA records, covering 42 target genes and 5 cell lines. It provides detailed information on siRNA sequences, target genes, inhibition efficiencies, experimental techniques, cell lines, siRNA concentrations, and incubation times. siRNAEfficacyDB offers a user-friendly web interface that makes it easy to query, browse and analyse data, enabling efficient access to siRNA-related information. In summary, siRNAEfficacyDB provides a useful data foundation for siRNA drug design and optimisation, serving as a valuable resource for advancing computer-aided siRNA design research and nucleic acid drug development. siRNAEfficacyDB is freely available at https://cellknowledge.com.cn/siRNAEfficacy for non-commercial use.</p>","PeriodicalId":50379,"journal":{"name":"IET Systems Biology","volume":"18 6","pages":"199-207"},"PeriodicalIF":1.9,"publicationDate":"2024-11-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11665841/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142631563","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Sonia Zouari, Farman Ali, Atef Masmoudi, Sarah Abu Ghazalah, Wajdi Alghamdi, Faris A. Kateb, Nouf Ibrahim
Globular proteins (GPs) play vital roles in a wide range of biological processes, encompassing enzymatic catalysis and immune responses. Enzymes, among these globular proteins, facilitate biochemical reactions, while others, such as haemoglobin, contribute to essential physiological functions such as oxygen transport. Given the importance of these considerations, accurately identifying Globular proteins is essential. To address the need for precise GP identification, this research introduces an innovative approach that employs a hybrid-based deep learning model called Deep-GP. We generated two datasets based on primary sequences and developed a novel feature descriptor called, Consensus Sequence-based Trisection-Position Specific Scoring Matrix (CST-PSSM). The model training phase involved the application of deep learning techniques, including the bidirectional long short-term memory network (BiLSTM), gated recurrent unit (GRU), and convolutional neural network (CNN). The BiLSTM and CNN were hybridised for ensemble learning. The CST-PSSM-based ensemble model achieved the most accurate predictive outcomes, outperforming other competitive predictors across both training and testing datasets. This demonstrates the potential of harnessing deep learning for precise GB prediction as a robust tool to expedite research, streamline drug discovery, and unveil novel therapeutic targets.
{"title":"Deep-GB: A novel deep learning model for globular protein prediction using CNN-BiLSTM architecture and enhanced PSSM with trisection strategy","authors":"Sonia Zouari, Farman Ali, Atef Masmoudi, Sarah Abu Ghazalah, Wajdi Alghamdi, Faris A. Kateb, Nouf Ibrahim","doi":"10.1049/syb2.12108","DOIUrl":"10.1049/syb2.12108","url":null,"abstract":"<p>Globular proteins (GPs) play vital roles in a wide range of biological processes, encompassing enzymatic catalysis and immune responses. Enzymes, among these globular proteins, facilitate biochemical reactions, while others, such as haemoglobin, contribute to essential physiological functions such as oxygen transport. Given the importance of these considerations, accurately identifying Globular proteins is essential. To address the need for precise GP identification, this research introduces an innovative approach that employs a hybrid-based deep learning model called Deep-GP. We generated two datasets based on primary sequences and developed a novel feature descriptor called, Consensus Sequence-based Trisection-Position Specific Scoring Matrix (CST-PSSM). The model training phase involved the application of deep learning techniques, including the bidirectional long short-term memory network (BiLSTM), gated recurrent unit (GRU), and convolutional neural network (CNN). The BiLSTM and CNN were hybridised for ensemble learning. The CST-PSSM-based ensemble model achieved the most accurate predictive outcomes, outperforming other competitive predictors across both training and testing datasets. This demonstrates the potential of harnessing deep learning for precise GB prediction as a robust tool to expedite research, streamline drug discovery, and unveil novel therapeutic targets.</p>","PeriodicalId":50379,"journal":{"name":"IET Systems Biology","volume":"18 6","pages":"208-217"},"PeriodicalIF":1.9,"publicationDate":"2024-11-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11665838/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142607355","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Maha Mesfer Alghamdi, Naael H. Alazwary, Waleed A. Alsowayan, Mohmmed Algamdi, Ahmed F. Alohali, Mustafa A. Yasawy, Abeer M. Alghamdi, Abdullah M. Alassaf, Mohammed R. Alshehri, Hussein A. Aljaziri, Nujoud H. Almoqati, Shatha S. Alghamdi, Norah A. Bin Magbel, Tareq A. AlMazeedi, Nashaat K. Neyazi, Mona M. Alghamdi, Mohammed N. Alazwary