Interdisciplinary Sciences: Computational Life Sciences最新文献_第10页

AlzhiNet: Traversing from 2D-CNN to 3D-CNN, Towards Early Detection and Diagnosis of Alzheimer's Disease. 阿尔茨海默病：从2D-CNN到3D-CNN，迈向阿尔茨海默病的早期检测和诊断。

IF 3.9 2区生物学 Q1 MATHEMATICAL & COMPUTATIONAL BIOLOGY

Interdisciplinary Sciences: Computational Life Sciences

Pub Date : 2025-08-22 DOI: 10.1007/s12539-025-00764-w

Romoke Grace Akindele, Samuel Adebayo, Ming Yu, Paul Shekonya Kanda

Alzheimer's disease (AD) is a progressive neurodegenerative disorder with increasing prevalence among the ageing population, necessitating early and accurate diagnosis for effective disease management. In this study, we present a novel hybrid deep learning framework, AlzhiNet, that integrates both 2D convolutional neural networks (2D-CNNs) and 3D convolutional neural networks (3D-CNNs), along with a custom loss function and volumetric data augmentation, to enhance feature extraction and improve classification performance in AD diagnosis. According to extensive experiments, AlzhiNet outperforms standalone 2D and 3D models, highlighting the importance of combining these complementary representations of data. The depth and quality of 3D volumes derived from the augmented 2D slices also significantly influence the model's performance. The results indicate that carefully selecting weighting factors in hybrid predictions is imperative for achieving optimal results. Our framework has been validated on the magnetic resonance imaging (MRI) from Kaggle and MIRIAD datasets, obtaining accuracies of 98.9% and 99.99%, respectively, with an AUC of 100%. Furthermore, AlzhiNet was studied under a variety of perturbation scenarios on the Alzheimer's Kaggle dataset, including Gaussian noise, brightness, contrast, salt and pepper noise, color jitter, and occlusion. The results obtained show that AlzhiNet is more robust to perturbations than ResNet-18, making it an excellent choice for real-world applications. This approach represents a promising advancement in the early diagnosis and treatment planning for AD.

阿尔茨海默病（AD）是一种进行性神经退行性疾病，在老龄化人群中患病率越来越高，需要早期准确诊断以有效治疗疾病。在这项研究中，我们提出了一种新的混合深度学习框架AlzhiNet，它集成了2D卷积神经网络（2D- cnn）和3D卷积神经网络（3D- cnn），以及自定义损失函数和体积数据增强，以增强特征提取并提高AD诊断中的分类性能。根据大量的实验，AlzhiNet优于独立的2D和3D模型，强调了将这些互补的数据表示结合起来的重要性。从增强的二维切片中获得的三维体的深度和质量也显著影响模型的性能。结果表明，在混合预测中，仔细选择权重因子是获得最佳结果的必要条件。我们的框架已经在来自Kaggle和MIRIAD数据集的磁共振成像（MRI）上进行了验证，分别获得了98.9%和99.99%的准确率，AUC为100%。此外，在老年痴呆症Kaggle数据集上研究了各种扰动场景下的AlzhiNet，包括高斯噪声、亮度、对比度、盐和胡椒噪声、颜色抖动和遮挡。结果表明，AlzhiNet比ResNet-18对扰动的鲁棒性更强，使其成为现实应用的绝佳选择。这种方法在阿尔茨海默病的早期诊断和治疗计划方面取得了有希望的进展。

{"title":"AlzhiNet: Traversing from 2D-CNN to 3D-CNN, Towards Early Detection and Diagnosis of Alzheimer's Disease.","authors":"Romoke Grace Akindele, Samuel Adebayo, Ming Yu, Paul Shekonya Kanda","doi":"10.1007/s12539-025-00764-w","DOIUrl":"https://doi.org/10.1007/s12539-025-00764-w","url":null,"abstract":"Alzheimer's disease (AD) is a progressive neurodegenerative disorder with increasing prevalence among the ageing population, necessitating early and accurate diagnosis for effective disease management. In this study, we present a novel hybrid deep learning framework, AlzhiNet, that integrates both 2D convolutional neural networks (2D-CNNs) and 3D convolutional neural networks (3D-CNNs), along with a custom loss function and volumetric data augmentation, to enhance feature extraction and improve classification performance in AD diagnosis. According to extensive experiments, AlzhiNet outperforms standalone 2D and 3D models, highlighting the importance of combining these complementary representations of data. The depth and quality of 3D volumes derived from the augmented 2D slices also significantly influence the model's performance. The results indicate that carefully selecting weighting factors in hybrid predictions is imperative for achieving optimal results. Our framework has been validated on the magnetic resonance imaging (MRI) from Kaggle and MIRIAD datasets, obtaining accuracies of 98.9% and 99.99%, respectively, with an AUC of 100%. Furthermore, AlzhiNet was studied under a variety of perturbation scenarios on the Alzheimer's Kaggle dataset, including Gaussian noise, brightness, contrast, salt and pepper noise, color jitter, and occlusion. The results obtained show that AlzhiNet is more robust to perturbations than ResNet-18, making it an excellent choice for real-world applications. This approach represents a promising advancement in the early diagnosis and treatment planning for AD.","PeriodicalId":13670,"journal":{"name":"Interdisciplinary Sciences: Computational Life Sciences","volume":" ","pages":""},"PeriodicalIF":3.9,"publicationDate":"2025-08-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144952961","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

VHGAE: Drug-Target Interaction Prediction Model Based on Heterogeneous Graph Variational Autoencoder. 基于异构图变分自编码器的药物-靶标相互作用预测模型。

IF 3.9 2区生物学 Q1 MATHEMATICAL & COMPUTATIONAL BIOLOGY

Interdisciplinary Sciences: Computational Life Sciences

Pub Date : 2025-08-21 DOI: 10.1007/s12539-025-00758-8

Chen Zhang, Jiaqi Sun, Linlin Xing, Longbo Zhang, Hongzhen Cai, Kai Che

Identifying drug-target interaction (DTI) is crucial for drug discovery and repositioning. Biological identification of DTI is costly and time-consuming, so heterogeneous network methods for DTI prediction, which can speed up drug development, have drawn much attention. However, the scarcity of known drug-target pairs leads to an overwhelmingly sparse network structure. This sparsity makes it difficult for graph convolutional networks to fully capture the potential relationships between nodes during information transmission and feature extraction, thus challenging the prediction performance of DTI. Therefore, we propose a method, called VHGAE, based on a heterogeneous graph variational autoencoder to predict drug-target interactions. The VHGAE model integrates a wealth of prior knowledge from various sources related to drugs and targets to construct a heterogeneous network. To address the sparsity issue within the constructed heterogeneous network, first, the weighted k-nearest neighbor algorithm is used to densify the DTI network to increase the connectivity of the sparse network. Second, the weighted graph convolutional network within a variational graph autoencoder framework is employed to bolster the edge weights with limited connections. Third, a variational expectation maximization algorithm is introduced between the encoder and the decoder to recover potential relationships in the sparse network. The experimental results on two datasets show that the performance of VHGAE outperforms nine state-of-the-art DTI prediction methods, which suggests that the design of VHGAE in multi-source data fusion and sparse network processing plays an important role in improving prediction performance.

确定药物-靶标相互作用（DTI）对药物发现和重新定位至关重要。DTI的生物学鉴定成本高、耗时长，因此基于异构网络的DTI预测方法因能加快药物开发速度而备受关注。然而，已知药物靶标对的稀缺性导致了绝大多数的稀疏网络结构。这种稀疏性使得图卷积网络在信息传输和特征提取过程中难以充分捕捉节点之间的潜在关系，从而对DTI的预测性能提出了挑战。因此，我们提出了一种基于异构图变分自编码器的VHGAE方法来预测药物-靶标相互作用。VHGAE模型集成了来自各种来源的与药物和靶标相关的丰富先验知识，构建了一个异构网络。为了解决构建的异构网络内部的稀疏性问题，首先采用加权k近邻算法对DTI网络进行密集化，增加稀疏网络的连通性；其次，利用变分图自编码器框架中的加权图卷积网络增强连接数有限的边权。第三，在编码器和解码器之间引入变分期望最大化算法来恢复稀疏网络中的潜在关系。在两个数据集上的实验结果表明，VHGAE的预测性能优于9种最先进的DTI预测方法，这表明VHGAE在多源数据融合和稀疏网络处理中的设计对提高预测性能具有重要作用。

{"title":"VHGAE: Drug-Target Interaction Prediction Model Based on Heterogeneous Graph Variational Autoencoder.","authors":"Chen Zhang, Jiaqi Sun, Linlin Xing, Longbo Zhang, Hongzhen Cai, Kai Che","doi":"10.1007/s12539-025-00758-8","DOIUrl":"https://doi.org/10.1007/s12539-025-00758-8","url":null,"abstract":"Identifying drug-target interaction (DTI) is crucial for drug discovery and repositioning. Biological identification of DTI is costly and time-consuming, so heterogeneous network methods for DTI prediction, which can speed up drug development, have drawn much attention. However, the scarcity of known drug-target pairs leads to an overwhelmingly sparse network structure. This sparsity makes it difficult for graph convolutional networks to fully capture the potential relationships between nodes during information transmission and feature extraction, thus challenging the prediction performance of DTI. Therefore, we propose a method, called VHGAE, based on a heterogeneous graph variational autoencoder to predict drug-target interactions. The VHGAE model integrates a wealth of prior knowledge from various sources related to drugs and targets to construct a heterogeneous network. To address the sparsity issue within the constructed heterogeneous network, first, the weighted k-nearest neighbor algorithm is used to densify the DTI network to increase the connectivity of the sparse network. Second, the weighted graph convolutional network within a variational graph autoencoder framework is employed to bolster the edge weights with limited connections. Third, a variational expectation maximization algorithm is introduced between the encoder and the decoder to recover potential relationships in the sparse network. The experimental results on two datasets show that the performance of VHGAE outperforms nine state-of-the-art DTI prediction methods, which suggests that the design of VHGAE in multi-source data fusion and sparse network processing plays an important role in improving prediction performance.","PeriodicalId":13670,"journal":{"name":"Interdisciplinary Sciences: Computational Life Sciences","volume":" ","pages":""},"PeriodicalIF":3.9,"publicationDate":"2025-08-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144953073","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

A Novel Dual-Level Momentum Distillation Method with Extreme Thresholding for Imputing Single-Cell RNA Sequencing Data. 一种新的双能级动量精馏极值法用于单细胞RNA测序数据的输入。

IF 3.9 2区生物学 Q1 MATHEMATICAL & COMPUTATIONAL BIOLOGY

Interdisciplinary Sciences: Computational Life Sciences

Pub Date : 2025-08-21 DOI: 10.1007/s12539-025-00754-y

Binhua Tang, Xinyu Gao, Guowei Cheng

Single-cell RNA sequencing (scRNA-seq) plays a vital role in studying cellular heterogeneity and gene expression patterns. However, the sequencing dropout phenomena still pose a significant challenge. Genes with low expression levels may be misidentified as exhibiting zero expression owing to limitations in sequencing depth and technical noise. This results in increased data sparsity and compromises the accuracy of subsequent analyses. Thus, a novel method, MoDET (Dual-level Momentum Distillation Method with Extreme Thresholding), has been proposed. MoDET employs a label-guided model and an extreme threshold mechanism to enhance cellular representation learning. Experiments demonstrate that MoDET significantly improves clustering performance of the gene expression matrix, with enhancements ranging from 3% to 20% across seven real-world datasets. Cross-batch training and evaluation experiments demonstrated that MoDET effectively mitigates batch effects, achieving an average performance improvement of 5%-7%. Concurrently, it exhibits superior accuracy in identifying rare cell types, outperforming other methods by 3%-20%. Ablation studies confirm that the dual-level momentum distillation boosts performance by 4%-20%, and the extreme threshold mechanism adds an additional 2%-15% improvement. Interpretability analysis shows that the extreme threshold makes the model's decision-making process more transparent. Moreover, MoDET surpasses methods incorporating advanced modules, thereby demonstrating its efficacy in addressing the sparsity challenges inherent in scRNA-seq datasets. The compiled source codes are accessible at https://github.com/gladex/MoDET.

单细胞RNA测序（scRNA-seq）在研究细胞异质性和基因表达模式方面发挥着重要作用。然而，排序退序现象仍然构成重大挑战。由于测序深度和技术噪声的限制，低表达水平的基因可能被误认为是零表达。这将导致数据稀疏性增加，并损害后续分析的准确性。为此，提出了一种新的方法——MoDET （Dual-level Momentum Distillation method with Extreme thresholds）。MoDET采用标签引导模型和极限阈值机制来增强细胞表示学习。实验表明，MoDET显著提高了基因表达矩阵的聚类性能，在7个真实数据集上的增强幅度在3%到20%之间。跨批训练和评估实验表明，模型有效地缓解了批效应，平均性能提高了5%-7%。同时，它在识别稀有细胞类型方面表现出优越的准确性，比其他方法高出3%-20%。烧蚀研究证实，双级动量蒸馏可提高4%-20%的性能，极端阈值机制可额外提高2%-15%的性能。可解释性分析表明，极值阈值使模型的决策过程更加透明。此外，MoDET超越了包含高级模块的方法，从而证明了其在解决scRNA-seq数据集固有的稀疏性挑战方面的有效性。编译后的源代码可在https://github.com/gladex/MoDET上访问。

{"title":"A Novel Dual-Level Momentum Distillation Method with Extreme Thresholding for Imputing Single-Cell RNA Sequencing Data.","authors":"Binhua Tang, Xinyu Gao, Guowei Cheng","doi":"10.1007/s12539-025-00754-y","DOIUrl":"https://doi.org/10.1007/s12539-025-00754-y","url":null,"abstract":"Single-cell RNA sequencing (scRNA-seq) plays a vital role in studying cellular heterogeneity and gene expression patterns. However, the sequencing dropout phenomena still pose a significant challenge. Genes with low expression levels may be misidentified as exhibiting zero expression owing to limitations in sequencing depth and technical noise. This results in increased data sparsity and compromises the accuracy of subsequent analyses. Thus, a novel method, MoDET (Dual-level Momentum Distillation Method with Extreme Thresholding), has been proposed. MoDET employs a label-guided model and an extreme threshold mechanism to enhance cellular representation learning. Experiments demonstrate that MoDET significantly improves clustering performance of the gene expression matrix, with enhancements ranging from 3% to 20% across seven real-world datasets. Cross-batch training and evaluation experiments demonstrated that MoDET effectively mitigates batch effects, achieving an average performance improvement of 5%-7%. Concurrently, it exhibits superior accuracy in identifying rare cell types, outperforming other methods by 3%-20%. Ablation studies confirm that the dual-level momentum distillation boosts performance by 4%-20%, and the extreme threshold mechanism adds an additional 2%-15% improvement. Interpretability analysis shows that the extreme threshold makes the model's decision-making process more transparent. Moreover, MoDET surpasses methods incorporating advanced modules, thereby demonstrating its efficacy in addressing the sparsity challenges inherent in scRNA-seq datasets. The compiled source codes are accessible at https://github.com/gladex/MoDET.","PeriodicalId":13670,"journal":{"name":"Interdisciplinary Sciences: Computational Life Sciences","volume":" ","pages":""},"PeriodicalIF":3.9,"publicationDate":"2025-08-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144952932","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

AIP-TranLAC: A Transformer-Based Method Integrating LSTM and Attention Mechanism for Predicting Anti-inflammatory Peptides. AIP-TranLAC：一种整合LSTM和注意机制的基于转换器的抗炎肽预测方法。

IF 3.9 2区生物学 Q1 MATHEMATICAL & COMPUTATIONAL BIOLOGY

Interdisciplinary Sciences: Computational Life Sciences

Pub Date : 2025-08-19 DOI: 10.1007/s12539-025-00761-z

Shengli Zhang, Jingyi Ren

Anti-inflammatory peptides (AIPs) have emerged as potential therapeutic candidates for managing various inflammatory disorders, but their computational identification remains challenging. We propose AIP-TranLAC, a novel deep learning framework that integrates Transformer-based embedding, bidirectional long short-term memory (Bi-LSTM), multi-head attention, and convolutional neural network (CNN) to classify AIPs accurately. Our model achieves superior performance on benchmark and independent test datasets, demonstrating significant improvements over existing methods. The hybrid architecture effectively captures local and global sequence patterns, while interpretability analyses reveal critical amino acid residues. With robust performance on imbalanced data and open-source availability, AIP-TranLAC provides a powerful tool for accelerating therapeutic peptide discovery and inflammation research. For reproducibility purposes, we have released the codebase, trained models, and all supporting data on GitHub ( https://github.com/Renjingyi123/AIP-TranLAC ).

抗炎肽（AIPs）已成为治疗各种炎症性疾病的潜在候选药物，但它们的计算识别仍然具有挑战性。我们提出了一种新的深度学习框架AIP-TranLAC，该框架集成了基于transformer的嵌入，双向长短期记忆（Bi-LSTM），多头注意和卷积神经网络（CNN）来准确分类aip。我们的模型在基准测试和独立测试数据集上取得了卓越的性能，比现有方法有了显著的改进。混合结构有效地捕获局部和全局序列模式，而可解释性分析揭示了关键的氨基酸残基。AIP-TranLAC具有强大的非平衡数据性能和开源可用性，为加速治疗性肽发现和炎症研究提供了强大的工具。出于可重复性的考虑，我们已经在GitHub （https://github.com/Renjingyi123/AIP-TranLAC）上发布了代码库、训练模型和所有支持数据。

引用次数: 0

Relevance of 3D Rotationally Equivariant Neural Networks for Predicting Protein-Ligand Binding Affinities. 三维旋转等变神经网络预测蛋白质-配体结合亲和力的相关性。

IF 3.9 2区生物学 Q1 MATHEMATICAL & COMPUTATIONAL BIOLOGY

Interdisciplinary Sciences: Computational Life Sciences

Pub Date : 2025-08-14 DOI: 10.1007/s12539-025-00745-z

Gaili Li, Yongna Yuan, Ruisheng Zhang

Proteins are fundamental to biological processes, mediating critical functions through precise molecular interactions. The rotational dynamics between ligand atoms and protein binding sites can significantly influence interaction efficacy by modifying spatial relationships. In our research, we present the PLAe (three-dimensional (3D) rotationally equivariant neural networks for predicting protein-ligand binding affinities) methodology. This novel model synergizes radial basis functions with e3nn networks to encapsulate the radial and angular dimensions of molecular features. Radial basis functions effectively measure interatomic distances, while e3nn-an advanced neural network utilizing spherical harmonics-maintains invariance under rotational and translational transformations. The Clebsch-Gordan coefficients are employed to integrate angular and atomic properties seamlessly. By merging radial basis and spherical harmonic elements with Clebsch-Gordan representations, our approach adeptly captures molecular rotational symmetries and interatomic interactions. The inclusion of an attention mechanism further refines the affinity predictions, ensuring a high level of precision. This integrative and sophisticated model sets a new benchmark to accurately predict protein-ligand binding affinities, leveraging intricate molecular details to enhance predictive performance.

蛋白质是生物过程的基础，通过精确的分子相互作用介导关键功能。配体原子与蛋白质结合位点之间的旋转动力学可以通过改变空间关系来显著影响相互作用的效果。在我们的研究中，我们提出了PLAe（用于预测蛋白质配体结合亲和力的三维（3D）旋转等变神经网络）方法。该模型将径向基函数与e3nn网络相结合，封装了分子特征的径向和角尺寸。径向基函数有效地测量原子间距离，而e3nn-一种利用球面谐波的先进神经网络-在旋转和平移变换下保持不变性。Clebsch-Gordan系数被用来无缝地整合角和原子性质。通过将径向基和球谐元与Clebsch-Gordan表示合并，我们的方法熟练地捕获了分子旋转对称性和原子间相互作用。注意机制的加入进一步完善了亲和预测，确保了高水平的精度。这种综合和复杂的模型为准确预测蛋白质-配体结合亲和力设定了新的基准，利用复杂的分子细节来提高预测性能。

{"title":"Relevance of 3D Rotationally Equivariant Neural Networks for Predicting Protein-Ligand Binding Affinities.","authors":"Gaili Li, Yongna Yuan, Ruisheng Zhang","doi":"10.1007/s12539-025-00745-z","DOIUrl":"https://doi.org/10.1007/s12539-025-00745-z","url":null,"abstract":"Proteins are fundamental to biological processes, mediating critical functions through precise molecular interactions. The rotational dynamics between ligand atoms and protein binding sites can significantly influence interaction efficacy by modifying spatial relationships. In our research, we present the PLAe (three-dimensional (3D) rotationally equivariant neural networks for predicting protein-ligand binding affinities) methodology. This novel model synergizes radial basis functions with e3nn networks to encapsulate the radial and angular dimensions of molecular features. Radial basis functions effectively measure interatomic distances, while e3nn-an advanced neural network utilizing spherical harmonics-maintains invariance under rotational and translational transformations. The Clebsch-Gordan coefficients are employed to integrate angular and atomic properties seamlessly. By merging radial basis and spherical harmonic elements with Clebsch-Gordan representations, our approach adeptly captures molecular rotational symmetries and interatomic interactions. The inclusion of an attention mechanism further refines the affinity predictions, ensuring a high level of precision. This integrative and sophisticated model sets a new benchmark to accurately predict protein-ligand binding affinities, leveraging intricate molecular details to enhance predictive performance.","PeriodicalId":13670,"journal":{"name":"Interdisciplinary Sciences: Computational Life Sciences","volume":" ","pages":""},"PeriodicalIF":3.9,"publicationDate":"2025-08-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144855175","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

ABEEM/MM Magnesium Force Field for Proteins and Aqueous Solutions. 蛋白质和水溶液的ABEEM/MM镁力场。

IF 3.9 2区生物学 Q1 MATHEMATICAL & COMPUTATIONAL BIOLOGY

Interdisciplinary Sciences: Computational Life Sciences

Pub Date : 2025-08-14 DOI: 10.1007/s12539-025-00746-y

Jing Zhang, Linan Lu, Runqiang Yu, Linlin Liu, Lei Wang, Cui Liu, Lidong Gong, Zhongzhi Yang

Magnesium is an essential element involved in diverse life activities. The strong polarization and significant charge transfer effects pose challenges to the traditional fixed charge force fields. Here we establish the ABEEM/MM magnesium force field for proteins and aqueous solutions. The interaction potentials of magnesium with water and proteins are treated as the ABEEM/MM bonded model (ABEEM-BM) in the Morse potential function form. Based on quantum mechanical (QM) results, the related parameters are optimized and determined. The charge distributions of model molecules from ABEEM-BM and the ABEEM/MM nonbonded model (ABEEM-NBM) agree well with the QM results. The potential energy surfaces (PESs) for bond stretching and angle bending between magnesium and ligands by ABEEM-BM have a good consistency with those from QM. Molecular dynamics (MD) simulations of 40 aqueous magnesium protein segments are carried out using ABEEM-BM, ABEEM-NBM, OPLS-AA, AMBER99, and CHARMM22 force fields. The root mean square deviations (RMSDs) for bond length and angle by ABEEM-BM are 0.088 Å and 5.99°, respectively, which are smaller than those from the others. MD simulations of aqueous magnesium solutions are carried out using ABEEM-BM and ABEEM-NBM. The radial and angular distribution functions from ABEEM-BM reproduce the best structural properties, and the rate constant is 4.7 × 10⁵ s^- 1. Moreover, the dynamic changing picture of charge transfer and the coordination number (CN) during water exchange processes is presented by ABEEM model. The overall performance of ABEEM models is evidently better than those from fixed charge force fields.

镁是多种生命活动中必不可少的元素。强极化和显著的电荷转移效应对传统的固定电荷力场提出了挑战。本文建立了蛋白质和水溶液的ABEEM/MM镁力场。将镁与水和蛋白质的相互作用势处理为具有摩尔斯势函数形式的ABEEM/MM键合模型（ABEEM- bm）。基于量子力学（QM）结果，优化并确定了相关参数。ABEEM- bm和ABEEM/MM非键模型（ABEEM- nbm）模型分子的电荷分布与QM结果吻合较好。abem - bm得到的镁与配体之间键拉伸和角度弯曲的势能面与QM得到的结果具有较好的一致性。采用ABEEM-BM、ABEEM-NBM、OPLS-AA、AMBER99和CHARMM22力场对40个水溶液镁蛋白片段进行了分子动力学（MD）模拟。ABEEM-BM得到的键长和键角的均方根偏差（rmsd）分别为0.088 Å和5.99°，均小于其他方法。采用ABEEM-BM和ABEEM-NBM对镁水溶液进行了MD模拟。abem - bm的径向和角度分布函数再现了最佳的结构性能，速率常数为4.7 × 105 s- 1。利用ABEEM模型，给出了水交换过程中电荷转移和配位数（CN）的动态变化图。ABEEM模型的综合性能明显优于固定电荷力场模型。

{"title":"ABEEM/MM Magnesium Force Field for Proteins and Aqueous Solutions.","authors":"Jing Zhang, Linan Lu, Runqiang Yu, Linlin Liu, Lei Wang, Cui Liu, Lidong Gong, Zhongzhi Yang","doi":"10.1007/s12539-025-00746-y","DOIUrl":"https://doi.org/10.1007/s12539-025-00746-y","url":null,"abstract":"Magnesium is an essential element involved in diverse life activities. The strong polarization and significant charge transfer effects pose challenges to the traditional fixed charge force fields. Here we establish the ABEEM/MM magnesium force field for proteins and aqueous solutions. The interaction potentials of magnesium with water and proteins are treated as the ABEEM/MM bonded model (ABEEM-BM) in the Morse potential function form. Based on quantum mechanical (QM) results, the related parameters are optimized and determined. The charge distributions of model molecules from ABEEM-BM and the ABEEM/MM nonbonded model (ABEEM-NBM) agree well with the QM results. The potential energy surfaces (PESs) for bond stretching and angle bending between magnesium and ligands by ABEEM-BM have a good consistency with those from QM. Molecular dynamics (MD) simulations of 40 aqueous magnesium protein segments are carried out using ABEEM-BM, ABEEM-NBM, OPLS-AA, AMBER99, and CHARMM22 force fields. The root mean square deviations (RMSDs) for bond length and angle by ABEEM-BM are 0.088 Å and 5.99°, respectively, which are smaller than those from the others. MD simulations of aqueous magnesium solutions are carried out using ABEEM-BM and ABEEM-NBM. The radial and angular distribution functions from ABEEM-BM reproduce the best structural properties, and the rate constant is 4.7 × 105 s- 1. Moreover, the dynamic changing picture of charge transfer and the coordination number (CN) during water exchange processes is presented by ABEEM model. The overall performance of ABEEM models is evidently better than those from fixed charge force fields.","PeriodicalId":13670,"journal":{"name":"Interdisciplinary Sciences: Computational Life Sciences","volume":" ","pages":""},"PeriodicalIF":3.9,"publicationDate":"2025-08-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144855173","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

An Adaptive Multi-Stage and Adjacent-Level Feature Integration Network for Brain Tumor Image Segmentation. 一种用于脑肿瘤图像分割的自适应多阶段邻接层特征集成网络。

IF 3.9 2区生物学 Q1 MATHEMATICAL & COMPUTATIONAL BIOLOGY

Interdisciplinary Sciences: Computational Life Sciences

Pub Date : 2025-08-14 DOI: 10.1007/s12539-025-00748-w

Jiwen Zhou, Yulun Wu, Yue Xu, Wanyu Liu

The segmentation of brain tumor magnetic resonance imaging (MRI) plays a crucial role in assisting diagnosis, treatment planning, and disease progression evaluation. Convolutional neural networks (CNNs) and transformer-based methods have achieved significant progress due to their local and global feature extraction capabilities. However, similar to other medical image segmentation tasks, challenges remain in addressing issues such as blurred boundaries, small lesion volumes, and interwoven regions. General CNN and transformer approaches struggle to effectively resolve these issues. Therefore, a new multi-stage and adjacent-level feature integration network (MAI-Net) is introduced to overcome these challenges, thereby improving the overall segmentation accuracy. MAI-Net consists of dual-branch, multi-level structures and three innovative modules. The stage-level multi-scale feature extraction (SMFE) module focuses on capturing feature details from fine to coarse scales, improving detection of blurred edges and small lesions. The adjacent-level feature fusion (AFF) module facilitates information exchange across different levels, enhancing segmentation accuracy in complex regions as well as small volume lesions. Finally, the multi-stage feature fusion (MFF) module further integrates features from various levels to improve segmentation performance in complex regions. Extensive experiments on BraTS2020 and BraTS2021 datasets demonstrate that MAI-Net significantly outperforms existing methods in Dice and HD95 metrics. Furthermore, generalization experiments on a public ischemic stroke dataset confirm its robustness across different segmentation tasks. These results highlight the significant advantages of MAI-Net in addressing domain-specific challenges while maintaining strong generalization capabilities.

脑肿瘤磁共振成像（MRI）的分割在辅助诊断、治疗计划和疾病进展评估中起着至关重要的作用。卷积神经网络（cnn）和基于变压器的方法由于其局部和全局特征提取能力而取得了重大进展。然而，与其他医学图像分割任务类似，在解决诸如边界模糊、病灶体积小和交织区域等问题方面仍然存在挑战。一般的CNN和变压器方法很难有效地解决这些问题。为此，提出了一种新的多阶段邻接层特征集成网络（MAI-Net）来克服这些挑战，从而提高整体分割精度。MAI-Net由双分支、多层次结构和三个创新模块组成。阶段级多尺度特征提取（SMFE）模块侧重于从细到粗的尺度捕获特征细节，提高对模糊边缘和小病灶的检测。邻接层特征融合（AFF）模块促进了不同层次的信息交换，提高了复杂区域和小体积病灶的分割精度。最后，多阶段特征融合（MFF）模块进一步整合各个层次的特征，提高复杂区域的分割性能。在BraTS2020和BraTS2021数据集上进行的大量实验表明，MAI-Net在Dice和HD95指标方面明显优于现有方法。此外，在公共缺血性脑卒中数据集上进行了泛化实验，验证了该方法在不同分割任务中的鲁棒性。这些结果突出了MAI-Net在解决特定领域挑战的同时保持强大的泛化能力方面的显著优势。

{"title":"An Adaptive Multi-Stage and Adjacent-Level Feature Integration Network for Brain Tumor Image Segmentation.","authors":"Jiwen Zhou, Yulun Wu, Yue Xu, Wanyu Liu","doi":"10.1007/s12539-025-00748-w","DOIUrl":"https://doi.org/10.1007/s12539-025-00748-w","url":null,"abstract":"The segmentation of brain tumor magnetic resonance imaging (MRI) plays a crucial role in assisting diagnosis, treatment planning, and disease progression evaluation. Convolutional neural networks (CNNs) and transformer-based methods have achieved significant progress due to their local and global feature extraction capabilities. However, similar to other medical image segmentation tasks, challenges remain in addressing issues such as blurred boundaries, small lesion volumes, and interwoven regions. General CNN and transformer approaches struggle to effectively resolve these issues. Therefore, a new multi-stage and adjacent-level feature integration network (MAI-Net) is introduced to overcome these challenges, thereby improving the overall segmentation accuracy. MAI-Net consists of dual-branch, multi-level structures and three innovative modules. The stage-level multi-scale feature extraction (SMFE) module focuses on capturing feature details from fine to coarse scales, improving detection of blurred edges and small lesions. The adjacent-level feature fusion (AFF) module facilitates information exchange across different levels, enhancing segmentation accuracy in complex regions as well as small volume lesions. Finally, the multi-stage feature fusion (MFF) module further integrates features from various levels to improve segmentation performance in complex regions. Extensive experiments on BraTS2020 and BraTS2021 datasets demonstrate that MAI-Net significantly outperforms existing methods in Dice and HD95 metrics. Furthermore, generalization experiments on a public ischemic stroke dataset confirm its robustness across different segmentation tasks. These results highlight the significant advantages of MAI-Net in addressing domain-specific challenges while maintaining strong generalization capabilities.","PeriodicalId":13670,"journal":{"name":"Interdisciplinary Sciences: Computational Life Sciences","volume":" ","pages":""},"PeriodicalIF":3.9,"publicationDate":"2025-08-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144855174","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Diffusion Model-Based Multi-Channel EEG Representation and Forecasting for Early Epileptic Seizure Warning. 基于扩散模型的多通道脑电图表征及早期癫痫发作预警预测。

IF 3.9 2区生物学 Q1 MATHEMATICAL & COMPUTATIONAL BIOLOGY

Interdisciplinary Sciences: Computational Life Sciences

Pub Date : 2025-08-11 DOI: 10.1007/s12539-025-00750-2

Zekun Jiang, Wei Dai, Qu Wei, Ziyuan Qin, Rui Wei, Mianyang Li, Xiaolong Chen, Ying Huo, Jingyun Liu, Kang Li, Le Zhang

引用次数: 0

BEST: Basic Embedding Search Tool Enhancing Discovery of Novel Enzyme. BEST：增强新酶发现的基本嵌入搜索工具。

IF 3.9 2区生物学 Q1 MATHEMATICAL & COMPUTATIONAL BIOLOGY

Interdisciplinary Sciences: Computational Life Sciences

Pub Date : 2025-08-11 DOI: 10.1007/s12539-025-00753-z

Yuxuan Wu, Xiao Yi, Yang Tan, Huiqun Yu, Guisheng Fan, Gaowei Zheng

The identification of protein homologs in large databases is critical for biological advancements. Traditional methods, such as protein sequence alignment, often miss remote homologs. To address this limitation, we present the Basic Embedding Search Tool (BEST), a fast and sensitive approach that employs protein language models to create sequence embeddings enriched with evolutionary and structural information. Besides, we introduce a segmented distillation pruning technique to accelerate sequence encoding and develop a multi-layer acceleration structure to achieve a 4290.86-fold speedup in swift access and retrieval of dense vectors. Extensive experiments on real datasets demonstrate that BEST increases sensitivity by over 20% compared to prior methods while maintaining precision and recall. It operates 23.41 times faster than traditional tools like PSI-BLAST and 3.92 times faster than Foldseek, while also detecting homologous sequences that conventional methods miss. BEST and its open-access web server ( http://pm2s.cpolar.top/best1/ ) are poised to significantly aid enzyme mining and advance biological research. The code is publicly available at https://github.com/SkyTai-W/ProteinMiningEvaluator .

在大型数据库中鉴定蛋白质同源物对生物学进步至关重要。传统的方法，如蛋白质序列比对，往往会遗漏远程同源物。为了解决这一限制，我们提出了基本嵌入搜索工具（BEST），这是一种快速敏感的方法，利用蛋白质语言模型来创建富含进化和结构信息的序列嵌入。此外，我们引入了分段蒸馏剪枝技术来加速序列编码，并开发了多层加速结构，使密集向量的快速访问和检索速度提高了4290.86倍。在真实数据集上的大量实验表明，BEST在保持精度和召回率的同时，比先前的方法提高了20%以上的灵敏度。它的运行速度比PSI-BLAST等传统工具快23.41倍，比Foldseek快3.92倍，同时还能检测到传统方法无法检测到的同源序列。BEST及其开放访问网络服务器（http://pm2s.cpolar.top/best1/）将极大地帮助酶挖掘和推进生物学研究。该代码可在https://github.com/SkyTai-W/ProteinMiningEvaluator上公开获得。

{"title":"BEST: Basic Embedding Search Tool Enhancing Discovery of Novel Enzyme.","authors":"Yuxuan Wu, Xiao Yi, Yang Tan, Huiqun Yu, Guisheng Fan, Gaowei Zheng","doi":"10.1007/s12539-025-00753-z","DOIUrl":"https://doi.org/10.1007/s12539-025-00753-z","url":null,"abstract":"The identification of protein homologs in large databases is critical for biological advancements. Traditional methods, such as protein sequence alignment, often miss remote homologs. To address this limitation, we present the Basic Embedding Search Tool (BEST), a fast and sensitive approach that employs protein language models to create sequence embeddings enriched with evolutionary and structural information. Besides, we introduce a segmented distillation pruning technique to accelerate sequence encoding and develop a multi-layer acceleration structure to achieve a 4290.86-fold speedup in swift access and retrieval of dense vectors. Extensive experiments on real datasets demonstrate that BEST increases sensitivity by over 20% compared to prior methods while maintaining precision and recall. It operates 23.41 times faster than traditional tools like PSI-BLAST and 3.92 times faster than Foldseek, while also detecting homologous sequences that conventional methods miss. BEST and its open-access web server ( http://pm2s.cpolar.top/best1/ ) are poised to significantly aid enzyme mining and advance biological research. The code is publicly available at https://github.com/SkyTai-W/ProteinMiningEvaluator .","PeriodicalId":13670,"journal":{"name":"Interdisciplinary Sciences: Computational Life Sciences","volume":" ","pages":""},"PeriodicalIF":3.9,"publicationDate":"2025-08-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144821313","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

SANNO: A Graph-Transformer Enhanced Optimal Transport Tool for Spatial Transcriptomic Annotation. SANNO：一个图形转换器增强的最佳传输工具，用于空间转录组注释。

IF 3.9 2区生物学 Q1 MATHEMATICAL & COMPUTATIONAL BIOLOGY

Interdisciplinary Sciences: Computational Life Sciences

Pub Date : 2025-08-11 DOI: 10.1007/s12539-025-00752-0

Yuansong Zeng, Yuanze Chen, Ningyuan Shangguan, Wenbing Li, Xiaoming Cai, Hongyu Zhang, Zheng Wang, Huiying Zhao

The latest progress in spatial transcriptomics has empowered scientists to investigate spatial heterogeneity with single-cell precision. A pivotal yet demanding aspect of spatial transcriptomics data analysis is cell type annotation. However, current methods exhibit limited performance as they are primarily designed for scRNA-seq data. Especially, these approaches often neglect spatial coordinate information and encounter challenges in identifying novel cell types. Here, we introduce SANNO, a novel approach that employs Optimal Transport (OT) to concurrently identify both known and novel cell types in spatially resolved single-cell data. Specifically, SANNO leverages a graph-Transformer module to model spatial coordinates and gene expression. This produces unified representations for both reference and query data. Building on this, SANNO employs a dual-strategy classifier. The first is an Unbalanced Optimal Transport (UOT) module that aligns query data with reference prototypes. The second is a self-supervised OT-based module that enhances global cluster separation and local cellular consistency, effectively eliminating batch effects. To further improve prediction accuracy, SANNO integrates an entropy-based re-weighted loss function. This significantly boosts the confidence of query cell predictions. Comprehensive experiments reveal that SANNO surpasses state-of-the-art techniques across both intra- and cross-spatial datasets, particularly in the identification of novel cell types. Additionally, SANNO demonstrates commendable performance in annotating cells within single-cell data, underscoring its potential as a versatile tool for cell annotation across single-cell and spatial transcriptomics datasets.

空间转录组学的最新进展使科学家能够以单细胞精度研究空间异质性。空间转录组学数据分析的一个关键但要求很高的方面是细胞类型注释。然而，目前的方法表现出有限的性能，因为它们主要是为scRNA-seq数据设计的。特别是，这些方法往往忽略了空间坐标信息，并且在识别新的细胞类型方面遇到了挑战。在这里，我们介绍SANNO，这是一种利用最佳传输（OT）在空间分辨的单细胞数据中同时识别已知和新的细胞类型的新方法。具体来说，SANNO利用图形转换器模块来模拟空间坐标和基因表达。这为引用和查询数据产生了统一的表示。在此基础上，SANNO采用了双策略分类器。第一个是不平衡最优传输（UOT）模块，它将查询数据与参考原型对齐。二是基于自监督的ot模块，增强了全局聚类分离和局部元一致性，有效消除了批处理效应。为了进一步提高预测精度，SANNO集成了一个基于熵的重加权损失函数。这大大提高了查询单元预测的可信度。综合实验表明，SANNO在内部和跨空间数据集上都超越了最先进的技术，特别是在鉴定新细胞类型方面。此外，SANNO在单细胞数据中注释细胞方面表现出值得称赞的性能，强调了其作为跨单细胞和空间转录组学数据集注释细胞的多功能工具的潜力。

{"title":"SANNO: A Graph-Transformer Enhanced Optimal Transport Tool for Spatial Transcriptomic Annotation.","authors":"Yuansong Zeng, Yuanze Chen, Ningyuan Shangguan, Wenbing Li, Xiaoming Cai, Hongyu Zhang, Zheng Wang, Huiying Zhao","doi":"10.1007/s12539-025-00752-0","DOIUrl":"https://doi.org/10.1007/s12539-025-00752-0","url":null,"abstract":"The latest progress in spatial transcriptomics has empowered scientists to investigate spatial heterogeneity with single-cell precision. A pivotal yet demanding aspect of spatial transcriptomics data analysis is cell type annotation. However, current methods exhibit limited performance as they are primarily designed for scRNA-seq data. Especially, these approaches often neglect spatial coordinate information and encounter challenges in identifying novel cell types. Here, we introduce SANNO, a novel approach that employs Optimal Transport (OT) to concurrently identify both known and novel cell types in spatially resolved single-cell data. Specifically, SANNO leverages a graph-Transformer module to model spatial coordinates and gene expression. This produces unified representations for both reference and query data. Building on this, SANNO employs a dual-strategy classifier. The first is an Unbalanced Optimal Transport (UOT) module that aligns query data with reference prototypes. The second is a self-supervised OT-based module that enhances global cluster separation and local cellular consistency, effectively eliminating batch effects. To further improve prediction accuracy, SANNO integrates an entropy-based re-weighted loss function. This significantly boosts the confidence of query cell predictions. Comprehensive experiments reveal that SANNO surpasses state-of-the-art techniques across both intra- and cross-spatial datasets, particularly in the identification of novel cell types. Additionally, SANNO demonstrates commendable performance in annotating cells within single-cell data, underscoring its potential as a versatile tool for cell annotation across single-cell and spatial transcriptomics datasets.","PeriodicalId":13670,"journal":{"name":"Interdisciplinary Sciences: Computational Life Sciences","volume":" ","pages":""},"PeriodicalIF":3.9,"publicationDate":"2025-08-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144821315","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0