首页 > 最新文献

Biology Methods and Protocols最新文献

英文 中文
Enhancing leptospirosis screening using a deep convolutional neural network with microscopic agglutination test images. 利用显微镜凝集试验图像的深度卷积神经网络增强钩端螺旋体病筛查。
IF 2.5 Q3 BIOCHEMICAL RESEARCH METHODS Pub Date : 2025-06-09 eCollection Date: 2025-01-01 DOI: 10.1093/biomethods/bpaf047
Murnihayati Hassan, Siti Nur Zawani Rosli, Natasya Amirah Mohamed Tahir, Nurul Azmawati Mohamed, Khairunnisa Mohd Sukri, Liyana Azmi, Norhasmira Mohammad

Leptospirosis poses substantial challenges to global public health. In Malaysia, leptospirosis is endemic, with annual cases peaking during the monsoon season. The microscopic agglutination test (MAT) is the gold-standard serological method for confirmation of leptospirosis. However, it is labor-intensive and time-consuming, as it relies on the subjective interpretation of medical lab technicians. This study describes the development of a semiautomated workflow for Leptospira screening by integrating a TensorFlow and custom-designed Keras-based Deep Convolutional Neural Network (DCNN) with conventional MAT. We used a dataset of 442 positive and 442 negative MAT images, which consisted of a mixture of Leptospira serovars from Malaysia to train the model. The model was subjected to hyperparameter tuning, which modulated the number of convolutional layers, filters, kernel sizes, units in dense layers, activation functions, and learning rate. Verification of our tested model compared to the verified patient MAT results achieved the following metrics: a Precision score of 0.8125, a Recall of 0.9286, and an F1-Score of 0.8667. Combining our model with the current Malaysia Leptospira workflow can significantly speed up, reduce inaccuracies, and improve the management of leptospirosis. Furthermore, the application of this model is practical and adaptable, making it suitable for other labs that observe MAT as their Leptospira diagnosis. To our knowledge, this approach is Malaysia's first hybrid diagnostic approach for Leptospira diagnosis. Scaling up the dataset would enhance the model's accuracy, making it adaptable in other regions where leptospirosis is endemic.

钩端螺旋体病对全球公共卫生构成重大挑战。在马来西亚,钩端螺旋体病是地方性疾病,每年的病例在季风季节达到高峰。显微镜凝集试验(MAT)是确认钩端螺旋体病的金标准血清学方法。然而,它是劳动密集型和耗时的,因为它依赖于医学实验室技术人员的主观解释。本研究通过将TensorFlow和定制设计的基于keras的深度卷积神经网络(DCNN)与传统MAT集成,描述了钩端螺旋体筛选的半自动化工作流程的开发。我们使用了442张阳性和442张阴性MAT图像的数据集,其中包括来自马来西亚的钩端螺旋体血清型的混合物来训练模型。该模型进行了超参数调整,调整了卷积层数、滤波器、核大小、密集层中的单元、激活函数和学习率。将我们测试的模型与经过验证的患者MAT结果进行验证,获得以下指标:Precision得分为0.8125,Recall得分为0.9286,f1得分为0.8667。将我们的模型与当前马来西亚钩端螺旋体工作流程相结合,可以显著加快,减少不准确性,并改善钩端螺旋体病的管理。此外,该模型的应用具有实用性和适应性,适用于其他实验室观察MAT作为钩端螺旋体诊断。据我们所知,该方法是马来西亚首个钩端螺旋体混合诊断方法。扩大数据集将提高模型的准确性,使其适用于其他钩端螺旋体病流行的地区。
{"title":"Enhancing leptospirosis screening using a deep convolutional neural network with microscopic agglutination test images.","authors":"Murnihayati Hassan, Siti Nur Zawani Rosli, Natasya Amirah Mohamed Tahir, Nurul Azmawati Mohamed, Khairunnisa Mohd Sukri, Liyana Azmi, Norhasmira Mohammad","doi":"10.1093/biomethods/bpaf047","DOIUrl":"10.1093/biomethods/bpaf047","url":null,"abstract":"<p><p>Leptospirosis poses substantial challenges to global public health. In Malaysia, leptospirosis is endemic, with annual cases peaking during the monsoon season. The microscopic agglutination test (MAT) is the gold-standard serological method for confirmation of leptospirosis. However, it is labor-intensive and time-consuming, as it relies on the subjective interpretation of medical lab technicians. This study describes the development of a semiautomated workflow for <i>Leptospira</i> screening by integrating a TensorFlow and custom-designed Keras-based Deep Convolutional Neural Network (DCNN) with conventional MAT. We used a dataset of 442 positive and 442 negative MAT images, which consisted of a mixture of <i>Leptospira</i> serovars from Malaysia to train the model. The model was subjected to hyperparameter tuning, which modulated the number of convolutional layers, filters, kernel sizes, units in dense layers, activation functions, and learning rate. Verification of our tested model compared to the verified patient MAT results achieved the following metrics: a Precision score of 0.8125, a Recall of 0.9286, and an F1-Score of 0.8667. Combining our model with the current Malaysia <i>Leptospira</i> workflow can significantly speed up, reduce inaccuracies, and improve the management of leptospirosis. Furthermore, the application of this model is practical and adaptable, making it suitable for other labs that observe MAT as their <i>Leptospira</i> diagnosis. To our knowledge, this approach is Malaysia's first hybrid diagnostic approach for <i>Leptospira</i> diagnosis. Scaling up the dataset would enhance the model's accuracy, making it adaptable in other regions where leptospirosis is endemic.</p>","PeriodicalId":36528,"journal":{"name":"Biology Methods and Protocols","volume":"10 1","pages":"bpaf047"},"PeriodicalIF":2.5,"publicationDate":"2025-06-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12188207/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144498261","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Harnessing multi-output machine learning approach and dynamical observables from network structure to optimize COVID-19 intervention strategies. 利用多输出机器学习方法和网络结构的动态观测值优化COVID-19干预策略。
IF 1.3 Q3 BIOCHEMICAL RESEARCH METHODS Pub Date : 2025-06-05 eCollection Date: 2025-01-01 DOI: 10.1093/biomethods/bpaf039
Caroline L Alves, Katharina Kuhnert, Francisco Aparecido Rodrigues, Michael Moeckel

The coronavirus disease 2019 (COVID-19) pandemic has necessitated the development of accurate models to predict disease dynamics and guide public health interventions. This study leverages the COVASIM agent-based model to simulate 1331 scenarios of COVID-19 transmission across various social settings, focusing on the school, community, and work contact layers. We extracted complex network measures from these simulations and applied deep learning algorithms to predict key epidemiological outcomes, such as infected, severe, and critical cases. Our approach achieved an R 2 value exceeding 95%, demonstrating the model's robust predictive capability. Additionally, we identified optimal intervention strategies using spline interpolation, revealing the critical roles of community and workplace interventions in minimizing the pandemic's impact. The findings underscore the value of integrating network analytics with deep learning to streamline epidemic modeling, reduce computational costs, and enhance public health decision-making. This research offers a novel framework for effectively managing infectious disease outbreaks through targeted, data-driven interventions.

2019冠状病毒病(COVID-19)大流行要求开发准确的模型来预测疾病动态并指导公共卫生干预措施。本研究利用COVASIM基于代理的模型,模拟了不同社会环境中COVID-19传播的1331种场景,重点关注学校、社区和工作接触层。我们从这些模拟中提取了复杂的网络测量,并应用深度学习算法来预测关键的流行病学结果,如感染、严重和危重病例。我们的方法获得了超过95%的r2值,证明了模型的鲁棒预测能力。此外,我们利用样条插值确定了最佳干预策略,揭示了社区和工作场所干预在最大限度地减少大流行影响方面的关键作用。研究结果强调了将网络分析与深度学习相结合,以简化流行病建模、降低计算成本和增强公共卫生决策的价值。这项研究为通过有针对性的、数据驱动的干预措施有效管理传染病暴发提供了一个新的框架。
{"title":"Harnessing multi-output machine learning approach and dynamical observables from network structure to optimize COVID-19 intervention strategies.","authors":"Caroline L Alves, Katharina Kuhnert, Francisco Aparecido Rodrigues, Michael Moeckel","doi":"10.1093/biomethods/bpaf039","DOIUrl":"10.1093/biomethods/bpaf039","url":null,"abstract":"<p><p>The coronavirus disease 2019 (COVID-19) pandemic has necessitated the development of accurate models to predict disease dynamics and guide public health interventions. This study leverages the COVASIM agent-based model to simulate 1331 scenarios of COVID-19 transmission across various social settings, focusing on the school, community, and work contact layers. We extracted complex network measures from these simulations and applied deep learning algorithms to predict key epidemiological outcomes, such as infected, severe, and critical cases. Our approach achieved an <math> <mrow> <mrow> <msup><mrow><mi>R</mi></mrow> <mn>2</mn></msup> </mrow> </mrow> </math> value exceeding 95%, demonstrating the model's robust predictive capability. Additionally, we identified optimal intervention strategies using spline interpolation, revealing the critical roles of community and workplace interventions in minimizing the pandemic's impact. The findings underscore the value of integrating network analytics with deep learning to streamline epidemic modeling, reduce computational costs, and enhance public health decision-making. This research offers a novel framework for effectively managing infectious disease outbreaks through targeted, data-driven interventions.</p>","PeriodicalId":36528,"journal":{"name":"Biology Methods and Protocols","volume":"10 1","pages":"bpaf039"},"PeriodicalIF":1.3,"publicationDate":"2025-06-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12377909/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144972637","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Advances and future directions of aptamer-functionalized nanoparticles for point-of-care diseases diagnosis. 适体功能化纳米颗粒在即时疾病诊断中的研究进展及未来发展方向。
IF 2.5 Q3 BIOCHEMICAL RESEARCH METHODS Pub Date : 2025-06-05 eCollection Date: 2025-01-01 DOI: 10.1093/biomethods/bpaf046
Ilemobayo Victor Fasogbon, Erick Nyakundi Ondari, Deusdedit Tusubira, Tonny Kabuuka, Ibrahim Babangida Abubakar, Wusa Makena, Angela Mumbua Musyoka, Patrick Maduabuchi Aja

Point-of-care (POC) diagnostics have revolutionized disease detection by enabling rapid, on-site testing without the need for centralized laboratory infrastructure. This review presents recent advances in aptamer-functionalized nanoparticles (AFNs) as promising tools for enhancing POC diagnostics, particularly in infectious diseases and cancer. Aptamers, with their high specificity, stability, and modifiability, offer significant advantages over antibodies, while nanoparticles contribute multifunctionality, including signal amplification and targeting capabilities. AFNs have demonstrated up to a 2-10 times increase in detection sensitivity and significant reductions in diagnostic timeframes. We discuss various nanoparticle types, conjugation strategies, real-world applications, and highlight innovative developments such as AI-assisted aptamer design, wearable diagnostic devices, and green nanoparticle synthesis. Challenges related to stability, manufacturing scalability, regulatory hurdles, and clinical translation are critically examined. By merging aptamer precision with nanoparticle versatility, AFNs hold transformative potential to deliver rapid, affordable, and decentralized healthcare solutions, especially in resource-limited settings. Future interdisciplinary research and sustainable practices will be pivotal in translating AFN-based diagnostics from promising prototypes to global healthcare standards.

即时诊断(POC)通过无需集中实验室基础设施即可进行快速现场检测,彻底改变了疾病检测。本文综述了适配体功能化纳米颗粒(afn)作为增强POC诊断的有前途的工具的最新进展,特别是在传染病和癌症中。适配体具有高特异性、稳定性和可修饰性,与抗体相比具有显著优势,而纳米颗粒具有多功能性,包括信号放大和靶向能力。afn的检测灵敏度提高了2-10倍,诊断时间显著缩短。我们讨论了各种纳米颗粒类型、偶联策略、现实世界的应用,并强调了人工智能辅助适配体设计、可穿戴诊断设备和绿色纳米颗粒合成等创新发展。与稳定性、制造可扩展性、监管障碍和临床翻译相关的挑战被严格审查。通过将合适的精度与纳米颗粒的多功能性相结合,afn具有革命性的潜力,可以提供快速、负担得起的分散式医疗保健解决方案,特别是在资源有限的环境中。未来的跨学科研究和可持续实践将是将基于afn的诊断从有希望的原型转化为全球医疗保健标准的关键。
{"title":"Advances and future directions of aptamer-functionalized nanoparticles for point-of-care diseases diagnosis.","authors":"Ilemobayo Victor Fasogbon, Erick Nyakundi Ondari, Deusdedit Tusubira, Tonny Kabuuka, Ibrahim Babangida Abubakar, Wusa Makena, Angela Mumbua Musyoka, Patrick Maduabuchi Aja","doi":"10.1093/biomethods/bpaf046","DOIUrl":"10.1093/biomethods/bpaf046","url":null,"abstract":"<p><p>Point-of-care (POC) diagnostics have revolutionized disease detection by enabling rapid, on-site testing without the need for centralized laboratory infrastructure. This review presents recent advances in aptamer-functionalized nanoparticles (AFNs) as promising tools for enhancing POC diagnostics, particularly in infectious diseases and cancer. Aptamers, with their high specificity, stability, and modifiability, offer significant advantages over antibodies, while nanoparticles contribute multifunctionality, including signal amplification and targeting capabilities. AFNs have demonstrated up to a 2-10 times increase in detection sensitivity and significant reductions in diagnostic timeframes. We discuss various nanoparticle types, conjugation strategies, real-world applications, and highlight innovative developments such as AI-assisted aptamer design, wearable diagnostic devices, and green nanoparticle synthesis. Challenges related to stability, manufacturing scalability, regulatory hurdles, and clinical translation are critically examined. By merging aptamer precision with nanoparticle versatility, AFNs hold transformative potential to deliver rapid, affordable, and decentralized healthcare solutions, especially in resource-limited settings. Future interdisciplinary research and sustainable practices will be pivotal in translating AFN-based diagnostics from promising prototypes to global healthcare standards.</p>","PeriodicalId":36528,"journal":{"name":"Biology Methods and Protocols","volume":"10 1","pages":"bpaf046"},"PeriodicalIF":2.5,"publicationDate":"2025-06-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12212641/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144545181","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Rough microsomes isolated from snap-frozen canine pancreatic tissue retain their co-translational translocation functionality. 从速冻犬胰腺组织中分离的粗粒体保留了其共翻译易位功能。
IF 2.5 Q3 BIOCHEMICAL RESEARCH METHODS Pub Date : 2025-06-02 eCollection Date: 2025-01-01 DOI: 10.1093/biomethods/bpaf044
Marianne Croonenborghs, Marijke Verhaegen, Eva Pauwels, Becky Provinciael, Kurt Vermeire

Proteins are essential for life in all organisms: they mediate cell signaling and cell division and provide structure/motility to cells and tissues. All proteins are synthesized on cytoplasmic ribosomes as unfolded precursors that need to find their correct location in the compartmentalized cell. In eukaryotes, ∼30% of the proteome is translocated across or integrated into the endoplasmic reticulum (ER) membrane, a process mostly mediated by the heterotrimeric Sec61 complex that spans the ER membrane. There is significant interest in identifying small-molecule inhibitors of the Sec61 translocon channel that hold great promise as putative anticancer, immunosuppressive, or antiviral drugs. Hence, representative models are needed to study Sec61-dependent protein import into the ER. Microsomal membranes (or microsomes) isolated from dog pancreatic tissue are the primary source of mammalian ER for cell-free in vitro protein translocation research. Here, we demonstrate that for the isolation of microsomal membranes, snap-frozen canine pancreatic tissue can serve as a valuable alternative to freshly isolated organ tissue from euthanized animals. For 17 out of 20 animals, a sufficient yield of microsomes was extracted from defrosted pancreatic tissue. The isolated microsomes contained the essential proteins of the translocation machinery, and proved to be intact as verified by the detection of ER lumenal chaperones. Importantly, 13 out of the 17 microsome samples retained their translocation competence, as reflected by successful in vitro co-translational translocation of wild-type bovine preprolactin. The microsomes supported post-translational modifications of the tested substrates such as signal peptide cleavage and N-linked glycosylation. Furthermore, the tested microsome samples responded well to the translocation inhibitor cyclotriazadisulfonamide in suppressing human CD4 protein translocation into the ER. In conclusion, microsomes isolated from frozen canine pancreatic tissue proved to retain their co-translational translocation functionality that can contribute to our research of Sec61-dependent protein translocation and selective inhibition thereof.

蛋白质对所有生物体的生命都至关重要:它们介导细胞信号传导和细胞分裂,并为细胞和组织提供结构/运动性。所有蛋白质都是作为未折叠的前体在细胞质核糖体上合成的,这些前体需要在区隔化的细胞中找到正确的位置。在真核生物中,约30%的蛋白质组易位或整合到内质网(ER)膜上,这一过程主要由跨越内质网膜的异三聚体Sec61复合物介导。人们对鉴定Sec61易位通道的小分子抑制剂非常感兴趣,这些小分子抑制剂有望成为潜在的抗癌、免疫抑制或抗病毒药物。因此,需要有代表性的模型来研究sec61依赖性蛋白进入内质网的情况。从狗胰腺组织中分离的微粒体膜(或微粒体)是哺乳动物内质网的主要来源,用于体外无细胞蛋白易位研究。在这里,我们证明了对于微粒体膜的分离,快速冷冻的犬胰腺组织可以作为一种有价值的替代从安乐死动物身上新鲜分离的器官组织。对于20只动物中的17只,从解冻的胰腺组织中提取了足够产量的微粒体。分离的微粒体含有易位机制的必需蛋白质,并且通过内质网腔伴侣的检测证明是完整的。重要的是,17个微粒体样本中有13个保留了其易位能力,这反映在野生型牛泌乳素的体外共翻译易位中。这些微粒体支持被测底物的翻译后修饰,如信号肽切割和n链糖基化。此外,所测试的微粒体样品对易位抑制剂环三氮二磺酰胺在抑制人CD4蛋白易位到内质网方面反应良好。总之,从冷冻犬胰腺组织中分离的微粒体被证明保留了它们的共翻译易位功能,这有助于我们对sec61依赖性蛋白易位及其选择性抑制的研究。
{"title":"Rough microsomes isolated from snap-frozen canine pancreatic tissue retain their co-translational translocation functionality.","authors":"Marianne Croonenborghs, Marijke Verhaegen, Eva Pauwels, Becky Provinciael, Kurt Vermeire","doi":"10.1093/biomethods/bpaf044","DOIUrl":"10.1093/biomethods/bpaf044","url":null,"abstract":"<p><p>Proteins are essential for life in all organisms: they mediate cell signaling and cell division and provide structure/motility to cells and tissues. All proteins are synthesized on cytoplasmic ribosomes as unfolded precursors that need to find their correct location in the compartmentalized cell. In eukaryotes, ∼30% of the proteome is translocated across or integrated into the endoplasmic reticulum (ER) membrane, a process mostly mediated by the heterotrimeric Sec61 complex that spans the ER membrane. There is significant interest in identifying small-molecule inhibitors of the Sec61 translocon channel that hold great promise as putative anticancer, immunosuppressive, or antiviral drugs. Hence, representative models are needed to study Sec61-dependent protein import into the ER. Microsomal membranes (or microsomes) isolated from dog pancreatic tissue are the primary source of mammalian ER for cell-free <i>in vitro</i> protein translocation research. Here, we demonstrate that for the isolation of microsomal membranes, snap-frozen canine pancreatic tissue can serve as a valuable alternative to freshly isolated organ tissue from euthanized animals. For 17 out of 20 animals, a sufficient yield of microsomes was extracted from defrosted pancreatic tissue. The isolated microsomes contained the essential proteins of the translocation machinery, and proved to be intact as verified by the detection of ER lumenal chaperones. Importantly, 13 out of the 17 microsome samples retained their translocation competence, as reflected by successful <i>in vitro</i> co-translational translocation of wild-type bovine preprolactin. The microsomes supported post-translational modifications of the tested substrates such as signal peptide cleavage and N-linked glycosylation. Furthermore, the tested microsome samples responded well to the translocation inhibitor cyclotriazadisulfonamide in suppressing human CD4 protein translocation into the ER. In conclusion, microsomes isolated from frozen canine pancreatic tissue proved to retain their co-translational translocation functionality that can contribute to our research of Sec61-dependent protein translocation and selective inhibition thereof.</p>","PeriodicalId":36528,"journal":{"name":"Biology Methods and Protocols","volume":"10 1","pages":"bpaf044"},"PeriodicalIF":2.5,"publicationDate":"2025-06-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12203168/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144530086","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Convolutional-LSTM approach for temporal catch hotspots (CATCH): an AI-driven model for spatiotemporal forecasting of fisheries catch probability densities. 时序捕捞热点(catch)的卷积- lstm方法:一种人工智能驱动的渔业捕捞概率密度时空预测模型。
IF 2.5 Q3 BIOCHEMICAL RESEARCH METHODS Pub Date : 2025-06-02 eCollection Date: 2025-01-01 DOI: 10.1093/biomethods/bpaf045
Altair Agmata, Svanur Guðmundsson

Efficient fisheries management is crucial for sustaining both marine ecosystems and the economies that heavily depend on them, such as Iceland. Current fishing practices involve decisions informed by a combination of personal experience, current data on environmental and oceanographic conditions, reports from other captains, and target species within the constraints of the fishing quota. However, the intricate spatiotemporal dynamics of fish behaviour make it difficult to predict fish stock distributions. Despite technological breakthroughs in fishing vessel data collection, much of the decision-making still relies heavily on subjective judgement, highlighting the need for more robust, data-driven predictive methods. This paper presents CATCH, a convolutional long short-term memory neural network model that forecasts fish stock probability densities over time and space in Icelandic waters to support operational planning and adaptive strategy in fisheries. The framework represents the first utilization of large-scale Icelandic fishing fleet data integrating multidimensional inputs, particularly depth, bottom temperature, salinity, dissolved oxygen and catch data, to produce accurate, multivariate forecasts. The model achieves favourable performance with average RMSE, MAE, WD, and SSI of 4.71 × 10-3, 1.16 × 10-3, 0.94 × 10-3, and 0.955, respectively, for cod, while 6.13 × 10-3, 1.25 × 10-3, 1.04 × 10-3, and 0.949, respectively, across other target species (haddock, saithe, golden redfish, and Greenland halibut). Additionally, Syrjala's test yielded nonsignificant P-values (P > .05) in most cases across lags and forecast horizons, indicating that the predicted and observed distributions are statistically indistinguishable. Its promising results suggest deep learning models have the potential to optimize fisheries operations, enhance sustainability, and support data-driven decision-making.

有效的渔业管理对于维持海洋生态系统和严重依赖海洋生态系统的经济(如冰岛)至关重要。目前的捕鱼做法涉及综合考虑个人经验、当前环境和海洋学条件数据、其他船长的报告以及捕捞配额限制内的目标鱼种所作出的决定。然而,鱼类行为的复杂时空动态使得预测鱼类种群分布变得困难。尽管渔船数据收集技术取得了突破,但大部分决策仍然严重依赖主观判断,这凸显了对更强大、数据驱动的预测方法的需求。本文介绍了CATCH,这是一个卷积长短期记忆神经网络模型,可以预测冰岛水域随时间和空间变化的鱼类种群概率密度,以支持渔业的运营规划和适应策略。该框架是第一次利用大型冰岛渔船队数据,整合多方面的投入,特别是深度、海底温度、盐度、溶解氧和渔获量数据,以产生准确的多元预测。该模型对鳕鱼的平均RMSE、MAE、WD和SSI分别为4.71 × 10-3、1.16 × 10-3、0.94 × 10-3和0.955,对其他目标鱼种(黑线鳕、塞氏、金红鱼和格陵兰大比目鱼)的平均RMSE、MAE、WD和SSI分别为6.13 × 10-3、1.25 × 10-3、1.04 × 10-3和0.949,取得了较好的效果。此外,Syrjala的检验在大多数情况下产生了不显著的P值(P >.05),这表明预测和观测的分布在统计上无法区分。其令人鼓舞的结果表明,深度学习模型有潜力优化渔业运营,提高可持续性,并支持数据驱动的决策。
{"title":"Convolutional-LSTM approach for temporal catch hotspots (CATCH): an AI-driven model for spatiotemporal forecasting of fisheries catch probability densities.","authors":"Altair Agmata, Svanur Guðmundsson","doi":"10.1093/biomethods/bpaf045","DOIUrl":"10.1093/biomethods/bpaf045","url":null,"abstract":"<p><p>Efficient fisheries management is crucial for sustaining both marine ecosystems and the economies that heavily depend on them, such as Iceland. Current fishing practices involve decisions informed by a combination of personal experience, current data on environmental and oceanographic conditions, reports from other captains, and target species within the constraints of the fishing quota. However, the intricate spatiotemporal dynamics of fish behaviour make it difficult to predict fish stock distributions. Despite technological breakthroughs in fishing vessel data collection, much of the decision-making still relies heavily on subjective judgement, highlighting the need for more robust, data-driven predictive methods. This paper presents CATCH, a convolutional long short-term memory neural network model that forecasts fish stock probability densities over time and space in Icelandic waters to support operational planning and adaptive strategy in fisheries. The framework represents the first utilization of large-scale Icelandic fishing fleet data integrating multidimensional inputs, particularly depth, bottom temperature, salinity, dissolved oxygen and catch data, to produce accurate, multivariate forecasts. The model achieves favourable performance with average RMSE, MAE, WD, and SSI of 4.71 × 10<sup>-3</sup>, 1.16 × 10<sup>-3</sup>, 0.94 × 10<sup>-3</sup>, and 0.955, respectively, for cod, while 6.13 × 10<sup>-3</sup>, 1.25 × 10<sup>-3</sup>, 1.04 × 10<sup>-3</sup>, and 0.949, respectively, across other target species (haddock, saithe, golden redfish, and Greenland halibut). Additionally, Syrjala's test yielded nonsignificant <i>P</i>-values (<i>P</i> > .05) in most cases across lags and forecast horizons, indicating that the predicted and observed distributions are statistically indistinguishable. Its promising results suggest deep learning models have the potential to optimize fisheries operations, enhance sustainability, and support data-driven decision-making.</p>","PeriodicalId":36528,"journal":{"name":"Biology Methods and Protocols","volume":"10 1","pages":"bpaf045"},"PeriodicalIF":2.5,"publicationDate":"2025-06-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12203189/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144530082","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Optimization of computational ancestry inference for use in cancer cell lines. 用于癌细胞系的计算祖先推断的优化。
IF 2.5 Q3 BIOCHEMICAL RESEARCH METHODS Pub Date : 2025-06-02 eCollection Date: 2025-01-01 DOI: 10.1093/biomethods/bpaf043
Matthew S Chang, Katherine A Martinez, Chayil C Lattimore, Christina M Gobin, Kimberly J Newsom, Kristianna M Fredenburg

Cancer cell lines have provided invaluable preclinical mechanistic data for cancer health disparities research. Although there are several studies that detail ancestry inference methods using microarray data, there are none that provide investigators with documentation of ancestry inference methods using sequencing data. Here, we describe our computational workflow for inferring genetic ancestry using either whole genome sequencing (WGS) or RNA-sequencing (RNA-seq) data from cancer cell lines. RNA-seq and WGS datasets were generated from four head and neck cancer cell lines with self-identified race/ethnicity (SIRE) as either White or Black. Our workflow included variant calling and genotype imputation via Illumina DRAGEN pipelines, merging genotyping datasets with the 1000 Genomes Project (1KGP), single nucleotide polymorphism (SNP) filtering via PLINK, and ancestry inference with ADMIXTURE. We encountered challenges in workflow development with SNP filtering and clustering of 1KGP superpopulations. Adjusting stringency of filtering parameters to a window size of 100 kb and r 2 threshold of 0.8 resulted in 312,821 SNPs remaining for the RNA-seq dataset and 1,569,578 SNPs remaining for the WGS dataset. Clustering with 1KGP improved with a panel of 291 ancestry informative markers. To estimate proportions of genetic ancestry, we used all filtered SNPs. For the WGS dataset, both clustering and genetic ancestry proportions for each cancer cell line showed concurrence with SIRE. In conclusion, our optimized workflow offers investigators a robust approach for transforming cancer cell line sequencing data to infer genetic ancestry and suggests that WGS datasets are superior to RNA-seq datasets in clustering superpopulations and more accurately estimating genetic ancestry.

癌细胞系为癌症健康差异研究提供了宝贵的临床前机制数据。尽管有几项研究详细介绍了使用微阵列数据的祖先推断方法,但没有一项研究为研究人员提供了使用测序数据的祖先推断方法的文档。在这里,我们描述了使用来自癌细胞系的全基因组测序(WGS)或rna测序(RNA-seq)数据推断遗传祖先的计算工作流程。RNA-seq和WGS数据集来自四种头颈癌细胞系,这些细胞系自我鉴定的种族/民族(SIRE)为白人或黑人。我们的工作流程包括通过Illumina DRAGEN管道进行变异调用和基因型插入,通过1000基因组计划(1KGP)合并基因分型数据集,通过PLINK进行单核苷酸多态性(SNP)过滤,以及使用admix进行祖先推断。我们在工作流程开发中遇到了SNP过滤和1KGP超种群聚类的挑战。将过滤参数的严格程度调整为窗口大小为100 kb, r2阈值为0.8,结果导致RNA-seq数据集保留312,821个snp, WGS数据集保留1,569,578个snp。用291个祖先信息标记改进了1KGP聚类。为了估计遗传祖先的比例,我们使用了所有过滤过的snp。对于WGS数据集,每个癌细胞系的聚类和遗传祖先比例都与SIRE一致。总之,我们优化的工作流程为研究人员提供了一种强大的方法来转化癌细胞系测序数据来推断遗传祖先,并表明WGS数据集在聚类超群体中优于RNA-seq数据集,并且更准确地估计遗传祖先。
{"title":"Optimization of computational ancestry inference for use in cancer cell lines.","authors":"Matthew S Chang, Katherine A Martinez, Chayil C Lattimore, Christina M Gobin, Kimberly J Newsom, Kristianna M Fredenburg","doi":"10.1093/biomethods/bpaf043","DOIUrl":"10.1093/biomethods/bpaf043","url":null,"abstract":"<p><p>Cancer cell lines have provided invaluable preclinical mechanistic data for cancer health disparities research. Although there are several studies that detail ancestry inference methods using microarray data, there are none that provide investigators with documentation of ancestry inference methods using sequencing data. Here, we describe our computational workflow for inferring genetic ancestry using either whole genome sequencing (WGS) or RNA-sequencing (RNA-seq) data from cancer cell lines. RNA-seq and WGS datasets were generated from four head and neck cancer cell lines with self-identified race/ethnicity (SIRE) as either White or Black. Our workflow included variant calling and genotype imputation via Illumina DRAGEN pipelines, merging genotyping datasets with the 1000 Genomes Project (1KGP), single nucleotide polymorphism (SNP) filtering via PLINK, and ancestry inference with ADMIXTURE. We encountered challenges in workflow development with SNP filtering and clustering of 1KGP superpopulations. Adjusting stringency of filtering parameters to a window size of 100 kb and <i>r</i> <sup>2</sup> threshold of 0.8 resulted in 312,821 SNPs remaining for the RNA-seq dataset and 1,569,578 SNPs remaining for the WGS dataset. Clustering with 1KGP improved with a panel of 291 ancestry informative markers. To estimate proportions of genetic ancestry, we used all filtered SNPs. For the WGS dataset, both clustering and genetic ancestry proportions for each cancer cell line showed concurrence with SIRE. In conclusion, our optimized workflow offers investigators a robust approach for transforming cancer cell line sequencing data to infer genetic ancestry and suggests that WGS datasets are superior to RNA-seq datasets in clustering superpopulations and more accurately estimating genetic ancestry.</p>","PeriodicalId":36528,"journal":{"name":"Biology Methods and Protocols","volume":"10 1","pages":"bpaf043"},"PeriodicalIF":2.5,"publicationDate":"2025-06-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12203193/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144530085","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
DrugPipe: Generative artificial intelligence-assisted virtual screening pipeline for generalizable and efficient drug repurposing. 药物管道:生成人工智能辅助的虚拟筛选管道,用于通用和有效的药物再利用。
IF 2.5 Q3 BIOCHEMICAL RESEARCH METHODS Pub Date : 2025-05-30 eCollection Date: 2025-01-01 DOI: 10.1093/biomethods/bpaf038
Phuc Pham, Viet Thanh Duy Nguyen, Kyu Hong Cho, Truong-Son Hy

Drug repurposing presents a promising strategy to accelerate drug discovery by identifying new therapeutic uses for existing compounds, particularly for diseases with limited or no effective treatment options. We introduce DrugPipe, a 'Generative AI-Assisted Virtual Screening Pipeline' developed within the target-centric paradigm of drug repurposing, which aims to discover new indications by identifying compounds that interact with a specific protein target. 'DrugPipe' integrates generative modeling, binding pocket prediction, and similarity-based retrieval from drug databases to enable a scalable and generalizable in silico repurposing workflow. It supports blind virtual screening for any protein target without requiring prior structural or functional annotations, making it especially suited for novel or understudied targets and emerging health threats. By efficiently generating candidate ligands and rapidly retrieving structurally similar approved drugs, 'DrugPipe' accelerates the identification and prioritization of repurposable compounds. In comparative evaluations, it achieves hit rate performance comparable to QVina-W, a widely used blind docking tool, while significantly reducing computational time, highlighting its practical value for large-scale virtual screening and data-scarce repurposing scenarios. The full implementation and evaluation details are available at https://github.com/HySonLab/DrugPipe.

药物再利用是一种很有前景的战略,通过确定现有化合物的新治疗用途,特别是对于治疗方案有限或没有有效治疗方案的疾病,可以加速药物的发现。我们介绍了DrugPipe,这是一种在以靶标为中心的药物再利用范例中开发的“生成式人工智能辅助虚拟筛选管道”,旨在通过识别与特定蛋白质靶标相互作用的化合物来发现新的适应症。“DrugPipe”集成了生成建模、绑定口袋预测和基于相似性的药物数据库检索,以实现可扩展和通用的计算机再利用工作流程。它支持对任何蛋白质靶标进行盲虚拟筛选,而不需要事先进行结构或功能注释,使其特别适合于新的或未充分研究的靶标和新出现的健康威胁。通过高效生成候选配体和快速检索结构相似的已批准药物,“DrugPipe”加速了可重复利用化合物的识别和优先排序。在对比评估中,该方法的命中率可与广泛使用的盲对接工具QVina-W相媲美,同时显著减少了计算时间,突出了其在大规模虚拟筛选和数据稀缺再利用场景中的实用价值。完整的实施和评估细节可在https://github.com/HySonLab/DrugPipe上获得。
{"title":"DrugPipe: Generative artificial intelligence-assisted virtual screening pipeline for generalizable and efficient drug repurposing.","authors":"Phuc Pham, Viet Thanh Duy Nguyen, Kyu Hong Cho, Truong-Son Hy","doi":"10.1093/biomethods/bpaf038","DOIUrl":"10.1093/biomethods/bpaf038","url":null,"abstract":"<p><p>Drug repurposing presents a promising strategy to accelerate drug discovery by identifying new therapeutic uses for existing compounds, particularly for diseases with limited or no effective treatment options. We introduce <b>DrugPipe</b>, a 'Generative AI-Assisted Virtual Screening Pipeline' developed within the target-centric paradigm of drug repurposing, which aims to discover new indications by identifying compounds that interact with a specific protein target. 'DrugPipe' integrates generative modeling, binding pocket prediction, and similarity-based retrieval from drug databases to enable a scalable and generalizable <i>in silico</i> repurposing workflow. It supports blind virtual screening for any protein target without requiring prior structural or functional annotations, making it especially suited for novel or understudied targets and emerging health threats. By efficiently generating candidate ligands and rapidly retrieving structurally similar approved drugs, 'DrugPipe' accelerates the identification and prioritization of repurposable compounds. In comparative evaluations, it achieves hit rate performance comparable to QVina-W, a widely used blind docking tool, while significantly reducing computational time, highlighting its practical value for large-scale virtual screening and data-scarce repurposing scenarios. The full implementation and evaluation details are available at https://github.com/HySonLab/DrugPipe.</p>","PeriodicalId":36528,"journal":{"name":"Biology Methods and Protocols","volume":"10 1","pages":"bpaf038"},"PeriodicalIF":2.5,"publicationDate":"2025-05-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12141201/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144250087","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Tissue-specific DNA isolation from dissected millipedes for nanopore sequencing. 从解剖千足虫中分离组织特异性DNA用于纳米孔测序。
IF 2.5 Q3 BIOCHEMICAL RESEARCH METHODS Pub Date : 2025-05-28 eCollection Date: 2025-01-01 DOI: 10.1093/biomethods/bpaf042
Elena Cruz, William Wittstock, Bruce A Snyder, Arnab Sengupta

There are approximately 12,000 described species within the class Diplopoda. Only five species, falling within 4 of 16 described orders, have fully sequenced genomes. No whole genomes are available for incredibly diverse families like Xystodesmidae. Furthermore, genetic information attributed to key functions in these species is very limited. There is a growing interest in characterizing genomes of non-model organisms, however, extracting high-quality DNA for organisms with complex morphology can be challenging. Here we describe a detailed methodology for obtaining high-purity DNA from legs, head, and body tissues from wild-caught specimens of the millipede species Cherokia georgiana. Our dissection protocol separates the digestive tract minimizing microbial abundance in the extracted DNA sample. We describe sample homogenization steps that improve total DNA yield. To assess sample quality, concentration, and size we use spectrophotometry, fluorometry, and automated electrophoresis, respectively. We consistently obtain average DNA length upwards of 12-25 kb. We applied Oxford Nanopore Technologies MinION long-read sequencing, an affordable and accessible option with potential for field-based applications. Here we present tissue-specific DNA sequencing metrics, alignment and assembly of mitochondrial DNA consensus sequence, and phylogenetic analysis. While noting the limitations of our nanopore-based sequencing methodology, we provide a framework to process field specimens for PCR-free DNA sequencing data that can be used for gene-specific alignment and analysis.

在双足纲中大约有12000种已被描述的物种。在16个已描述目中的4个目中,只有5个物种完成了基因组测序。没有完整的基因组可以用于像木蝇科这样难以置信的多样化家庭。此外,与这些物种的关键功能有关的遗传信息非常有限。人们对表征非模式生物的基因组越来越感兴趣,然而,为具有复杂形态的生物体提取高质量的DNA可能具有挑战性。在这里,我们描述了一种详细的方法,从腿部、头部和身体组织中获得高纯度的DNA,这些DNA来自野生捕获的千足虫物种Cherokia georgiana。我们的解剖方案分离消化道,最大限度地减少提取DNA样本中的微生物丰度。我们描述了样品均质步骤,提高总DNA产量。为了评估样品的质量、浓度和大小,我们分别使用分光光度法、荧光法和自动电泳法。我们一直得到平均DNA长度在12-25 kb以上。我们采用了Oxford Nanopore Technologies的MinION长读测序技术,这是一种价格合理且易于使用的选择,具有现场应用的潜力。在这里,我们提出了组织特异性DNA测序指标,线粒体DNA一致序列的比对和组装,以及系统发育分析。虽然注意到我们基于纳米孔的测序方法的局限性,但我们提供了一个框架来处理现场标本,以获得可用于基因特异性比对和分析的无pcr DNA测序数据。
{"title":"Tissue-specific DNA isolation from dissected millipedes for nanopore sequencing.","authors":"Elena Cruz, William Wittstock, Bruce A Snyder, Arnab Sengupta","doi":"10.1093/biomethods/bpaf042","DOIUrl":"10.1093/biomethods/bpaf042","url":null,"abstract":"<p><p>There are approximately 12,000 described species within the class Diplopoda. Only five species, falling within 4 of 16 described orders, have fully sequenced genomes. No whole genomes are available for incredibly diverse families like Xystodesmidae. Furthermore, genetic information attributed to key functions in these species is very limited. There is a growing interest in characterizing genomes of non-model organisms, however, extracting high-quality DNA for organisms with complex morphology can be challenging. Here we describe a detailed methodology for obtaining high-purity DNA from legs, head, and body tissues from wild-caught specimens of the millipede species <i>Cherokia georgiana</i>. Our dissection protocol separates the digestive tract minimizing microbial abundance in the extracted DNA sample. We describe sample homogenization steps that improve total DNA yield. To assess sample quality, concentration, and size we use spectrophotometry, fluorometry, and automated electrophoresis, respectively. We consistently obtain average DNA length upwards of 12-25 kb. We applied Oxford Nanopore Technologies MinION long-read sequencing, an affordable and accessible option with potential for field-based applications. Here we present tissue-specific DNA sequencing metrics, alignment and assembly of mitochondrial DNA consensus sequence, and phylogenetic analysis. While noting the limitations of our nanopore-based sequencing methodology, we provide a framework to process field specimens for PCR-free DNA sequencing data that can be used for gene-specific alignment and analysis.</p>","PeriodicalId":36528,"journal":{"name":"Biology Methods and Protocols","volume":"10 1","pages":"bpaf042"},"PeriodicalIF":2.5,"publicationDate":"2025-05-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12202871/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144530087","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
KD_MultiSucc: incorporating multi-teacher knowledge distillation and word embeddings for cross-species prediction of protein succinylation sites. kd_multisuc:结合多教师知识蒸馏和词嵌入跨物种预测蛋白质琥珀酰化位点。
IF 2.5 Q3 BIOCHEMICAL RESEARCH METHODS Pub Date : 2025-05-28 eCollection Date: 2025-01-01 DOI: 10.1093/biomethods/bpaf041
Thi-Xuan Tran, Thi-Tuyen Nguyen, Nguyen-Quoc-Khanh Le, Van-Nui Nguyen

Protein succinylation is a vital post-translational modification (PTM) that involves the covalent attachment of a succinyl group (-CO-CH2-CH2-CO-) to the lysine residue of a protein molecule. The mechanism underlying the succinylation process plays a critical role in regulating protein structure, stability, and function, contributing to various biological processes, including metabolism, gene expression, and signal transduction. Succinylation has also been associated with numerous diseases, such as cancer, neurodegenerative disorders, and metabolic syndromes. Due to its important roles, the accurate prediction of succinylation sites is essential for a comprehensive understanding of the mechanisms underlying succinylation. Although research on the identification of protein succinylation sites has been increasing, experimental methods remain time-consuming and costly, underscoring the need for efficient computational approaches. In this study, we present KD_MultiSucc, a model for cross-species prediction of succinylation sites using Multi-Teacher Knowledge Distillation and Word Embedding. The proposed method leverages the strengths of both Knowledge Distillation and Word Embedding techniques to reduce computational complexity while maintaining high accuracy in predicting protein succinylation sites across species. Experimental results demonstrate that the proposed predictor outperforms existing predictors, providing a valuable contribution to PTM research and biomedical applications. To assist readers and researchers, the codes and resources related to this work have been made freely accessible on GitHub at https://github.com/nuinvtnu/KD_MultiSucc/.

蛋白质琥珀酰化是一种重要的翻译后修饰(PTM),涉及琥珀酰基(- co - ch2 - ch2 - co -)与蛋白质分子赖氨酸残基的共价连接。琥珀酰化过程的机制在调节蛋白质结构、稳定性和功能方面起着关键作用,参与多种生物过程,包括代谢、基因表达和信号转导。琥珀酰化也与许多疾病有关,如癌症、神经退行性疾病和代谢综合征。由于其重要作用,准确预测琥珀酰化位点对于全面了解琥珀酰化的机制至关重要。尽管对蛋白质琥珀酰化位点鉴定的研究一直在增加,但实验方法仍然耗时且昂贵,强调需要有效的计算方法。在这项研究中,我们提出了kd_multisuc,一个使用多教师知识蒸馏和词嵌入的跨物种琥珀酰化位点预测模型。该方法利用知识蒸馏和词嵌入技术的优势,降低了计算复杂度,同时保持了跨物种蛋白质琥珀酰化位点预测的高精度。实验结果表明,所提出的预测器优于现有的预测器,为PTM研究和生物医学应用提供了宝贵的贡献。为了帮助读者和研究人员,与这项工作相关的代码和资源已在GitHub上免费提供,网址为https://github.com/nuinvtnu/KD_MultiSucc/。
{"title":"KD_MultiSucc: incorporating multi-teacher knowledge distillation and word embeddings for cross-species prediction of protein succinylation sites.","authors":"Thi-Xuan Tran, Thi-Tuyen Nguyen, Nguyen-Quoc-Khanh Le, Van-Nui Nguyen","doi":"10.1093/biomethods/bpaf041","DOIUrl":"10.1093/biomethods/bpaf041","url":null,"abstract":"<p><p>Protein succinylation is a vital post-translational modification (PTM) that involves the covalent attachment of a succinyl group (-CO-CH2-CH2-CO-) to the lysine residue of a protein molecule. The mechanism underlying the succinylation process plays a critical role in regulating protein structure, stability, and function, contributing to various biological processes, including metabolism, gene expression, and signal transduction. Succinylation has also been associated with numerous diseases, such as cancer, neurodegenerative disorders, and metabolic syndromes. Due to its important roles, the accurate prediction of succinylation sites is essential for a comprehensive understanding of the mechanisms underlying succinylation. Although research on the identification of protein succinylation sites has been increasing, experimental methods remain time-consuming and costly, underscoring the need for efficient computational approaches. In this study, we present KD_MultiSucc, a model for cross-species prediction of succinylation sites using Multi-Teacher Knowledge Distillation and Word Embedding. The proposed method leverages the strengths of both Knowledge Distillation and Word Embedding techniques to reduce computational complexity while maintaining high accuracy in predicting protein succinylation sites across species. Experimental results demonstrate that the proposed predictor outperforms existing predictors, providing a valuable contribution to PTM research and biomedical applications. To assist readers and researchers, the codes and resources related to this work have been made freely accessible on GitHub at https://github.com/nuinvtnu/KD_MultiSucc/.</p>","PeriodicalId":36528,"journal":{"name":"Biology Methods and Protocols","volume":"10 1","pages":"bpaf041"},"PeriodicalIF":2.5,"publicationDate":"2025-05-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12202750/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144530083","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Optimized protein extraction protocol from human skin samples. 优化人体皮肤样品蛋白质提取方案。
IF 2.5 Q3 BIOCHEMICAL RESEARCH METHODS Pub Date : 2025-05-10 eCollection Date: 2025-01-01 DOI: 10.1093/biomethods/bpaf035
Ana Paula Carvalho Reis, Giovanna Azevedo Celestrino, Talita Souza Siqueira, Milena De Melo Scarano Coelho, Juliana Carreiro Avila, Isabela De Oliveira Cavalcante Pimentel, Leo Kei Iwai, Pritesh Jaychand Lalwani, Vitor Manoel Silva Dos Reis, Kaique Arriel, José Ângelo Lindoso, Gil Benard, Maria Gloria Teixeira Sousa

The skin is the largest organ in the body and is the site for a diverse set of diseases. Yet, given the complexity of the cutaneous tissue, there is a limited availability of data in the literature on skin proteomics. Here, we proposed an adapted and optimized protocol for the extraction of proteins from human skin, using a combination of chemical and mechanical lysis approaches. For this, we used of a lysis buffer containing 2% SDS, 50 mM TEAB, and a 1% protease and phosphatase inhibitor cocktail, in addition to Matrix A beads and a FastPrep-24 5G homogenizer. For the characterization of the samples, the obtained proteins were purified and digested using the SP3 method (Single-pot, solid phase, sample preparation), and analyzed by nano liquid chromatography coupled with tandem mass spectrometry. In this way, we were able to identify around 6000 proteins in the skin samples from healthy individuals and patients with the fungal infection sporotrichosis. Our improved methodology could significantly enrich our understanding of skin biology and provide new perspectives for the discovery of biomarkers and therapeutic targets for cutaneous diseases.

皮肤是人体最大的器官,也是多种疾病的发病部位。然而,考虑到皮肤组织的复杂性,关于皮肤蛋白质组学的文献数据有限。在这里,我们提出了一种适应和优化的方案,用于从人体皮肤中提取蛋白质,使用化学和机械裂解相结合的方法。为此,我们使用了含有2% SDS, 50 mM TEAB, 1%蛋白酶和磷酸酶抑制剂混合物的裂解缓冲液,以及Matrix a珠和FastPrep-24 5G均质机。为了对样品进行表征,采用SP3法(单锅,固相,样品制备)对所得蛋白质进行纯化和消化,并采用纳米液相色谱-串联质谱法进行分析。通过这种方式,我们能够从健康个体和真菌感染孢子菌病患者的皮肤样本中识别出大约6000种蛋白质。我们改进的方法可以显著丰富我们对皮肤生物学的理解,并为发现皮肤疾病的生物标志物和治疗靶点提供新的视角。
{"title":"Optimized protein extraction protocol from human skin samples.","authors":"Ana Paula Carvalho Reis, Giovanna Azevedo Celestrino, Talita Souza Siqueira, Milena De Melo Scarano Coelho, Juliana Carreiro Avila, Isabela De Oliveira Cavalcante Pimentel, Leo Kei Iwai, Pritesh Jaychand Lalwani, Vitor Manoel Silva Dos Reis, Kaique Arriel, José Ângelo Lindoso, Gil Benard, Maria Gloria Teixeira Sousa","doi":"10.1093/biomethods/bpaf035","DOIUrl":"10.1093/biomethods/bpaf035","url":null,"abstract":"<p><p>The skin is the largest organ in the body and is the site for a diverse set of diseases. Yet, given the complexity of the cutaneous tissue, there is a limited availability of data in the literature on skin proteomics. Here, we proposed an adapted and optimized protocol for the extraction of proteins from human skin, using a combination of chemical and mechanical lysis approaches. For this, we used of a lysis buffer containing 2% SDS, 50 mM TEAB, and a 1% protease and phosphatase inhibitor cocktail, in addition to Matrix A beads and a FastPrep-24 5G homogenizer. For the characterization of the samples, the obtained proteins were purified and digested using the SP3 method (Single-pot, solid phase, sample preparation), and analyzed by nano liquid chromatography coupled with tandem mass spectrometry. In this way, we were able to identify around 6000 proteins in the skin samples from healthy individuals and patients with the fungal infection sporotrichosis. Our improved methodology could significantly enrich our understanding of skin biology and provide new perspectives for the discovery of biomarkers and therapeutic targets for cutaneous diseases.</p>","PeriodicalId":36528,"journal":{"name":"Biology Methods and Protocols","volume":"10 1","pages":"bpaf035"},"PeriodicalIF":2.5,"publicationDate":"2025-05-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12202028/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144508692","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Biology Methods and Protocols
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1