Background: Accurate assessment of left ventricular function is essential for diagnosing and managing cardiovascular disease. Gated myocardial perfusion SPECT (MPS) enables simultaneous evaluation of perfusion and function, but reliable contour extraction is challenged by image noise, resolution limits, and anatomical variability. Multi-center validation is further restricted by data privacy concerns, underscoring the need for robust and privacy-preserving contouring methods.
Methods: In this study, we propose a novel approach, FedDA-TSformer, which integrates Federated Domain Adaptation with the TimeSformer model for the task of left ventricle segmentation using MPS images. The proposed model captures spatial and temporal features through a Divide-Space-Time-Attention mechanism, which ensures spatial-temporal consistency in predictions across multi-center datasets. To facilitate domain adaptation, we employ a local maximum mean discrepancy (LMMD) loss to align model outputs across data from three different institutions. This strategy effectively combines federated learning and domain adaptation to enhance model generalization while ensuring data security.
Results: We evaluated FedDA-TSformer on a dataset comprising 150 subjects collected from three hospitals, with each cardiac cycle divided into eight gates. The model achieved Dice Similarity Coefficients (DSC) of 0.842 and 0.907 for left ventricular endocardium and epicardium segmentation, respectively.
Discussion: FedDA-TSformer provides a robust, privacy-preserving solution for multi-center left ventricular segmentation, outperforming traditional FedAvg in handling domain shifts. By leveraging the TimeSformer architecture and domain adaptation mechanisms, the framework ensures spatial-temporal consistency and data security across heterogeneous clinical sites. Despite current limitations regarding communication overhead and its focus on a small SPECT-only dataset, this study establishes a scalable foundation for collaborative cardiac diagnosis. Future work will prioritize model compression, asynchronous updates, and cross-modality generalization to CT and MRI to enhance its practicality in resource-constrained environments.
{"title":"FedDA-TSformer: Federated Domain Adaptation with vision TimeSformer for left ventricle segmentation on gated myocardial perfusion SPECT image.","authors":"Yehong Huang, Chen Zhao, Rochak Dhakal, Min Zhao, Guang-Uei Hung, Zhixin Jiang, Weihua Zhou","doi":"10.1186/s44330-026-00057-8","DOIUrl":"10.1186/s44330-026-00057-8","url":null,"abstract":"<p><strong>Background: </strong>Accurate assessment of left ventricular function is essential for diagnosing and managing cardiovascular disease. Gated myocardial perfusion SPECT (MPS) enables simultaneous evaluation of perfusion and function, but reliable contour extraction is challenged by image noise, resolution limits, and anatomical variability. Multi-center validation is further restricted by data privacy concerns, underscoring the need for robust and privacy-preserving contouring methods.</p><p><strong>Methods: </strong>In this study, we propose a novel approach, FedDA-TSformer, which integrates Federated Domain Adaptation with the TimeSformer model for the task of left ventricle segmentation using MPS images. The proposed model captures spatial and temporal features through a Divide-Space-Time-Attention mechanism, which ensures spatial-temporal consistency in predictions across multi-center datasets. To facilitate domain adaptation, we employ a local maximum mean discrepancy (LMMD) loss to align model outputs across data from three different institutions. This strategy effectively combines federated learning and domain adaptation to enhance model generalization while ensuring data security.</p><p><strong>Results: </strong>We evaluated FedDA-TSformer on a dataset comprising 150 subjects collected from three hospitals, with each cardiac cycle divided into eight gates. The model achieved Dice Similarity Coefficients (DSC) of 0.842 and 0.907 for left ventricular endocardium and epicardium segmentation, respectively.</p><p><strong>Discussion: </strong>FedDA-TSformer provides a robust, privacy-preserving solution for multi-center left ventricular segmentation, outperforming traditional FedAvg in handling domain shifts. By leveraging the TimeSformer architecture and domain adaptation mechanisms, the framework ensures spatial-temporal consistency and data security across heterogeneous clinical sites. Despite current limitations regarding communication overhead and its focus on a small SPECT-only dataset, this study establishes a scalable foundation for collaborative cardiac diagnosis. Future work will prioritize model compression, asynchronous updates, and cross-modality generalization to CT and MRI to enhance its practicality in resource-constrained environments.</p>","PeriodicalId":519945,"journal":{"name":"BMC methods","volume":"3 1","pages":"4"},"PeriodicalIF":0.0,"publicationDate":"2026-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12862020/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146115397","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-01-01Epub Date: 2025-01-02DOI: 10.1186/s44330-024-00020-5
Hayden C Stites, Uri Manor
Background: To address the limitations of large-scale high quality microscopy image acquisition, PSSR (Point-Scanning Super-Resolution) was introduced to enhance easily acquired low quality microscopy data to a higher quality using deep learning-based methods. However, while PSSR was released as open-source, it was difficult for users to implement into their workflows due to an outdated codebase, limiting its usage by prospective users. Additionally, while the data enhancements provided by PSSR were significant, there was still potential for further improvement.
Methods: To overcome this, we introduce PSSR2, a redesigned implementation of PSSR workflows and methods built to put state-of-the-art technology into the hands of the general microscopy and biology research community. PSSR2 enables user-friendly implementation of super-resolution workflows for simultaneous super-resolution and denoising of undersampled microscopy data, especially through its integrated Command Line Interface and Napari plugin. PSSR2 improves and expands upon previously established PSSR algorithms, mainly through improvements in the semi-synthetic data generation ("crappification") and training processes.
Results: In benchmarking PSSR2 on a test dataset of paired high and low resolution electron microscopy images, PSSR2 super-resolves high-resolution images from low-resolution images to a significantly higher accuracy than PSSR. The super-resolved images are also more visually representative of real-world high-resolution images.
Discussion: The improvements in PSSR2, in providing higher quality images, should improve the performance of downstream analyses. We note that for accurate super-resolution, PSSR2 models should only be applied to super-resolve data sufficiently similar to training data and should be validated against real-world ground truth data.
{"title":"PSSR2: a user-friendly Python package for democratizing deep learning-based point-scanning super-resolution microscopy.","authors":"Hayden C Stites, Uri Manor","doi":"10.1186/s44330-024-00020-5","DOIUrl":"10.1186/s44330-024-00020-5","url":null,"abstract":"<p><strong>Background: </strong>To address the limitations of large-scale high quality microscopy image acquisition, PSSR (Point-Scanning Super-Resolution) was introduced to enhance easily acquired low quality microscopy data to a higher quality using deep learning-based methods. However, while PSSR was released as open-source, it was difficult for users to implement into their workflows due to an outdated codebase, limiting its usage by prospective users. Additionally, while the data enhancements provided by PSSR were significant, there was still potential for further improvement.</p><p><strong>Methods: </strong>To overcome this, we introduce PSSR2, a redesigned implementation of PSSR workflows and methods built to put state-of-the-art technology into the hands of the general microscopy and biology research community. PSSR2 enables user-friendly implementation of super-resolution workflows for simultaneous super-resolution and denoising of undersampled microscopy data, especially through its integrated Command Line Interface and Napari plugin. PSSR2 improves and expands upon previously established PSSR algorithms, mainly through improvements in the semi-synthetic data generation (\"crappification\") and training processes.</p><p><strong>Results: </strong>In benchmarking PSSR2 on a test dataset of paired high and low resolution electron microscopy images, PSSR2 super-resolves high-resolution images from low-resolution images to a significantly higher accuracy than PSSR. The super-resolved images are also more visually representative of real-world high-resolution images.</p><p><strong>Discussion: </strong>The improvements in PSSR2, in providing higher quality images, should improve the performance of downstream analyses. We note that for accurate super-resolution, PSSR2 models should only be applied to super-resolve data sufficiently similar to training data and should be validated against real-world ground truth data.</p>","PeriodicalId":519945,"journal":{"name":"BMC methods","volume":"2 ","pages":""},"PeriodicalIF":0.0,"publicationDate":"2025-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12263091/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144644595","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-01-01Epub Date: 2025-11-03DOI: 10.1186/s44330-025-00048-1
Irene Silvernail, Andi N Morgan, Kenya Gordon, Alexandria N Kerr, Kanda Borgognoni, Andrew M Atisa, Benjamin S Clark, Jose F Castaneda, Robin E Stanley, Sharonda J LeBlanc
Background: The structural dynamics of proteins and nucleic acids are critical for their function in many biological processes but investigating these dynamics is often challenging with traditional techniques. Time-correlated single photon counting (TCSPC) coupled with confocal microscopy is a versatile biophysical tool that enables real-time monitoring of biomolecular dynamics in a variety of systems, across many timescales. Quantitative single-molecule time-resolved fluorescence methods are uniquely positioned to investigate transient interactions and structural changes, yet application in complex biological systems remains limited by technical and analytical challenges. Combining fluorescence lifetime imaging microscopy (FLIM) with pulsed interleaved excitation Förster resonance energy transfer (PIE-FRET) offers a robust approach to overcome these barriers, enabling accurate distance measurements and dynamic studies across diverse sample types.
Methods: We describe practical workflows for implementing FLIM/PIE-FRET for quantitative measurements of nanoscale distances and dynamic processes in various biomolecular systems on a commercial microscope. Benchmark DNA constructs, RNA/DNA hybrids, liposome-encapsulated enzymes, and live Saccharomyces cerevisiae strains were prepared and imaged. Correction factors for FRET efficiency recovery were determined from diffusion-based experiments, and results were validated by direct comparison of intensity- and lifetime-based analyses.
Results: FRET efficiencies from both intensity- and lifetime-based analyses were consistent across systems. DNA standards reproduced expected values, RNA/DNA hybrids reported on substrate dynamics, liposome encapsulation enabled single-enzyme conformational probing, and live-cell imaging revealed transient protein-protein interactions during ribosome biogenesis.
Discussion: This work establishes guidelines for implementing FLIM/PIE-FRET as an accessible method to interrogate nanoscale distances, conformational dynamics, and protein-protein interactions both in vitro and in live cells. The strategies outlined here facilitate broader adoption of quantitative single-molecule time-resolved fluorescence in structural and cell biology.
Supplementary information: The online version contains supplementary material available at 10.1186/s44330-025-00048-1.
{"title":"Quantitative single-molecule FLIM and PIE-FRET imaging of biomolecular systems.","authors":"Irene Silvernail, Andi N Morgan, Kenya Gordon, Alexandria N Kerr, Kanda Borgognoni, Andrew M Atisa, Benjamin S Clark, Jose F Castaneda, Robin E Stanley, Sharonda J LeBlanc","doi":"10.1186/s44330-025-00048-1","DOIUrl":"10.1186/s44330-025-00048-1","url":null,"abstract":"<p><strong>Background: </strong>The structural dynamics of proteins and nucleic acids are critical for their function in many biological processes but investigating these dynamics is often challenging with traditional techniques. Time-correlated single photon counting (TCSPC) coupled with confocal microscopy is a versatile biophysical tool that enables real-time monitoring of biomolecular dynamics in a variety of systems, across many timescales. Quantitative single-molecule time-resolved fluorescence methods are uniquely positioned to investigate transient interactions and structural changes, yet application in complex biological systems remains limited by technical and analytical challenges. Combining fluorescence lifetime imaging microscopy (FLIM) with pulsed interleaved excitation Förster resonance energy transfer (PIE-FRET) offers a robust approach to overcome these barriers, enabling accurate distance measurements and dynamic studies across diverse sample types.</p><p><strong>Methods: </strong>We describe practical workflows for implementing FLIM/PIE-FRET for quantitative measurements of nanoscale distances and dynamic processes in various biomolecular systems on a commercial microscope. Benchmark DNA constructs, RNA/DNA hybrids, liposome-encapsulated enzymes, and live <i>Saccharomyces cerevisiae</i> strains were prepared and imaged. Correction factors for FRET efficiency recovery were determined from diffusion-based experiments, and results were validated by direct comparison of intensity- and lifetime-based analyses.</p><p><strong>Results: </strong>FRET efficiencies from both intensity- and lifetime-based analyses were consistent across systems. DNA standards reproduced expected values, RNA/DNA hybrids reported on substrate dynamics, liposome encapsulation enabled single-enzyme conformational probing, and live-cell imaging revealed transient protein-protein interactions during ribosome biogenesis.</p><p><strong>Discussion: </strong>This work establishes guidelines for implementing FLIM/PIE-FRET as an accessible method to interrogate nanoscale distances, conformational dynamics, and protein-protein interactions both in vitro and in live cells. The strategies outlined here facilitate broader adoption of quantitative single-molecule time-resolved fluorescence in structural and cell biology.</p><p><strong>Supplementary information: </strong>The online version contains supplementary material available at 10.1186/s44330-025-00048-1.</p>","PeriodicalId":519945,"journal":{"name":"BMC methods","volume":"2 1","pages":"27"},"PeriodicalIF":0.0,"publicationDate":"2025-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12580441/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145446811","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-01-01Epub Date: 2025-05-08DOI: 10.1186/s44330-025-00030-x
Allie Ivy, Shelby N Bess, Shilpi Agrawal, Varun Kochar, Abbey L Stokes, Timothy J Muldoon, Christopher E Nelson
Background: Macrophages are a promising target for therapeutics in various applications such as regenerative medicine and immunotherapy for cancer. Due to their plastic nature, macrophages can switch from a non-activated state to activated with the smallest environmental change. For macrophages to be effective in their respective applications, screening for phenotypic changes is necessary to elucidate the cell response to different delivery vehicles, vaccines, small molecules, and other stimuli.
Methods: We created a sensitive and dynamic high-throughput screening method for macrophages based on the activation of NF-κB. For this reporter, we placed an mRFP1 fluorescence gene under the control of an inflammatory promoter, which recruits NF-κB response elements to promote expression during the inflammatory response in macrophages. We characterized the inflammatory reporter based on key markers of an inflammatory response in macrophages including TNF-α cytokine release and immunostaining for inflammatory and non-inflammatory cell surface markers. We compared gene delivery and inflammation of several clinically relevant viral vehicles and commercially available non-viral vehicles. Statistical analysis between groups was performed with a one-way ANOVA with post-hoc Tukey's test.
Results: The reporter macrophages demonstrated a dynamic range after LPS stimulation with an EC50 of 0.61 ng/mL that was highly predictive of TNF-α release. Flow cytometry revealed heterogeneity between groups but confirmed population level shifts in pro-inflammatory markers. Finally, we demonstrated utility of the reporter by showing divergent effects with various leading gene delivery vehicles.
Discussion: This screening technique developed here provides a dynamic, high-throughput screening technique for determining inflammatory response by mouse macrophages to specific stimuli. The method presented here provides insight into the inflammatory response in mouse macrophages to different viral and non-viral gene delivery methods and provides a tool for high-throughput screening of novel vehicles.
Supplementary information: The online version contains supplementary material available at 10.1186/s44330-025-00030-x.
{"title":"A dual-fluorescence assay for gene delivery vehicle screening in macrophages with an inflammation-inducible reporter construct.","authors":"Allie Ivy, Shelby N Bess, Shilpi Agrawal, Varun Kochar, Abbey L Stokes, Timothy J Muldoon, Christopher E Nelson","doi":"10.1186/s44330-025-00030-x","DOIUrl":"10.1186/s44330-025-00030-x","url":null,"abstract":"<p><strong>Background: </strong>Macrophages are a promising target for therapeutics in various applications such as regenerative medicine and immunotherapy for cancer. Due to their plastic nature, macrophages can switch from a non-activated state to activated with the smallest environmental change. For macrophages to be effective in their respective applications, screening for phenotypic changes is necessary to elucidate the cell response to different delivery vehicles, vaccines, small molecules, and other stimuli.</p><p><strong>Methods: </strong>We created a sensitive and dynamic high-throughput screening method for macrophages based on the activation of NF-κB. For this reporter, we placed an mRFP1 fluorescence gene under the control of an inflammatory promoter, which recruits NF-κB response elements to promote expression during the inflammatory response in macrophages. We characterized the inflammatory reporter based on key markers of an inflammatory response in macrophages including TNF-α cytokine release and immunostaining for inflammatory and non-inflammatory cell surface markers. We compared gene delivery and inflammation of several clinically relevant viral vehicles and commercially available non-viral vehicles. Statistical analysis between groups was performed with a one-way ANOVA with post-hoc Tukey's test.</p><p><strong>Results: </strong>The reporter macrophages demonstrated a dynamic range after LPS stimulation with an EC50 of 0.61 ng/mL that was highly predictive of TNF-α release. Flow cytometry revealed heterogeneity between groups but confirmed population level shifts in pro-inflammatory markers. Finally, we demonstrated utility of the reporter by showing divergent effects with various leading gene delivery vehicles.</p><p><strong>Discussion: </strong>This screening technique developed here provides a dynamic, high-throughput screening technique for determining inflammatory response by mouse macrophages to specific stimuli. The method presented here provides insight into the inflammatory response in mouse macrophages to different viral and non-viral gene delivery methods and provides a tool for high-throughput screening of novel vehicles.</p><p><strong>Supplementary information: </strong>The online version contains supplementary material available at 10.1186/s44330-025-00030-x.</p>","PeriodicalId":519945,"journal":{"name":"BMC methods","volume":"2 1","pages":"8"},"PeriodicalIF":0.0,"publicationDate":"2025-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12062070/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144056919","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-01-01Epub Date: 2025-08-01DOI: 10.1186/s44330-025-00037-4
Brendan Miller, Eduardo Vieira de Souza, Victor J Pai, Hosung Kim, Joan M Vaughan, Calvin J Lau, Jolene K Diedrich, Alan Saghatelian
<p><strong>Background: </strong>The human genome contains over 3 million small open reading frames (smORFs, <i>≤</i> 150 codons). Ribosome profiling and proteogenomics transformed our understanding of these sequences by showing that thousands are actively translated, and hundreds produce detectable peptides by mass spectrometry. However, the random arrangement of codons across the 3-gigabase human genome naturally generates smORFs by chance, suggesting many may represent translational noise or regulatory elements rather than functional proteins. This is supported by the fact that most translating smORFs occur in upstream open reading frames (uORFs), which typically regulate translation of canonical coding sequences rather than encode bioactive microproteins. As interest grows in uncovering biologically meaningful microproteins, a key challenge remains: distinguishing functional smORFs from non-functional or regulatory translation products. Although empirical methods such as individual microprotein studies or large-scale screens can help, these approaches are time-consuming, expensive, and come with technical limitations. New complementary strategies are needed.</p><p><strong>Methods: </strong>To address this challenge, we developed ShortStop, a computational framework based on the idea that not all translating smORFs produce functional proteins, but the ones that do may resemble experimentally characterized microproteins. ShortStop classifies smORFs into two reference groups: Swiss-Prot Analog Microproteins (SAMs), which resemble known microproteins, and PRISMs (Physicochemically Resembling In Silico Microproteins), which are synthetic sequences designed to match the composition of translating smORFs but lacking sequence order or evolutionary selection, and therefore serving as a proxy for non-functional peptides. This two-class system enables machine learning to help prioritize smORFs for downstream study.</p><p><strong>Results: </strong>ShortStop achieved high precision (90-94%), recall (87-96%), and F1 scores (90-93%) across all classes. When applied to a published dataset of translating smORFs, ShortStop classified about 8% as candidates with biochemical properties resembling Swiss-Prot microproteins (i.e., called SAMs). The remaining 92% resembled in silico generated sequences (i.e., called PRISMs), representing noncanonical proteins, non-functional peptides, or regulatory translation events. SAMs showed lower C-terminal hydrophobicity-linked to reduced proteasomal degradation-and greater N-terminal hydrophilicity at neutral pH, suggesting improved solubility and intracellular stability. ShortStop also identified microproteins overlooked by other methods, including one encoded by an upstream overlapping smORF in the StAR gene, which was detectable in human cells and steroid-producing tissues. In a clinical lung cancer dataset, ShortStop uncovered differentially expressed microprotein candidates, several of which were validated by mass spectr
背景:人类基因组包含超过300万个小开放阅读框(smorf,≤150个密码子)。核糖体分析和蛋白质基因组学改变了我们对这些序列的理解,通过质谱分析显示,数千个被积极翻译,数百个产生可检测的肽。然而,在人类基因组中,密码子的随机排列自然会偶然产生smorf,这表明许多smorf可能代表翻译噪声或调节元件,而不是功能蛋白。大多数翻译smorf发生在上游开放阅读框(uorf)中,这一事实支持了这一观点,uorf通常调节规范编码序列的翻译,而不是编码生物活性微蛋白。随着人们对发现具有生物学意义的微蛋白的兴趣日益增长,一个关键的挑战仍然存在:区分功能性smorf与非功能性或调节性翻译产物。尽管个体微蛋白研究或大规模筛选等经验方法可以提供帮助,但这些方法耗时、昂贵,并且存在技术限制。需要新的补充战略。方法:为了解决这一挑战,我们开发了ShortStop,这是一个基于并非所有翻译smorf都产生功能蛋白的想法的计算框架,但那些具有功能的smorf可能类似于实验表征的微蛋白。ShortStop将smorf分为两个参考组:类似于已知微蛋白的Swiss-Prot Analog Microproteins (SAMs)和类似于硅微蛋白的PRISMs(物理化学上类似于硅微蛋白),这是一种合成序列,旨在匹配翻译smorf的组成,但缺乏序列顺序或进化选择,因此作为非功能肽的替代品。这种两级系统使机器学习能够帮助优先考虑下游研究的smorf。结果:游击手在所有类别中具有较高的准确率(90-94%)、召回率(87-96%)和F1分数(90-93%)。当应用于已发表的翻译smorf数据集时,ShortStop将大约8%的候选基因分类为具有类似Swiss-Prot微蛋白(即SAMs)的生化特性。剩下的92%类似于计算机生成的序列(即称为PRISMs),代表非规范蛋白、非功能肽或调节翻译事件。在中性pH下,SAMs表现出较低的c端疏水性(与降低蛋白酶体降解有关)和较强的n端亲水性,表明其溶解度和细胞内稳定性得到改善。ShortStop还发现了被其他方法忽略的微蛋白,包括由StAR基因上游重叠的smORF编码的一种蛋白,这种蛋白在人类细胞和产生类固醇的组织中可以检测到。在临床肺癌数据集中,ShortStop发现了差异表达的候选微蛋白,其中一些通过质谱验证。讨论:ShortStop解决了微蛋白研究中的一个关键空白-缺乏可扩展的工具来表征微蛋白和标准化的负训练数据来训练微蛋白的机器学习模型。通过提供基于生化特征的分类框架,ShortStop为功能研究中的smorf靶向提供了实用的解决方案,为新发现工具的基准测试和推进微蛋白研究提供了实用的解决方案。补充资料:在线版本包含补充资料,下载地址:10.1186/s44330-025-00037-4。
{"title":"ShortStop: a machine learning framework for microprotein discovery.","authors":"Brendan Miller, Eduardo Vieira de Souza, Victor J Pai, Hosung Kim, Joan M Vaughan, Calvin J Lau, Jolene K Diedrich, Alan Saghatelian","doi":"10.1186/s44330-025-00037-4","DOIUrl":"10.1186/s44330-025-00037-4","url":null,"abstract":"<p><strong>Background: </strong>The human genome contains over 3 million small open reading frames (smORFs, <i>≤</i> 150 codons). Ribosome profiling and proteogenomics transformed our understanding of these sequences by showing that thousands are actively translated, and hundreds produce detectable peptides by mass spectrometry. However, the random arrangement of codons across the 3-gigabase human genome naturally generates smORFs by chance, suggesting many may represent translational noise or regulatory elements rather than functional proteins. This is supported by the fact that most translating smORFs occur in upstream open reading frames (uORFs), which typically regulate translation of canonical coding sequences rather than encode bioactive microproteins. As interest grows in uncovering biologically meaningful microproteins, a key challenge remains: distinguishing functional smORFs from non-functional or regulatory translation products. Although empirical methods such as individual microprotein studies or large-scale screens can help, these approaches are time-consuming, expensive, and come with technical limitations. New complementary strategies are needed.</p><p><strong>Methods: </strong>To address this challenge, we developed ShortStop, a computational framework based on the idea that not all translating smORFs produce functional proteins, but the ones that do may resemble experimentally characterized microproteins. ShortStop classifies smORFs into two reference groups: Swiss-Prot Analog Microproteins (SAMs), which resemble known microproteins, and PRISMs (Physicochemically Resembling In Silico Microproteins), which are synthetic sequences designed to match the composition of translating smORFs but lacking sequence order or evolutionary selection, and therefore serving as a proxy for non-functional peptides. This two-class system enables machine learning to help prioritize smORFs for downstream study.</p><p><strong>Results: </strong>ShortStop achieved high precision (90-94%), recall (87-96%), and F1 scores (90-93%) across all classes. When applied to a published dataset of translating smORFs, ShortStop classified about 8% as candidates with biochemical properties resembling Swiss-Prot microproteins (i.e., called SAMs). The remaining 92% resembled in silico generated sequences (i.e., called PRISMs), representing noncanonical proteins, non-functional peptides, or regulatory translation events. SAMs showed lower C-terminal hydrophobicity-linked to reduced proteasomal degradation-and greater N-terminal hydrophilicity at neutral pH, suggesting improved solubility and intracellular stability. ShortStop also identified microproteins overlooked by other methods, including one encoded by an upstream overlapping smORF in the StAR gene, which was detectable in human cells and steroid-producing tissues. In a clinical lung cancer dataset, ShortStop uncovered differentially expressed microprotein candidates, several of which were validated by mass spectr","PeriodicalId":519945,"journal":{"name":"BMC methods","volume":"2 1","pages":"16"},"PeriodicalIF":0.0,"publicationDate":"2025-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12313729/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144777561","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Background: Collaborative efforts, such as the Human Cell Atlas, are rapidly accumulating large amounts of single-cell data. To ensure that single-cell atlases are representative of human genetic diversity, we need to determine the ancestry of the donors from whom single-cell data are generated. Self-reporting of race and ethnicity, although important, can be biased and is not always available for the datasets already collected.
Methods: Here, we introduce scAI-SNP, a tool to infer ancestry directly from single-cell genomics data. To train scAI-SNP, we identified 4.5 million ancestry-informative single-nucleotide polymorphisms (SNPs) in the 1000 Genomes Project dataset across 3201 individuals from 26 population groups. For a query single-cell dataset, scAI-SNP uses these ancestry-informative SNPs to compute the contribution of each of the 26 population groups to the ancestry of the donor from whom the cells were obtained.
Results: Using diverse single-cell datasets with matched whole-genome sequencing data, we show that scAI-SNP is robust to the sparsity of single-cell data, can accurately and consistently infer ancestry from samples derived from diverse types of tissues and cancer cells, and can be applied to different modalities of single-cell profiling assays, such as single-cell RNA-seq and single-cell ATAC-seq.
Discussion: Finally, we argue that ensuring that single-cell atlases represent diverse ancestry, ideally alongside race and ethnicity, is ultimately important for improved and equitable health outcomes by accounting for human diversity.
Supplementary information: The online version contains supplementary material available at 10.1186/s44330-025-00029-4.
{"title":"scAI-SNP: a method for inferring ancestry from single-cell data.","authors":"Sung Chul Hong, Francesc Muyas, Isidro Cortés-Ciriano, Sahand Hormoz","doi":"10.1186/s44330-025-00029-4","DOIUrl":"10.1186/s44330-025-00029-4","url":null,"abstract":"<p><strong>Background: </strong>Collaborative efforts, such as the Human Cell Atlas, are rapidly accumulating large amounts of single-cell data. To ensure that single-cell atlases are representative of human genetic diversity, we need to determine the ancestry of the donors from whom single-cell data are generated. Self-reporting of race and ethnicity, although important, can be biased and is not always available for the datasets already collected.</p><p><strong>Methods: </strong>Here, we introduce scAI-SNP, a tool to infer ancestry directly from single-cell genomics data. To train scAI-SNP, we identified 4.5 million ancestry-informative single-nucleotide polymorphisms (SNPs) in the 1000 Genomes Project dataset across 3201 individuals from 26 population groups. For a query single-cell dataset, scAI-SNP uses these ancestry-informative SNPs to compute the contribution of each of the 26 population groups to the ancestry of the donor from whom the cells were obtained.</p><p><strong>Results: </strong>Using diverse single-cell datasets with matched whole-genome sequencing data, we show that scAI-SNP is robust to the sparsity of single-cell data, can accurately and consistently infer ancestry from samples derived from diverse types of tissues and cancer cells, and can be applied to different modalities of single-cell profiling assays, such as single-cell RNA-seq and single-cell ATAC-seq.</p><p><strong>Discussion: </strong>Finally, we argue that ensuring that single-cell atlases represent diverse ancestry, ideally alongside race and ethnicity, is ultimately important for improved and equitable health outcomes by accounting for human diversity.</p><p><strong>Supplementary information: </strong>The online version contains supplementary material available at 10.1186/s44330-025-00029-4.</p>","PeriodicalId":519945,"journal":{"name":"BMC methods","volume":"2 1","pages":"10"},"PeriodicalIF":0.0,"publicationDate":"2025-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12089154/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144121847","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Background: Blood proteome analysis is becoming increasingly popular in veterinary research because many animal models have been used to study a range of human diseases. Most of the commercial high-abundance protein (HAP) depletion kits are optimized for human serum, whereas proteins in animal serum may not be present in human serum or may be present at different abundances. There are no previous studies investigating the efficacy of using various HAP kits for proteome analysis of animal serums.
Method: We used three commercial serum abundant protein depletion (SAPD) kits (i.e., ion exchange-based Norgen kit (ProteoSpin™), antibody-based Thermo Albumin Depletion Kit (Pierce™), solubility-based kit (Minute™), and a cost-effective method (i.e., perchloric acid (PerCA) HAP precipitation) to assess their effectiveness to process serums from five different species (i.e., mouse, chicken, dog, goat, and guinea pig). Protocols of the commercial kits were adopted from manufacturers' guidelines with minor modifications for optimized performance. Following HAP depletion, proteins from all species were digested using a Trypsin/Lys-C enzyme mix, desalted, and subjected to label-free quantitative bottom-up proteomics analysis via liquid chromatography-tandem mass spectrometry (LC-MS/MS). The raw data were processed using the Andromeda search engine integrated into MaxQuant, and peptide identification was performed by searching against the UniProt-reviewed protein database. Advanced bioinformatics tools were employed to facilitate data analysis and visualization, ensuring comprehensive interpretation of the depletion efficiency and comparative performance of the methods across species.
Result: We determined their capabilities of protein identification (Norgen kit > Minute kit > PerCA precipitation > Thermo kit), depletion efficiencies of HAPs (Minute kit > Norgen kit > PerCA precipitation > Thermo kit), and cost-effectiveness (PerCA precipitation > Minute kit > Norgen kit > Thermo kit). Our results show that the PerCA precipitation method, which is > 20 times cheaper than commercial kits, outperforms other methods in depleting HAPs, especially in mouse serum. While Norgen kit excels in mouse and goat serum, the PerCA precipitation method offers broader applicability and reveals unique low abundant proteins. Protein pathway analysis highlights distinct biological processes affected by different depletion methods.
Discussion: Overall, our studies provide valuable insights into protein depletion techniques, with the PerCA depletion method emerging as a cost-effective and versatile option for proteomics research across various serums.
{"title":"A cross-species proteomic assessment of cost-effective platforms for depleting high-abundant proteins from blood serum.","authors":"Zongkai Peng, Shakya Wije Munige, Deepti Bhusal, Isabella L Yang, Zhibo Yang, Nagib Ahsan","doi":"10.1186/s44330-025-00042-7","DOIUrl":"10.1186/s44330-025-00042-7","url":null,"abstract":"<p><strong>Background: </strong>Blood proteome analysis is becoming increasingly popular in veterinary research because many animal models have been used to study a range of human diseases. Most of the commercial high-abundance protein (HAP) depletion kits are optimized for human serum, whereas proteins in animal serum may not be present in human serum or may be present at different abundances. There are no previous studies investigating the efficacy of using various HAP kits for proteome analysis of animal serums.</p><p><strong>Method: </strong>We used three commercial serum abundant protein depletion (SAPD) kits (i.e., ion exchange-based Norgen kit (ProteoSpin™), antibody-based Thermo Albumin Depletion Kit (Pierce™), solubility-based kit (Minute™), and a cost-effective method (i.e., perchloric acid (PerCA) HAP precipitation) to assess their effectiveness to process serums from five different species (i.e., mouse, chicken, dog, goat, and guinea pig). Protocols of the commercial kits were adopted from manufacturers' guidelines with minor modifications for optimized performance. Following HAP depletion, proteins from all species were digested using a Trypsin/Lys-C enzyme mix, desalted, and subjected to label-free quantitative bottom-up proteomics analysis via liquid chromatography-tandem mass spectrometry (LC-MS/MS). The raw data were processed using the Andromeda search engine integrated into MaxQuant, and peptide identification was performed by searching against the UniProt-reviewed protein database. Advanced bioinformatics tools were employed to facilitate data analysis and visualization, ensuring comprehensive interpretation of the depletion efficiency and comparative performance of the methods across species.</p><p><strong>Result: </strong>We determined their capabilities of protein identification (Norgen kit > Minute kit > PerCA precipitation > Thermo kit), depletion efficiencies of HAPs (Minute kit > Norgen kit > PerCA precipitation > Thermo kit), and cost-effectiveness (PerCA precipitation > Minute kit > Norgen kit > Thermo kit). Our results show that the PerCA precipitation method, which is > 20 times cheaper than commercial kits, outperforms other methods in depleting HAPs, especially in mouse serum. While Norgen kit excels in mouse and goat serum, the PerCA precipitation method offers broader applicability and reveals unique low abundant proteins. Protein pathway analysis highlights distinct biological processes affected by different depletion methods.</p><p><strong>Discussion: </strong>Overall, our studies provide valuable insights into protein depletion techniques, with the PerCA depletion method emerging as a cost-effective and versatile option for proteomics research across various serums.</p>","PeriodicalId":519945,"journal":{"name":"BMC methods","volume":"2 1","pages":"21"},"PeriodicalIF":0.0,"publicationDate":"2025-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12450807/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145133345","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-01-01Epub Date: 2025-07-01DOI: 10.1186/s44330-025-00034-7
Mary E W Collier, Natalie Allcock, Nicolas Sylvius, Jordan Cassidy, Flaviano Giorgini
Background: The isolation of neuron-derived extracellular vesicles (nEVs) from biofluids offers the potential to discover novel biomarkers to aid in diagnosis and treatment of psychiatric and neurodegenerative diseases. A few studies have used anti-NCAM antibody-bead-based immunocapture to enrich nEVs from plasma, some with little method validation. We therefore examined in detail this method for nEV enrichment.
Methods: EVs were isolated from SH-SY5Y cell-conditioned media by precipitation, or from plasma using size exclusion chromatography. EVs were characterised using nanoparticle tracking analysis (NTA), transmission electron microscopy (TEM) and immunoblot analysis. SH-SY5Y-EVs were incubated with anti-NCAM immunocapture beads and examined by flow cytometry, immunoblot analysis and scanning electron microscopy (SEM). Immunocaptured plasma-derived EVs were examined using a sensitive NCAM ELISA, SEM and qPCR for miRNAs.
Results: Characterisation of SH-SY5Y-derived and plasma-derived EVs revealed the expected size distributions of EVs using NTA, the presence of EV markers using immunoblot analysis, and a cup-shaped morphology using TEM. Anti-NCAM beads, but not anti-L1CAM or IgG beads, captured NCAM-positive SH-SY5Y-EVs as shown by flow cytometry and immunoblot analysis. Both SH-SY5Y and plasma-derived EVs were visualised on the surface of anti-NCAM immunocapture beads using SEM. A sensitive NCAM ELISA detected NCAM antigen in plasma-derived EVs immunocaptured on anti-NCAM beads. qPCR analysis of plasma-derived EVs detected many miRNAs in total plasma-EVs with high expression of hsa-miR-16-5p, hsa-miR-451a and hsa-miR-126-3p. However, only between two and seven miRNAs were detected in EVs captured on anti-NCAM-beads from three blood donors. Finally, tissue distribution analysis of miRNAs from plasma-derived EVs on anti-NCAM beads revealed that these miRNAs are enriched in tissues or organs such as blood vessels, lung, bone, thyroid and heart, but were not enriched for brain-derived miRNAs.
Discussion: This study indicates that anti-NCAM beads can efficiently enrich NCAM-positive EVs from cell culture conditioned media. However, nEV levels in small volumes of plasma are possibly too low to enable efficient anti-NCAM immunocapture for subsequent miRNA analysis. Other neuron-specific markers with high expression levels on nEVs are therefore required for processing patient samples where plasma volumes are low, and to allow efficient isolation of nEVs in clinical studies for subsequent cargo analysis.
Supplementary information: The online version contains supplementary material available at 10.1186/s44330-025-00034-7.
{"title":"Examination of the enrichment of neuronal extracellular vesicles from cell conditioned media and human plasma using an anti-NCAM immunocapture bead approach.","authors":"Mary E W Collier, Natalie Allcock, Nicolas Sylvius, Jordan Cassidy, Flaviano Giorgini","doi":"10.1186/s44330-025-00034-7","DOIUrl":"10.1186/s44330-025-00034-7","url":null,"abstract":"<p><strong>Background: </strong>The isolation of neuron-derived extracellular vesicles (nEVs) from biofluids offers the potential to discover novel biomarkers to aid in diagnosis and treatment of psychiatric and neurodegenerative diseases. A few studies have used anti-NCAM antibody-bead-based immunocapture to enrich nEVs from plasma, some with little method validation. We therefore examined in detail this method for nEV enrichment.</p><p><strong>Methods: </strong>EVs were isolated from SH-SY5Y cell-conditioned media by precipitation, or from plasma using size exclusion chromatography. EVs were characterised using nanoparticle tracking analysis (NTA), transmission electron microscopy (TEM) and immunoblot analysis. SH-SY5Y-EVs were incubated with anti-NCAM immunocapture beads and examined by flow cytometry, immunoblot analysis and scanning electron microscopy (SEM). Immunocaptured plasma-derived EVs were examined using a sensitive NCAM ELISA, SEM and qPCR for miRNAs.</p><p><strong>Results: </strong>Characterisation of SH-SY5Y-derived and plasma-derived EVs revealed the expected size distributions of EVs using NTA, the presence of EV markers using immunoblot analysis, and a cup-shaped morphology using TEM. Anti-NCAM beads, but not anti-L1CAM or IgG beads, captured NCAM-positive SH-SY5Y-EVs as shown by flow cytometry and immunoblot analysis. Both SH-SY5Y and plasma-derived EVs were visualised on the surface of anti-NCAM immunocapture beads using SEM. A sensitive NCAM ELISA detected NCAM antigen in plasma-derived EVs immunocaptured on anti-NCAM beads. qPCR analysis of plasma-derived EVs detected many miRNAs in total plasma-EVs with high expression of hsa-miR-16-5p, hsa-miR-451a and hsa-miR-126-3p. However, only between two and seven miRNAs were detected in EVs captured on anti-NCAM-beads from three blood donors. Finally, tissue distribution analysis of miRNAs from plasma-derived EVs on anti-NCAM beads revealed that these miRNAs are enriched in tissues or organs such as blood vessels, lung, bone, thyroid and heart, but were not enriched for brain-derived miRNAs.</p><p><strong>Discussion: </strong>This study indicates that anti-NCAM beads can efficiently enrich NCAM-positive EVs from cell culture conditioned media. However, nEV levels in small volumes of plasma are possibly too low to enable efficient anti-NCAM immunocapture for subsequent miRNA analysis. Other neuron-specific markers with high expression levels on nEVs are therefore required for processing patient samples where plasma volumes are low, and to allow efficient isolation of nEVs in clinical studies for subsequent cargo analysis.</p><p><strong>Supplementary information: </strong>The online version contains supplementary material available at 10.1186/s44330-025-00034-7.</p>","PeriodicalId":519945,"journal":{"name":"BMC methods","volume":"2 1","pages":"12"},"PeriodicalIF":0.0,"publicationDate":"2025-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12209398/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144556422","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-01-01Epub Date: 2025-04-01DOI: 10.1186/s44330-025-00023-w
Sarah Lecinski, Jamieson A L Howard, Chris MacDonald, Mark C Leake
Background: Cells employ myriad regulatory mechanisms to maintain protein homeostasis, termed proteostasis, to ensure correct cellular function. Dysregulation of proteostasis, which is often induced by physiological stress and ageing, often results in protein aggregation in cells. These aggregated structures can perturb normal physiological function, compromising cell integrity and viability, a prime example being early onset of several neurodegenerative diseases. Understanding aggregate dynamics in vivo is therefore of strong interest for biomedicine and pharmacology. However, factors involved in formation, distribution and clearance of intracellular aggregates are not fully understood.
Methods: Here, we report an improved methodology for production of fluorescent aggregates in model budding yeast which can be detected, tracked and quantified using fluorescence microscopy in live cells. This new openly-available technology, iPAR (inducible Protein Aggregation Reporter), involves monomeric fluorescent protein reporters fused to a ∆ssCPY* aggregation biomarker, with expression controlled under the copper-regulated CUP1 promoter.
Results: Monomeric tags overcome challenges associated with non-physiological reporter aggregation, whilst CUP1 provides more precise control of protein production. We show that iPAR and the associated bioimaging methodology enables quantitative study of cytoplasmic aggregate kinetics and inheritance features in vivo. We demonstrate that iPAR can be used with traditional epifluorescence and confocal microscopy as well as single-molecule precise Slimfield millisecond microscopy. Our results indicate that cytoplasmic aggregates are mobile and contain a broad range of number of iPAR molecules, from tens to several hundred per aggregate, whose mean value increases with extracellular hyperosmotic stress.
Discussion: Time lapse imaging shows that although larger iPAR aggregates associate with nuclear and vacuolar compartments, we show directly, for the first time, that these proteotoxic accumulations are not inherited by daughter cells, unlike nuclei and vacuoles. If suitably adapted, iPAR offers new potential for studying diseases relating to protein oligomerization processes in other model cellular systems.
Supplementary information: The online version contains supplementary material available at 10.1186/s44330-025-00023-w.
背景:细胞采用多种调节机制来维持蛋白质稳态,称为蛋白质稳态,以确保正确的细胞功能。蛋白质平衡失调通常由生理应激和衰老引起,通常导致细胞内蛋白质聚集。这些聚集的结构可以扰乱正常的生理功能,损害细胞的完整性和活力,一个主要的例子是一些神经退行性疾病的早期发病。因此,生物医学和药理学对体内聚合动力学有着浓厚的兴趣。然而,与细胞内聚集体的形成、分布和清除有关的因素尚不完全清楚。方法:在这里,我们报告了一种改进的方法,用于在模型出芽酵母中生产荧光聚集体,可以在活细胞中使用荧光显微镜检测,跟踪和定量。这项新技术名为iPAR (inducible Protein Aggregation Reporter,诱导型蛋白聚集报告因子),将单体荧光蛋白报告因子融合到一个∆ssCPY*聚集的生物标志物上,在铜调控的CUP1启动子下控制其表达。结果:单体标签克服了与非生理性报告聚集相关的挑战,而CUP1提供了更精确的蛋白质生产控制。我们表明iPAR和相关的生物成像方法能够定量研究体内细胞质聚集动力学和遗传特征。我们证明了iPAR可以与传统的会聚荧光和共聚焦显微镜以及单分子精确细场毫秒显微镜一起使用。我们的研究结果表明,细胞质聚集体是可移动的,并且含有广泛数量的iPAR分子,每个聚集体从几十到几百个,其平均值随着细胞外高渗胁迫而增加。讨论:延时成像显示,尽管较大的iPAR聚集体与细胞核和液泡室有关,但我们首次直接显示,这些蛋白质毒性聚集体不像细胞核和液泡那样由子细胞遗传。如果适当调整,iPAR为研究其他模型细胞系统中与蛋白质寡聚化过程相关的疾病提供了新的潜力。补充信息:在线版本包含补充信息,获取地址:10.1186/s44330-025-00023-w。
{"title":"iPAR: a new reporter for eukaryotic cytoplasmic protein aggregation.","authors":"Sarah Lecinski, Jamieson A L Howard, Chris MacDonald, Mark C Leake","doi":"10.1186/s44330-025-00023-w","DOIUrl":"10.1186/s44330-025-00023-w","url":null,"abstract":"<p><strong>Background: </strong>Cells employ myriad regulatory mechanisms to maintain protein homeostasis, termed proteostasis, to ensure correct cellular function. Dysregulation of proteostasis, which is often induced by physiological stress and ageing, often results in protein aggregation in cells. These aggregated structures can perturb normal physiological function, compromising cell integrity and viability, a prime example being early onset of several neurodegenerative diseases. Understanding aggregate dynamics <i>in vivo</i> is therefore of strong interest for biomedicine and pharmacology. However, factors involved in formation, distribution and clearance of intracellular aggregates are not fully understood.</p><p><strong>Methods: </strong>Here, we report an improved methodology for production of fluorescent aggregates in model budding yeast which can be detected, tracked and quantified using fluorescence microscopy in live cells. This new openly-available technology, iPAR (inducible Protein Aggregation Reporter), involves monomeric fluorescent protein reporters fused to a ∆ssCPY* aggregation biomarker, with expression controlled under the copper-regulated <i>CUP1</i> promoter.</p><p><strong>Results: </strong>Monomeric tags overcome challenges associated with non-physiological reporter aggregation, whilst <i>CUP1</i> provides more precise control of protein production. We show that iPAR and the associated bioimaging methodology enables quantitative study of cytoplasmic aggregate kinetics and inheritance features <i>in vivo</i>. We demonstrate that iPAR can be used with traditional epifluorescence and confocal microscopy as well as single-molecule precise Slimfield millisecond microscopy. Our results indicate that cytoplasmic aggregates are mobile and contain a broad range of number of iPAR molecules, from tens to several hundred per aggregate, whose mean value increases with extracellular hyperosmotic stress.</p><p><strong>Discussion: </strong>Time lapse imaging shows that although larger iPAR aggregates associate with nuclear and vacuolar compartments, we show directly, for the first time, that these proteotoxic accumulations are not inherited by daughter cells, unlike nuclei and vacuoles. If suitably adapted, iPAR offers new potential for studying diseases relating to protein oligomerization processes in other model cellular systems.</p><p><strong>Supplementary information: </strong>The online version contains supplementary material available at 10.1186/s44330-025-00023-w.</p>","PeriodicalId":519945,"journal":{"name":"BMC methods","volume":"2 1","pages":"5"},"PeriodicalIF":0.0,"publicationDate":"2025-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11958454/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143775259","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-01-01Epub Date: 2025-10-06DOI: 10.1186/s44330-025-00041-8
Heming Zhang, Dekang Cao, Tim Xu, Emily Chen, Guangfu Li, Yixin Chen, Philip Payne, Michael Province, Fuhai Li
Multi-omic dataset can better characterize complex cellular signaling pathways from multiple views compared to individual omic data. However, integrative multi-omic data analysis to rank key disease biomarkers and infer core signaling pathways remains an open problem. In this study, we developed a novel graph AI model, mosGraphFlow, for analyzing multi-omic signaling graphs (mosGraphs), 2) analyzed multi-omic mosGraph datasets of Alzheimers' Disease (AD), and 3) developed a visualization tool to facilitate the visualization of identified disease associated signaling biomarkers and network. The comparison results show that the proposed model not only achieves the best classification accuracy but also identifies important AD disease biomarkers and signaling interactions. In the visualization, the signaling sources are highlighted at specific omic levels to facilitate the understanding of disease pathogenesis. The proposed model can also be applied and expanded for other multi-omic data-driven studies. The code of the model is publicly accessible via GitHub: https://github.com/FuhaiLiAiLab/mosGraphFlow.
Supplementary information: The online version contains supplementary material available at 10.1186/s44330-025-00041-8.
{"title":"MosGraphFlow: a novel integrative graph AI model mining signaling targets from multi-omic data.","authors":"Heming Zhang, Dekang Cao, Tim Xu, Emily Chen, Guangfu Li, Yixin Chen, Philip Payne, Michael Province, Fuhai Li","doi":"10.1186/s44330-025-00041-8","DOIUrl":"10.1186/s44330-025-00041-8","url":null,"abstract":"<p><p>Multi-omic dataset can better characterize complex cellular signaling pathways from multiple views compared to individual omic data. However, integrative multi-omic data analysis to rank key disease biomarkers and infer core signaling pathways remains an open problem. In this study, we developed a novel graph AI model, mosGraphFlow, for analyzing multi-omic signaling graphs (mosGraphs), 2) analyzed multi-omic mosGraph datasets of Alzheimers' Disease (AD), and 3) developed a visualization tool to facilitate the visualization of identified disease associated signaling biomarkers and network. The comparison results show that the proposed model not only achieves the best classification accuracy but also identifies important AD disease biomarkers and signaling interactions. In the visualization, the signaling sources are highlighted at specific omic levels to facilitate the understanding of disease pathogenesis. The proposed model can also be applied and expanded for other multi-omic data-driven studies. The code of the model is publicly accessible via GitHub: https://github.com/FuhaiLiAiLab/mosGraphFlow.</p><p><strong>Supplementary information: </strong>The online version contains supplementary material available at 10.1186/s44330-025-00041-8.</p>","PeriodicalId":519945,"journal":{"name":"BMC methods","volume":"2 1","pages":"23"},"PeriodicalIF":0.0,"publicationDate":"2025-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12497674/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145246323","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}