Integrating non-Euclidean brain imaging data with Euclidean tabular data, such as clinical and demographic information, poses a substantial challenge for medical imaging analysis, particularly in forecasting future outcomes. While machine learning and deep learning techniques have been applied successfully to cross-sectional classification and prediction tasks, effectively forecasting outcomes in longitudinal imaging studies remains challenging. To address this challenge, we introduce a time-aware graph neural network model with transformer fusion (GNN-TF). This model flexibly integrates both tabular data and dynamic brain connectivity data, leveraging the temporal order of these variables within a coherent framework. By incorporating non-Euclidean and Euclidean sources of information from a longitudinal resting-state fMRI dataset from the National Consortium on Alcohol and Neurodevelopment in Adolescence (NCANDA), the GNN-TF enables a comprehensive analysis that captures critical aspects of longitudinal imaging data. Comparative analyses against a variety of established machine learning and deep learning models demonstrate that GNN-TF outperforms these state-of-the-art methods, delivering superior predictive accuracy for predicting future tobacco usage. The end-to-end, time-aware transformer fusion structure of the proposed GNN-TF model successfully integrates multiple data modalities and leverages temporal dynamics, making it a valuable analytic tool for functional brain imaging studies focused on clinical outcome prediction.
{"title":"Graph Neural Networks with Transformer Fusion of Brain Connectivity Dynamics and Tabular Data for Forecasting Future Tobacco Use.","authors":"Runzhi Zhou, Xi Luo","doi":"","DOIUrl":"","url":null,"abstract":"<p><p>Integrating non-Euclidean brain imaging data with Euclidean tabular data, such as clinical and demographic information, poses a substantial challenge for medical imaging analysis, particularly in forecasting future outcomes. While machine learning and deep learning techniques have been applied successfully to cross-sectional classification and prediction tasks, effectively forecasting outcomes in longitudinal imaging studies remains challenging. To address this challenge, we introduce a time-aware graph neural network model with transformer fusion (GNN-TF). This model flexibly integrates both tabular data and dynamic brain connectivity data, leveraging the temporal order of these variables within a coherent framework. By incorporating non-Euclidean and Euclidean sources of information from a longitudinal resting-state fMRI dataset from the National Consortium on Alcohol and Neurodevelopment in Adolescence (NCANDA), the GNN-TF enables a comprehensive analysis that captures critical aspects of longitudinal imaging data. Comparative analyses against a variety of established machine learning and deep learning models demonstrate that GNN-TF outperforms these state-of-the-art methods, delivering superior predictive accuracy for predicting future tobacco usage. The end-to-end, time-aware transformer fusion structure of the proposed GNN-TF model successfully integrates multiple data modalities and leverages temporal dynamics, making it a valuable analytic tool for functional brain imaging studies focused on clinical outcome prediction.</p>","PeriodicalId":93888,"journal":{"name":"ArXiv","volume":" ","pages":""},"PeriodicalIF":0.0,"publicationDate":"2025-12-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12772699/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145919009","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Measurements of cell size dynamics have established the adder principle as a robust mechanism of cell size homeostasis. In this framework, cells add a nearly constant amount of size during each cell cycle, independent of their size at birth. Theoretical studies have shown that the adder principle can be achieved when cell-cycle progression is coupled to cell size. Here, we extend this framework by considering a general growth law modeled as a Hill-type function of cell size. This assumption introduces growth saturation to the model, such that very large cells grow approximately linearly rather than exponentially. Additionally, to capture the sequential nature of division, we implement a stochastic multi-step adder model in which cells progress through internal regulatory stages before dividing. From this model, we derive exact analytical expressions for the moments of cell size distributions. Our results show that stronger growth saturation increases the mean cell size in steady state, while slightly reducing fluctuations compared to exponential growth. Importantly, despite these changes, the adder property is preserved. This emphasizes that the reduction in size variability is a consequence of~the growth law rather than simple scaling with mean size. Finally, we analyze stochastic clonal proliferation and find that growth saturation influences both single-cell size statistics and variability across populations. Our results provide a generalized framework for connecting multi-step adder mechanisms with proliferation dynamics, extending size control theory beyond exponential growth.
{"title":"Stochastic multi-step cell size homeostasis model for cycling human cells.","authors":"Sayeh Rezaee, Cesar Nieto, Abhyudai Singh","doi":"","DOIUrl":"","url":null,"abstract":"<p><p>Measurements of cell size dynamics have established the adder principle as a robust mechanism of cell size homeostasis. In this framework, cells add a nearly constant amount of size during each cell cycle, independent of their size at birth. Theoretical studies have shown that the adder principle can be achieved when cell-cycle progression is coupled to cell size. Here, we extend this framework by considering a general growth law modeled as a Hill-type function of cell size. This assumption introduces growth saturation to the model, such that very large cells grow approximately linearly rather than exponentially. Additionally, to capture the sequential nature of division, we implement a stochastic multi-step adder model in which cells progress through internal regulatory stages before dividing. From this model, we derive exact analytical expressions for the moments of cell size distributions. Our results show that stronger growth saturation increases the mean cell size in steady state, while slightly reducing fluctuations compared to exponential growth. Importantly, despite these changes, the adder property is preserved. This emphasizes that the reduction in size variability is a consequence of~the growth law rather than simple scaling with mean size. Finally, we analyze stochastic clonal proliferation and find that growth saturation influences both single-cell size statistics and variability across populations. Our results provide a generalized framework for connecting multi-step adder mechanisms with proliferation dynamics, extending size control theory beyond exponential growth.</p>","PeriodicalId":93888,"journal":{"name":"ArXiv","volume":" ","pages":""},"PeriodicalIF":0.0,"publicationDate":"2025-12-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12772695/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145919191","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Miguel E Wimbish, Nicole K Guittari, Victoria A Rose, Jorge L Rivera, Patricia K Rivlin, Mark A Hinton, Jordan K Matelsky, Nicole E Stock, Brock A Wester, Erik C Johnson, William R Gray-Roncal
High resolution volumetric neuroimaging datasets from electron microscopy (EM) and x-ray micro and holographic-nano tomography (XRM/XHN) are being generated at an increasing rate and by a growing number of research teams. These datasets are derived from an increasing number of species, in an increasing number of brain regions, and with an increasing number of techniques. Each of these large-scale datasets, often surpassing petascale levels, is typically accompanied by a unique and varied set of metadata. These datasets can be used to derive connectomes, or neuron-synapse level connectivity diagrams, to investigate the fundamental organization of neural circuitry, neuronal development, and neurodegenerative disease. Standardization is essential to facilitate comparative connectomics analysis and enhance data utilization. Although the neuroinformatics community has successfully established and adopted data standards for many modalities, this effort has not yet encompassed EM and XRM/ XHN connectomics data. This lack of standardization isolates these datasets, hindering their integration and comparison with other research performed in the field. Towards this end, our team formed a working group consisting of community stakeholders to develop Image and Experimental Metadata Standards for EM and XRM/XHN data to ensure the scientific impact and further motivate the generation and sharing of these data. This document addresses version 1.1 of these standards, aiming to support metadata services and future software designs for community collaboration. Standards for derived annotations are described in a companion document. Standards definitions are available on a community github page. We hope these standards will enable comparative analysis, improve interoperability between connectomics software tools, and continue to be refined and improved by the neuroinformatics community.
{"title":"EM and XRM Connectomics Imaging and Experimental Metadata Standards.","authors":"Miguel E Wimbish, Nicole K Guittari, Victoria A Rose, Jorge L Rivera, Patricia K Rivlin, Mark A Hinton, Jordan K Matelsky, Nicole E Stock, Brock A Wester, Erik C Johnson, William R Gray-Roncal","doi":"","DOIUrl":"","url":null,"abstract":"<p><p>High resolution volumetric neuroimaging datasets from electron microscopy (EM) and x-ray micro and holographic-nano tomography (XRM/XHN) are being generated at an increasing rate and by a growing number of research teams. These datasets are derived from an increasing number of species, in an increasing number of brain regions, and with an increasing number of techniques. Each of these large-scale datasets, often surpassing petascale levels, is typically accompanied by a unique and varied set of metadata. These datasets can be used to derive connectomes, or neuron-synapse level connectivity diagrams, to investigate the fundamental organization of neural circuitry, neuronal development, and neurodegenerative disease. Standardization is essential to facilitate comparative connectomics analysis and enhance data utilization. Although the neuroinformatics community has successfully established and adopted data standards for many modalities, this effort has not yet encompassed EM and XRM/ XHN connectomics data. This lack of standardization isolates these datasets, hindering their integration and comparison with other research performed in the field. Towards this end, our team formed a working group consisting of community stakeholders to develop Image and Experimental Metadata Standards for EM and XRM/XHN data to ensure the scientific impact and further motivate the generation and sharing of these data. This document addresses version 1.1 of these standards, aiming to support metadata services and future software designs for community collaboration. Standards for derived annotations are described in a companion document. Standards definitions are available on a community github page. We hope these standards will enable comparative analysis, improve interoperability between connectomics software tools, and continue to be refined and improved by the neuroinformatics community.</p>","PeriodicalId":93888,"journal":{"name":"ArXiv","volume":" ","pages":""},"PeriodicalIF":0.0,"publicationDate":"2025-12-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12772707/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145918950","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Sean Campbell, Courtney C White, Amanda M Alexander, William Ott
Delay is an inherent feature of genetic regulatory networks. It represents the time required for the assembly of functional regulator proteins. The protein production process is complex, as it includes transcription, translocation, translation, folding, and oligomerization. Because these steps are noisy, the resulting delay associated with protein production is distributed (random). We here consider how distributed delay impacts the dynamics of bistable genetic circuits. We show that for a variety of genetic circuits that exhibit bistability, increasing the noise level in the delay distribution dramatically stabilizes the metastable states. By this we mean that mean residence times in the metastable states dramatically increase.
Relevance to life sciences: Bistable genetic regulatory networks are ubiquitous in living organisms. Evolutionary processes seem to have tuned such networks so that they switch between metastable states when it is important to do so, but small fluctuations do not cause unwanted switching. Understanding how evolution has tuned the stability of biological switches is an important problem. In particular, such understanding can guide the design of forward-engineered synthetic bistable genetic regulatory networks.
Mathematical content: We use two methods to explain this stabilization phenomenon. First, we introduce and simulate stochastic hybrid models that depend on a switching-rate parameter. These stochastic hybrid models allow us to unfold the distributed-delay models in the sense that, in certain cases, the distributed-delay model can be viewed as a fast-switching limit of the corresponding stochastic hybrid model. Second, we generalize the three-states model, a symbolic model of bistability, and analyze this extension.
{"title":"DISTRIBUTED DELAY STABILIZES BISTABLE GENETIC NETWORKS.","authors":"Sean Campbell, Courtney C White, Amanda M Alexander, William Ott","doi":"","DOIUrl":"","url":null,"abstract":"<p><p>Delay is an inherent feature of genetic regulatory networks. It represents the time required for the assembly of functional regulator proteins. The protein production process is complex, as it includes transcription, translocation, translation, folding, and oligomerization. Because these steps are noisy, the resulting delay associated with protein production is distributed (random). We here consider how distributed delay impacts the dynamics of bistable genetic circuits. We show that for a variety of genetic circuits that exhibit bistability, increasing the noise level in the delay distribution dramatically stabilizes the metastable states. By this we mean that mean residence times in the metastable states dramatically increase.</p><p><strong>Relevance to life sciences: </strong>Bistable genetic regulatory networks are ubiquitous in living organisms. Evolutionary processes seem to have tuned such networks so that they switch between metastable states when it is important to do so, but small fluctuations do not cause unwanted switching. Understanding how evolution has tuned the stability of biological switches is an important problem. In particular, such understanding can guide the design of forward-engineered synthetic bistable genetic regulatory networks.</p><p><strong>Mathematical content: </strong>We use two methods to explain this stabilization phenomenon. First, we introduce and simulate stochastic hybrid models that depend on a switching-rate parameter. These stochastic hybrid models allow us to unfold the distributed-delay models in the sense that, in certain cases, the distributed-delay model can be viewed as a fast-switching limit of the corresponding stochastic hybrid model. Second, we generalize the three-states model, a symbolic model of bistability, and analyze this extension.</p>","PeriodicalId":93888,"journal":{"name":"ArXiv","volume":" ","pages":""},"PeriodicalIF":0.0,"publicationDate":"2025-12-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12755252/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145890882","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Mojtaba Safari, Shansong Wang, Vanessa L Wildman, Mingzhe Hu, Zach Eidex, Chih-Wei Chang, Erik H Middlebrooks, Richard L J Qiu, Pretesh Patel, Ashesh B Jani, Hui Mao, Zhen Tian, Xiaofeng Yang
<p><strong>Background: </strong>High-resolution MRI is essential for accurate diagnosis and treatment planning, but its clinical acquisition is often constrained by long scanning times, which increase patient discomfort and reduce scanner throughput. While super-resolution (SR) techniques offer a post-acquisition solution to enhance resolution, existing deep learning approaches face trade-offs between reconstruction fidelity and computational efficiency, limiting their clinical applicability.</p><p><strong>Purpose: </strong>This study aims to develop an efficient and accurate deep learning framework for MRI super-resolution that preserves fine anatomical detail while maintaining low computational overhead, enabling practical integration into clinical workflows.</p><p><strong>Materials and methods: </strong>We propose a novel SR framework based on multi-head selective state-space models (MHSSM) integrated with a lightweight channel multilayer perceptron (MLP). The model employs 2D patch extraction with hybrid scanning strategies (vertical, horizontal, and diagonal) to capture long-range dependencies while mitigating pixel forgetting. Each MambaFormer block combines MHSSM, depthwise convolutions, and gated channel mixing to balance local and global feature representation. The framework was trained and evaluated on two distinct datasets: 7T brain T1 MP2RAGE maps (142 subjects) and 1.5T prostate T2w MRI (334 subjects). Performance was compared against multiple baselines including Bicubic interpolation, GAN-based (CycleGAN, Pix2pix, SPSR), transformer-based (SwinIR), Mamba-based (MambaIR), and diffusion-based (I<sup>2</sup>SB, Res-SRDiff) methods.</p><p><strong>Results: </strong>The proposed model demonstrated superior performance across all evaluation metrics while maintaining exceptional computational efficiency. On the 7T brain dataset, our method achieved the highest structural similarity (SSIM: 0.951 ± 0.021) and peak signal-to-noise ratio (PSNR: 26.90 ± 1.41 dB), along with the best perceptual quality scores (LPIPS: 0.076 ± 0.022; GMSD: 0.083 ± 0.017). These results represented statistically significant improvements over all baselines (<i>p <</i> 0.001), including a 2.1% SSIM gain over SPSR and a 2.4% PSNR improvement over Res-SRDiff. For the prostate dataset, the model similarly outperformed competing approaches, achieving SSIM of 0.770 ± 0.049, PSNR of 27.15 ± 2.19 dB, LPIPS of 0.190 ± 0.095, and GMSD of 0.087 ± 0.013. Notably, our framework accomplished these results with only 0.9 million parameters and 57 GFLOPs, representing reductions of 99.8% in parameters and 97.5% in computational operations compared to Res-SRDiff, while also substantially outperforming SwinIR and MambaIR in both accuracy and efficiency metrics.</p><p><strong>Conclusion: </strong>The proposed framework provides a computationally efficient yet accurate solution for MRI super-resolution, delivering well-defined anatomical details and improved perceptual fidelity across an
{"title":"Efficient Vision Mamba for MRI Super-Resolution via Hybrid Selective Scanning.","authors":"Mojtaba Safari, Shansong Wang, Vanessa L Wildman, Mingzhe Hu, Zach Eidex, Chih-Wei Chang, Erik H Middlebrooks, Richard L J Qiu, Pretesh Patel, Ashesh B Jani, Hui Mao, Zhen Tian, Xiaofeng Yang","doi":"","DOIUrl":"","url":null,"abstract":"<p><strong>Background: </strong>High-resolution MRI is essential for accurate diagnosis and treatment planning, but its clinical acquisition is often constrained by long scanning times, which increase patient discomfort and reduce scanner throughput. While super-resolution (SR) techniques offer a post-acquisition solution to enhance resolution, existing deep learning approaches face trade-offs between reconstruction fidelity and computational efficiency, limiting their clinical applicability.</p><p><strong>Purpose: </strong>This study aims to develop an efficient and accurate deep learning framework for MRI super-resolution that preserves fine anatomical detail while maintaining low computational overhead, enabling practical integration into clinical workflows.</p><p><strong>Materials and methods: </strong>We propose a novel SR framework based on multi-head selective state-space models (MHSSM) integrated with a lightweight channel multilayer perceptron (MLP). The model employs 2D patch extraction with hybrid scanning strategies (vertical, horizontal, and diagonal) to capture long-range dependencies while mitigating pixel forgetting. Each MambaFormer block combines MHSSM, depthwise convolutions, and gated channel mixing to balance local and global feature representation. The framework was trained and evaluated on two distinct datasets: 7T brain T1 MP2RAGE maps (142 subjects) and 1.5T prostate T2w MRI (334 subjects). Performance was compared against multiple baselines including Bicubic interpolation, GAN-based (CycleGAN, Pix2pix, SPSR), transformer-based (SwinIR), Mamba-based (MambaIR), and diffusion-based (I<sup>2</sup>SB, Res-SRDiff) methods.</p><p><strong>Results: </strong>The proposed model demonstrated superior performance across all evaluation metrics while maintaining exceptional computational efficiency. On the 7T brain dataset, our method achieved the highest structural similarity (SSIM: 0.951 ± 0.021) and peak signal-to-noise ratio (PSNR: 26.90 ± 1.41 dB), along with the best perceptual quality scores (LPIPS: 0.076 ± 0.022; GMSD: 0.083 ± 0.017). These results represented statistically significant improvements over all baselines (<i>p <</i> 0.001), including a 2.1% SSIM gain over SPSR and a 2.4% PSNR improvement over Res-SRDiff. For the prostate dataset, the model similarly outperformed competing approaches, achieving SSIM of 0.770 ± 0.049, PSNR of 27.15 ± 2.19 dB, LPIPS of 0.190 ± 0.095, and GMSD of 0.087 ± 0.013. Notably, our framework accomplished these results with only 0.9 million parameters and 57 GFLOPs, representing reductions of 99.8% in parameters and 97.5% in computational operations compared to Res-SRDiff, while also substantially outperforming SwinIR and MambaIR in both accuracy and efficiency metrics.</p><p><strong>Conclusion: </strong>The proposed framework provides a computationally efficient yet accurate solution for MRI super-resolution, delivering well-defined anatomical details and improved perceptual fidelity across an","PeriodicalId":93888,"journal":{"name":"ArXiv","volume":" ","pages":""},"PeriodicalIF":0.0,"publicationDate":"2025-12-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12755253/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145890822","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Eugene H Lin, Yishun Zhou, Hsin-Yi Hung, Luke Moon, Andrew Gordus, Chen Li
One of the key functions of organisms is to sense their physical environment so that they can react upon the sensed information appropriately. All spiders can perceive their environment through vibration sensors in their legs, and most spiders rely on substrate-born vibration sensing to detect prey. Orb-weaving spiders primarily sense leg vibrations to detect and locate prey caught on their wheel-shaped webs. Biological experiments and computational modeling elucidated the physics of how these spiders use long-timescale web-building behaviors, which occur before prey capture, to modulate vibration sensing of prey by controlling web geometry, materials, and tension distribution. By contrast, the physics of how spiders use short-timescale leg behaviors to modulate vibration sensing on a web during prey capture is less known. This is in part due to challenges in biological experiments (e.g., having little control over spider behavior, difficulty measuring the whole spider-web-prey system vibrations) and theoretical/computation modeling (e.g., close-form equations intractable for a complex web, high computation cost for simulating vibrations with behaving animals). Here, we use robophysical modeling as a complementary approach to address these challenges and study how dynamic leg crouching behavior common in orb-weaving spiders contributes to vibration sensing of prey on a web. Following observations in the orb-weaver Uloborus diversus from a parallel biological study, we created a robophysical model consisting of a spider robot that can dynamically crouch its legs and sense its leg vibrations and a prey robot that can shake both on a horizontal physical wheel-shaped web. Without the prey robot, after each dynamic crouch, the spider robot sensed leg vibrations with only one dominant frequency-the natural frequency of itself passively vibrating on the web. With the prey robot, after each dynamic crouch, the spider robot sensed leg vibrations with two dominant frequencies-the additional higher frequency being the natural frequency of itself passively vibrating on its spiral thread induced by the spider robot's dynamic crouch. This additional frequency increased as the prey robot became closer from the web center where the spider robot was. These features allowed the spider robot to detect prey presence and distance. We developed a minimalistic physics model that decoupled the spider-web-prey system into two subsystems to explain these observations. Guided by both these results, we found evidence of the same physical mechanism appearing in the web of the U. diversus spider during prey capture in the data from the parallel biological study. Our work demonstrated that robophysical modeling is a useful approach for discovering physical mechanisms of how spiders use short-time scale leg behaviors to enhance vibration sensing of objects on a web and providing new biological hypotheses.
{"title":"Why orb-weaving spiders use leg crouching behavior in vibration sensing of prey on a web: A physical mechanism from robophysical modeling.","authors":"Eugene H Lin, Yishun Zhou, Hsin-Yi Hung, Luke Moon, Andrew Gordus, Chen Li","doi":"","DOIUrl":"","url":null,"abstract":"<p><p>One of the key functions of organisms is to sense their physical environment so that they can react upon the sensed information appropriately. All spiders can perceive their environment through vibration sensors in their legs, and most spiders rely on substrate-born vibration sensing to detect prey. Orb-weaving spiders primarily sense leg vibrations to detect and locate prey caught on their wheel-shaped webs. Biological experiments and computational modeling elucidated the physics of how these spiders use long-timescale web-building behaviors, which occur <i>before</i> prey capture, to modulate vibration sensing of prey by controlling web geometry, materials, and tension distribution. By contrast, the physics of how spiders use short-timescale leg behaviors to modulate vibration sensing on a web <i>during</i> prey capture is less known. This is in part due to challenges in biological experiments (e.g., having little control over spider behavior, difficulty measuring the whole spider-web-prey system vibrations) and theoretical/computation modeling (e.g., close-form equations intractable for a complex web, high computation cost for simulating vibrations with behaving animals). Here, we use robophysical modeling as a complementary approach to address these challenges and study how dynamic leg crouching behavior common in orb-weaving spiders contributes to vibration sensing of prey on a web. Following observations in the orb-weaver <i>Uloborus diversus</i> from a parallel biological study, we created a robophysical model consisting of a spider robot that can dynamically crouch its legs and sense its leg vibrations and a prey robot that can shake both on a horizontal physical wheel-shaped web. Without the prey robot, after each dynamic crouch, the spider robot sensed leg vibrations with only one dominant frequency-the natural frequency of itself passively vibrating on the web. With the prey robot, after each dynamic crouch, the spider robot sensed leg vibrations with two dominant frequencies-the additional higher frequency being the natural frequency of itself passively vibrating on its spiral thread induced by the spider robot's dynamic crouch. This additional frequency increased as the prey robot became closer from the web center where the spider robot was. These features allowed the spider robot to detect prey presence and distance. We developed a minimalistic physics model that decoupled the spider-web-prey system into two subsystems to explain these observations. Guided by both these results, we found evidence of the same physical mechanism appearing in the web of the <i>U. diversus</i> spider during prey capture in the data from the parallel biological study. Our work demonstrated that robophysical modeling is a useful approach for discovering physical mechanisms of how spiders use short-time scale leg behaviors to enhance vibration sensing of objects on a web and providing new biological hypotheses.</p>","PeriodicalId":93888,"journal":{"name":"ArXiv","volume":" ","pages":""},"PeriodicalIF":0.0,"publicationDate":"2025-12-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12755255/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145890266","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Mary Elena An, Paul Griffin, Jonathan G Stine, Ramakrishna Balakrishnan, Soundar Kumara
Background: Metabolic Dysfunction-Associated Steatotic Liver Disease (MASLD) affects ~33% of U.S. adults and is the most common chronic liver disease. Although often asymptomatic, progression can lead to cirrhosis. Early detection is important, as lifestyle interventions can prevent disease progression. We developed a fair, rigorous, and reproducible MASLD prediction model and compared it to prior methods using a large electronic health record database.
Methods: We evaluated LASSO logistic regression, random forest, XGBoost, and a neural network for MASLD prediction using clinical feature subsets, including the top 10 SHAP-ranked features. To reduce disparities in true positive rates across racial and ethnic subgroups, we applied an equal opportunity postprocessing method.
Results: This study included 59,492 patients in the training data, 24,198 in the validating data, and 25,188 in the testing data. The LASSO logistic regression model with the top 10 features was selected for its interpretability and comparable performance. Before fairness adjustment, the model achieved AUROC of 0.84, accuracy of 78%, sensitivity of 72%, specificity of 79%, and F1-score of 0.617. After equal opportunity postprocessing, accuracy modestly increased to 81% and specificity to 94%, while sensitivity decreased to 41% and F1-score to 0.515, reflecting the fairness trade-off.
Conclusions: We developed the MASER prediction model (MASLD Static EHR Risk Prediction), a LASSO logistic regression model which achieved competitive performance for MASLD prediction (AUROC 0.836, accuracy 77.6%), comparable to previously reported ensemble and tree-based models. Overall, this approach demonstrates that interpretable models can achieve a balance of predictive performance and fairness in diverse patient populations.
{"title":"Predicting Metabolic Dysfunction-Associated Steatotic Liver Disease using Machine Learning Methods.","authors":"Mary Elena An, Paul Griffin, Jonathan G Stine, Ramakrishna Balakrishnan, Soundar Kumara","doi":"","DOIUrl":"","url":null,"abstract":"<p><strong>Background: </strong>Metabolic Dysfunction-Associated Steatotic Liver Disease (MASLD) affects ~33% of U.S. adults and is the most common chronic liver disease. Although often asymptomatic, progression can lead to cirrhosis. Early detection is important, as lifestyle interventions can prevent disease progression. We developed a fair, rigorous, and reproducible MASLD prediction model and compared it to prior methods using a large electronic health record database.</p><p><strong>Methods: </strong>We evaluated LASSO logistic regression, random forest, XGBoost, and a neural network for MASLD prediction using clinical feature subsets, including the top 10 SHAP-ranked features. To reduce disparities in true positive rates across racial and ethnic subgroups, we applied an equal opportunity postprocessing method.</p><p><strong>Results: </strong>This study included 59,492 patients in the training data, 24,198 in the validating data, and 25,188 in the testing data. The LASSO logistic regression model with the top 10 features was selected for its interpretability and comparable performance. Before fairness adjustment, the model achieved AUROC of 0.84, accuracy of 78%, sensitivity of 72%, specificity of 79%, and F1-score of 0.617. After equal opportunity postprocessing, accuracy modestly increased to 81% and specificity to 94%, while sensitivity decreased to 41% and F1-score to 0.515, reflecting the fairness trade-off.</p><p><strong>Conclusions: </strong>We developed the MASER prediction model (MASLD Static EHR Risk Prediction), a LASSO logistic regression model which achieved competitive performance for MASLD prediction (AUROC 0.836, accuracy 77.6%), comparable to previously reported ensemble and tree-based models. Overall, this approach demonstrates that interpretable models can achieve a balance of predictive performance and fairness in diverse patient populations.</p>","PeriodicalId":93888,"journal":{"name":"ArXiv","volume":" ","pages":""},"PeriodicalIF":0.0,"publicationDate":"2025-12-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12755246/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145890816","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Tree rearrangements such as Nearest Neighbor Interchange (NNI) and Subtree Prune and Regraft (SPR) are commonly used to explore phylogenetic treespace. Computing distances based on them, however, is often intractable, so the efficiently computable Robinson-Foulds (RF) distance is used in practice. We investigate how the RF distance behaves along paths in the NNI and SPR graphs, where trees are nodes, edges represent single rearrangements. We show that any two trees are connected by a path along which the RF distance to the target decreases monotonically in the NNI graph and strictly in the SPR graph; we also exhibit trees for which no strictly decreasing NNI path exists.
{"title":"Tree rearrangement graphs admit paths of decreasing Robinson-Foulds distance.","authors":"Lena Collienne, Frederick A Matsen","doi":"","DOIUrl":"","url":null,"abstract":"<p><p>Tree rearrangements such as Nearest Neighbor Interchange (NNI) and Subtree Prune and Regraft (SPR) are commonly used to explore phylogenetic treespace. Computing distances based on them, however, is often intractable, so the efficiently computable Robinson-Foulds (RF) distance is used in practice. We investigate how the RF distance behaves along paths in the NNI and SPR graphs, where trees are nodes, edges represent single rearrangements. We show that any two trees are connected by a path along which the RF distance to the target decreases monotonically in the NNI graph and strictly in the SPR graph; we also exhibit trees for which no strictly decreasing NNI path exists.</p>","PeriodicalId":93888,"journal":{"name":"ArXiv","volume":" ","pages":""},"PeriodicalIF":0.0,"publicationDate":"2025-12-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12755248/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145890846","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Juan Sebastián Hernández, César Nieto, Juan Manuel Pedraza, Abhyudai Singh
Multiple cellular processes are triggered when the concentration of a regulatory protein reaches a critical threshold. Previous analyses have characterized timing statistics for single-gene systems. However, many biological timers are based on cascades of genes that activate each other sequentially. Here, we develop an analytical framework to describe the timing precision of such cascades using a burst-dilution hybrid stochastic model. We first revisit the single-gene case and recover the known result of an optimal activation threshold that minimizes first-passage-time (FPT) variability. Extending this concept to two-gene cascades, we identify three distinct optimization regimes determined by the ratio of intrinsic noise levels and the protein dilution rate, defining when coupling improves or worsens timing precision compared to a single-gene strategy. Generalizing to cascades of arbitrary gene length, we obtain a simple mathematical condition that determines when a new gene in the cascade can decrease the timing noise based on its intrinsic noise and protein dilution rate. In the specific case of a cascade of identical genes, our analytical results predict suppression of FPT noise with increasing cascade length and the existence of a mean time that decreases relative timing fluctuations. Together, these results define the intrinsic limits of timekeeping precision in gene regulatory cascades and provide a minimal analytical framework to explore timing control in biological systems.
{"title":"Intrinsic limits of timekeeping precision in gene regulatory cascades.","authors":"Juan Sebastián Hernández, César Nieto, Juan Manuel Pedraza, Abhyudai Singh","doi":"","DOIUrl":"","url":null,"abstract":"<p><p>Multiple cellular processes are triggered when the concentration of a regulatory protein reaches a critical threshold. Previous analyses have characterized timing statistics for single-gene systems. However, many biological timers are based on cascades of genes that activate each other sequentially. Here, we develop an analytical framework to describe the timing precision of such cascades using a burst-dilution hybrid stochastic model. We first revisit the single-gene case and recover the known result of an optimal activation threshold that minimizes first-passage-time (FPT) variability. Extending this concept to two-gene cascades, we identify three distinct optimization regimes determined by the ratio of intrinsic noise levels and the protein dilution rate, defining when coupling improves or worsens timing precision compared to a single-gene strategy. Generalizing to cascades of arbitrary gene length, we obtain a simple mathematical condition that determines when a new gene in the cascade can decrease the timing noise based on its intrinsic noise and protein dilution rate. In the specific case of a cascade of identical genes, our analytical results predict suppression of FPT noise with increasing cascade length and the existence of a mean time that decreases relative timing fluctuations. Together, these results define the intrinsic limits of timekeeping precision in gene regulatory cascades and provide a minimal analytical framework to explore timing control in biological systems.</p>","PeriodicalId":93888,"journal":{"name":"ArXiv","volume":" ","pages":""},"PeriodicalIF":0.0,"publicationDate":"2025-12-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12755261/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145890869","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Aging includes both continuous gradual decline from microscopic mechanisms together with major deficit onset events such as morbidity, disability and ultimately death. These deficit events are stochastic, obscuring the connection between aging mechanisms and overall health. We propose a framework for modelling both the gradual effects of aging together with health deficit onset events, as reflected in the frailty index (FI) - a quantitative measure of overall age-related health. We model damage and repair dynamics of the FI from individual health transitions within two large longitudinal studies of aging health, the Health and Retirement Study (HRS) and the English Longitudinal Study of Ageing (ELSA), which together included N=47592 individuals. We find that both damage resistance (robustness) and damage recovery (resilience) rates decline smoothly with both increasing age and with increasing FI, for both sexes. This leads to two distinct dynamical states: a robust and resilient young state of stable good health (low FI) and an older state that drifts towards poor health (high FI). These two health states are separated by a sharp transition near age 75. Since FI accumulation risk accelerates dramatically across this tipping point, ages 70-80 are crucial for understanding and managing late-life decline in health.
{"title":"Aging health dynamics cross a tipping point near age 75.","authors":"Glen Pridham, Kenneth Rockwood, Andrew Rutenberg","doi":"","DOIUrl":"","url":null,"abstract":"<p><p>Aging includes both continuous gradual decline from microscopic mechanisms together with major deficit onset events such as morbidity, disability and ultimately death. These deficit events are stochastic, obscuring the connection between aging mechanisms and overall health. We propose a framework for modelling both the gradual effects of aging together with health deficit onset events, as reflected in the frailty index (FI) - a quantitative measure of overall age-related health. We model damage and repair dynamics of the FI from individual health transitions within two large longitudinal studies of aging health, the Health and Retirement Study (HRS) and the English Longitudinal Study of Ageing (ELSA), which together included N=47592 individuals. We find that both damage resistance (robustness) and damage recovery (resilience) rates decline smoothly with both increasing age and with increasing FI, for both sexes. This leads to two distinct dynamical states: a robust and resilient young state of stable good health (low FI) and an older state that drifts towards poor health (high FI). These two health states are separated by a sharp transition near age 75. Since FI accumulation risk accelerates dramatically across this tipping point, ages 70-80 are crucial for understanding and managing late-life decline in health.</p>","PeriodicalId":93888,"journal":{"name":"ArXiv","volume":" ","pages":""},"PeriodicalIF":0.0,"publicationDate":"2025-12-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12755256/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145890762","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}