首页 > 最新文献

Patterns最新文献

英文 中文
MetaGate: Interactive analysis of high-dimensional cytometry data with metadata integration MetaGate:利用元数据集成对高维细胞测量数据进行交互式分析
IF 6.5 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2024-05-13 DOI: 10.1016/j.patter.2024.100989
Eivind Heggernes Ask, Astrid Tschan-Plessl, Hanna Julie Hoel, Arne Kolstad, Harald Holte, Karl-Johan Malmberg

Flow cytometry is a powerful technology for high-throughput protein quantification at the single-cell level. Technical advances have substantially increased data complexity, but novel bioinformatical tools often show limitations in statistical testing, data sharing, cross-experiment comparability, or clinical data integration. We developed MetaGate as a platform for interactive statistical analysis and visualization of manually gated high-dimensional cytometry data with integration of metadata. MetaGate provides a data reduction algorithm based on a combinatorial gating system that produces a small, portable, and standardized data file. This is subsequently used to produce figures and statistical analyses through a fast web-based user interface. We demonstrate the utility of MetaGate through a comprehensive mass cytometry analysis of peripheral blood immune cells from 28 patients with diffuse large B cell lymphoma along with 17 healthy controls. Through MetaGate analysis, our study identifies key immune cell population changes associated with disease progression.

流式细胞术是一种在单细胞水平上进行高通量蛋白质定量的强大技术。技术的进步大大提高了数据的复杂性,但新型生物信息学工具在统计测试、数据共享、跨实验可比性或临床数据整合方面往往存在局限性。我们开发的 MetaGate 是一个平台,用于对人工选通的高维细胞计量数据进行交互式统计分析和可视化,并整合元数据。MetaGate 提供了一种基于组合门控系统的数据缩减算法,可生成小巧、便携和标准化的数据文件。随后,通过一个基于网络的快速用户界面,就能生成图表并进行统计分析。我们通过对 28 名弥漫大 B 细胞淋巴瘤患者和 17 名健康对照者的外周血免疫细胞进行全面的质谱分析,证明了 MetaGate 的实用性。通过 MetaGate 分析,我们的研究确定了与疾病进展相关的关键免疫细胞群变化。
{"title":"MetaGate: Interactive analysis of high-dimensional cytometry data with metadata integration","authors":"Eivind Heggernes Ask, Astrid Tschan-Plessl, Hanna Julie Hoel, Arne Kolstad, Harald Holte, Karl-Johan Malmberg","doi":"10.1016/j.patter.2024.100989","DOIUrl":"https://doi.org/10.1016/j.patter.2024.100989","url":null,"abstract":"<p>Flow cytometry is a powerful technology for high-throughput protein quantification at the single-cell level. Technical advances have substantially increased data complexity, but novel bioinformatical tools often show limitations in statistical testing, data sharing, cross-experiment comparability, or clinical data integration. We developed MetaGate as a platform for interactive statistical analysis and visualization of manually gated high-dimensional cytometry data with integration of metadata. MetaGate provides a data reduction algorithm based on a combinatorial gating system that produces a small, portable, and standardized data file. This is subsequently used to produce figures and statistical analyses through a fast web-based user interface. We demonstrate the utility of MetaGate through a comprehensive mass cytometry analysis of peripheral blood immune cells from 28 patients with diffuse large B cell lymphoma along with 17 healthy controls. Through MetaGate analysis, our study identifies key immune cell population changes associated with disease progression.</p>","PeriodicalId":36242,"journal":{"name":"Patterns","volume":"109 1","pages":""},"PeriodicalIF":6.5,"publicationDate":"2024-05-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140933481","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Federated learning for privacy-preserving depression detection with multilingual language models in social media posts 利用社交媒体帖子中的多语言语言模型,为保护隐私的抑郁检测提供联合学习
IF 6.5 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2024-05-13 DOI: 10.1016/j.patter.2024.100990
Samar Samir Khalil, Noha S. Tawfik, Marco Spruit

The incidences of mental health illnesses, such as suicidal ideation and depression, are increasing, which highlights the urgent need for early detection methods. There is a growing interest in using natural language processing (NLP) models to analyze textual data from patients, but accessing patients’ data for research purposes can be challenging due to privacy concerns. Federated learning (FL) is a promising approach that can balance the need for centralized learning with data ownership sensitivity. In this study, we examine the effectiveness of FL models in detecting depression by using a simulated multilingual dataset. We analyzed social media posts in five different languages with varying sample sizes. Our findings indicate that FL achieves strong performance in most cases while maintaining clients’ privacy for both independent and non-independent client partitioning.

自杀意念和抑郁症等精神疾病的发病率不断上升,这凸显了对早期检测方法的迫切需求。人们对使用自然语言处理(NLP)模型分析患者文本数据的兴趣与日俱增,但由于隐私问题,为研究目的访问患者数据可能具有挑战性。联合学习(FL)是一种很有前景的方法,它能在集中学习需求与数据所有权敏感性之间取得平衡。在本研究中,我们使用一个模拟的多语言数据集来检验 FL 模型在检测抑郁症方面的有效性。我们分析了五种不同语言的社交媒体帖子,样本量各不相同。我们的研究结果表明,在大多数情况下,FL 都能取得很好的性能,同时在独立和非独立客户分区中都能维护客户的隐私。
{"title":"Federated learning for privacy-preserving depression detection with multilingual language models in social media posts","authors":"Samar Samir Khalil, Noha S. Tawfik, Marco Spruit","doi":"10.1016/j.patter.2024.100990","DOIUrl":"https://doi.org/10.1016/j.patter.2024.100990","url":null,"abstract":"<p>The incidences of mental health illnesses, such as suicidal ideation and depression, are increasing, which highlights the urgent need for early detection methods. There is a growing interest in using natural language processing (NLP) models to analyze textual data from patients, but accessing patients’ data for research purposes can be challenging due to privacy concerns. Federated learning (FL) is a promising approach that can balance the need for centralized learning with data ownership sensitivity. In this study, we examine the effectiveness of FL models in detecting depression by using a simulated multilingual dataset. We analyzed social media posts in five different languages with varying sample sizes. Our findings indicate that FL achieves strong performance in most cases while maintaining clients’ privacy for both independent and non-independent client partitioning.</p>","PeriodicalId":36242,"journal":{"name":"Patterns","volume":"9 1","pages":""},"PeriodicalIF":6.5,"publicationDate":"2024-05-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140934296","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
AI deception: A survey of examples, risks, and potential solutions 人工智能欺骗:实例、风险和潜在解决方案调查
IF 6.5 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2024-05-10 DOI: 10.1016/j.patter.2024.100988
Peter S. Park, Simon Goldstein, Aidan O’Gara, Michael Chen, Dan Hendrycks

This paper argues that a range of current AI systems have learned how to deceive humans. We define deception as the systematic inducement of false beliefs in the pursuit of some outcome other than the truth. We first survey empirical examples of AI deception, discussing both special-use AI systems (including Meta’s CICERO) and general-purpose AI systems (including large language models). Next, we detail several risks from AI deception, such as fraud, election tampering, and losing control of AI. Finally, we outline several potential solutions: first, regulatory frameworks should subject AI systems that are capable of deception to robust risk-assessment requirements; second, policymakers should implement bot-or-not laws; and finally, policymakers should prioritize the funding of relevant research, including tools to detect AI deception and to make AI systems less deceptive. Policymakers, researchers, and the broader public should work proactively to prevent AI deception from destabilizing the shared foundations of our society.

本文认为,当前一系列人工智能系统已经学会了如何欺骗人类。我们将欺骗定义为系统性地诱导错误信念,以追求某种非真相的结果。我们首先调查了人工智能欺骗的实证案例,讨论了特殊用途人工智能系统(包括 Meta 的 CICERO)和通用人工智能系统(包括大型语言模型)。接下来,我们详细介绍了人工智能欺骗的几种风险,如欺诈、篡改选举和失去对人工智能的控制。最后,我们概述了几种潜在的解决方案:首先,监管框架应该对能够进行欺骗的人工智能系统提出严格的风险评估要求;其次,政策制定者应该实施 "要么机器人,要么不机器人 "的法律;最后,政策制定者应该优先资助相关研究,包括检测人工智能欺骗行为和减少人工智能系统欺骗性的工具。政策制定者、研究人员和广大公众应积极努力,防止人工智能欺骗行为破坏我们社会的共同基础。
{"title":"AI deception: A survey of examples, risks, and potential solutions","authors":"Peter S. Park, Simon Goldstein, Aidan O’Gara, Michael Chen, Dan Hendrycks","doi":"10.1016/j.patter.2024.100988","DOIUrl":"https://doi.org/10.1016/j.patter.2024.100988","url":null,"abstract":"<p>This paper argues that a range of current AI systems have learned how to deceive humans. We define deception as the systematic inducement of false beliefs in the pursuit of some outcome other than the truth. We first survey empirical examples of AI deception, discussing both special-use AI systems (including Meta’s CICERO) and general-purpose AI systems (including large language models). Next, we detail several risks from AI deception, such as fraud, election tampering, and losing control of AI. Finally, we outline several potential solutions: first, regulatory frameworks should subject AI systems that are capable of deception to robust risk-assessment requirements; second, policymakers should implement bot-or-not laws; and finally, policymakers should prioritize the funding of relevant research, including tools to detect AI deception and to make AI systems less deceptive. Policymakers, researchers, and the broader public should work proactively to prevent AI deception from destabilizing the shared foundations of our society.</p>","PeriodicalId":36242,"journal":{"name":"Patterns","volume":"253 1","pages":""},"PeriodicalIF":6.5,"publicationDate":"2024-05-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140933222","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Navigating color integrity in data visualization 数据可视化中的色彩完整性导航
IF 6.5 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2024-05-10 DOI: 10.1016/j.patter.2024.100972
Fabio Crameri, Sari Hason

Color is crucial in scientific visualization, yet it is often misused. Addressing this, we think accessible and accurate techniques, such as color-blind friendly palettes and perceptually even gradients, are vital. Accountability and basic knowledge in data visualization are key in fostering a culture of color integrity, ensuring accurate and inclusive data representation.

色彩在科学可视化中至关重要,但却经常被滥用。针对这一问题,我们认为使用方便、准确的技术至关重要,例如色盲友好调色板和感知均匀的梯度。数据可视化方面的责任感和基础知识是培养色彩完整性文化的关键,可确保准确、包容的数据表示。
{"title":"Navigating color integrity in data visualization","authors":"Fabio Crameri, Sari Hason","doi":"10.1016/j.patter.2024.100972","DOIUrl":"https://doi.org/10.1016/j.patter.2024.100972","url":null,"abstract":"<p>Color is crucial in scientific visualization, yet it is often misused. Addressing this, we think accessible and accurate techniques, such as color-blind friendly palettes and perceptually even gradients, are vital. Accountability and basic knowledge in data visualization are key in fostering a culture of color integrity, ensuring accurate and inclusive data representation.</p>","PeriodicalId":36242,"journal":{"name":"Patterns","volume":"66 1","pages":""},"PeriodicalIF":6.5,"publicationDate":"2024-05-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140933308","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Reducing overconfident errors in molecular property classification using Posterior Network 利用后验网络减少分子特性分类中的过度自信误差
IF 6.5 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2024-05-08 DOI: 10.1016/j.patter.2024.100991
Zhehuan Fan, Jie Yu, Xiang Zhang, Yijie Chen, Shihui Sun, Yuanyuan Zhang, Mingan Chen, Fu Xiao, Wenyong Wu, Xutong Li, Mingyue Zheng, Xiaomin Luo, Dingyan Wang

Deep-learning-based classification models are increasingly used for predicting molecular properties in drug development. However, traditional classification models using the Softmax function often give overconfident mispredictions for out-of-distribution samples, highlighting a critical lack of accurate uncertainty estimation. Such limitations can result in substantial costs and should be avoided during drug development. Inspired by advances in evidential deep learning and Posterior Network, we replaced the Softmax function with a normalizing flow to enhance the uncertainty estimation ability of the model in molecular property classification. The proposed strategy was evaluated across diverse scenarios, including simulated experiments based on a synthetic dataset, ADMET predictions, and ligand-based virtual screening. The results demonstrate that compared with the vanilla model, the proposed strategy effectively alleviates the problem of giving overconfident but incorrect predictions. Our findings support the promising application of evidential deep learning in drug development and offer a valuable framework for further research.

基于深度学习的分类模型越来越多地用于预测药物开发中的分子特性。然而,使用 Softmax 函数的传统分类模型往往会对分布外样本做出过于自信的错误预测,这凸显了准确不确定性估计的严重不足。这种局限性会导致巨大的成本,在药物开发过程中应该避免。受证据深度学习和后验网络的启发,我们用归一化流取代了 Softmax 函数,以增强模型在分子性质分类中的不确定性估计能力。我们在不同的场景中评估了所提出的策略,包括基于合成数据集的模拟实验、ADMET 预测和基于配体的虚拟筛选。结果表明,与 vanilla 模型相比,所提出的策略有效地缓解了预测过于自信但不正确的问题。我们的研究结果支持了证据深度学习在药物开发中的应用前景,并为进一步研究提供了有价值的框架。
{"title":"Reducing overconfident errors in molecular property classification using Posterior Network","authors":"Zhehuan Fan, Jie Yu, Xiang Zhang, Yijie Chen, Shihui Sun, Yuanyuan Zhang, Mingan Chen, Fu Xiao, Wenyong Wu, Xutong Li, Mingyue Zheng, Xiaomin Luo, Dingyan Wang","doi":"10.1016/j.patter.2024.100991","DOIUrl":"https://doi.org/10.1016/j.patter.2024.100991","url":null,"abstract":"<p>Deep-learning-based classification models are increasingly used for predicting molecular properties in drug development. However, traditional classification models using the Softmax function often give overconfident mispredictions for out-of-distribution samples, highlighting a critical lack of accurate uncertainty estimation. Such limitations can result in substantial costs and should be avoided during drug development. Inspired by advances in evidential deep learning and Posterior Network, we replaced the Softmax function with a normalizing flow to enhance the uncertainty estimation ability of the model in molecular property classification. The proposed strategy was evaluated across diverse scenarios, including simulated experiments based on a synthetic dataset, ADMET predictions, and ligand-based virtual screening. The results demonstrate that compared with the vanilla model, the proposed strategy effectively alleviates the problem of giving overconfident but incorrect predictions. Our findings support the promising application of evidential deep learning in drug development and offer a valuable framework for further research.</p>","PeriodicalId":36242,"journal":{"name":"Patterns","volume":"29 1","pages":""},"PeriodicalIF":6.5,"publicationDate":"2024-05-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140933408","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Active sensing with predictive coding and uncertainty minimization 带有预测编码和不确定性最小化功能的主动传感
IF 6.5 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2024-05-03 DOI: 10.1016/j.patter.2024.100983
Abdelrahman Sharafeldin, Nabil Imam, Hannah Choi

We present an end-to-end architecture for embodied exploration inspired by two biological computations: predictive coding and uncertainty minimization. The architecture can be applied to any exploration setting in a task-independent and intrinsically driven manner. We first demonstrate our approach in a maze navigation task and show that it can discover the underlying transition distributions and spatial features of the environment. Second, we apply our model to a more complex active vision task, whereby an agent actively samples its visual environment to gather information. We show that our model builds unsupervised representations through exploration that allow it to efficiently categorize visual scenes. We further show that using these representations for downstream classification leads to superior data efficiency and learning speed compared to other baselines while maintaining lower parameter complexity. Finally, the modular structure of our model facilitates interpretability, allowing us to probe its internal mechanisms and representations during exploration.

我们提出了一种端到端架构,用于体现式探索,其灵感来自两种生物计算:预测编码和不确定性最小化。该架构可以独立于任务和内在驱动的方式应用于任何探索环境。我们首先在迷宫导航任务中演示了我们的方法,并证明它能发现环境的潜在过渡分布和空间特征。其次,我们将模型应用于更复杂的主动视觉任务,即代理主动采样其视觉环境以收集信息。我们的研究表明,我们的模型通过探索建立了无监督表征,使其能够有效地对视觉场景进行分类。我们进一步证明,与其他基线相比,使用这些表征进行下游分类能带来更高的数据效率和学习速度,同时保持较低的参数复杂度。最后,我们模型的模块化结构有利于解释性,使我们能够在探索过程中探究其内部机制和表征。
{"title":"Active sensing with predictive coding and uncertainty minimization","authors":"Abdelrahman Sharafeldin, Nabil Imam, Hannah Choi","doi":"10.1016/j.patter.2024.100983","DOIUrl":"https://doi.org/10.1016/j.patter.2024.100983","url":null,"abstract":"<p>We present an end-to-end architecture for embodied exploration inspired by two biological computations: predictive coding and uncertainty minimization. The architecture can be applied to any exploration setting in a task-independent and intrinsically driven manner. We first demonstrate our approach in a maze navigation task and show that it can discover the underlying transition distributions and spatial features of the environment. Second, we apply our model to a more complex active vision task, whereby an agent actively samples its visual environment to gather information. We show that our model builds unsupervised representations through exploration that allow it to efficiently categorize visual scenes. We further show that using these representations for downstream classification leads to superior data efficiency and learning speed compared to other baselines while maintaining lower parameter complexity. Finally, the modular structure of our model facilitates interpretability, allowing us to probe its internal mechanisms and representations during exploration.</p>","PeriodicalId":36242,"journal":{"name":"Patterns","volume":"9 1","pages":""},"PeriodicalIF":6.5,"publicationDate":"2024-05-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140832974","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Cortical similarities in psychiatric and mood disorders identified in federated VBM analysis via COINSTAC 通过 COINSTAC 联合 VBM 分析发现精神病和情绪障碍的皮质相似性
IF 6.5 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2024-05-02 DOI: 10.1016/j.patter.2024.100987
Kelly Rootes-Murdy, Sandeep Panta, Ross Kelly, Javier Romero, Yann Quidé, Murray J. Cairns, Carmel Loughland, Vaughan J. Carr, Stanley V. Catts, Assen Jablensky, Melissa J. Green, Frans Henskens, Dylan Kiltschewskij, Patricia T. Michie, Bryan Mowry, Christos Pantelis, Paul E. Rasser, William R. Reay, Ulrich Schall, Rodney J. Scott, Vince D. Calhoun

Structural neuroimaging studies have identified a combination of shared and disorder-specific patterns of gray matter (GM) deficits across psychiatric disorders. Pooling large data allows for examination of a possible common neuroanatomical basis that may identify a certain vulnerability for mental illness. Large-scale collaborative research is already facilitated by data repositories, institutionally supported databases, and data archives. However, these data-sharing methodologies can suffer from significant barriers. Federated approaches augment these approaches by enabling access or more sophisticated, shareable and scaled-up analyses of large-scale data. We examined GM alterations using Collaborative Informatics and Neuroimaging Suite Toolkit for Anonymous Computation, an open-source, decentralized analysis application. Through federated analysis of eight sites, we identified significant overlap in the GM patterns (n = 4,102) of individuals with schizophrenia, major depressive disorder, and autism spectrum disorder. These results show cortical and subcortical regions that may indicate a shared vulnerability to psychiatric disorders.

结构神经影像学研究发现,精神疾病的灰质(GM)缺陷既有共同的模式,也有特定疾病的模式。将大量数据汇集在一起,可以对可能存在的共同神经解剖学基础进行研究,从而确定精神疾病的某种易感性。数据存储库、机构支持的数据库和数据档案已经为大规模合作研究提供了便利。然而,这些数据共享方法可能存在重大障碍。联盟式方法可以对大规模数据进行访问或更复杂、可共享和可扩展的分析,从而增强了这些方法。我们使用匿名计算的协作信息学和神经成像套件工具包(一种开源、分散的分析应用程序)研究了基因改变。通过对八个站点的联合分析,我们发现精神分裂症、重度抑郁障碍和自闭症谱系障碍患者的基因组模式(n = 4,102 个)存在显著重叠。这些结果表明,皮层和皮层下区域可能预示着精神疾病的共同易感性。
{"title":"Cortical similarities in psychiatric and mood disorders identified in federated VBM analysis via COINSTAC","authors":"Kelly Rootes-Murdy, Sandeep Panta, Ross Kelly, Javier Romero, Yann Quidé, Murray J. Cairns, Carmel Loughland, Vaughan J. Carr, Stanley V. Catts, Assen Jablensky, Melissa J. Green, Frans Henskens, Dylan Kiltschewskij, Patricia T. Michie, Bryan Mowry, Christos Pantelis, Paul E. Rasser, William R. Reay, Ulrich Schall, Rodney J. Scott, Vince D. Calhoun","doi":"10.1016/j.patter.2024.100987","DOIUrl":"https://doi.org/10.1016/j.patter.2024.100987","url":null,"abstract":"<p>Structural neuroimaging studies have identified a combination of shared and disorder-specific patterns of gray matter (GM) deficits across psychiatric disorders. Pooling large data allows for examination of a possible common neuroanatomical basis that may identify a certain vulnerability for mental illness. Large-scale collaborative research is already facilitated by data repositories, institutionally supported databases, and data archives. However, these data-sharing methodologies can suffer from significant barriers. Federated approaches augment these approaches by enabling access or more sophisticated, shareable and scaled-up analyses of large-scale data. We examined GM alterations using Collaborative Informatics and Neuroimaging Suite Toolkit for Anonymous Computation, an open-source, decentralized analysis application. Through federated analysis of eight sites, we identified significant overlap in the GM patterns (<em>n</em> = 4,102) of individuals with schizophrenia, major depressive disorder, and autism spectrum disorder. These results show cortical and subcortical regions that may indicate a shared vulnerability to psychiatric disorders.</p>","PeriodicalId":36242,"journal":{"name":"Patterns","volume":"9 1","pages":""},"PeriodicalIF":6.5,"publicationDate":"2024-05-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140832823","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
MUSTANG: Multi-sample spatial transcriptomics data analysis with cross-sample transcriptional similarity guidance MUSTANG:利用跨样本转录相似性指导进行多样本空间转录组学数据分析
IF 6.5 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2024-05-02 DOI: 10.1016/j.patter.2024.100986
Seyednami Niyakan, Jianting Sheng, Yuliang Cao, Xiang Zhang, Zhan Xu, Ling Wu, Stephen T.C. Wong, Xiaoning Qian

Spatially resolved transcriptomics has revolutionized genome-scale transcriptomic profiling by providing high-resolution characterization of transcriptional patterns. Here, we present our spatial transcriptomics analysis framework, MUSTANG (MUlti-sample Spatial Transcriptomics data ANalysis with cross-sample transcriptional similarity Guidance), which is capable of performing multi-sample spatial transcriptomics spot cellular deconvolution by allowing both cross-sample expression-based similarity information sharing as well as spatial correlation in gene expression patterns within samples. Experiments on a semi-synthetic spatial transcriptomics dataset and three real-world spatial transcriptomics datasets demonstrate the effectiveness of MUSTANG in revealing biological insights inherent in the cellular characterization of tissue samples under study.

空间解析转录组学通过提供高分辨率的转录模式表征,彻底改变了基因组规模的转录组学分析。在这里,我们介绍了我们的空间转录组学分析框架 MUSTANG(MUlti-sample Spatial Transcriptomics data ANalysis with cross-sample transcriptional similarity Guidance),它能够通过基于表达的跨样本相似性信息共享以及样本内基因表达模式的空间相关性来执行多样本空间转录组学定点细胞解卷积。在一个半合成空间转录组学数据集和三个真实世界空间转录组学数据集上的实验证明了 MUSTANG 在揭示所研究组织样本细胞特征内在的生物学见解方面的有效性。
{"title":"MUSTANG: Multi-sample spatial transcriptomics data analysis with cross-sample transcriptional similarity guidance","authors":"Seyednami Niyakan, Jianting Sheng, Yuliang Cao, Xiang Zhang, Zhan Xu, Ling Wu, Stephen T.C. Wong, Xiaoning Qian","doi":"10.1016/j.patter.2024.100986","DOIUrl":"https://doi.org/10.1016/j.patter.2024.100986","url":null,"abstract":"<p>Spatially resolved transcriptomics has revolutionized genome-scale transcriptomic profiling by providing high-resolution characterization of transcriptional patterns. Here, we present our spatial transcriptomics analysis framework, MUSTANG (MUlti-sample Spatial Transcriptomics data ANalysis with cross-sample transcriptional similarity Guidance), which is capable of performing multi-sample spatial transcriptomics spot cellular deconvolution by allowing both cross-sample expression-based similarity information sharing as well as spatial correlation in gene expression patterns within samples. Experiments on a semi-synthetic spatial transcriptomics dataset and three real-world spatial transcriptomics datasets demonstrate the effectiveness of MUSTANG in revealing biological insights inherent in the cellular characterization of tissue samples under study.</p>","PeriodicalId":36242,"journal":{"name":"Patterns","volume":"32 1","pages":""},"PeriodicalIF":6.5,"publicationDate":"2024-05-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140832909","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A generalized AI system for human embryo selection covering the entire IVF cycle via multi-modal contrastive learning 通过多模态对比学习,建立覆盖整个试管婴儿周期的人类胚胎选择通用人工智能系统
IF 6.5 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2024-05-02 DOI: 10.1016/j.patter.2024.100985
Guangyu Wang, Kai Wang, Yuanxu Gao, Longbin Chen, Tianrun Gao, Yuanlin Ma, Zeyu Jiang, Guoxing Yang, Fajin Feng, Shuoping Zhang, Yifan Gu, Guangdong Liu, Lei Chen, Li-Shuang Ma, Ye Sang, Yanwen Xu, Ge Lin, Xiaohong Liu

In vitro fertilization (IVF) has revolutionized infertility treatment, benefiting millions of couples worldwide. However, current clinical practices for embryo selection rely heavily on visual inspection of morphology, which is highly variable and experience dependent. Here, we propose a comprehensive artificial intelligence (AI) system that can interpret embryo-developmental knowledge encoded in vast unlabeled multi-modal datasets and provide personalized embryo selection. This AI platform consists of a transformer-based network backbone named IVFormer and a self-supervised learning framework, VTCLR (visual-temporal contrastive learning of representations), for training multi-modal embryo representations pre-trained on large and unlabeled data. When evaluated on clinical scenarios covering the entire IVF cycle, our pre-trained AI model demonstrates accurate and reliable performance on euploidy ranking and live-birth occurrence prediction. For AI vs. physician for euploidy ranking, our model achieved superior performance across all score categories. The results demonstrate the potential of the AI system as a non-invasive, efficient, and cost-effective tool to improve embryo selection and IVF outcomes.

体外受精(IVF)彻底改变了不孕症的治疗,使全球数百万对夫妇受益。然而,目前的胚胎选择临床实践主要依赖于对形态的目测,而目测具有很大的可变性和经验依赖性。在这里,我们提出了一种综合性人工智能(AI)系统,它可以解读大量无标记多模态数据集中编码的胚胎发育知识,并提供个性化的胚胎选择。这个人工智能平台由一个名为 IVFormer 的基于变压器的网络骨干和一个自监督学习框架 VTCLR(视觉-时间对比表征学习)组成,用于在大量无标记数据上训练多模态胚胎表征。在涵盖整个试管婴儿周期的临床场景中进行评估时,我们预先训练的人工智能模型在非整倍性排序和活产预测方面表现出了准确可靠的性能。在人工智能与医生的非整倍体排序对比中,我们的模型在所有得分类别中都取得了优异的表现。这些结果证明了人工智能系统作为一种无创、高效、经济的工具,在改善胚胎选择和试管婴儿结果方面的潜力。
{"title":"A generalized AI system for human embryo selection covering the entire IVF cycle via multi-modal contrastive learning","authors":"Guangyu Wang, Kai Wang, Yuanxu Gao, Longbin Chen, Tianrun Gao, Yuanlin Ma, Zeyu Jiang, Guoxing Yang, Fajin Feng, Shuoping Zhang, Yifan Gu, Guangdong Liu, Lei Chen, Li-Shuang Ma, Ye Sang, Yanwen Xu, Ge Lin, Xiaohong Liu","doi":"10.1016/j.patter.2024.100985","DOIUrl":"https://doi.org/10.1016/j.patter.2024.100985","url":null,"abstract":"<p><em>In vitro</em> fertilization (IVF) has revolutionized infertility treatment, benefiting millions of couples worldwide. However, current clinical practices for embryo selection rely heavily on visual inspection of morphology, which is highly variable and experience dependent. Here, we propose a comprehensive artificial intelligence (AI) system that can interpret embryo-developmental knowledge encoded in vast unlabeled multi-modal datasets and provide personalized embryo selection. This AI platform consists of a transformer-based network backbone named IVFormer and a self-supervised learning framework, VTCLR (visual-temporal contrastive learning of representations), for training multi-modal embryo representations pre-trained on large and unlabeled data. When evaluated on clinical scenarios covering the entire IVF cycle, our pre-trained AI model demonstrates accurate and reliable performance on euploidy ranking and live-birth occurrence prediction. For AI vs. physician for euploidy ranking, our model achieved superior performance across all score categories. The results demonstrate the potential of the AI system as a non-invasive, efficient, and cost-effective tool to improve embryo selection and IVF outcomes.</p>","PeriodicalId":36242,"journal":{"name":"Patterns","volume":"75 1","pages":""},"PeriodicalIF":6.5,"publicationDate":"2024-05-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140832827","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
CURE: A deep learning framework pre-trained on large-scale patient data for treatment effect estimation CURE:在大规模患者数据上预先训练的深度学习框架,用于估计治疗效果
IF 6.5 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2024-05-01 DOI: 10.1016/j.patter.2024.100973
Ruoqi Liu, Pin-Yu Chen, Ping Zhang

Treatment effect estimation (TEE) aims to identify the causal effects of treatments on important outcomes. Current machine-learning-based methods, mainly trained on labeled data for specific treatments or outcomes, can be sub-optimal with limited labeled data. In this article, we propose a new pre-training and fine-tuning framework, CURE (causal treatment effect estimation), for TEE from observational data. CURE is pre-trained on large-scale unlabeled patient data to learn representative contextual patient representations and fine-tuned on labeled patient data for TEE. We present a new sequence encoding approach for longitudinal patient data embedding both structure and time. Evaluated on four downstream TEE tasks, CURE outperforms the state-of-the-art methods, marking a 7% increase in area under the precision-recall curve and an 8% rise in the influence-function-based precision of estimating heterogeneous effects. Validation with four randomized clinical trials confirms its efficacy in producing trial conclusions, highlighting CURE’s capacity to supplement traditional clinical trials.

治疗效果估计(TEE)旨在确定治疗对重要结果的因果效应。目前基于机器学习的方法主要是针对特定治疗或结果的标注数据进行训练,但在标注数据有限的情况下,这些方法可能无法达到最佳效果。在本文中,我们提出了一种新的预训练和微调框架 CURE(因果治疗效果估计),用于从观察数据中获得 TEE。CURE 在大规模无标记患者数据上进行预训练,以学习有代表性的上下文患者表征,并在有标记患者数据上进行微调,以用于 TEE。我们提出了一种新的序列编码方法,用于嵌入结构和时间的纵向患者数据。在四项下游 TEE 任务的评估中,CURE 的表现优于最先进的方法,其精度-召回曲线下的面积增加了 7%,基于影响函数的异质效应估计精度提高了 8%。四项随机临床试验的验证证实了 CURE 在得出试验结论方面的功效,凸显了 CURE 补充传统临床试验的能力。
{"title":"CURE: A deep learning framework pre-trained on large-scale patient data for treatment effect estimation","authors":"Ruoqi Liu, Pin-Yu Chen, Ping Zhang","doi":"10.1016/j.patter.2024.100973","DOIUrl":"https://doi.org/10.1016/j.patter.2024.100973","url":null,"abstract":"<p>Treatment effect estimation (TEE) aims to identify the causal effects of treatments on important outcomes. Current machine-learning-based methods, mainly trained on labeled data for specific treatments or outcomes, can be sub-optimal with limited labeled data. In this article, we propose a new pre-training and fine-tuning framework, CURE (causal treatment effect estimation), for TEE from observational data. CURE is pre-trained on large-scale unlabeled patient data to learn representative contextual patient representations and fine-tuned on labeled patient data for TEE. We present a new sequence encoding approach for longitudinal patient data embedding both structure and time. Evaluated on four downstream TEE tasks, CURE outperforms the state-of-the-art methods, marking a 7% increase in area under the precision-recall curve and an 8% rise in the influence-function-based precision of estimating heterogeneous effects. Validation with four randomized clinical trials confirms its efficacy in producing trial conclusions, highlighting CURE’s capacity to supplement traditional clinical trials.</p>","PeriodicalId":36242,"journal":{"name":"Patterns","volume":"2011 1","pages":""},"PeriodicalIF":6.5,"publicationDate":"2024-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140832895","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Patterns
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1