首页 > 最新文献

Patterns最新文献

英文 中文
A FAIR, open-source virtual reality platform for dendritic spine analysis 用于树突棘分析的 FAIR 开源虚拟现实平台
IF 6.5 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2024-08-12 DOI: 10.1016/j.patter.2024.101041

Neuroanatomy is fundamental to understanding the nervous system, particularly dendritic spines, which are vital for synaptic transmission and change in response to injury or disease. Advancements in imaging have allowed for detailed three-dimensional (3D) visualization of these structures. However, existing tools for analyzing dendritic spine morphology are limited. To address this, we developed an open-source virtual reality (VR) structural analysis software ecosystem (coined “VR-SASE”) that offers a powerful, intuitive approach for analyzing dendritic spines. Our validation process confirmed the method’s superior accuracy, outperforming recognized gold-standard neural reconstruction techniques. Importantly, the VR-SASE workflow automatically calculates key morphological metrics, such as dendritic spine length, volume, and surface area, and reliably replicates established datasets from published dendritic spine studies. By integrating the Neurodata Without Borders (NWB) data standard, VR-SASE datasets can be preserved/distributed through DANDI Archives, satisfying the NIH data sharing mandate.

神经解剖学是了解神经系统,特别是树突棘的基础,树突棘对突触传递和对损伤或疾病的反应变化至关重要。成像技术的进步使这些结构的详细三维(3D)可视化成为可能。然而,现有的树突棘形态分析工具非常有限。为了解决这个问题,我们开发了一个开源虚拟现实(VR)结构分析软件生态系统(被称为 "VR-SASE"),它为树突棘的分析提供了一种强大、直观的方法。我们的验证过程证实了该方法的卓越准确性,超过了公认的黄金标准神经重建技术。重要的是,VR-SASE 工作流程能自动计算树突棘长度、体积和表面积等关键形态指标,并可靠地复制已发表的树突棘研究数据集。通过整合神经数据无国界(NWB)数据标准,VR-SASE 数据集可以通过 DANDI 档案馆保存/分发,从而满足美国国立卫生研究院的数据共享要求。
{"title":"A FAIR, open-source virtual reality platform for dendritic spine analysis","authors":"","doi":"10.1016/j.patter.2024.101041","DOIUrl":"https://doi.org/10.1016/j.patter.2024.101041","url":null,"abstract":"<p>Neuroanatomy is fundamental to understanding the nervous system, particularly dendritic spines, which are vital for synaptic transmission and change in response to injury or disease. Advancements in imaging have allowed for detailed three-dimensional (3D) visualization of these structures. However, existing tools for analyzing dendritic spine morphology are limited. To address this, we developed an open-source virtual reality (VR) structural analysis software ecosystem (coined “VR-SASE”) that offers a powerful, intuitive approach for analyzing dendritic spines. Our validation process confirmed the method’s superior accuracy, outperforming recognized gold-standard neural reconstruction techniques. Importantly, the VR-SASE workflow automatically calculates key morphological metrics, such as dendritic spine length, volume, and surface area, and reliably replicates established datasets from published dendritic spine studies. By integrating the Neurodata Without Borders (NWB) data standard, VR-SASE datasets can be preserved/distributed through DANDI Archives, satisfying the NIH data sharing mandate.</p>","PeriodicalId":36242,"journal":{"name":"Patterns","volume":null,"pages":null},"PeriodicalIF":6.5,"publicationDate":"2024-08-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141936990","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Multi-objective latent space optimization of generative molecular design models 生成式分子设计模型的多目标潜空间优化
IF 6.5 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2024-08-12 DOI: 10.1016/j.patter.2024.101042

Molecular design based on generative models, such as variational autoencoders (VAEs), has become increasingly popular in recent years due to its efficiency for exploring high-dimensional molecular space to identify molecules with desired properties. While the efficacy of the initial model strongly depends on the training data, the sampling efficiency of the model for suggesting novel molecules with enhanced properties can be further enhanced via latent space optimization (LSO). In this paper, we propose a multi-objective LSO method that can significantly enhance the performance of generative molecular design (GMD). The proposed method adopts an iterative weighted retraining approach, where the respective weights of the molecules in the training data are determined by their Pareto efficiency. We demonstrate that our multi-objective GMD LSO method can significantly improve the performance of GMD for jointly optimizing multiple molecular properties.

近年来,基于生成模型(如变异自动编码器(VAE))的分子设计越来越受欢迎,因为它能高效地探索高维分子空间,识别具有所需特性的分子。虽然初始模型的功效在很大程度上取决于训练数据,但通过潜在空间优化(LSO)可以进一步提高模型的采样效率,从而提出具有更强特性的新型分子。本文提出了一种多目标 LSO 方法,可显著提高生成式分子设计(GMD)的性能。所提出的方法采用了迭代加权再训练方法,其中训练数据中分子各自的权重由它们的帕累托效率决定。我们证明了我们的多目标 GMD LSO 方法可以显著提高 GMD 的性能,从而联合优化多种分子特性。
{"title":"Multi-objective latent space optimization of generative molecular design models","authors":"","doi":"10.1016/j.patter.2024.101042","DOIUrl":"https://doi.org/10.1016/j.patter.2024.101042","url":null,"abstract":"<p>Molecular design based on generative models, such as variational autoencoders (VAEs), has become increasingly popular in recent years due to its efficiency for exploring high-dimensional molecular space to identify molecules with desired properties. While the efficacy of the initial model strongly depends on the training data, the sampling efficiency of the model for suggesting novel molecules with enhanced properties can be further enhanced via latent space optimization (LSO). In this paper, we propose a multi-objective LSO method that can significantly enhance the performance of generative molecular design (GMD). The proposed method adopts an iterative weighted retraining approach, where the respective weights of the molecules in the training data are determined by their Pareto efficiency. We demonstrate that our multi-objective GMD LSO method can significantly improve the performance of GMD for jointly optimizing multiple molecular properties.</p>","PeriodicalId":36242,"journal":{"name":"Patterns","volume":null,"pages":null},"PeriodicalIF":6.5,"publicationDate":"2024-08-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141936991","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Concepts and applications of digital twins in healthcare and medicine 数字双胞胎在医疗保健领域的概念和应用
IF 6.5 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2024-08-09 DOI: 10.1016/j.patter.2024.101028

The digital twin (DT) is a concept widely used in industry to create digital replicas of physical objects or systems. The dynamic, bi-directional link between the physical entity and its digital counterpart enables a real-time update of the digital entity. It can predict perturbations related to the physical object’s function. The obvious applications of DTs in healthcare and medicine are extremely attractive prospects that have the potential to revolutionize patient diagnosis and treatment. However, challenges including technical obstacles, biological heterogeneity, and ethical considerations make it difficult to achieve the desired goal. Advances in multi-modal deep learning methods, embodied AI agents, and the metaverse may mitigate some difficulties. Here, we discuss the basic concepts underlying DTs, the requirements for implementing DTs in medicine, and their current and potential healthcare uses. We also provide our perspective on five hallmarks for a healthcare DT system to advance research in this field.

数字孪生(DT)是一个广泛应用于工业领域的概念,用于创建物理对象或系统的数字复制品。物理实体与其数字对应物之间的动态双向链接可实现数字实体的实时更新。它可以预测与物理对象功能相关的扰动。DT 在医疗保健和医药领域的明显应用前景极具吸引力,有可能彻底改变病人的诊断和治疗。然而,包括技术障碍、生物异质性和伦理考虑在内的各种挑战使其难以实现预期目标。多模态深度学习方法、嵌入式人工智能代理和元宇宙的进步可能会缓解一些困难。在此,我们将讨论 DT 的基本概念、在医学中实施 DT 的要求以及 DT 在医疗保健领域的当前和潜在用途。我们还提出了医疗保健 DT 系统的五个标志,以推动该领域的研究。
{"title":"Concepts and applications of digital twins in healthcare and medicine","authors":"","doi":"10.1016/j.patter.2024.101028","DOIUrl":"https://doi.org/10.1016/j.patter.2024.101028","url":null,"abstract":"<p>The digital twin (DT) is a concept widely used in industry to create digital replicas of physical objects or systems. The dynamic, bi-directional link between the physical entity and its digital counterpart enables a real-time update of the digital entity. It can predict perturbations related to the physical object’s function. The obvious applications of DTs in healthcare and medicine are extremely attractive prospects that have the potential to revolutionize patient diagnosis and treatment. However, challenges including technical obstacles, biological heterogeneity, and ethical considerations make it difficult to achieve the desired goal. Advances in multi-modal deep learning methods, embodied AI agents, and the metaverse may mitigate some difficulties. Here, we discuss the basic concepts underlying DTs, the requirements for implementing DTs in medicine, and their current and potential healthcare uses. We also provide our perspective on five hallmarks for a healthcare DT system to advance research in this field.</p>","PeriodicalId":36242,"journal":{"name":"Patterns","volume":null,"pages":null},"PeriodicalIF":6.5,"publicationDate":"2024-08-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141936994","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
How deep can we decipher protein evolution with deep learning models 利用深度学习模型解密蛋白质进化的深度有多深?
IF 6.5 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2024-08-09 DOI: 10.1016/j.patter.2024.101043

Evolutionary-based machine learning models have emerged as a fascinating approach to mapping the landscape for protein evolution. Lian et al. demonstrated that evolution-based deep generative models, specifically variational autoencoders, can organize SH3 homologs in a hierarchical latent space, effectively distinguishing the specific Sho1SH3 domains.

基于进化的机器学习模型已成为绘制蛋白质进化图谱的迷人方法。Lian等人证明,基于进化的深度生成模型,特别是变异自动编码器,可以在分层的潜在空间中组织SH3同源物,有效区分特定的Sho1SH3结构域。
{"title":"How deep can we decipher protein evolution with deep learning models","authors":"","doi":"10.1016/j.patter.2024.101043","DOIUrl":"https://doi.org/10.1016/j.patter.2024.101043","url":null,"abstract":"<p>Evolutionary-based machine learning models have emerged as a fascinating approach to mapping the landscape for protein evolution. Lian et al. demonstrated that evolution-based deep generative models, specifically variational autoencoders, can organize SH3 homologs in a hierarchical latent space, effectively distinguishing the specific Sho1<sup>SH3</sup> domains.</p>","PeriodicalId":36242,"journal":{"name":"Patterns","volume":null,"pages":null},"PeriodicalIF":6.5,"publicationDate":"2024-08-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141936992","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Meet the authors: Zixin Jiang and Bing Dong 与作者见面蒋子欣和董冰
IF 6.5 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2024-08-09 DOI: 10.1016/j.patter.2024.101044

What can we do to mitigate climate change and achieve carbon neutrality for buildings? In their recent publication in Patterns, the authors proposed a modularized neural network incorporating physical priors for future building energy modeling, paving the way for scalable and reliable building energy modeling, optimization, retrofit designs, and buildings-to-grid integration. In this interview, the authors talk about incorporating fundamental heat transfer and thermodynamics knowledge into data-driven models.

我们能做些什么来减缓气候变化并实现建筑碳中和?在最近发表于《模式》(Patterns)的文章中,作者提出了一种模块化神经网络,将物理先验纳入未来的建筑能源建模中,为可扩展和可靠的建筑能源建模、优化、改造设计以及建筑与电网的集成铺平了道路。在这篇访谈中,作者谈到了将基本传热学和热力学知识纳入数据驱动模型的问题。
{"title":"Meet the authors: Zixin Jiang and Bing Dong","authors":"","doi":"10.1016/j.patter.2024.101044","DOIUrl":"https://doi.org/10.1016/j.patter.2024.101044","url":null,"abstract":"<p>What can we do to mitigate climate change and achieve carbon neutrality for buildings? In their recent publication in <em>Patterns</em>, the authors proposed a modularized neural network incorporating physical priors for future building energy modeling, paving the way for scalable and reliable building energy modeling, optimization, retrofit designs, and buildings-to-grid integration. In this interview, the authors talk about incorporating fundamental heat transfer and thermodynamics knowledge into data-driven models.</p>","PeriodicalId":36242,"journal":{"name":"Patterns","volume":null,"pages":null},"PeriodicalIF":6.5,"publicationDate":"2024-08-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141936993","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Exposing image splicing traces in scientific publications via uncertainty-guided refinement 通过不确定性引导的精炼揭示科学出版物中的图像拼接痕迹
IF 6.5 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2024-08-08 DOI: 10.1016/j.patter.2024.101038

Recently, a surge in image manipulations in scientific publications has led to numerous retractions, highlighting the importance of image integrity. Although forensic detectors for image duplication and synthesis have been researched, the detection of image splicing in scientific publications remains largely unexplored. Splicing detection is more challenging than duplication detection due to the lack of reference images and more difficult than synthesis detection because of the presence of smaller tampered-with areas. Moreover, disruptive factors in scientific images, such as artifacts, abnormal patterns, and noise, present misleading features like splicing traces, rendering this task difficult. In addition, the scarcity of high-quality datasets of spliced scientific images has limited advancements. Therefore, we propose the uncertainty-guided refinement network (URN) to mitigate these disruptive factors. We also construct a dataset for image splicing detection (SciSp) with 1,290 spliced images by collecting and manually splicing. Comprehensive experiments demonstrate the URN’s superior splicing detection performance.

最近,科学出版物中的图像篡改现象激增,导致许多出版物被撤回,这凸显了图像完整性的重要性。尽管针对图像复制和合成的法证检测器已经得到研究,但科学出版物中的图像拼接检测在很大程度上仍未得到探索。由于缺乏参考图像,拼接检测比复制检测更具挑战性;由于存在较小的篡改区域,拼接检测比合成检测更加困难。此外,科学图像中的干扰因素,如人工痕迹、异常模式和噪声,会呈现出拼接痕迹等误导性特征,从而使这项任务变得困难。此外,高质量拼接科学图像数据集的稀缺也限制了研究的进展。因此,我们提出了不确定性引导细化网络(URN)来减少这些干扰因素。我们还通过收集和手动拼接的方式,构建了一个包含 1,290 幅拼接图像的图像拼接检测数据集(SciSp)。综合实验证明了 URN 的卓越拼接检测性能。
{"title":"Exposing image splicing traces in scientific publications via uncertainty-guided refinement","authors":"","doi":"10.1016/j.patter.2024.101038","DOIUrl":"https://doi.org/10.1016/j.patter.2024.101038","url":null,"abstract":"<p>Recently, a surge in image manipulations in scientific publications has led to numerous retractions, highlighting the importance of image integrity. Although forensic detectors for image duplication and synthesis have been researched, the detection of image splicing in scientific publications remains largely unexplored. Splicing detection is more challenging than duplication detection due to the lack of reference images and more difficult than synthesis detection because of the presence of smaller tampered-with areas. Moreover, disruptive factors in scientific images, such as artifacts, abnormal patterns, and noise, present misleading features like splicing traces, rendering this task difficult. In addition, the scarcity of high-quality datasets of spliced scientific images has limited advancements. Therefore, we propose the uncertainty-guided refinement network (URN) to mitigate these disruptive factors. We also construct a dataset for image splicing detection (SciSp) with 1,290 spliced images by collecting and manually splicing. Comprehensive experiments demonstrate the URN’s superior splicing detection performance.</p>","PeriodicalId":36242,"journal":{"name":"Patterns","volume":null,"pages":null},"PeriodicalIF":6.5,"publicationDate":"2024-08-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141936995","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Highly accurate and precise determination of mouse mass using computer vision 利用计算机视觉高精度地测定鼠标质量
IF 6.5 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2024-08-07 DOI: 10.1016/j.patter.2024.101039

Changes in body mass are key indicators of health in humans and animals and are routinely monitored in animal husbandry and preclinical studies. In rodent studies, the current method of manually weighing the animal on a balance causes at least two issues. First, directly handling the animal induces stress, possibly confounding studies. Second, these data are static, limiting continuous assessment and obscuring rapid changes. A non-invasive, continuous method of monitoring animal mass would have utility in multiple biomedical research areas. We combine computer vision with statistical modeling to demonstrate the feasibility of determining mouse body mass by using video data. Our methods determine mass with a 4.8% error across genetically diverse mouse strains with varied coat colors and masses. This error is low enough to replace manual weighing in most mouse studies. We conclude that visually determining rodent mass enables non-invasive, continuous monitoring, improving preclinical studies and animal welfare.

体重变化是人类和动物健康的关键指标,也是动物饲养和临床前研究的常规监测指标。在啮齿动物研究中,目前在天平上手动称量动物体重的方法至少会造成两个问题。首先,直接处理动物会造成应激,可能会干扰研究。其次,这些数据是静态的,限制了连续评估,并掩盖了快速变化。一种非侵入式的连续动物质量监测方法将在多个生物医学研究领域发挥作用。我们将计算机视觉与统计建模相结合,证明了利用视频数据确定小鼠体重的可行性。我们的方法能确定具有不同毛色和体重的不同基因小鼠品系的体重,误差仅为 4.8%。这个误差很低,足以在大多数小鼠研究中取代人工称重。我们的结论是,目测啮齿动物的体重可以实现无创、连续的监测,从而改善临床前研究和动物福利。
{"title":"Highly accurate and precise determination of mouse mass using computer vision","authors":"","doi":"10.1016/j.patter.2024.101039","DOIUrl":"https://doi.org/10.1016/j.patter.2024.101039","url":null,"abstract":"<p>Changes in body mass are key indicators of health in humans and animals and are routinely monitored in animal husbandry and preclinical studies. In rodent studies, the current method of manually weighing the animal on a balance causes at least two issues. First, directly handling the animal induces stress, possibly confounding studies. Second, these data are static, limiting continuous assessment and obscuring rapid changes. A non-invasive, continuous method of monitoring animal mass would have utility in multiple biomedical research areas. We combine computer vision with statistical modeling to demonstrate the feasibility of determining mouse body mass by using video data. Our methods determine mass with a 4.8% error across genetically diverse mouse strains with varied coat colors and masses. This error is low enough to replace manual weighing in most mouse studies. We conclude that visually determining rodent mass enables non-invasive, continuous monitoring, improving preclinical studies and animal welfare.</p>","PeriodicalId":36242,"journal":{"name":"Patterns","volume":null,"pages":null},"PeriodicalIF":6.5,"publicationDate":"2024-08-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141936996","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A federated learning architecture for secure and private neuroimaging analysis 用于安全保密神经成像分析的联合学习架构
IF 6.5 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2024-08-01 DOI: 10.1016/j.patter.2024.101031

The amount of biomedical data continues to grow rapidly. However, collecting data from multiple sites for joint analysis remains challenging due to security, privacy, and regulatory concerns. To overcome this challenge, we use federated learning, which enables distributed training of neural network models over multiple data sources without sharing data. Each site trains the neural network over its private data for some time and then shares the neural network parameters (i.e., weights and/or gradients) with a federation controller, which in turn aggregates the local models and sends the resulting community model back to each site, and the process repeats. Our federated learning architecture, MetisFL, provides strong security and privacy. First, sample data never leave a site. Second, neural network parameters are encrypted before transmission and the global neural model is computed under fully homomorphic encryption. Finally, we use information-theoretic methods to limit information leakage from the neural model to prevent a “curious” site from performing model inversion or membership attacks. We present a thorough evaluation of the performance of secure, private federated learning in neuroimaging tasks, including for predicting Alzheimer’s disease and for brain age gap estimation (BrainAGE) from magnetic resonance imaging (MRI) studies in challenging, heterogeneous federated environments where sites have different amounts of data and statistical distributions.

生物医学数据量持续快速增长。然而,出于安全、隐私和监管等方面的考虑,从多个站点收集数据进行联合分析仍然具有挑战性。为了克服这一挑战,我们采用了联合学习的方法,即在不共享数据的情况下,通过多个数据源对神经网络模型进行分布式训练。每个站点通过其私有数据对神经网络进行一段时间的训练,然后与联盟控制器共享神经网络参数(即权重和/或梯度),该控制器反过来汇总本地模型,并将生成的社区模型发送回每个站点,整个过程重复进行。我们的联合学习架构 MetisFL 具有很强的安全性和私密性。首先,样本数据永远不会离开站点。其次,神经网络参数在传输前已加密,全局神经模型是在完全同态加密的情况下计算得出的。最后,我们使用信息论方法限制神经模型的信息泄露,以防止 "好奇 "的网站进行模型反转或成员攻击。我们对神经成像任务中的安全、私有联合学习的性能进行了全面评估,包括在具有挑战性的异构联合环境中预测阿尔茨海默氏症和从磁共振成像(MRI)研究中估计脑年龄差距(BrainAGE),这些环境中的站点拥有不同数量的数据和统计分布。
{"title":"A federated learning architecture for secure and private neuroimaging analysis","authors":"","doi":"10.1016/j.patter.2024.101031","DOIUrl":"https://doi.org/10.1016/j.patter.2024.101031","url":null,"abstract":"<p>The amount of biomedical data continues to grow rapidly. However, collecting data from multiple sites for joint analysis remains challenging due to security, privacy, and regulatory concerns. To overcome this challenge, we use federated learning, which enables distributed training of neural network models over multiple data sources without sharing data. Each site trains the neural network over its private data for some time and then shares the neural network parameters (i.e., weights and/or gradients) with a federation controller, which in turn aggregates the local models and sends the resulting community model back to each site, and the process repeats. Our federated learning architecture, MetisFL, provides strong security and privacy. First, sample data never leave a site. Second, neural network parameters are encrypted before transmission and the global neural model is computed under fully homomorphic encryption. Finally, we use information-theoretic methods to limit information leakage from the neural model to prevent a “curious” site from performing model inversion or membership attacks. We present a thorough evaluation of the performance of secure, private federated learning in neuroimaging tasks, including for predicting Alzheimer’s disease and for brain age gap estimation (BrainAGE) from magnetic resonance imaging (MRI) studies in challenging, heterogeneous federated environments where sites have different amounts of data and statistical distributions.</p>","PeriodicalId":36242,"journal":{"name":"Patterns","volume":null,"pages":null},"PeriodicalIF":6.5,"publicationDate":"2024-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141865539","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
The reanimation of pseudoscience in machine learning and its ethical repercussions 机器学习中伪科学的复活及其伦理反响
IF 6.5 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2024-08-01 DOI: 10.1016/j.patter.2024.101027

The present perspective outlines how epistemically baseless and ethically pernicious paradigms are recycled back into the scientific literature via machine learning (ML) and explores connections between these two dimensions of failure. We hold up the renewed emergence of physiognomic methods, facilitated by ML, as a case study in the harmful repercussions of ML-laundered junk science. A summary and analysis of several such studies is delivered, with attention to the means by which unsound research lends itself to social harms. We explore some of the many factors contributing to poor practice in applied ML. In conclusion, we offer resources for research best practices to developers and practitioners.

本视角概述了在认识论上毫无根据、在伦理道德上有害的范式是如何通过机器学习(ML)重新回到科学文献中的,并探讨了这两方面失败之间的联系。我们将机器学习推动下重新出现的相貌学方法作为一个案例,研究机器学习垃圾科学的有害影响。我们将对几项此类研究进行总结和分析,并关注不靠谱的研究是如何造成社会危害的。我们探讨了造成应用 ML 不良实践的诸多因素。最后,我们为开发人员和从业人员提供了研究最佳实践的资源。
{"title":"The reanimation of pseudoscience in machine learning and its ethical repercussions","authors":"","doi":"10.1016/j.patter.2024.101027","DOIUrl":"https://doi.org/10.1016/j.patter.2024.101027","url":null,"abstract":"<p>The present perspective outlines how epistemically baseless and ethically pernicious paradigms are recycled back into the scientific literature via machine learning (ML) and explores connections between these two dimensions of failure. We hold up the renewed emergence of physiognomic methods, facilitated by ML, as a case study in the harmful repercussions of ML-laundered junk science. A summary and analysis of several such studies is delivered, with attention to the means by which unsound research lends itself to social harms. We explore some of the many factors contributing to poor practice in applied ML. In conclusion, we offer resources for research best practices to developers and practitioners.</p>","PeriodicalId":36242,"journal":{"name":"Patterns","volume":null,"pages":null},"PeriodicalIF":6.5,"publicationDate":"2024-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141865439","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Exploring the reversal curse and other deductive logical reasoning in BERT and GPT-based large language models 探索基于 BERT 和 GPT 的大型语言模型中的逆转诅咒和其他演绎逻辑推理
IF 6.5 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2024-07-25 DOI: 10.1016/j.patter.2024.101030

The “Reversal Curse” describes the inability of autoregressive decoder large language models (LLMs) to deduce “B is A” from “A is B,” assuming that B and A are distinct and can be uniquely identified from each other. This logical failure suggests limitations in using generative pretrained transformer (GPT) models for tasks like constructing knowledge graphs. Our study revealed that a bidirectional LLM, bidirectional encoder representations from transformers (BERT), does not suffer from this issue. To investigate further, we focused on more complex deductive reasoning by training encoder and decoder LLMs to perform union and intersection operations on sets. While both types of models managed tasks involving two sets, they struggled with operations involving three sets. Our findings underscore the differences between encoder and decoder models in handling logical reasoning. Thus, selecting BERT or GPT should depend on the task’s specific needs, utilizing BERT’s bidirectional context comprehension or GPT’s sequence prediction strengths.

逆转诅咒 "描述的是自回归解码器大型语言模型(LLM)无法从 "A 是 B "推导出 "B 是 A",前提是 B 和 A 是不同的,并且可以从彼此中唯一地识别出来。这种逻辑上的失败表明,在构建知识图谱等任务中使用生成式预训练转换器(GPT)模型存在局限性。我们的研究表明,双向 LLM--来自变换器的双向编码器表征(BERT)并不存在这个问题。为了进一步研究,我们将重点放在了更复杂的演绎推理上,训练编码器和解码器 LLM 对集合进行联合和相交运算。虽然这两类模型都能完成涉及两个集合的任务,但它们在涉及三个集合的运算中却举步维艰。我们的发现强调了编码器模型和解码器模型在处理逻辑推理方面的差异。因此,选择 BERT 还是 GPT 应取决于任务的具体需求,利用 BERT 的双向上下文理解能力或 GPT 的序列预测能力。
{"title":"Exploring the reversal curse and other deductive logical reasoning in BERT and GPT-based large language models","authors":"","doi":"10.1016/j.patter.2024.101030","DOIUrl":"https://doi.org/10.1016/j.patter.2024.101030","url":null,"abstract":"<p>The “Reversal Curse” describes the inability of autoregressive decoder large language models (LLMs) to deduce “B is A” from “A is B,” assuming that B and A are distinct and can be uniquely identified from each other. This logical failure suggests limitations in using generative pretrained transformer (GPT) models for tasks like constructing knowledge graphs. Our study revealed that a bidirectional LLM, bidirectional encoder representations from transformers (BERT), does not suffer from this issue. To investigate further, we focused on more complex deductive reasoning by training encoder and decoder LLMs to perform union and intersection operations on sets. While both types of models managed tasks involving two sets, they struggled with operations involving three sets. Our findings underscore the differences between encoder and decoder models in handling logical reasoning. Thus, selecting BERT or GPT should depend on the task’s specific needs, utilizing BERT’s bidirectional context comprehension or GPT’s sequence prediction strengths.</p>","PeriodicalId":36242,"journal":{"name":"Patterns","volume":null,"pages":null},"PeriodicalIF":6.5,"publicationDate":"2024-07-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141778215","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Patterns
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1