A method for unsupervised learning of coherent spatiotemporal patterns in multiscale data.

IF 9.1 1区 综合性期刊 Q1 MULTIDISCIPLINARY SCIENCES Proceedings of the National Academy of Sciences of the United States of America Pub Date : 2025-02-18 Epub Date: 2025-02-14 DOI:10.1073/pnas.2415786122
Karl Lapo, Sara M Ichinaga, J Nathan Kutz
{"title":"A method for unsupervised learning of coherent spatiotemporal patterns in multiscale data.","authors":"Karl Lapo, Sara M Ichinaga, J Nathan Kutz","doi":"10.1073/pnas.2415786122","DOIUrl":null,"url":null,"abstract":"<p><p>The unsupervised and principled diagnosis of multiscale data is a fundamental obstacle in modern scientific problems from, for instance, weather and climate prediction, neurology, epidemiology, and turbulence. Multiscale data are characterized by a combination of processes acting along multiple dimensions simultaneously, spatiotemporal scales across orders of magnitude, nonstationarity, and/or invariances such as translation and rotation. Existing methods are not well-suited to multiscale data, usually requiring supervised strategies such as human intervention, extensive tuning, or selection of ideal time periods. We present the multiresolution coherent spatio-temporal scale separation (mrCOSTS), a hierarchical and automated algorithm for the diagnosis of coherent patterns or modes in multiscale data. mrCOSTS is a variant of dynamic mode decomposition which decomposes data into bands of spatial patterns with shared time dynamics, thereby providing a robust method for analyzing multiscale data. It requires no training but instead takes advantage of the hierarchical nature of multiscale systems. We demonstrate mrCOSTS using complex multiscale datasets that are canonically difficult to analyze: 1) climate patterns of sea surface temperature, 2) electrophysiological observations of neural signals of the motor cortex, and 3) horizontal wind in the mountain boundary layer. With mrCOSTS, we trivially retrieve complex dynamics that were previously difficult to resolve while additionally extracting hitherto unknown patterns of activity embedded in the dynamics, allowing for advancing the understanding of these fields of study. This method is an important advancement for addressing the multiscale data which characterize many of the grand challenges in science and engineering.</p>","PeriodicalId":20548,"journal":{"name":"Proceedings of the National Academy of Sciences of the United States of America","volume":"122 7","pages":"e2415786122"},"PeriodicalIF":9.1000,"publicationDate":"2025-02-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11848389/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the National Academy of Sciences of the United States of America","FirstCategoryId":"103","ListUrlMain":"https://doi.org/10.1073/pnas.2415786122","RegionNum":1,"RegionCategory":"综合性期刊","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/2/14 0:00:00","PubModel":"Epub","JCR":"Q1","JCRName":"MULTIDISCIPLINARY SCIENCES","Score":null,"Total":0}
引用次数: 0

Abstract

The unsupervised and principled diagnosis of multiscale data is a fundamental obstacle in modern scientific problems from, for instance, weather and climate prediction, neurology, epidemiology, and turbulence. Multiscale data are characterized by a combination of processes acting along multiple dimensions simultaneously, spatiotemporal scales across orders of magnitude, nonstationarity, and/or invariances such as translation and rotation. Existing methods are not well-suited to multiscale data, usually requiring supervised strategies such as human intervention, extensive tuning, or selection of ideal time periods. We present the multiresolution coherent spatio-temporal scale separation (mrCOSTS), a hierarchical and automated algorithm for the diagnosis of coherent patterns or modes in multiscale data. mrCOSTS is a variant of dynamic mode decomposition which decomposes data into bands of spatial patterns with shared time dynamics, thereby providing a robust method for analyzing multiscale data. It requires no training but instead takes advantage of the hierarchical nature of multiscale systems. We demonstrate mrCOSTS using complex multiscale datasets that are canonically difficult to analyze: 1) climate patterns of sea surface temperature, 2) electrophysiological observations of neural signals of the motor cortex, and 3) horizontal wind in the mountain boundary layer. With mrCOSTS, we trivially retrieve complex dynamics that were previously difficult to resolve while additionally extracting hitherto unknown patterns of activity embedded in the dynamics, allowing for advancing the understanding of these fields of study. This method is an important advancement for addressing the multiscale data which characterize many of the grand challenges in science and engineering.

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
多尺度数据中相干时空模式的无监督学习方法。
多尺度数据的无监督和原则性诊断是现代科学问题的根本障碍,例如天气和气候预测、神经病学、流行病学和湍流。多尺度数据的特点是同时在多个维度上作用的过程的组合,跨数量级的时空尺度,非平稳性和/或不变性,如平移和旋转。现有的方法不太适合多尺度数据,通常需要有监督的策略,如人为干预、广泛调整或选择理想的时间段。我们提出了多分辨率相干时空尺度分离(mrCOSTS),这是一种用于多尺度数据中相干模式或模式诊断的分层自动化算法。mrCOSTS是动态模式分解的一种变体,它将数据分解成具有共享时间动态的空间模式带,从而为多尺度数据分析提供了一种鲁棒的方法。它不需要训练,而是利用了多尺度系统的层次特性。我们使用复杂的多尺度数据集来演示mrCOSTS,这些数据集通常难以分析:1)海面温度的气候模式,2)运动皮层神经信号的电生理观测,以及3)山边界层的水平风。通过mrCOSTS,我们可以轻松地检索以前难以解决的复杂动态,同时额外提取嵌入在动态中的迄今未知的活动模式,从而促进对这些研究领域的理解。这种方法是处理多尺度数据的重要进步,多尺度数据是科学和工程中许多重大挑战的特征。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
CiteScore
19.00
自引率
0.90%
发文量
3575
审稿时长
2.5 months
期刊介绍: The Proceedings of the National Academy of Sciences (PNAS), a peer-reviewed journal of the National Academy of Sciences (NAS), serves as an authoritative source for high-impact, original research across the biological, physical, and social sciences. With a global scope, the journal welcomes submissions from researchers worldwide, making it an inclusive platform for advancing scientific knowledge.
期刊最新文献
Enhancer-directed gene delivery for digit regeneration based on conserved epidermal factors. Machine learning multiscale collective cell dynamics: From single-cell characterization to multicellular monolayer modeling. Monoclonal antibodies against nitrated nerve growth factor reveal an oxidation-dependent pathogenic hallmark in ALS. A conserved E3 ubiquitin ligase rewires carotenoid, sterol, and lipid metabolism in Xanthophyllomyces dendrorhous. Serotonin potentiates presynaptic calcium currents and transmitter release at cortical synapses.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1