The Shape of Things: Topological Data Analysis

N. Lazar, Hyunnam Ryu
{"title":"The Shape of Things: Topological Data Analysis","authors":"N. Lazar, Hyunnam Ryu","doi":"10.1080/09332480.2021.1915036","DOIUrl":null,"url":null,"abstract":"An interesting feature of much modern Big Data is that the data we collect, or the data we want to analyze, are not necessarily in the traditional matrix or array form familiar from our textbooks. They may be coerced to such a format for relative ease of analysis, but this is not a strong justification. Past columns have explored new methods that exploit the natural structure of such data sets more directly. Topological data analysis (TDA) is one such method. Much daunting mathematics lies behind the methods of TDA, but it is possible to gain an idea and understanding of the approach and its potential usefulness even without a deep dive into the intricacies of topology, homology classes, and the like. In fact, the basic idea is quite simple: to study data through their low-dimension topological features, which translate into connected components (dimension 0), loops (dimension 1), and voids (dimension 2). Higher dimensions do exist, but often do not contain much useful information. For threedimensional data, up to the second dimension topological features can be considered at most. A good analogy to make the meaning of these features concrete is a piece of Swiss cheese. The piece of cheese itself is one connected component. The holes that are apparent on the The Shape of Things: Topological Data Analysis","PeriodicalId":88226,"journal":{"name":"Chance (New York, N.Y.)","volume":"3 1","pages":"59 - 64"},"PeriodicalIF":0.0000,"publicationDate":"2021-04-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Chance (New York, N.Y.)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1080/09332480.2021.1915036","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1

Abstract

An interesting feature of much modern Big Data is that the data we collect, or the data we want to analyze, are not necessarily in the traditional matrix or array form familiar from our textbooks. They may be coerced to such a format for relative ease of analysis, but this is not a strong justification. Past columns have explored new methods that exploit the natural structure of such data sets more directly. Topological data analysis (TDA) is one such method. Much daunting mathematics lies behind the methods of TDA, but it is possible to gain an idea and understanding of the approach and its potential usefulness even without a deep dive into the intricacies of topology, homology classes, and the like. In fact, the basic idea is quite simple: to study data through their low-dimension topological features, which translate into connected components (dimension 0), loops (dimension 1), and voids (dimension 2). Higher dimensions do exist, but often do not contain much useful information. For threedimensional data, up to the second dimension topological features can be considered at most. A good analogy to make the meaning of these features concrete is a piece of Swiss cheese. The piece of cheese itself is one connected component. The holes that are apparent on the The Shape of Things: Topological Data Analysis
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
事物的形状:拓扑数据分析
许多现代大数据的一个有趣特征是,我们收集的数据,或者我们想要分析的数据,不一定是我们在教科书中熟悉的传统矩阵或数组形式。他们可能被迫使用这样的格式来相对容易地进行分析,但这并不是一个强有力的理由。过去的专栏已经探讨了更直接地利用这些数据集的自然结构的新方法。拓扑数据分析(TDA)就是这样一种方法。TDA方法的背后隐藏着许多令人生畏的数学知识,但是即使不深入研究拓扑、同调类等的复杂性,也有可能获得对该方法及其潜在用途的概念和理解。事实上,基本思想非常简单:通过低维拓扑特征来研究数据,这些特征可以转化为连接的组件(维度0)、循环(维度1)和空洞(维度2)。高维确实存在,但通常不包含太多有用的信息。对于三维数据,最多可以考虑到二维拓扑特征。将这些特征的含义具体化的一个很好的类比是一块瑞士奶酪。这块奶酪本身就是一个相连的组成部分。在《事物的形状:拓扑数据分析》中可以明显看到的洞
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Multiple discoveries in causal inference: LATE for the party. Bayes Factors for Forensic Decision Analyses with R Three Welcome Arrivals for 2023: 1. Florence Nightingale Bayesian Probability for Babies Fresh Perspective
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1