首页 > 最新文献

arXiv - PHYS - Data Analysis, Statistics and Probability最新文献

英文 中文
Burst-tree structure and higher-order temporal correlations 突发树结构和高阶时间相关性
Pub Date : 2024-09-03 DOI: arxiv-2409.01674
Tibebe Birhanu, Hang-Hyun Jo
Understanding characteristics of temporal correlations in time series iscrucial for developing accurate models in natural and social sciences. Theburst-tree decomposition method was recently introduced to reveal higher-ordertemporal correlations in time series in a form of an event sequence, inparticular, the hierarchical structure of bursty trains of events for theentire range of timescales [Jo et al., Sci.~Rep.~textbf{10}, 12202 (2020)].Such structure has been found to be simply characterized by the burst-mergingkernel governing which bursts are merged together as the timescale fordetecting bursts increases. In this work, we study the effects of kernels onthe higher-order temporal correlations in terms of burst size distributions,memory coefficients for bursts, and the autocorrelation function. We employseveral kernels, including the constant, additive, and product kernels as wellas those inspired by the empirical results. We find that kernels withpreferential mixing lead to the heavy-tailed burst size distributions, whilekernels with assortative mixing lead to positive correlations between burstsizes. The decaying exponent of the autocorrelation function depends not onlyon the kernel but also on the power-law exponent of the interevent timedistribution. In addition, thanks to the analogy to the coagulation process,analytical solutions of burst size distributions for some kernels could beobtained.
了解时间序列中时间相关性的特征对于建立自然科学和社会科学的精确模型至关重要。最近引入的爆发树分解方法以事件序列的形式揭示了时间序列中的高阶时间相关性,特别是整个时间尺度范围内事件爆发序列的层次结构[Jo 等,Sci.~Rep.~textbf{10},12202 (2020)]。在这项工作中,我们从突发大小分布、突发记忆系数和自相关函数等方面研究了核对高阶时间相关性的影响。我们采用了多种核,包括常数核、加法核、乘积核以及受经验结果启发的核。我们发现,具有偏好混合的核会导致重尾突发规模分布,而具有同类混合的核则会导致突发规模之间的正相关。自相关函数的衰减指数不仅取决于核,还取决于事件间时间分布的幂律指数。此外,由于类比了凝结过程,可以得到某些核的猝发大小分布的解析解。
{"title":"Burst-tree structure and higher-order temporal correlations","authors":"Tibebe Birhanu, Hang-Hyun Jo","doi":"arxiv-2409.01674","DOIUrl":"https://doi.org/arxiv-2409.01674","url":null,"abstract":"Understanding characteristics of temporal correlations in time series is\u0000crucial for developing accurate models in natural and social sciences. The\u0000burst-tree decomposition method was recently introduced to reveal higher-order\u0000temporal correlations in time series in a form of an event sequence, in\u0000particular, the hierarchical structure of bursty trains of events for the\u0000entire range of timescales [Jo et al., Sci.~Rep.~textbf{10}, 12202 (2020)].\u0000Such structure has been found to be simply characterized by the burst-merging\u0000kernel governing which bursts are merged together as the timescale for\u0000detecting bursts increases. In this work, we study the effects of kernels on\u0000the higher-order temporal correlations in terms of burst size distributions,\u0000memory coefficients for bursts, and the autocorrelation function. We employ\u0000several kernels, including the constant, additive, and product kernels as well\u0000as those inspired by the empirical results. We find that kernels with\u0000preferential mixing lead to the heavy-tailed burst size distributions, while\u0000kernels with assortative mixing lead to positive correlations between burst\u0000sizes. The decaying exponent of the autocorrelation function depends not only\u0000on the kernel but also on the power-law exponent of the interevent time\u0000distribution. In addition, thanks to the analogy to the coagulation process,\u0000analytical solutions of burst size distributions for some kernels could be\u0000obtained.","PeriodicalId":501065,"journal":{"name":"arXiv - PHYS - Data Analysis, Statistics and Probability","volume":"6 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-09-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142179400","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Categorising current-voltage curves in single-molecule junctions and their comparison to Single-Level Model 单分子结的电流-电压曲线分类及其与单级模型的比较
Pub Date : 2024-08-30 DOI: arxiv-2409.09051
Giovanna Angelis Schmidt
This thesis investigates the mechanically controlled break junctions, with aparticular emphasis on elucidating the behaviour of molecular currents at roomtemperature. The core of this experimental investigation involves a detailedanalysis of conductance, examining how it varies over time and with changes inthe gap between electrodes. Additionally, this study thoroughly evaluatestransmission properties, coupling effects, and current characteristics. Apivotal aspect of the research was the meticulous current measurement, followedby carefully selecting optimal data sets. This process set the stage for anin-depth analysis of resonant tunnelling phenomena observed through a singlechannel. Notably, these experiments were conducted under open atmosphericconditions at room temperature. A significant finding from this study is therecognition that our current model requires refinement. This adjustment isnecessary to encapsulate a broader spectrum of molecular transport mechanismsmore accurately. Furthermore, this work significantly advances ourcomprehension of quantum effects in single-molecule junctions, particularlyconcerning similar molecules to Corannulene extending to some organometallics.One of the essential disclosures is the identification of deviations in thetransport model, primarily attributable to electron-electron interactions. Thisinsight is crucial as it paves the way for developing a more comprehensive andprecise model, enhancing our understanding of molecular-scale electronictransport.
本论文研究机械控制的断裂结,尤其侧重于阐明分子电流在室温下的行为。实验研究的核心是对电导进行详细分析,研究电导如何随时间和电极间隙的变化而变化。此外,这项研究还全面评估了传输特性、耦合效应和电流特性。这项研究的关键是对电流进行细致的测量,然后仔细选择最佳数据集。这一过程为深入分析通过单通道观察到的共振隧道现象奠定了基础。值得注意的是,这些实验是在室温的开放大气条件下进行的。这项研究的一个重要发现是认识到我们目前的模型需要改进。为了更准确地囊括更广泛的分子传输机制,这一调整是必要的。此外,这项工作极大地推动了我们对单分子结中量子效应的理解,尤其是对类似于堇菜烯的分子以及某些有机金属的理解。这一发现至关重要,因为它为建立更全面、更精确的模型铺平了道路,增强了我们对分子尺度电子传输的理解。
{"title":"Categorising current-voltage curves in single-molecule junctions and their comparison to Single-Level Model","authors":"Giovanna Angelis Schmidt","doi":"arxiv-2409.09051","DOIUrl":"https://doi.org/arxiv-2409.09051","url":null,"abstract":"This thesis investigates the mechanically controlled break junctions, with a\u0000particular emphasis on elucidating the behaviour of molecular currents at room\u0000temperature. The core of this experimental investigation involves a detailed\u0000analysis of conductance, examining how it varies over time and with changes in\u0000the gap between electrodes. Additionally, this study thoroughly evaluates\u0000transmission properties, coupling effects, and current characteristics. A\u0000pivotal aspect of the research was the meticulous current measurement, followed\u0000by carefully selecting optimal data sets. This process set the stage for an\u0000in-depth analysis of resonant tunnelling phenomena observed through a single\u0000channel. Notably, these experiments were conducted under open atmospheric\u0000conditions at room temperature. A significant finding from this study is the\u0000recognition that our current model requires refinement. This adjustment is\u0000necessary to encapsulate a broader spectrum of molecular transport mechanisms\u0000more accurately. Furthermore, this work significantly advances our\u0000comprehension of quantum effects in single-molecule junctions, particularly\u0000concerning similar molecules to Corannulene extending to some organometallics.\u0000One of the essential disclosures is the identification of deviations in the\u0000transport model, primarily attributable to electron-electron interactions. This\u0000insight is crucial as it paves the way for developing a more comprehensive and\u0000precise model, enhancing our understanding of molecular-scale electronic\u0000transport.","PeriodicalId":501065,"journal":{"name":"arXiv - PHYS - Data Analysis, Statistics and Probability","volume":"65 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-08-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142253356","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Preservation of the Direct Photon and Neutral Meson Analysis in the PHENIX Experiment at RHIC 在 RHIC 的 PHENIX 实验中保留直接光子和中性介子分析
Pub Date : 2024-08-22 DOI: arxiv-2408.12072
Gabor David, Maxim Potekhin, Dmitri Smirnov
The PHENIX Collaboration has actively pursued a Data and AnalysisPreservation program since 2019, the first such dedicated effort at RHIC. Aparticularly challenging aspect of this endeavor is preservation of complexphysics analyses, selected for their scientific importance and the value of thespecific techniques developed as a part of the research. For this, we havechosen one of the most impactful PHENIX results, the joint study of directphotons and neutral pions in high-energy d+Au collisions. To ensurereproducibility of this analysis going forward, we partitioned it intoself-contained tasks and used a combination of containerization techniques,code management, and robust documentation. We then leveraged REANA (theplatform for reproducible analysis developed at CERN) to run the requiredsoftware. We present our experience based on this example, and outline ourfuture plans for analysis preservation.
自 2019 年以来,PHENIX 合作组织一直在积极开展数据和分析保存计划,这是 RHIC 首次开展此类专门工作。这项工作的一个特别具有挑战性的方面是保存综合物理分析,这些分析因其科学重要性和作为研究一部分开发的特定技术的价值而被选中。为此,我们选择了最具影响力的 PHENIX 成果之一,即在高能 d+Au 碰撞中对直接光子和中性离子的联合研究。为了确保这项分析的可重复性,我们将其划分为独立的任务,并结合使用了容器化技术、代码管理和强大的文档。然后,我们利用 REANA(欧洲核子研究中心开发的可重现分析平台)来运行所需的软件。我们根据这个例子介绍了我们的经验,并概述了我们未来的分析保存计划。
{"title":"Preservation of the Direct Photon and Neutral Meson Analysis in the PHENIX Experiment at RHIC","authors":"Gabor David, Maxim Potekhin, Dmitri Smirnov","doi":"arxiv-2408.12072","DOIUrl":"https://doi.org/arxiv-2408.12072","url":null,"abstract":"The PHENIX Collaboration has actively pursued a Data and Analysis\u0000Preservation program since 2019, the first such dedicated effort at RHIC. A\u0000particularly challenging aspect of this endeavor is preservation of complex\u0000physics analyses, selected for their scientific importance and the value of the\u0000specific techniques developed as a part of the research. For this, we have\u0000chosen one of the most impactful PHENIX results, the joint study of direct\u0000photons and neutral pions in high-energy d+Au collisions. To ensure\u0000reproducibility of this analysis going forward, we partitioned it into\u0000self-contained tasks and used a combination of containerization techniques,\u0000code management, and robust documentation. We then leveraged REANA (the\u0000platform for reproducible analysis developed at CERN) to run the required\u0000software. We present our experience based on this example, and outline our\u0000future plans for analysis preservation.","PeriodicalId":501065,"journal":{"name":"arXiv - PHYS - Data Analysis, Statistics and Probability","volume":"82 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-08-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142179265","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Active Learning of Molecular Data for Task-Specific Objectives 针对特定任务目标主动学习分子数据
Pub Date : 2024-08-20 DOI: arxiv-2408.11191
Kunal Ghosh, Milica Todorović, Aki Vehtari, Patrick Rinke
Active learning (AL) has shown promise for being a particularlydata-efficient machine learning approach. Yet, its performance depends on theapplication and it is not clear when AL practitioners can expect computationalsavings. Here, we carry out a systematic AL performance assessment for threediverse molecular datasets and two common scientific tasks: compiling compact,informative datasets and targeted molecular searches. We implemented AL withGaussian processes (GP) and used the many-body tensor as molecularrepresentation. For the first task, we tested different data acquisitionstrategies, batch sizes and GP noise settings. AL was insensitive to theacquisition batch size and we observed the best AL performance for theacquisition strategy that combines uncertainty reduction with clustering topromote diversity. However, for optimal GP noise settings, AL did notoutperform randomized selection of data points. Conversely, for targetedsearches, AL outperformed random sampling and achieved data savings up to 64%.Our analysis provides insight into this task-specific performance difference interms of target distributions and data collection strategies. We establishedthat the performance of AL depends on the relative distribution of the targetmolecules in comparison to the total dataset distribution, with the largestcomputational savings achieved when their overlap is minimal.
主动学习(AL)有望成为一种数据效率特别高的机器学习方法。然而,主动学习的性能取决于应用,目前还不清楚主动学习实践者何时可以期望节省计算量。在这里,我们针对三种不同的分子数据集和两种常见的科学任务进行了系统的 AL 性能评估:编译紧凑、信息丰富的数据集和有针对性的分子搜索。我们用高斯过程(GP)实现了 AL,并使用多体张量作为分子描述。对于第一个任务,我们测试了不同的数据采集策略、批量大小和 GP 噪声设置。AL对采集批量大小不敏感,我们观察到,将减少不确定性与促进多样性的聚类相结合的采集策略具有最佳的AL性能。然而,对于最佳的 GP 噪声设置,AL 的表现并不优于随机选择数据点。相反,对于有针对性的搜索,AL 的性能优于随机抽样,并节省了高达 64% 的数据。我们的分析深入揭示了目标分布和数据采集策略在特定任务中的性能差异。我们的分析深入揭示了目标分布和数据收集策略方面的这种特定任务性能差异。我们发现,AL 的性能取决于目标分子相对于整个数据集分布的相对分布,当两者的重叠最小时,计算量节省最大。
{"title":"Active Learning of Molecular Data for Task-Specific Objectives","authors":"Kunal Ghosh, Milica Todorović, Aki Vehtari, Patrick Rinke","doi":"arxiv-2408.11191","DOIUrl":"https://doi.org/arxiv-2408.11191","url":null,"abstract":"Active learning (AL) has shown promise for being a particularly\u0000data-efficient machine learning approach. Yet, its performance depends on the\u0000application and it is not clear when AL practitioners can expect computational\u0000savings. Here, we carry out a systematic AL performance assessment for three\u0000diverse molecular datasets and two common scientific tasks: compiling compact,\u0000informative datasets and targeted molecular searches. We implemented AL with\u0000Gaussian processes (GP) and used the many-body tensor as molecular\u0000representation. For the first task, we tested different data acquisition\u0000strategies, batch sizes and GP noise settings. AL was insensitive to the\u0000acquisition batch size and we observed the best AL performance for the\u0000acquisition strategy that combines uncertainty reduction with clustering to\u0000promote diversity. However, for optimal GP noise settings, AL did not\u0000outperform randomized selection of data points. Conversely, for targeted\u0000searches, AL outperformed random sampling and achieved data savings up to 64%.\u0000Our analysis provides insight into this task-specific performance difference in\u0000terms of target distributions and data collection strategies. We established\u0000that the performance of AL depends on the relative distribution of the target\u0000molecules in comparison to the total dataset distribution, with the largest\u0000computational savings achieved when their overlap is minimal.","PeriodicalId":501065,"journal":{"name":"arXiv - PHYS - Data Analysis, Statistics and Probability","volume":"37 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-08-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142179256","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Improved precision and accuracy of electron energy-loss spectroscopy quantification via fine structure fitting with constrained optimization 通过约束优化精细结构拟合提高电子能量损失光谱量化的精度和准确性
Pub Date : 2024-08-19 DOI: arxiv-2408.11870
Daen Jannis, Wouter Van den Broek, Zezhong Zhang, Sandra Van Aert, Jo Verbeeck
By working out the Bethe sum rule, a boundary condition that takes the formof a linear equality is derived for the fine structure observed in ionizationedges present in electron energy-loss spectra. This condition is subsequentlyused as a constraint in the estimation process of the elemental abundances,demonstrating starkly improved precision and accuracy and reduced sensitivityto the number of model parameters. Furthermore, the fine structure is reliablyextracted from the spectra in an automated way, thus providing criticalinformation on the sample's electronic properties that is hard or impossible toobtain otherwise. Since this approach allows dispensing with the need foruser-provided input, a potential source of bias is prevented.
通过计算贝特和规则,得出了电子能量损失光谱中电离边的精细结构的线性相等边界条件。这一条件随后被用作元素丰度估算过程中的约束条件,其精确度和准确性明显提高,并降低了对模型参数数量的敏感性。此外,还能以自动化方式从光谱中可靠地提取精细结构,从而提供有关样品电子特性的重要信息,而这些信息是很难或不可能获得的。由于这种方法无需用户提供输入,因此避免了潜在的偏差来源。
{"title":"Improved precision and accuracy of electron energy-loss spectroscopy quantification via fine structure fitting with constrained optimization","authors":"Daen Jannis, Wouter Van den Broek, Zezhong Zhang, Sandra Van Aert, Jo Verbeeck","doi":"arxiv-2408.11870","DOIUrl":"https://doi.org/arxiv-2408.11870","url":null,"abstract":"By working out the Bethe sum rule, a boundary condition that takes the form\u0000of a linear equality is derived for the fine structure observed in ionization\u0000edges present in electron energy-loss spectra. This condition is subsequently\u0000used as a constraint in the estimation process of the elemental abundances,\u0000demonstrating starkly improved precision and accuracy and reduced sensitivity\u0000to the number of model parameters. Furthermore, the fine structure is reliably\u0000extracted from the spectra in an automated way, thus providing critical\u0000information on the sample's electronic properties that is hard or impossible to\u0000obtain otherwise. Since this approach allows dispensing with the need for\u0000user-provided input, a potential source of bias is prevented.","PeriodicalId":501065,"journal":{"name":"arXiv - PHYS - Data Analysis, Statistics and Probability","volume":"15 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-08-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142179255","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
KAN 2.0: Kolmogorov-Arnold Networks Meet Science KAN 2.0:柯尔莫哥洛夫-阿诺德网络与科学相遇
Pub Date : 2024-08-19 DOI: arxiv-2408.10205
Ziming Liu, Pingchuan Ma, Yixuan Wang, Wojciech Matusik, Max Tegmark
A major challenge of AI + Science lies in their inherent incompatibility:today's AI is primarily based on connectionism, while science depends onsymbolism. To bridge the two worlds, we propose a framework to seamlesslysynergize Kolmogorov-Arnold Networks (KANs) and science. The frameworkhighlights KANs' usage for three aspects of scientific discovery: identifyingrelevant features, revealing modular structures, and discovering symbolicformulas. The synergy is bidirectional: science to KAN (incorporatingscientific knowledge into KANs), and KAN to science (extracting scientificinsights from KANs). We highlight major new functionalities in the pykanpackage: (1) MultKAN: KANs with multiplication nodes. (2) kanpiler: a KANcompiler that compiles symbolic formulas into KANs. (3) tree converter: convertKANs (or any neural networks) to tree graphs. Based on these tools, wedemonstrate KANs' capability to discover various types of physical laws,including conserved quantities, Lagrangians, symmetries, and constitutive laws.
当今的人工智能主要基于连接主义,而科学则依赖于符号主义。为了沟通这两个世界,我们提出了一个将科尔莫哥罗夫-阿诺德网络(KANs)与科学无缝协同的框架。该框架强调了 KAN 在科学发现三个方面的用途:识别相关特征、揭示模块结构和发现符号公式。协同作用是双向的:科学到 KAN(将科学知识纳入 KAN),KAN 到科学(从 KAN 中提取科学见解)。我们重点介绍 pykanpackage 中的主要新功能:(1) MultKAN:带有乘法节点的 KAN。(2) kanpiler:将符号公式编译成 KAN 的 KAN 编译器。(3) 树状图转换器:将 KAN(或任何神经网络)转换为树状图。基于这些工具,我们展示了 KAN 发现各类物理定律的能力,包括守恒量、拉格朗日、对称性和构成定律。
{"title":"KAN 2.0: Kolmogorov-Arnold Networks Meet Science","authors":"Ziming Liu, Pingchuan Ma, Yixuan Wang, Wojciech Matusik, Max Tegmark","doi":"arxiv-2408.10205","DOIUrl":"https://doi.org/arxiv-2408.10205","url":null,"abstract":"A major challenge of AI + Science lies in their inherent incompatibility:\u0000today's AI is primarily based on connectionism, while science depends on\u0000symbolism. To bridge the two worlds, we propose a framework to seamlessly\u0000synergize Kolmogorov-Arnold Networks (KANs) and science. The framework\u0000highlights KANs' usage for three aspects of scientific discovery: identifying\u0000relevant features, revealing modular structures, and discovering symbolic\u0000formulas. The synergy is bidirectional: science to KAN (incorporating\u0000scientific knowledge into KANs), and KAN to science (extracting scientific\u0000insights from KANs). We highlight major new functionalities in the pykan\u0000package: (1) MultKAN: KANs with multiplication nodes. (2) kanpiler: a KAN\u0000compiler that compiles symbolic formulas into KANs. (3) tree converter: convert\u0000KANs (or any neural networks) to tree graphs. Based on these tools, we\u0000demonstrate KANs' capability to discover various types of physical laws,\u0000including conserved quantities, Lagrangians, symmetries, and constitutive laws.","PeriodicalId":501065,"journal":{"name":"arXiv - PHYS - Data Analysis, Statistics and Probability","volume":"153 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-08-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142179257","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Two points are enough 两点就够了
Pub Date : 2024-08-19 DOI: arxiv-2408.11872
Hao Liu, Yanbin Zhao, Huarong Zheng, Xiulin Fan, Zhihua Deng, Mengchi Chen, Xingkai Wang, Zhiyang Liu, Jianguo Lu, Jian Chen
Prognosis and diagnosis play an important role in accelerating thedevelopment of lithium-ion batteries, as well as reliable and long-lifeoperation. In this work, we answer an important question: What is the minimumamount of data required to extract features for accurate battery prognosis anddiagnosis? Based on the first principle, we successfully extracted the besttwo-point feature (BTPF) for accurate battery prognosis and diagnosis using thefewest data points (only two) and the simplest feature selection method(Pearson correlation coefficient). The BTPF extraction method is tested on 820cells from 6 open-source datasets (covering five different chemistry types,seven manufacturers, and three data types). It achieves comparable accuracy tostate-of-the-art features in both prognosis and diagnosis tasks. This workchallenges the cognition of existing studies on the difficulty of batteryprognosis and diagnosis tasks, subverts the fixed pattern of establishingprognosis and diagnosis methods for complex dynamic systems through deliberatefeature engineering, highlights the promise of data-driven methods for fieldbattery prognosis and diagnosis applications, and provides a new benchmark forfuture studies.
预测和诊断在加速锂离子电池的开发以及实现可靠和长寿命运行方面发挥着重要作用。在这项工作中,我们回答了一个重要问题:提取准确的电池预报和诊断特征所需的最小数据量是多少?基于第一条原则,我们使用最少的数据点(仅两个)和最简单的特征选择方法(皮尔逊相关系数),成功提取了用于准确电池预报和诊断的最佳两点特征(BTPF)。BTPF 提取方法在 6 个开源数据集(涵盖 5 种不同化学类型、7 家制造商和 3 种数据类型)的 820 个电池上进行了测试。在预后和诊断任务中,该方法的准确性与目前最先进的特征相当。这项工作挑战了现有研究对电池预测和诊断任务难度的认知,颠覆了通过深思熟虑的特征工程建立复杂动态系统预测和诊断方法的固定模式,凸显了数据驱动方法在现场电池预测和诊断应用中的前景,并为未来研究提供了新的基准。
{"title":"Two points are enough","authors":"Hao Liu, Yanbin Zhao, Huarong Zheng, Xiulin Fan, Zhihua Deng, Mengchi Chen, Xingkai Wang, Zhiyang Liu, Jianguo Lu, Jian Chen","doi":"arxiv-2408.11872","DOIUrl":"https://doi.org/arxiv-2408.11872","url":null,"abstract":"Prognosis and diagnosis play an important role in accelerating the\u0000development of lithium-ion batteries, as well as reliable and long-life\u0000operation. In this work, we answer an important question: What is the minimum\u0000amount of data required to extract features for accurate battery prognosis and\u0000diagnosis? Based on the first principle, we successfully extracted the best\u0000two-point feature (BTPF) for accurate battery prognosis and diagnosis using the\u0000fewest data points (only two) and the simplest feature selection method\u0000(Pearson correlation coefficient). The BTPF extraction method is tested on 820\u0000cells from 6 open-source datasets (covering five different chemistry types,\u0000seven manufacturers, and three data types). It achieves comparable accuracy to\u0000state-of-the-art features in both prognosis and diagnosis tasks. This work\u0000challenges the cognition of existing studies on the difficulty of battery\u0000prognosis and diagnosis tasks, subverts the fixed pattern of establishing\u0000prognosis and diagnosis methods for complex dynamic systems through deliberate\u0000feature engineering, highlights the promise of data-driven methods for field\u0000battery prognosis and diagnosis applications, and provides a new benchmark for\u0000future studies.","PeriodicalId":501065,"journal":{"name":"arXiv - PHYS - Data Analysis, Statistics and Probability","volume":"1 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-08-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142179254","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Large-Scale Pretraining and Finetuning for Efficient Jet Classification in Particle Physics 大规模预训练和微调,实现粒子物理中的高效射流分类
Pub Date : 2024-08-18 DOI: arxiv-2408.09343
Zihan Zhao, Farouk Mokhtar, Raghav Kansal, Haoyang Li, Javier Duarte
This study introduces an innovative approach to analyzing unlabeled data inhigh-energy physics (HEP) through the application of self-supervised learning(SSL). Faced with the increasing computational cost of producing high-qualitylabeled simulation samples at the CERN LHC, we propose leveraging large volumesof unlabeled data to overcome the limitations of supervised learning methods,which heavily rely on detailed labeled simulations. By pretraining models onthese vast, mostly untapped datasets, we aim to learn generic representationsthat can be finetuned with smaller quantities of labeled data. Our methodologyemploys contrastive learning with augmentations on jet datasets to teach themodel to recognize common representations of jets, addressing the uniquechallenges of LHC physics. Building on the groundwork laid by previous studies,our work demonstrates the critical ability of SSL to utilize large-scaleunlabeled data effectively. We showcase the scalability and effectiveness ofour models by gradually increasing the size of the pretraining dataset andassessing the resultant performance enhancements. Our results, obtained fromexperiments on two datasets -- JetClass, representing unlabeled data, and TopTagging, serving as labeled simulation data -- show significant improvements indata efficiency, computational efficiency, and overall performance. Thesefindings suggest that SSL can greatly enhance the adaptability of ML models tothe HEP domain. This work opens new avenues for the use of unlabeled data inHEP and contributes to a better understanding the potential of SSL forscientific discovery.
本研究介绍了一种通过应用自监督学习(SSL)来分析高能物理(HEP)中未标记数据的创新方法。面对欧洲核子研究中心大型强子对撞机(CERN LHC)制作高质量标签模拟样本的计算成本不断增加的问题,我们建议利用大量未标签数据来克服监督学习方法的局限性,因为监督学习方法严重依赖于详细的标签模拟。通过在这些庞大的、大部分尚未开发的数据集上预训练模型,我们的目标是学习通用表示法,然后再用较小数量的标注数据进行微调。我们的方法利用对比学习和喷流数据集上的增强来教模型识别喷流的常见表示,从而解决大型强子对撞机物理的独特挑战。在以往研究奠定的基础上,我们的工作展示了 SSL 有效利用大规模无标记数据的关键能力。我们通过逐步增加预训练数据集的规模和评估由此带来的性能提升,展示了我们模型的可扩展性和有效性。我们在两个数据集(代表无标签数据的 JetClass 和作为有标签模拟数据的 TopTagging)上的实验结果表明,数据效率、计算效率和整体性能都有显著提高。这些发现表明,SSL 可以大大提高 ML 模型在 HEP 领域的适应性。这项工作为在 HEP 中使用无标记数据开辟了新途径,有助于更好地了解 SSL 在科学发现方面的潜力。
{"title":"Large-Scale Pretraining and Finetuning for Efficient Jet Classification in Particle Physics","authors":"Zihan Zhao, Farouk Mokhtar, Raghav Kansal, Haoyang Li, Javier Duarte","doi":"arxiv-2408.09343","DOIUrl":"https://doi.org/arxiv-2408.09343","url":null,"abstract":"This study introduces an innovative approach to analyzing unlabeled data in\u0000high-energy physics (HEP) through the application of self-supervised learning\u0000(SSL). Faced with the increasing computational cost of producing high-quality\u0000labeled simulation samples at the CERN LHC, we propose leveraging large volumes\u0000of unlabeled data to overcome the limitations of supervised learning methods,\u0000which heavily rely on detailed labeled simulations. By pretraining models on\u0000these vast, mostly untapped datasets, we aim to learn generic representations\u0000that can be finetuned with smaller quantities of labeled data. Our methodology\u0000employs contrastive learning with augmentations on jet datasets to teach the\u0000model to recognize common representations of jets, addressing the unique\u0000challenges of LHC physics. Building on the groundwork laid by previous studies,\u0000our work demonstrates the critical ability of SSL to utilize large-scale\u0000unlabeled data effectively. We showcase the scalability and effectiveness of\u0000our models by gradually increasing the size of the pretraining dataset and\u0000assessing the resultant performance enhancements. Our results, obtained from\u0000experiments on two datasets -- JetClass, representing unlabeled data, and Top\u0000Tagging, serving as labeled simulation data -- show significant improvements in\u0000data efficiency, computational efficiency, and overall performance. These\u0000findings suggest that SSL can greatly enhance the adaptability of ML models to\u0000the HEP domain. This work opens new avenues for the use of unlabeled data in\u0000HEP and contributes to a better understanding the potential of SSL for\u0000scientific discovery.","PeriodicalId":501065,"journal":{"name":"arXiv - PHYS - Data Analysis, Statistics and Probability","volume":"47 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-08-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142179259","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Euler Characteristic Surfaces: A Stable Multiscale Topological Summary of Time Series Data 欧拉特征曲面:时间序列数据的稳定多尺度拓扑总结
Pub Date : 2024-08-18 DOI: arxiv-2408.09400
Anamika Roy, Atish J. Mitra, Tapati Dutta
We present Euler Characteristic Surfaces as a multiscale spatiotemporaltopological summary of time series data encapsulating the topology of thesystem at different time instants and length scales. Euler CharacteristicSurfaces with an appropriate metric is used to quantify stability and locatecritical changes in a dynamical system with respect to variations in aparameter, while being substantially computationally cheaper than availablealternate methods such as persistent homology. The stability of theconstruction is demonstrated by a quantitative comparison bound with persistenthomology, and a quantitative stability bound under small changes in time isestablished. The proposed construction is used to analyze two different kindsof simulated disordered flow situations.
我们提出的欧拉特征曲面是时间序列数据的多尺度时空拓扑总结,囊括了系统在不同时间瞬间和长度尺度上的拓扑结构。欧拉特征曲面与适当的度量被用来量化动力学系统的稳定性和定位临界变化,与持久同调等现有替代方法相比,计算成本大大降低。通过与持久同源性的定量比较约束证明了该构造的稳定性,并建立了时间微小变化下的定量稳定性约束。提出的构造被用于分析两种不同的模拟无序流情况。
{"title":"Euler Characteristic Surfaces: A Stable Multiscale Topological Summary of Time Series Data","authors":"Anamika Roy, Atish J. Mitra, Tapati Dutta","doi":"arxiv-2408.09400","DOIUrl":"https://doi.org/arxiv-2408.09400","url":null,"abstract":"We present Euler Characteristic Surfaces as a multiscale spatiotemporal\u0000topological summary of time series data encapsulating the topology of the\u0000system at different time instants and length scales. Euler Characteristic\u0000Surfaces with an appropriate metric is used to quantify stability and locate\u0000critical changes in a dynamical system with respect to variations in a\u0000parameter, while being substantially computationally cheaper than available\u0000alternate methods such as persistent homology. The stability of the\u0000construction is demonstrated by a quantitative comparison bound with persistent\u0000homology, and a quantitative stability bound under small changes in time is\u0000established. The proposed construction is used to analyze two different kinds\u0000of simulated disordered flow situations.","PeriodicalId":501065,"journal":{"name":"arXiv - PHYS - Data Analysis, Statistics and Probability","volume":"110 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-08-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142179258","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
NEAR: A Training-Free Pre-Estimator of Machine Learning Model Performance NEAR:机器学习模型性能的免训练预估器
Pub Date : 2024-08-16 DOI: arxiv-2408.08776
Raphael T. Husistein, Markus Reiher, Marco Eckhoff
Artificial neural networks have been shown to be state-of-the-art machinelearning models in a wide variety of applications, including natural languageprocessing and image recognition. However, building a performant neural networkis a laborious task and requires substantial computing power. NeuralArchitecture Search (NAS) addresses this issue by an automatic selection of theoptimal network from a set of potential candidates. While many NAS methodsstill require training of (some) neural networks, zero-cost proxies promise toidentify the optimal network without training. In this work, we propose thezero-cost proxy Network Expressivity by Activation Rank (NEAR). It is based onthe effective rank of the pre- and post-activation matrix, i.e., the values ofa neural network layer before and after applying its activation function. Wedemonstrate the cutting-edge correlation between this network score and themodel accuracy on NAS-Bench-101 and NATS-Bench-SSS/TSS. In addition, we presenta simple approach to estimate the optimal layer sizes in multi-layerperceptrons. Furthermore, we show that this score can be utilized to selecthyperparameters such as the activation function and the neural network weightinitialization scheme.
在自然语言处理和图像识别等多种应用中,人工神经网络已被证明是最先进的机器学习模型。然而,构建一个性能良好的神经网络是一项艰巨的任务,需要强大的计算能力。神经架构搜索(NAS)通过从一组潜在候选网络中自动选择最佳网络来解决这一问题。虽然许多 NAS 方法仍需要对(某些)神经网络进行训练,但零成本代理有望在无需训练的情况下识别出最佳网络。在这项工作中,我们提出了零成本代理 "激活等级网络表达性"(NEAR)。它基于激活前和激活后矩阵的有效秩,即神经网络层在应用激活函数前后的值。我们在 NAS-Bench-101 和 NATS-Bench-SSS/TSS 上展示了该网络得分与模型准确性之间的尖端相关性。此外,我们还介绍了一种估算多层感知器最佳层大小的简单方法。此外,我们还展示了可以利用这一分数来选择激活函数和神经网络权重初始化方案等参数。
{"title":"NEAR: A Training-Free Pre-Estimator of Machine Learning Model Performance","authors":"Raphael T. Husistein, Markus Reiher, Marco Eckhoff","doi":"arxiv-2408.08776","DOIUrl":"https://doi.org/arxiv-2408.08776","url":null,"abstract":"Artificial neural networks have been shown to be state-of-the-art machine\u0000learning models in a wide variety of applications, including natural language\u0000processing and image recognition. However, building a performant neural network\u0000is a laborious task and requires substantial computing power. Neural\u0000Architecture Search (NAS) addresses this issue by an automatic selection of the\u0000optimal network from a set of potential candidates. While many NAS methods\u0000still require training of (some) neural networks, zero-cost proxies promise to\u0000identify the optimal network without training. In this work, we propose the\u0000zero-cost proxy Network Expressivity by Activation Rank (NEAR). It is based on\u0000the effective rank of the pre- and post-activation matrix, i.e., the values of\u0000a neural network layer before and after applying its activation function. We\u0000demonstrate the cutting-edge correlation between this network score and the\u0000model accuracy on NAS-Bench-101 and NATS-Bench-SSS/TSS. In addition, we present\u0000a simple approach to estimate the optimal layer sizes in multi-layer\u0000perceptrons. Furthermore, we show that this score can be utilized to select\u0000hyperparameters such as the activation function and the neural network weight\u0000initialization scheme.","PeriodicalId":501065,"journal":{"name":"arXiv - PHYS - Data Analysis, Statistics and Probability","volume":"15 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-08-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142179260","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
arXiv - PHYS - Data Analysis, Statistics and Probability
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1