Modeling Performance of Data Collection Systems for High-Energy Physics

Wilkie Olin-Ammentorp, Xingfu Wu, Andrew A. Chien
{"title":"Modeling Performance of Data Collection Systems for High-Energy Physics","authors":"Wilkie Olin-Ammentorp, Xingfu Wu, Andrew A. Chien","doi":"arxiv-2407.00123","DOIUrl":null,"url":null,"abstract":"Exponential increases in scientific experimental data are outstripping the\nrate of progress in silicon technology. As a result, heterogeneous combinations\nof architectures and process or device technologies are increasingly important\nto meet the computing demands of future scientific experiments. However, the\ncomplexity of heterogeneous computing systems requires systematic modeling to\nunderstand performance. We present a model which addresses this need by framing key aspects of data\ncollection pipelines and constraints, and combines them with the important\nvectors of technology that shape alternatives, computing metrics that allow\ncomplex alternatives to be compared. For instance, a data collection pipeline\nmay be characterized by parameters such as sensor sampling rates, amount of\ndata collected, and the overall relevancy of retrieved samples. Alternatives to\nthis pipeline are enabled by hardware development vectors including advancing\nCMOS, GPUs, neuromorphic computing, and edge computing. By calculating metrics\nfor each alternative such as overall F1 score, power, hardware cost, and energy\nexpended per relevant sample, this model allows alternate data collection\nsystems to be rigorously compared. To demonstrate this model's capability, we apply it to the CMS experiment\n(and planned HL-LHC upgrade) to evaluate and compare the application of novel\ntechnologies in the data acquisition system (DAQ). We demonstrate that\nimprovements to early stages in the DAQ are highly beneficial, greatly reducing\nthe resources required at later stages of processing (such as a 60% power\nreduction) and increasing the amount of relevant data retrieved from the\nexperiment per unit power (improving from 0.065 to 0.31 samples/kJ) However, we\npredict further advances will be required in order to meet overall power and\ncost constraints for the DAQ.","PeriodicalId":501065,"journal":{"name":"arXiv - PHYS - Data Analysis, Statistics and Probability","volume":"26 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-06-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"arXiv - PHYS - Data Analysis, Statistics and Probability","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/arxiv-2407.00123","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Exponential increases in scientific experimental data are outstripping the rate of progress in silicon technology. As a result, heterogeneous combinations of architectures and process or device technologies are increasingly important to meet the computing demands of future scientific experiments. However, the complexity of heterogeneous computing systems requires systematic modeling to understand performance. We present a model which addresses this need by framing key aspects of data collection pipelines and constraints, and combines them with the important vectors of technology that shape alternatives, computing metrics that allow complex alternatives to be compared. For instance, a data collection pipeline may be characterized by parameters such as sensor sampling rates, amount of data collected, and the overall relevancy of retrieved samples. Alternatives to this pipeline are enabled by hardware development vectors including advancing CMOS, GPUs, neuromorphic computing, and edge computing. By calculating metrics for each alternative such as overall F1 score, power, hardware cost, and energy expended per relevant sample, this model allows alternate data collection systems to be rigorously compared. To demonstrate this model's capability, we apply it to the CMS experiment (and planned HL-LHC upgrade) to evaluate and compare the application of novel technologies in the data acquisition system (DAQ). We demonstrate that improvements to early stages in the DAQ are highly beneficial, greatly reducing the resources required at later stages of processing (such as a 60% power reduction) and increasing the amount of relevant data retrieved from the experiment per unit power (improving from 0.065 to 0.31 samples/kJ) However, we predict further advances will be required in order to meet overall power and cost constraints for the DAQ.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
高能物理数据采集系统性能建模
科学实验数据的指数级增长超过了硅技术的进步速度。因此,为了满足未来科学实验的计算需求,架构与工艺或器件技术的异构组合变得越来越重要。然而,异构计算系统的复杂性需要系统建模来了解其性能。为了满足这一需求,我们提出了一个模型,该模型将数据收集管道和约束条件的关键方面框定出来,并将它们与影响替代方案的重要技术向量、计算指标结合起来,以便对复杂的替代方案进行比较。例如,数据收集管道可以通过传感器采样率、收集的数据量和检索样本的总体相关性等参数来表征。该流水线的替代方案由硬件开发矢量支持,包括不断进步的CMOS、GPU、神经形态计算和边缘计算。通过计算每个替代方案的指标,如总体 F1 分数、功率、硬件成本和每个相关样本的能源消耗,该模型允许对替代数据收集系统进行严格比较。为了证明该模型的能力,我们将其应用于 CMS 实验(以及计划中的 HL-LHC 升级),以评估和比较数据采集系统(DAQ)中新型技术的应用。我们证明,对数据采集系统早期阶段的改进是非常有益的,大大减少了后期处理阶段所需的资源(例如降低了 60% 的功率),并提高了单位功率从实验中检索到的相关数据量(从 0.065 样本/千焦提高到 0.31 样本/千焦),但是,我们预测还需要进一步的改进才能满足数据采集系统的总体功率和成本限制。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
PASS: An Asynchronous Probabilistic Processor for Next Generation Intelligence Astrometric Binary Classification Via Artificial Neural Networks XENONnT Analysis: Signal Reconstruction, Calibration and Event Selection Converting sWeights to Probabilities with Density Ratios Challenges and perspectives in recurrence analyses of event time series
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1