首页 > 最新文献

BenchCouncil Transactions on Benchmarks, Standards and Evaluations最新文献

英文 中文
A BenchCouncil view on benchmarking emerging and future computing 基准委员会对新兴和未来计算基准的看法
Pub Date : 2022-04-01 DOI: 10.1016/j.tbench.2022.100064
Jianfeng Zhan

The measurable properties of the artifacts or objects in the computer, management, or finance disciplines are extrinsic, not inherent — dependent on their problem definitions and solution instantiations. The processes of problem definition, solution instantiation, and measurement are entangled. Only after the instantiation can the solutions to the problem be measured. Definition, instantiation, and measurement have complex mutual influences. Meanwhile, the technology inertia brings instantiation bias — trapped into a subspace or even a point at a high-dimension solution space. These daunting challenges, which emerging computing aggravates, make metrology cannot work for benchmark communities. It is pressing to establish independent benchmark science and engineering.

This article presents a unifying benchmark definition, a conceptual framework, and a traceable and supervised learning-based benchmarking methodology, laying the foundation for benchmark science and engineering. I also discuss BenchCouncil’s plans for emerging and future computing. The ongoing projects include defining the challenges of intelligence, instinct, quantum computers, Metaverse, planet-scale computers, and reformulating data centers, artificial intelligence for science, and CPU benchmark suites. Also, BenchCouncil will collaborate with ComputerCouncil on open-source computer systems for planet-scale computing, AI for science systems, and Metaverse.

计算机、管理或金融学科中的工件或对象的可测量属性是外在的,而不是内在的——依赖于它们的问题定义和解决方案实例。问题定义、解决方案实例化和度量的过程是相互纠缠的。只有在实例化之后,才能衡量问题的解决方案。定义、实例化和度量具有复杂的相互影响。同时,技术惯性带来了实例化偏差——被困在一个子空间甚至高维解空间中的一个点上。新兴计算加剧了这些令人生畏的挑战,使得计量学无法适用于基准社区。建立独立的科学与工程标杆迫在眉睫。本文提出了一个统一的基准定义,一个概念框架,以及一个可跟踪和监督的基于学习的基准测试方法,为基准科学和工程奠定了基础。我还讨论了BenchCouncil对新兴和未来计算的计划。正在进行的项目包括定义智能、本能、量子计算机、元宇宙、行星级计算机的挑战,以及重新制定数据中心、科学人工智能和CPU基准套件。此外,BenchCouncil将与ComputerCouncil合作,开发用于行星规模计算的开源计算机系统、用于科学系统的人工智能和元宇宙。
{"title":"A BenchCouncil view on benchmarking emerging and future computing","authors":"Jianfeng Zhan","doi":"10.1016/j.tbench.2022.100064","DOIUrl":"https://doi.org/10.1016/j.tbench.2022.100064","url":null,"abstract":"<div><p>The measurable properties of the artifacts or objects in the computer, management, or finance disciplines are extrinsic, not inherent — dependent on their problem definitions and solution instantiations. The processes of problem definition, solution instantiation, and measurement are entangled. Only after the instantiation can the solutions to the problem be measured. Definition, instantiation, and measurement have complex mutual influences. Meanwhile, the technology inertia brings instantiation bias — trapped into a subspace or even a point at a high-dimension solution space. These daunting challenges, which emerging computing aggravates, make metrology cannot work for benchmark communities. It is pressing to establish independent benchmark science and engineering.</p><p>This article presents a unifying benchmark definition, a conceptual framework, and a traceable and supervised learning-based benchmarking methodology, laying the foundation for benchmark science and engineering. I also discuss BenchCouncil’s plans for emerging and future computing. The ongoing projects include defining the challenges of intelligence, instinct, quantum computers, Metaverse, planet-scale computers, and reformulating data centers, artificial intelligence for science, and CPU benchmark suites. Also, BenchCouncil will collaborate with ComputerCouncil on open-source computer systems for planet-scale computing, AI for science systems, and Metaverse.</p></div>","PeriodicalId":100155,"journal":{"name":"BenchCouncil Transactions on Benchmarks, Standards and Evaluations","volume":"2 2","pages":"Article 100064"},"PeriodicalIF":0.0,"publicationDate":"2022-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2772485922000515/pdfft?md5=e08bdc20e367ab431cafc4a66c0be3d8&pid=1-s2.0-S2772485922000515-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"137281306","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
SAIBench: Benchmarking AI for Science SAIBench:为科学测试人工智能
Pub Date : 2022-04-01 DOI: 10.1016/j.tbench.2022.100063
Yatao Li , Jianfeng Zhan

Scientific research communities are embracing AI-based solutions to target tractable scientific tasks and improve research work flows. However, the development and evaluation of such solutions are scattered across multiple disciplines. We formalize the problem of scientific AI benchmarking, and propose a system called SAIBench in the hope of unifying the efforts and enabling low-friction on-boarding of new disciplines. The system approaches this goal with SAIL, a domain-specific language to decouple research problems, AI models, ranking criteria, and software/hardware configuration into reusable modules. We show that this approach is flexible and can adapt to problems, AI models, and evaluation methods defined in different perspectives. The project homepage is https://www.computercouncil.org/SAIBench.

科学研究界正在采用基于人工智能的解决方案来针对可处理的科学任务并改善研究工作流程。然而,这些解决方案的开发和评估分散在多个学科中。我们将科学的人工智能基准问题形式化,并提出了一个名为SAIBench的系统,希望能够统一努力并实现新学科的低摩擦入职。该系统通过SAIL实现了这一目标,SAIL是一种领域特定的语言,可以将研究问题、人工智能模型、排名标准和软件/硬件配置解耦到可重用的模块中。我们表明,这种方法是灵活的,可以适应不同角度定义的问题、人工智能模型和评估方法。项目主页是https://www.computercouncil.org/SAIBench。
{"title":"SAIBench: Benchmarking AI for Science","authors":"Yatao Li ,&nbsp;Jianfeng Zhan","doi":"10.1016/j.tbench.2022.100063","DOIUrl":"10.1016/j.tbench.2022.100063","url":null,"abstract":"<div><p>Scientific research communities are embracing AI-based solutions to target tractable scientific tasks and improve research work flows. However, the development and evaluation of such solutions are scattered across multiple disciplines. We formalize the problem of scientific AI benchmarking, and propose a system called SAIBench in the hope of unifying the efforts and enabling low-friction on-boarding of new disciplines. The system approaches this goal with <em>SAIL</em>, a domain-specific language to decouple research problems, AI models, ranking criteria, and software/hardware configuration into reusable modules. We show that this approach is flexible and can adapt to problems, AI models, and evaluation methods defined in different perspectives. The project homepage is <span>https://www.computercouncil.org/SAIBench</span><svg><path></path></svg>.</p></div>","PeriodicalId":100155,"journal":{"name":"BenchCouncil Transactions on Benchmarks, Standards and Evaluations","volume":"2 2","pages":"Article 100063"},"PeriodicalIF":0.0,"publicationDate":"2022-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2772485922000503/pdfft?md5=505b11231536e6de9f0ebf9c8f5747d2&pid=1-s2.0-S2772485922000503-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"77244854","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Performance and energy consumption tradeoff in server consolidation 服务器整合中的性能和能耗权衡
Pub Date : 2022-04-01 DOI: 10.1016/j.tbench.2022.100060
Belen Bermejo, Carlos Juiz

Server consolidation is one of the techniques used to increase energy efficiency in datacentres. Nevertheless, the server consolidation has an inherent trade-off between performance degradation and energy consumption which has to be quantified to be managed. In this paper, the CiS2 index is proposed to quantify the mentioned trade-off. We validated de use of the CiS2 index through real experimentation. Also, these observations lead us to propose the second contribution, which focuses on the consolidation overhead. We proposed a general method to quantify this overhead and be able to manage its effect on performance degradation. To sum up, this paper improved the management of energy efficiency in datacentres’ servers through the CiS2 index and the server consolidation determination method.

服务器整合是用于提高数据中心能源效率的技术之一。然而,服务器整合在性能下降和能源消耗之间存在固有的权衡,必须对其进行量化以进行管理。本文提出了CiS2指数来量化上述权衡。我们通过实际实验验证了CiS2索引的使用。此外,这些观察结果使我们提出了第二个贡献,它关注于合并开销。我们提出了一种通用的方法来量化这种开销,并能够管理其对性能下降的影响。综上所述,本文通过CiS2索引和服务器整合确定方法改进了数据中心服务器的能效管理。
{"title":"Performance and energy consumption tradeoff in server consolidation","authors":"Belen Bermejo,&nbsp;Carlos Juiz","doi":"10.1016/j.tbench.2022.100060","DOIUrl":"10.1016/j.tbench.2022.100060","url":null,"abstract":"<div><p>Server consolidation is one of the techniques used to increase energy efficiency in datacentres. Nevertheless, the server consolidation has an inherent trade-off between performance degradation and energy consumption which has to be quantified to be managed. In this paper, the <span><math><mrow><mi>C</mi><mi>i</mi><msup><mrow><mi>S</mi></mrow><mrow><mn>2</mn></mrow></msup></mrow></math></span> index is proposed to quantify the mentioned trade-off. We validated de use of the <span><math><mrow><mi>C</mi><mi>i</mi><msup><mrow><mi>S</mi></mrow><mrow><mn>2</mn></mrow></msup></mrow></math></span> index through real experimentation. Also, these observations lead us to propose the second contribution, which focuses on the consolidation overhead. We proposed a general method to quantify this overhead and be able to manage its effect on performance degradation. To sum up, this paper improved the management of energy efficiency in datacentres’ servers through the <span><math><mrow><mi>C</mi><mi>i</mi><msup><mrow><mi>S</mi></mrow><mrow><mn>2</mn></mrow></msup></mrow></math></span> index and the server consolidation determination method.</p></div>","PeriodicalId":100155,"journal":{"name":"BenchCouncil Transactions on Benchmarks, Standards and Evaluations","volume":"2 2","pages":"Article 100060"},"PeriodicalIF":0.0,"publicationDate":"2022-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2772485922000473/pdfft?md5=249fa5d38cea71ee99e8472c64edf7af&pid=1-s2.0-S2772485922000473-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"82476007","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Asynchronous memory access unit for general purpose processors 通用处理器的异步存储器访问单元
Pub Date : 2022-04-01 DOI: 10.1016/j.tbench.2022.100061
Luming Wang, Xu Zhang, Tianyue Lu, Mingyu Chen

In future data centers, applications will make heavy use of far memory (including disaggregated memory pools and NVM). The access latency of far memory is more widely distributed than that of local memory accesses. This makes the efficiency of traditional out-of-order load/store mechanism in most general-purpose processors decrease in this scenario. Therefore, this work proposes an in-core asynchronous memory access unit to fully utilize the far memory resources.

在未来的数据中心中,应用程序将大量使用远端内存(包括分解内存池和NVM)。远端内存的访问延迟比本地内存的访问延迟分布更广。在这种情况下,这使得大多数通用处理器中传统的乱序加载/存储机制的效率降低。因此,本文提出了一种核内异步内存访问单元,以充分利用远端内存资源。
{"title":"Asynchronous memory access unit for general purpose processors","authors":"Luming Wang,&nbsp;Xu Zhang,&nbsp;Tianyue Lu,&nbsp;Mingyu Chen","doi":"10.1016/j.tbench.2022.100061","DOIUrl":"10.1016/j.tbench.2022.100061","url":null,"abstract":"<div><p>In future data centers, applications will make heavy use of far memory (including disaggregated memory pools and NVM). The access latency of far memory is more widely distributed than that of local memory accesses. This makes the efficiency of traditional out-of-order load/store mechanism in most general-purpose processors decrease in this scenario. Therefore, this work proposes an in-core asynchronous memory access unit to fully utilize the far memory resources.</p></div>","PeriodicalId":100155,"journal":{"name":"BenchCouncil Transactions on Benchmarks, Standards and Evaluations","volume":"2 2","pages":"Article 100061"},"PeriodicalIF":0.0,"publicationDate":"2022-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2772485922000485/pdfft?md5=54d32533bcc872f110f985167a950308&pid=1-s2.0-S2772485922000485-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"79522755","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
An efficient encrypted deduplication scheme with security-enhanced proof of ownership in edge computing 一种有效的加密重复数据删除方案,在边缘计算中具有安全增强的所有权证明
Pub Date : 2022-04-01 DOI: 10.1016/j.tbench.2022.100062
Yukun Zhou , Zhibin Yu , Liang Gu , Dan Feng

With the rapid expansion of Internet of Things (IoT), relevant files are stored and transmitted at the network edge by employing data deduplication to eliminate redundant data for the best accessibility. Although deduplication improves storage and network efficiency, it decreases security strength and performance. Existing schemes usually adopt message-locked encryption (MLE) to encrypt data, which is vulnerable to brute-force attacks. Meanwhile, these schemes utilize proof-of-ownership (PoW) to prevent duplicate-faking attacks, while they suffer from replay attacks or incur large computation overheads. This paper proposes SE-PoW, an efficient and location-aware hybrid encrypted deduplication scheme with a dual-level security-enhanced Proof-of-Ownership in edge computing. Specifically, SE-PoW firstly encrypts files with an inter-edge server-aided randomized convergent encryption (RCE) method and then protects blocks with an intra-edge edge-aided MLE method to balance security and system efficiency. To resist duplicate-faking attacks and replay attacks, SE-PoW performs the dual-level PoW algorithm. Then it combines the verification of a cuckoo filter and the homomorphism of algebraic signatures in sequence to enhance security and improve ownership checking efficiency. Security analysis demonstrates that SE-PoW ensures data security and resists the mentioned attacks. Evaluation results show that SE-PoW reduces up to 61.9% upload time overheads compared with the state-of-the-art schemes.

随着物联网的快速发展,相关文件在网络边缘存储和传输,通过重复数据删除技术消除冗余数据,以达到最佳的可访问性。重复数据删除虽然可以提高存储效率和网络效率,但会降低安全强度和性能。现有方案通常采用消息锁定加密(message-locked encryption, MLE)对数据进行加密,容易受到暴力攻击。同时,这些方案利用所有权证明(PoW)来防止重复伪造攻击,同时遭受重放攻击或产生大量计算开销。SE-PoW是一种高效、位置感知的混合加密重复数据删除方案,在边缘计算中具有双重安全增强的所有权证明。SE-PoW首先使用边缘间服务器辅助随机收敛加密(RCE)方法加密文件,然后使用边缘内边缘辅助MLE方法保护块,以平衡安全性和系统效率。为了防止重复伪造攻击和重放攻击,SE-PoW采用了双级PoW算法。然后将布谷鸟滤波器的验证与代数签名的同态序列验证相结合,增强了安全性,提高了所有权检查效率。安全性分析表明,SE-PoW能够保证数据安全,抵御上述攻击。评估结果表明,与最先进的方案相比,SE-PoW可减少61.9%的上传时间开销。
{"title":"An efficient encrypted deduplication scheme with security-enhanced proof of ownership in edge computing","authors":"Yukun Zhou ,&nbsp;Zhibin Yu ,&nbsp;Liang Gu ,&nbsp;Dan Feng","doi":"10.1016/j.tbench.2022.100062","DOIUrl":"10.1016/j.tbench.2022.100062","url":null,"abstract":"<div><p>With the rapid expansion of Internet of Things (IoT), relevant files are stored and transmitted at the network edge by employing data deduplication to eliminate redundant data for the best accessibility. Although deduplication improves storage and network efficiency, it decreases security strength and performance. Existing schemes usually adopt message-locked encryption (MLE) to encrypt data, which is vulnerable to brute-force attacks. Meanwhile, these schemes utilize proof-of-ownership (PoW) to prevent duplicate-faking attacks, while they suffer from replay attacks or incur large computation overheads. This paper proposes SE-PoW, an efficient and location-aware hybrid encrypted deduplication scheme with a dual-level security-enhanced Proof-of-Ownership in edge computing. Specifically, SE-PoW firstly encrypts files with an inter-edge server-aided randomized convergent encryption (RCE) method and then protects blocks with an intra-edge edge-aided MLE method to balance security and system efficiency. To resist duplicate-faking attacks and replay attacks, SE-PoW performs the dual-level PoW algorithm. Then it combines the verification of a cuckoo filter and the homomorphism of algebraic signatures in sequence to enhance security and improve ownership checking efficiency. Security analysis demonstrates that SE-PoW ensures data security and resists the mentioned attacks. Evaluation results show that SE-PoW reduces up to 61.9% upload time overheads compared with the state-of-the-art schemes.</p></div>","PeriodicalId":100155,"journal":{"name":"BenchCouncil Transactions on Benchmarks, Standards and Evaluations","volume":"2 2","pages":"Article 100062"},"PeriodicalIF":0.0,"publicationDate":"2022-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2772485922000497/pdfft?md5=6d431fd53173a00cc3005f03b1e16151&pid=1-s2.0-S2772485922000497-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"75950489","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 5
Three laws of technology rise or fall 技术兴衰的三大法则
Pub Date : 2022-03-01 DOI: 10.1016/j.tbench.2022.100034
Jianfeng Zhan

Newton’s laws of motion perfectly explain or approximate physical phenomena in our everyday life. Are there any laws that explain or approximate technology’s rise or fall? After reviewing thirteen information technologies that succeeded, this article concludes three laws of technology and derives five corollaries to explain or approximate the rise or fall of technology. Three laws are the laws of technology inertia, technology change force, and technology action and reaction. Five corollaries are the corollaries of measurement of technology change force, technology breakthrough, technology monopoly, technology openness, and technology business opportunity. I present how to use the laws and the corollaries to analyze an emerging technology—the open-source RISC-V processor. Also, I elaborate on benchmarks’ role in applying those laws.

牛顿的运动定律完美地解释或近似于我们日常生活中的物理现象。有没有什么定律可以解释或近似地解释技术的兴衰?在回顾了13项成功的信息技术之后,本文总结了技术的三个定律,并推导出五个推论来解释或近似技术的兴衰。三大规律分别是技术惯性规律、技术变革力规律和技术作用与反作用规律。五个推论分别是技术变革力、技术突破、技术垄断、技术开放、技术商业机会计量的推论。我介绍了如何使用这些定律和推论来分析一项新兴技术——开源RISC-V处理器。此外,我还详细阐述了基准在适用这些法律方面的作用。
{"title":"Three laws of technology rise or fall","authors":"Jianfeng Zhan","doi":"10.1016/j.tbench.2022.100034","DOIUrl":"10.1016/j.tbench.2022.100034","url":null,"abstract":"<div><p>Newton’s laws of motion perfectly explain or approximate physical phenomena in our everyday life. Are there any laws that explain or approximate technology’s rise or fall? After reviewing thirteen information technologies that succeeded, this article concludes three laws of technology and derives five corollaries to explain or approximate the rise or fall of technology. Three laws are the laws of technology inertia, technology change force, and technology action and reaction. Five corollaries are the corollaries of measurement of technology change force, technology breakthrough, technology monopoly, technology openness, and technology business opportunity. I present how to use the laws and the corollaries to analyze an emerging technology—the open-source RISC-V processor. Also, I elaborate on benchmarks’ role in applying those laws.</p></div>","PeriodicalId":100155,"journal":{"name":"BenchCouncil Transactions on Benchmarks, Standards and Evaluations","volume":"2 1","pages":"Article 100034"},"PeriodicalIF":0.0,"publicationDate":"2022-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2772485922000217/pdfft?md5=5c56aa807df0d773d317e45841d2b270&pid=1-s2.0-S2772485922000217-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"80318518","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Training, testing and benchmarking medical AI models using Clinical AIBench 使用临床AIBench培训,测试和对标医疗人工智能模型
Pub Date : 2022-03-01 DOI: 10.1016/j.tbench.2022.100037
Yunyou Huang , Xiuxia Miao , Ruchang Zhang , Li Ma , Wenjing Liu , Fan Zhang , Xianglong Guan , Xiaoshuang Liang , Xiangjiang Lu , Suqing Tang , Zhifei Zhang

AI technology has been used in many clinical research fields, but most AI technologies are difficult to land in real-world clinical settings. In most current clinical AI research settings, the diagnosis task is to identify different types of diseases among the given ones. However, the diagnosis in real-world settings needs dynamically developing inspection strategies based on the existing resources of medical institutions and identifying different kinds of diseases out of many possibilities. To promote the development of different clinical AI technologies and the implementation of clinical applications, we propose a benchmark named Clinical AIBench for developing, verifying, and evaluating clinical AI technologies in real-world clinical settings. Specifically, Clinical AIBench can be used for: (1) Model training and testing: Researchers can use the data to train and test their models. (2)Model evaluation: Researchers can use Clinical AIBench to objectively, fairly, and comparably evaluate various models of different researchers. (3) Clinical value evaluation: Researchers can use the clinical indicators provided by Clinical AIBench to evaluate the clinical value of models, which will be applied in real-world clinical settings. For convenience, Clinical AIBench provides three different levels of clinical settings: restricted clinical setting, which is named closed clinical setting, data island clinical setting, and real-world clinical setting, which is called open clinical setting. In addition, Clinical AIBench covers three diseases: Alzheimer’s disease, COVID-19, and dental. Clinical AIBench provides python APIs to researchers. The data and source code are publicly available from the project website https://www.benchcouncil.org/clinical_aibench/.

人工智能技术已经应用于许多临床研究领域,但大多数人工智能技术很难在现实世界的临床环境中落地。在目前大多数临床人工智能研究环境中,诊断任务是在给定的疾病中识别不同类型的疾病。然而,现实环境中的诊断需要基于医疗机构现有资源动态制定检查策略,并从多种可能性中识别不同类型的疾病。为了促进不同临床人工智能技术的发展和临床应用的实施,我们提出了一个名为临床AIBench的基准,用于在现实临床环境中开发、验证和评估临床人工智能技术。具体来说,临床AIBench可以用于:(1)模型训练和测试:研究人员可以使用数据来训练和测试他们的模型。(2)模型评价:研究人员可以使用Clinical AIBench对不同研究人员的各种模型进行客观、公正、可比性的评价。(3)临床价值评价:研究人员可以利用临床AIBench提供的临床指标对模型的临床价值进行评价,并将其应用于实际临床环境中。为方便起见,临床AIBench提供了三种不同层次的临床设置:限制性临床设置,称为封闭临床设置;数据孤岛临床设置;真实世界临床设置,称为开放临床设置。此外,Clinical AIBench还涵盖了三种疾病:阿尔茨海默病、COVID-19和牙科。临床AIBench为研究人员提供python api。数据和源代码可从项目网站https://www.benchcouncil.org/clinical_aibench/公开获取。
{"title":"Training, testing and benchmarking medical AI models using Clinical AIBench","authors":"Yunyou Huang ,&nbsp;Xiuxia Miao ,&nbsp;Ruchang Zhang ,&nbsp;Li Ma ,&nbsp;Wenjing Liu ,&nbsp;Fan Zhang ,&nbsp;Xianglong Guan ,&nbsp;Xiaoshuang Liang ,&nbsp;Xiangjiang Lu ,&nbsp;Suqing Tang ,&nbsp;Zhifei Zhang","doi":"10.1016/j.tbench.2022.100037","DOIUrl":"10.1016/j.tbench.2022.100037","url":null,"abstract":"<div><p>AI technology has been used in many clinical research fields, but most AI technologies are difficult to land in real-world clinical settings. In most current clinical AI research settings, the diagnosis task is to identify different types of diseases among the given ones. However, the diagnosis in real-world settings needs dynamically developing inspection strategies based on the existing resources of medical institutions and identifying different kinds of diseases out of many possibilities. To promote the development of different clinical AI technologies and the implementation of clinical applications, we propose a benchmark named Clinical AIBench for developing, verifying, and evaluating clinical AI technologies in real-world clinical settings. Specifically, Clinical AIBench can be used for: (1) Model training and testing: Researchers can use the data to train and test their models. (2)Model evaluation: Researchers can use Clinical AIBench to objectively, fairly, and comparably evaluate various models of different researchers. (3) Clinical value evaluation: Researchers can use the clinical indicators provided by Clinical AIBench to evaluate the clinical value of models, which will be applied in real-world clinical settings. For convenience, Clinical AIBench provides three different levels of clinical settings: restricted clinical setting, which is named closed clinical setting, data island clinical setting, and real-world clinical setting, which is called open clinical setting. In addition, Clinical AIBench covers three diseases: Alzheimer’s disease, COVID-19, and dental. Clinical AIBench provides python APIs to researchers. The data and source code are publicly available from the project website <span>https://www.benchcouncil.org/clinical_aibench/</span><svg><path></path></svg>.</p></div>","PeriodicalId":100155,"journal":{"name":"BenchCouncil Transactions on Benchmarks, Standards and Evaluations","volume":"2 1","pages":"Article 100037"},"PeriodicalIF":0.0,"publicationDate":"2022-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2772485922000242/pdfft?md5=20f33241cdf793f91b18a1c74673a127&pid=1-s2.0-S2772485922000242-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"88872350","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Combating disinformation on social media: A computational perspective 打击社交媒体上的虚假信息:一个计算的视角
Pub Date : 2022-03-01 DOI: 10.1016/j.tbench.2022.100035
Kai Shu

The use of social media has accelerated information sharing and instantaneous communications. The low barrier to enter social media enables more users to participate and makes them stay engaged longer, while incentivizing individuals with a hidden agenda to use disinformation to manipulate information and influence opinions. Disinformation, such as fake news, hoaxes, and conspiracy theories, has increasingly been weaponized to divide people and create detrimental societal effects. Therefore, it is imperative to understand disinformation and systematically investigate how we can improve resistance against it, taking into account the tension between the need for information and the need for security and protection against disinformation. In this survey, we look into the concepts, methods, and recent advancements of detecting disinformation from a computational perspective. We will also discuss open issues and future research directions for combating disinformation on social media.

社交媒体的使用加速了信息共享和即时通信。进入社交媒体的低门槛使更多的用户能够参与,并使他们保持更长时间的参与,同时激励有隐藏议程的个人使用虚假信息来操纵信息和影响意见。虚假信息,如假新闻、骗局和阴谋论,越来越多地被用来分裂人们并造成有害的社会影响。因此,必须了解虚假信息,并系统地研究我们如何提高对它的抵抗力,同时考虑到信息需求与安全需求之间的紧张关系,以及对虚假信息的保护。在这项调查中,我们从计算的角度研究了检测虚假信息的概念、方法和最新进展。我们还将讨论打击社交媒体虚假信息的开放性问题和未来的研究方向。
{"title":"Combating disinformation on social media: A computational perspective","authors":"Kai Shu","doi":"10.1016/j.tbench.2022.100035","DOIUrl":"10.1016/j.tbench.2022.100035","url":null,"abstract":"<div><p>The use of social media has accelerated information sharing and instantaneous communications. The low barrier to enter social media enables more users to participate and makes them stay engaged longer, while incentivizing individuals with a hidden agenda to use disinformation to manipulate information and influence opinions. Disinformation, such as fake news, hoaxes, and conspiracy theories, has increasingly been weaponized to divide people and create detrimental societal effects. Therefore, it is imperative to understand disinformation and systematically investigate how we can improve resistance against it, taking into account the tension between the need for information and the need for security and protection against disinformation. In this survey, we look into the concepts, methods, and recent advancements of detecting disinformation from a computational perspective. We will also discuss open issues and future research directions for combating disinformation on social media.</p></div>","PeriodicalId":100155,"journal":{"name":"BenchCouncil Transactions on Benchmarks, Standards and Evaluations","volume":"2 1","pages":"Article 100035"},"PeriodicalIF":0.0,"publicationDate":"2022-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2772485922000229/pdfft?md5=276c9f039616b23c7d0aa7446e484c6b&pid=1-s2.0-S2772485922000229-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"86488538","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
Challenges and recent advances in the design of real-time wireless Cyber-Physical Systems 实时无线信息物理系统设计的挑战和最新进展
Pub Date : 2022-03-01 DOI: 10.1016/j.tbench.2022.100036
Romain Jacob

Cyber-Physical Systems (CPS) refer to systems where some intelligence is embedded into devices that interact with their environment. Using wireless technology in such systems is desirable for better flexibility, improved maintainability, and cost reduction, among others. Moreover, CPS applications often specify deadlines; that is, maximal tolerable delays between the execution of distributed tasks. Systems that guarantee to meet such deadlines are called real-time systems. In the past few years, a technique known as synchronous transmissions (ST) has been shown to enable reliable and energy efficient communication, which is promising for the design of real-time wireless CPS.

We identify at least three issues that limit the adoption of ST in this domain: (i) ST is difficult to use due to stringent time synchronization requirements (in the order of μs). There is a lack of tools to facilitate the implementation of ST by CPS engineers, which are often not wireless communication experts. (ii) There are only few examples showcasing the use of ST for CPS applications and academic works based on ST tend to focus on communication rather than applications. Convincing proof-of-concept CPS applications are missing. (iii) The inherent variability of the wireless environment makes performance evaluation challenging. The lack of an agreed-upon methodology hinders experiment reproducibility and limits the confidence in the performance claims. This paper synthesizes recent advances what address these three problems, thereby enabling significant progress for future applications of low-power wireless technology in real-time CPS.

网络物理系统(CPS)是指将一些智能嵌入到与环境交互的设备中的系统。在这样的系统中使用无线技术是为了获得更好的灵活性、改进的可维护性和降低成本等。此外,CPS申请通常会指定截止日期;也就是说,分布式任务执行之间的最大可容忍延迟。保证在这样的期限内完成任务的系统被称为实时系统。在过去的几年里,一种被称为同步传输(ST)的技术已经被证明可以实现可靠和节能的通信,这对实时无线CPS的设计很有希望。我们发现至少有三个问题限制了ST在该领域的采用:(i)由于严格的时间同步要求(以μs为顺序),ST难以使用。CPS工程师通常不是无线通信专家,因此缺乏工具来促进ST的实施。(ii)在CPS应用中使用科技的例子很少,基于科技的学术作品往往侧重于交流而不是应用。缺乏令人信服的概念验证CPS应用程序。(iii)无线环境固有的可变性使性能评估具有挑战性。缺乏商定的方法阻碍了实验的可重复性,并限制了对性能声明的信心。本文综合了解决这三个问题的最新进展,从而为低功耗无线技术在实时CPS中的未来应用取得了重大进展。
{"title":"Challenges and recent advances in the design of real-time wireless Cyber-Physical Systems","authors":"Romain Jacob","doi":"10.1016/j.tbench.2022.100036","DOIUrl":"10.1016/j.tbench.2022.100036","url":null,"abstract":"<div><p>Cyber-Physical Systems (CPS) refer to systems where some intelligence is embedded into devices that interact with their environment. Using wireless technology in such systems is desirable for better flexibility, improved maintainability, and cost reduction, among others. Moreover, CPS applications often specify deadlines; that is, maximal tolerable delays between the execution of distributed tasks. Systems that guarantee to meet such deadlines are called real-time systems. In the past few years, a technique known as synchronous transmissions (ST) has been shown to enable reliable and energy efficient communication, which is promising for the design of real-time wireless CPS.</p><p>We identify at least three issues that limit the adoption of ST in this domain: (i) ST is difficult to use due to stringent time synchronization requirements (in the order of <span><math><mrow><mspace></mspace><mi>μ</mi><mtext>s</mtext></mrow></math></span>). There is a lack of tools to facilitate the implementation of ST by CPS engineers, which are often not wireless communication experts. (ii) There are only few examples showcasing the use of ST for CPS applications and academic works based on ST tend to focus on communication rather than applications. Convincing proof-of-concept CPS applications are missing. (iii) The inherent variability of the wireless environment makes performance evaluation challenging. The lack of an agreed-upon methodology hinders experiment reproducibility and limits the confidence in the performance claims. This paper synthesizes recent advances what address these three problems, thereby enabling significant progress for future applications of low-power wireless technology in real-time CPS.</p></div>","PeriodicalId":100155,"journal":{"name":"BenchCouncil Transactions on Benchmarks, Standards and Evaluations","volume":"2 1","pages":"Article 100036"},"PeriodicalIF":0.0,"publicationDate":"2022-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2772485922000230/pdfft?md5=eb1b51cae3b646d955655faccd0430b8&pid=1-s2.0-S2772485922000230-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"78885692","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Are current benchmarks adequate to evaluate distributed transactional databases? 当前的基准测试是否足以评估分布式事务数据库?
Pub Date : 2022-03-01 DOI: 10.1016/j.tbench.2022.100031
Luyi Qu , Qingshuai Wang , Ting Chen , Keqiang Li , Rong Zhang , Xuan Zhou , Quanqing Xu , Zhifeng Yang , Chuanhui Yang , Weining Qian , Aoying Zhou

With the rapid development of distributed transactional databases in recent years, there is an urgent need for fair performance evaluation and comparison. Though there are various open-source benchmarks built for databases, it is lack of a comprehensive study about the applicability for distributed transactional databases. This paper presents a review of the state-of-art benchmarks with respect to distributed transactional databases. We first summarize the representative architectures of distributed transactional databases and then provide an overview about the chock points in distributed transactional databases. Then, we classify the classic transactional benchmarks based on their characteristics and design purposes. Finally, we review these benchmarks from schema and data definition, workload generation, and evaluation and metrics to check whether they are still applicable to distributed transactional databases with respect to the chock points. This paper exposes a potential research direction to motivate future benchmark designs in the area of distributed transactional databases.

随着近年来分布式事务数据库的快速发展,人们迫切需要对分布式事务数据库进行公平的性能评估和比较。虽然有各种各样的开源数据库基准,但缺乏对分布式事务数据库适用性的全面研究。本文介绍了关于分布式事务数据库的最新基准测试。我们首先总结了分布式事务数据库的代表性体系结构,然后概述了分布式事务数据库中的阻塞点。然后,我们根据它们的特征和设计目的对经典事务性基准进行分类。最后,我们从模式和数据定义、工作负载生成、评估和度量等方面回顾这些基准,以检查它们是否仍然适用于分布式事务数据库中的阻塞点。本文揭示了一个潜在的研究方向,以激励分布式事务数据库领域未来的基准设计。
{"title":"Are current benchmarks adequate to evaluate distributed transactional databases?","authors":"Luyi Qu ,&nbsp;Qingshuai Wang ,&nbsp;Ting Chen ,&nbsp;Keqiang Li ,&nbsp;Rong Zhang ,&nbsp;Xuan Zhou ,&nbsp;Quanqing Xu ,&nbsp;Zhifeng Yang ,&nbsp;Chuanhui Yang ,&nbsp;Weining Qian ,&nbsp;Aoying Zhou","doi":"10.1016/j.tbench.2022.100031","DOIUrl":"10.1016/j.tbench.2022.100031","url":null,"abstract":"<div><p>With the rapid development of distributed transactional databases in recent years, there is an urgent need for fair performance evaluation and comparison. Though there are various open-source benchmarks built for databases, it is lack of a comprehensive study about the applicability for distributed transactional databases. This paper presents a review of the state-of-art benchmarks with respect to distributed transactional databases. We first summarize the representative architectures of distributed transactional databases and then provide an overview about the chock points in distributed transactional databases. Then, we classify the classic transactional benchmarks based on their characteristics and design purposes. Finally, we review these benchmarks from schema and data definition, workload generation, and evaluation and metrics to check whether they are still applicable to distributed transactional databases with respect to the chock points. This paper exposes a potential research direction to motivate future benchmark designs in the area of distributed transactional databases.</p></div>","PeriodicalId":100155,"journal":{"name":"BenchCouncil Transactions on Benchmarks, Standards and Evaluations","volume":"2 1","pages":"Article 100031"},"PeriodicalIF":0.0,"publicationDate":"2022-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2772485922000187/pdfft?md5=f4298cd3b83df8248ba96df30a0f7411&pid=1-s2.0-S2772485922000187-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"91530405","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 6
期刊
BenchCouncil Transactions on Benchmarks, Standards and Evaluations
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1