首页 > 最新文献

BenchCouncil Transactions on Benchmarks, Standards and Evaluations最新文献

英文 中文
Unlocking the opportunities through ChatGPT Tool towards ameliorating the education system 通过ChatGPT工具释放改善教育系统的机会
Pub Date : 2023-06-01 DOI: 10.1016/j.tbench.2023.100115
Mohd Javaid , Abid Haleem , Ravi Pratap Singh , Shahbaz Khan , Ibrahim Haleem Khan

Artificial Intelligence (AI)-based ChatGPT developed by OpenAI is now widely accepted in several fields, including education. Students can learn about ideas and theories by using this technology while generating content with it. ChatGPT is built on State of the Art (SOA), like Deep Learning (DL), Natural Language Processing (NLP), and Machine Learning (ML), an extrapolation of a class of ML-NLP models known as Large Language Model (LLMs). It may be used to automate test and assignment grading, giving instructors more time to concentrate on instruction. This technology can be utilised to customise learning for kids, enabling them to focus more intently on the subject matter and critical thinking ChatGPT is an excellent tool for language lessons since it can translate text from one language to another. It may provide lists of vocabulary terms and meanings, assisting students in developing their language proficiency with resources. Personalised learning opportunities are one of ChatGPT’s significant applications in the classroom. This might include creating educational resources and content tailored to a student’s unique interests, skills, and learning goals. This paper discusses the need for ChatGPT and the significant features of ChatGPT in the education system. Further, it identifies and discusses the significant applications of ChatGPT in education. Using ChatGPT, educators may design lessons and instructional materials specific to each student’s requirements and skills based on current trends. Students may work at their speed and concentrate on the areas where they need the most support, resulting in a more effective and efficient learning environment. Both instructors and students may profit significantly from using ChatGPT in the classroom. Instructors may save time on numerous duties by using this technology. In future, ChatGPT will become a powerful tool for enhancing students’ and teachers’ experience.

由OpenAI开发的基于人工智能(AI)的ChatGPT现在在包括教育在内的多个领域被广泛接受。学生可以使用这项技术学习思想和理论,同时使用它生成内容。ChatGPT建立在最先进的(SOA)之上,如深度学习(DL)、自然语言处理(NLP)和机器学习(ML),这是一类ML-NLP模型(LLM)的外推。它可以用于自动化测试和作业评分,让教师有更多的时间专注于教学。这项技术可以用于为孩子们定制学习,使他们能够更加专注于主题和批判性思维。ChatGPT是语言课程的一个优秀工具,因为它可以将文本从一种语言翻译成另一种语言。它可以提供词汇术语和含义的列表,帮助学生利用资源发展他们的语言能力。个性化学习机会是ChatGPT在课堂上的重要应用之一。这可能包括创建适合学生独特兴趣、技能和学习目标的教育资源和内容。本文讨论了ChatGPT的必要性以及ChatGPT在教育系统中的显著特点。此外,它确定并讨论了ChatGPT在教育中的重要应用。使用ChatGPT,教育工作者可以根据当前趋势设计针对每个学生的要求和技能的课程和教学材料。学生可以按照自己的速度工作,专注于他们最需要支持的领域,从而创造一个更有效、更高效的学习环境。教师和学生都可能从课堂上使用ChatGPT中受益匪浅。教员可以通过使用这项技术来节省许多任务的时间。未来,ChatGPT将成为增强学生和教师体验的强大工具。
{"title":"Unlocking the opportunities through ChatGPT Tool towards ameliorating the education system","authors":"Mohd Javaid ,&nbsp;Abid Haleem ,&nbsp;Ravi Pratap Singh ,&nbsp;Shahbaz Khan ,&nbsp;Ibrahim Haleem Khan","doi":"10.1016/j.tbench.2023.100115","DOIUrl":"https://doi.org/10.1016/j.tbench.2023.100115","url":null,"abstract":"<div><p>Artificial Intelligence (AI)-based ChatGPT developed by OpenAI is now widely accepted in several fields, including education. Students can learn about ideas and theories by using this technology while generating content with it. ChatGPT is built on State of the Art (SOA), like Deep Learning (DL), Natural Language Processing (NLP), and Machine Learning (ML), an extrapolation of a class of ML-NLP models known as Large Language Model (LLMs). It may be used to automate test and assignment grading, giving instructors more time to concentrate on instruction. This technology can be utilised to customise learning for kids, enabling them to focus more intently on the subject matter and critical thinking ChatGPT is an excellent tool for language lessons since it can translate text from one language to another. It may provide lists of vocabulary terms and meanings, assisting students in developing their language proficiency with resources. Personalised learning opportunities are one of ChatGPT’s significant applications in the classroom. This might include creating educational resources and content tailored to a student’s unique interests, skills, and learning goals. This paper discusses the need for ChatGPT and the significant features of ChatGPT in the education system. Further, it identifies and discusses the significant applications of ChatGPT in education. Using ChatGPT, educators may design lessons and instructional materials specific to each student’s requirements and skills based on current trends. Students may work at their speed and concentrate on the areas where they need the most support, resulting in a more effective and efficient learning environment. Both instructors and students may profit significantly from using ChatGPT in the classroom. Instructors may save time on numerous duties by using this technology. In future, ChatGPT will become a powerful tool for enhancing students’ and teachers’ experience.</p></div>","PeriodicalId":100155,"journal":{"name":"BenchCouncil Transactions on Benchmarks, Standards and Evaluations","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2023-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"49716023","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 24
Benchmarking HTAP databases for performance isolation and real-time analytics 对HTAP数据库进行性能隔离和实时分析的基准测试
Pub Date : 2023-06-01 DOI: 10.1016/j.tbench.2023.100122
Guoxin Kang, Simin Chen, Hongxiao Li

Hybrid Transactional/Analytical Processing (HTAP) databases are designed to execute real-time analytics and provide performance isolation for online transactions and analytical queries. Real-time analytics emphasize analyzing the fresh data generated by online transactions. And performance isolation depicts the performance interference between concurrently executing online transactions and analytical queries. However, HTAP databases are extreme lack micro-benchmarks to accurately measure data freshness. Despite the abundance of HTAP databases and benchmarks, there needs to be more thorough research on the performance isolation and real-time analytics capabilities of HTAP databases. This paper focuses on the critical designs of mainstream HTAP databases and the state-of-the-art and state-of-the-practice HTAP benchmarks. First, we systematically introduce the advanced technologies adopted by HTAP databases for real-time analytics and performance isolation capabilities. Then, we summarize the pros and cons of the state-of-the-art and state-of-the-practice HTAP benchmarks. Next, we design and implement a micro-benchmark for HTAP databases, which can precisely control the rate of fresh data generation and the granularity of fresh data access. Finally, we devise experiments to evaluate the performance isolation and real-time analytics capabilities of the state-of-the-art HTAP database. In our continued pursuit of transparency and community collaboration, we will soon make available our comprehensive specifications, meticulously crafted source code, and significant results for public access at https://www.benchcouncil.org/mOLxPBench.

混合事务/分析处理(HTAP)数据库旨在执行实时分析,并为在线事务和分析查询提供性能隔离。实时分析强调分析在线交易产生的新鲜数据。性能隔离描述了并发执行的在线事务和分析查询之间的性能干扰。然而,HTAP数据库极度缺乏精确衡量数据新鲜度的微观基准。尽管有大量的HTAP数据库和基准测试,但需要对HTAP数据库的性能隔离和实时分析能力进行更彻底的研究。本文重点介绍了主流HTAP数据库的关键设计以及HTAP基准测试的最新技术和现状。首先,我们系统地介绍了HTAP数据库在实时分析和性能隔离功能方面采用的先进技术。然后,我们总结了最先进的HTAP基准的优缺点和实践状况。接下来,我们设计并实现了一个用于HTAP数据库的微基准测试,它可以精确地控制新数据生成的速率和新数据访问的粒度。最后,我们设计了实验来评估最先进的HTAP数据库的性能隔离和实时分析能力。在我们持续追求透明度和社区合作的过程中,我们将很快在https://www.benchcouncil.org/mOLxPBench.
{"title":"Benchmarking HTAP databases for performance isolation and real-time analytics","authors":"Guoxin Kang,&nbsp;Simin Chen,&nbsp;Hongxiao Li","doi":"10.1016/j.tbench.2023.100122","DOIUrl":"https://doi.org/10.1016/j.tbench.2023.100122","url":null,"abstract":"<div><p><strong>H</strong>ybrid <strong>T</strong>ransactional/<strong>A</strong>nalytical <strong>P</strong>rocessing (HTAP) databases are designed to execute real-time analytics and provide performance isolation for online transactions and analytical queries. Real-time analytics emphasize analyzing the fresh data generated by online transactions. And performance isolation depicts the performance interference between concurrently executing online transactions and analytical queries. However, HTAP databases are extreme lack micro-benchmarks to accurately measure data freshness. Despite the abundance of HTAP databases and benchmarks, there needs to be more thorough research on the performance isolation and real-time analytics capabilities of HTAP databases. This paper focuses on the critical designs of mainstream HTAP databases and the state-of-the-art and state-of-the-practice HTAP benchmarks. First, we systematically introduce the advanced technologies adopted by HTAP databases for real-time analytics and performance isolation capabilities. Then, we summarize the pros and cons of the state-of-the-art and state-of-the-practice HTAP benchmarks. Next, we design and implement a micro-benchmark for HTAP databases, which can precisely control the rate of fresh data generation and the granularity of fresh data access. Finally, we devise experiments to evaluate the performance isolation and real-time analytics capabilities of the state-of-the-art HTAP database. In our continued pursuit of transparency and community collaboration, we will soon make available our comprehensive specifications, meticulously crafted source code, and significant results for public access at <span>https://www.benchcouncil.org/mOLxPBench</span><svg><path></path></svg>.</p></div>","PeriodicalId":100155,"journal":{"name":"BenchCouncil Transactions on Benchmarks, Standards and Evaluations","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2023-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"49715975","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
ChatGPT for healthcare services: An emerging stage for an innovative perspective 医疗保健服务的ChatGPT:创新视角的新兴阶段
Pub Date : 2023-02-01 DOI: 10.1016/j.tbench.2023.100105
Mohd Javaid , Abid Haleem , Ravi Pratap Singh

Generative Pretrained Transformer, often known as GPT, is an innovative kind of Artificial Intelligence (AI) which can produce writing that seems to have been written by a person. OpenAI created this AI language model called ChatGPT. It is built using the GPT architecture and is trained on a large corpus of text data to respond to natural language inquiries that resemble a person’s requirements. This technology has lots of applications in healthcare. The need for accurate and current data is one of the major obstacles to adopting ChatGPT in healthcare. GPT must have access to precise and up-to-date medical data to provide trustworthy suggestions and treatment options. It might be accomplished by ensuring that the data used by GPT is received from reliable sources and that the data is updated regularly. Since sensitive medical information would be involved, it will also be crucial to consider privacy and security issues while utilising GPT in the healthcare industry. This paper briefs about ChatGPT and its need for healthcare, its significant Work Flow Dimensions and typical features of ChatGPT for the Healthcare domain. Finally, it identified and discussed significant applications of ChatGPT for healthcare. ChatGPT can comprehend the conversational context and provide contextually appropriate replies. Its effectiveness as a conversational AI tool makes it useful for chatbots, virtual assistants, and other applications. However, we see many limitations in medical ethics, data interpretation, accountability and other issues related to the privacy. Regarding specialised tasks like text creation, language translation, text categorisation, text summarisation, and creating conversation systems, ChatGPT has been pre-trained on a large corpus of text data, and somewhat satisfactory results can be expected. Moreover, it can also be utilised for various Natural Language Processing (NLP) activities, including sentiment analysis, part-of-speech tagging, and named entity identification.

生成预训练转换器(Generative Pretrained Transformer),通常被称为GPT,是一种创新的人工智能(AI),它可以产生似乎是由个人编写的文字。OpenAI创建了一个名为ChatGPT的人工智能语言模型。它是使用GPT架构构建的,并在大型文本数据语料库上进行训练,以响应类似于个人需求的自然语言查询。这项技术在医疗保健领域有很多应用。对准确和最新数据的需求是在医疗保健中采用ChatGPT的主要障碍之一。GPT必须能够访问精确和最新的医疗数据,以提供值得信赖的建议和治疗选择。这可以通过确保GPT使用的数据是从可靠的来源接收的,并定期更新数据来实现。由于涉及敏感的医疗信息,在医疗保健行业使用GPT时,考虑隐私和安全问题也至关重要。本文简要介绍了ChatGPT及其对医疗保健的需求、其重要的工作流程维度以及ChatGPT在医疗保健领域的典型特征。最后,它确定并讨论了ChatGPT在医疗保健方面的重要应用。ChatGPT可以理解会话上下文并提供上下文适当的回复。它作为一种对话式人工智能工具的有效性使其对聊天机器人、虚拟助理和其他应用程序非常有用。然而,我们看到在医学伦理、数据解释、问责制和其他与隐私相关的问题上存在许多局限性。关于文本创建、语言翻译、文本分类、文本总结和创建对话系统等专业任务,ChatGPT已经在大量文本数据上进行了预训练,预计会取得一些令人满意的结果。此外,它还可以用于各种自然语言处理(NLP)活动,包括情感分析、词性标记和命名实体识别。
{"title":"ChatGPT for healthcare services: An emerging stage for an innovative perspective","authors":"Mohd Javaid ,&nbsp;Abid Haleem ,&nbsp;Ravi Pratap Singh","doi":"10.1016/j.tbench.2023.100105","DOIUrl":"https://doi.org/10.1016/j.tbench.2023.100105","url":null,"abstract":"<div><p>Generative Pretrained Transformer, often known as GPT, is an innovative kind of Artificial Intelligence (AI) which can produce writing that seems to have been written by a person. OpenAI created this AI language model called ChatGPT. It is built using the GPT architecture and is trained on a large corpus of text data to respond to natural language inquiries that resemble a person’s requirements. This technology has lots of applications in healthcare. The need for accurate and current data is one of the major obstacles to adopting ChatGPT in healthcare. GPT must have access to precise and up-to-date medical data to provide trustworthy suggestions and treatment options. It might be accomplished by ensuring that the data used by GPT is received from reliable sources and that the data is updated regularly. Since sensitive medical information would be involved, it will also be crucial to consider privacy and security issues while utilising GPT in the healthcare industry. This paper briefs about ChatGPT and its need for healthcare, its significant Work Flow Dimensions and typical features of ChatGPT for the Healthcare domain. Finally, it identified and discussed significant applications of ChatGPT for healthcare. ChatGPT can comprehend the conversational context and provide contextually appropriate replies. Its effectiveness as a conversational AI tool makes it useful for chatbots, virtual assistants, and other applications. However, we see many limitations in medical ethics, data interpretation, accountability and other issues related to the privacy. Regarding specialised tasks like text creation, language translation, text categorisation, text summarisation, and creating conversation systems, ChatGPT has been pre-trained on a large corpus of text data, and somewhat satisfactory results can be expected. Moreover, it can also be utilised for various Natural Language Processing (NLP) activities, including sentiment analysis, part-of-speech tagging, and named entity identification.</p></div>","PeriodicalId":100155,"journal":{"name":"BenchCouncil Transactions on Benchmarks, Standards and Evaluations","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2023-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"49731345","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 52
SNNBench: End-to-end AI-oriented spiking neural network benchmarking SNNBench:端到端面向ai的峰值神经网络基准测试
Pub Date : 2023-02-01 DOI: 10.1016/j.tbench.2023.100108
Fei Tang, Wanling Gao

Spiking Neural Networks (SNNs) show great potential for solving Artificial Intelligence (AI) applications. At the preliminary stage of SNNs, benchmarks are essential for evaluating and optimizing SNN algorithms, software, and hardware toward AI scenarios. However, a majority of SNN benchmarks focus on evaluating SNN for brain science, which has distinct neural network architectures and targets. Even though there have several benchmarks evaluating SNN for AI, they only focus on a single stage of training and inference or a processing fragment of a whole stage without accuracy information. Thus, the existing SNN benchmarks lack an end-to-end perspective that not only covers both training and inference but also provides a whole training process to a target accuracy level.

This paper presents SNNBench—the first end-to-end AI-oriented SNN benchmark covering the processing stages of training and inference and containing the accuracy information. Focusing on two typical AI applications: image classification and speech recognition, we provide nine workloads that consider the typical characteristics of SNN, i.e., the dynamics of spiking neurons, and AI, i.e., learning paradigms including supervised and unsupervised learning, learning rules like backpropagation, connection types like fully connected, and accuracy. The evaluations of SNNBench on both CPU and GPU show its effectiveness. The specifications, source code, and results will be publicly available from https://www.benchcouncil.org/SNNBench.

Spiking神经网络在解决人工智能应用方面显示出巨大的潜力。在SNN的初步阶段,基准对于评估和优化面向人工智能场景的SNN算法、软件和硬件至关重要。然而,大多数SNN基准都侧重于评估脑科学的SNN,脑科学具有不同的神经网络架构和目标。尽管有几个评估人工智能SNN的基准,但它们只关注训练和推理的单个阶段或整个阶段的处理片段,而没有准确性信息。因此,现有的SNN基准缺乏端到端的视角,不仅涵盖了训练和推理,而且还提供了达到目标精度水平的整个训练过程。本文提出了SNNBench——第一个面向人工智能的端到端SNN基准,涵盖了训练和推理的处理阶段,并包含准确性信息。专注于两个典型的人工智能应用:图像分类和语音识别,我们提供了九种工作负载,这些工作负载考虑了SNN的典型特征,即尖峰神经元的动力学,以及人工智能,即学习范式,包括监督和非监督学习,学习规则,如反向传播,连接类型,如完全连接,以及准确性。SNNBench在CPU和GPU上的测试表明了它的有效性。规范、源代码和结果将在https://www.benchcouncil.org/SNNBench.
{"title":"SNNBench: End-to-end AI-oriented spiking neural network benchmarking","authors":"Fei Tang,&nbsp;Wanling Gao","doi":"10.1016/j.tbench.2023.100108","DOIUrl":"https://doi.org/10.1016/j.tbench.2023.100108","url":null,"abstract":"<div><p>Spiking Neural Networks (SNNs) show great potential for solving Artificial Intelligence (AI) applications. At the preliminary stage of SNNs, benchmarks are essential for evaluating and optimizing SNN algorithms, software, and hardware toward AI scenarios. However, a majority of SNN benchmarks focus on evaluating SNN for brain science, which has distinct neural network architectures and targets. Even though there have several benchmarks evaluating SNN for AI, they only focus on a single stage of training and inference or a processing fragment of a whole stage without accuracy information. Thus, the existing SNN benchmarks lack an end-to-end perspective that not only covers both training and inference but also provides a whole training process to a target accuracy level.</p><p>This paper presents SNNBench—the first end-to-end AI-oriented SNN benchmark covering the processing stages of training and inference and containing the accuracy information. Focusing on two typical AI applications: image classification and speech recognition, we provide nine workloads that consider the typical characteristics of SNN, i.e., the dynamics of spiking neurons, and AI, i.e., learning paradigms including supervised and unsupervised learning, learning rules like backpropagation, connection types like fully connected, and accuracy. The evaluations of SNNBench on both CPU and GPU show its effectiveness. The specifications, source code, and results will be publicly available from <span>https://www.benchcouncil.org/SNNBench</span><svg><path></path></svg>.</p></div>","PeriodicalId":100155,"journal":{"name":"BenchCouncil Transactions on Benchmarks, Standards and Evaluations","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2023-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"49714571","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
e₹—The digital currency in India: Challenges and prospects 印度的数字货币:挑战与前景
Pub Date : 2023-02-01 DOI: 10.1016/j.tbench.2023.100107
Md. Asraful Haque, Mohd Shoaib

The Reserve Bank of India (RBI) has recently launched the country’s first pilot project for the digital currency known as the digital rupee or e-Rupee (e). The launch of the digital rupee represents a significant advancement in the “Digital India” revolution. It will be a fantastic opportunity for India since it might make conducting business easier while enhancing the security and resilience of the overall payments system. Digital currency attempts to rapidly progress monetary policy to disrupt physical money, lower the cost of financial transactions, and reshape how the money will circulate. Although the effects of digital currency cannot be foreseen, it is extremely important to thoroughly research digital currency and its effects on the operational stage. The development of a digital currency infrastructure has some challenges in terms of performance, scalability, and different usage scenarios. The article clarifies what e is. How does it work? What makes it different from cryptocurrencies? What are the major challenges and prospects for it in India?

印度储备银行(RBI)最近启动了该国首个数字货币试点项目,即数字卢比或电子卢比(e₹). 数字卢比的推出代表着“数字印度”革命的重大进展。这对印度来说将是一个绝佳的机会,因为它可能会让开展业务变得更容易,同时增强整个支付系统的安全性和弹性。数字货币试图迅速推进货币政策,扰乱实物货币,降低金融交易成本,重塑货币流通方式。尽管数字货币的影响是无法预见的,但深入研究数字货币及其在运营阶段的影响是极其重要的。数字货币基础设施的开发在性能、可扩展性和不同的使用场景方面存在一些挑战。这篇文章阐明了₹是。它是如何工作的?它与加密货币的区别是什么?它在印度面临的主要挑战和前景是什么?
{"title":"e₹—The digital currency in India: Challenges and prospects","authors":"Md. Asraful Haque,&nbsp;Mohd Shoaib","doi":"10.1016/j.tbench.2023.100107","DOIUrl":"https://doi.org/10.1016/j.tbench.2023.100107","url":null,"abstract":"<div><p>The Reserve Bank of India (RBI) has recently launched the country’s first pilot project for the digital currency known as the digital rupee or e-Rupee (e<sup>₹</sup>). The launch of the digital rupee represents a significant advancement in the “Digital India” revolution. It will be a fantastic opportunity for India since it might make conducting business easier while enhancing the security and resilience of the overall payments system. Digital currency attempts to rapidly progress monetary policy to disrupt physical money, lower the cost of financial transactions, and reshape how the money will circulate. Although the effects of digital currency cannot be foreseen, it is extremely important to thoroughly research digital currency and its effects on the operational stage. The development of a digital currency infrastructure has some challenges in terms of performance, scalability, and different usage scenarios. The article clarifies what e<sup>₹</sup> is. How does it work? What makes it different from cryptocurrencies? What are the major challenges and prospects for it in India?</p></div>","PeriodicalId":100155,"journal":{"name":"BenchCouncil Transactions on Benchmarks, Standards and Evaluations","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2023-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"49731127","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Bench 2023 Calls For Papers 2023号法官席要求提交文件
Pub Date : 2023-02-01 DOI: 10.1016/j.tbench.2023.100116
{"title":"Bench 2023 Calls For Papers","authors":"","doi":"10.1016/j.tbench.2023.100116","DOIUrl":"https://doi.org/10.1016/j.tbench.2023.100116","url":null,"abstract":"","PeriodicalId":100155,"journal":{"name":"BenchCouncil Transactions on Benchmarks, Standards and Evaluations","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2023-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"49714853","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
ERMDS: A obfuscation dataset for evaluating robustness of learning-based malware detection system 一个用于评估基于学习的恶意软件检测系统鲁棒性的混淆数据集
Pub Date : 2023-02-01 DOI: 10.1016/j.tbench.2023.100106
Lichen Jia , Yang Yang , Bowen Tang , Zihan Jiang

Learning-based malware detection systems (LB-MDS) play a crucial role in defending computer systems from malicious attacks. Nevertheless, these systems can be vulnerable to various attacks, which can have significant consequences. Software obfuscation techniques can be used to modify the features of malware, thereby avoiding its classification as malicious by LB-MDS. However, existing portable executable (PE) malware datasets primarily use a single obfuscation technique, which LB-MDS has already learned, leading to a loss of their robustness evaluation ability. Therefore, creating a dataset with diverse features that were not observed during LB-MDS training has become the main challenge in evaluating the robustness of LB-MDS.

We propose a obfuscation dataset ERMDS that solves the problem of evaluating the robustness of LB-MDS by generating malwares with diverse features. When designing this dataset, we created three types of obfuscation spaces, corresponding to binary obfuscation, source code obfuscation, and packing obfuscation. Each obfuscation space has multiple obfuscation techniques, each with different parameters. The obfuscation techniques in these three obfuscation spaces can be used in combination and can be reused. This enables us to theoretically obtain an infinite number of obfuscation combinations, thereby creating malwares with a diverse range of features that have not been captured by LB-MDS.

To assess the effectiveness of the ERMDS obfuscation dataset, we create an instance of the obfuscation dataset called ERMDS-X. By utilizing this dataset, we conducted an evaluation of the robustness of two LB-MDS models, namely MalConv and EMBER, as well as six commercial antivirus software products, which are anonymized as AV1-AV6. The results of our experiments showed that ERMDS-X effectively reveals the limitations in the robustness of existing LB-MDS models, leading to an average accuracy reduction of 20% in LB-MDS and 32% in commercial antivirus software. We conducted a comprehensive analysis of the factors that contributed to the observed accuracy decline in both LB-MDS and commercial antivirus software. We have released the ERMDS-X dataset as an open-source resource, available on GitHub at https://github.com/lcjia94/ERMDS.

基于学习的恶意软件检测系统(LB-MDS)在保护计算机系统免受恶意攻击方面发挥着至关重要的作用。然而,这些系统可能容易受到各种攻击,从而产生重大后果。软件混淆技术可用于修改恶意软件的特征,从而避免其被LB-MDS归类为恶意软件。然而,现有的可移植可执行(PE)恶意软件数据集主要使用LB-MDS已经学会的单一模糊技术,导致其鲁棒性评估能力的丧失。因此,创建一个具有在LB-MDS训练过程中没有观察到的各种特征的数据集已成为评估LB-MDS鲁棒性的主要挑战。我们提出了一个模糊数据集ERMDS,该数据集通过生成具有各种特征的恶意软件来解决评估LB-MDS-鲁棒性的问题。在设计该数据集时,我们创建了三种类型的模糊空间,分别对应于二进制模糊、源代码模糊和打包模糊。每个模糊处理空间都有多种模糊处理技术,每种技术都有不同的参数。这三个模糊空间中的模糊技术可以组合使用,并且可以重用。这使我们能够在理论上获得无限数量的模糊组合,从而创建具有LB-MDS尚未捕获的各种功能的恶意软件。为了评估ERMDS模糊数据集的有效性,我们创建了一个名为ERMDS-X的模糊数据集实例。通过利用该数据集,我们对两个LB-MDS模型(即MalConv和EMBR)以及六个商业杀毒软件产品(匿名为AV1-AV6)的稳健性进行了评估。我们的实验结果表明,ERMDS-X有效地揭示了现有LB-MDS模型稳健性的局限性,导致LB-MDS的平均准确率降低了20%,商业反病毒软件的平均准确度降低了32%。我们对LB-MDS和商业杀毒软件中导致观察到的准确性下降的因素进行了全面分析。我们已经发布了ERMDS-X数据集作为开源资源,可在GitHub上获得,网址为https://github.com/lcjia94/ERMDS.
{"title":"ERMDS: A obfuscation dataset for evaluating robustness of learning-based malware detection system","authors":"Lichen Jia ,&nbsp;Yang Yang ,&nbsp;Bowen Tang ,&nbsp;Zihan Jiang","doi":"10.1016/j.tbench.2023.100106","DOIUrl":"https://doi.org/10.1016/j.tbench.2023.100106","url":null,"abstract":"<div><p>Learning-based malware detection systems (LB-MDS) play a crucial role in defending computer systems from malicious attacks. Nevertheless, these systems can be vulnerable to various attacks, which can have significant consequences. Software obfuscation techniques can be used to modify the features of malware, thereby avoiding its classification as malicious by LB-MDS. However, existing portable executable (PE) malware datasets primarily use a single obfuscation technique, which LB-MDS has already learned, leading to a loss of their robustness evaluation ability. Therefore, creating a dataset with diverse features that were not observed during LB-MDS training has become the main challenge in evaluating the robustness of LB-MDS.</p><p>We propose a obfuscation dataset ERMDS that solves the problem of evaluating the robustness of LB-MDS by generating malwares with diverse features. When designing this dataset, we created three types of obfuscation spaces, corresponding to binary obfuscation, source code obfuscation, and packing obfuscation. Each obfuscation space has multiple obfuscation techniques, each with different parameters. The obfuscation techniques in these three obfuscation spaces can be used in combination and can be reused. This enables us to theoretically obtain an infinite number of obfuscation combinations, thereby creating malwares with a diverse range of features that have not been captured by LB-MDS.</p><p>To assess the effectiveness of the ERMDS obfuscation dataset, we create an instance of the obfuscation dataset called ERMDS-X. By utilizing this dataset, we conducted an evaluation of the robustness of two LB-MDS models, namely MalConv and EMBER, as well as six commercial antivirus software products, which are anonymized as AV1-AV6. The results of our experiments showed that ERMDS-X effectively reveals the limitations in the robustness of existing LB-MDS models, leading to an average accuracy reduction of 20% in LB-MDS and 32% in commercial antivirus software. We conducted a comprehensive analysis of the factors that contributed to the observed accuracy decline in both LB-MDS and commercial antivirus software. We have released the ERMDS-X dataset as an open-source resource, available on GitHub at <span>https://github.com/lcjia94/ERMDS</span><svg><path></path></svg>.</p></div>","PeriodicalId":100155,"journal":{"name":"BenchCouncil Transactions on Benchmarks, Standards and Evaluations","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2023-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"49714566","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Enabling hyperscale web services 启用超大规模web服务
Pub Date : 2023-02-01 DOI: 10.1016/j.tbench.2023.100092
Akshitha Sriraman

Modern web services such as social media, online messaging, and web search support billions of users, requiring data centers that scale to hundreds of thousands of servers, i.e., hyperscale. The key challenge in enabling hyperscale web services arise from (1) an unprecedented growth in data, users, and service functionality and (2) a decline in hardware performance scaling. We highlight a dissertation’s contributions in bridging the software and hardware worlds to realize more efficient hyperscale services despite these challenges.

社交媒体、在线消息和网络搜索等现代网络服务支持数十亿用户,需要扩展到数十万服务器的数据中心,即超大规模。实现超大规模web服务的关键挑战来自(1)数据、用户和服务功能的空前增长,以及(2)硬件性能扩展的下降。我们强调了一篇论文在连接软件和硬件世界以实现更高效的超规模服务方面的贡献,尽管存在这些挑战。
{"title":"Enabling hyperscale web services","authors":"Akshitha Sriraman","doi":"10.1016/j.tbench.2023.100092","DOIUrl":"https://doi.org/10.1016/j.tbench.2023.100092","url":null,"abstract":"<div><p>Modern web services such as social media, online messaging, and web search support billions of users, requiring data centers that scale to hundreds of thousands of servers, i.e., <em>hyperscale</em>. The key challenge in enabling hyperscale web services arise from (1) an unprecedented growth in data, users, and service functionality and (2) a decline in hardware performance scaling. We highlight a dissertation’s contributions in bridging the software and hardware worlds to realize more efficient hyperscale services despite these challenges.</p></div>","PeriodicalId":100155,"journal":{"name":"BenchCouncil Transactions on Benchmarks, Standards and Evaluations","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2023-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"49714575","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Optimizing the sparse approximate inverse preconditioning algorithm on GPU 基于GPU的稀疏近似逆预处理算法优化
Pub Date : 2022-10-01 DOI: 10.1016/j.tbench.2023.100087
Xinyue Chu, Yizhou Wang, Qi Chen, Jiaquan Gao

In this study, we present an optimization sparse approximate inverse (SPAI) preconditioning algorithm on GPU, called GSPAI-Opt. In GSPAI-Opt, it fuses the advantages of two popular SPAI preconditioning algorithms, and has the following novelties: (1) an optimization strategy is proposed to choose whether to use the constant or non-constant thread group for any sparse pattern of the preprocessor, and (2) a parallel framework of optimizing the SPAI preconditioner is proposed on GPU, and (3) for each component of the preconditioner, a decision tree is established to choose the optimal kernel of computing it. Experimental results validate the effectiveness of GSPAI-Opt.

在本研究中,我们提出了一种GPU上的优化稀疏近似逆(SPAI)预处理算法,称为GSPAI-Opt。GSPAI-Opt,融合两种流行的优势SPAI预处理算法,和下面的小礼品:(1)提出了一种优化策略选择是否使用常数或不恒定线程组的稀疏模式预处理器,和(2)一个并行的框架上,提出了优化SPAI预处理GPU,和(3)对于每个组件的预处理,建立决策树来选择最优计算内核。实验结果验证了GSPAI-Opt算法的有效性。
{"title":"Optimizing the sparse approximate inverse preconditioning algorithm on GPU","authors":"Xinyue Chu,&nbsp;Yizhou Wang,&nbsp;Qi Chen,&nbsp;Jiaquan Gao","doi":"10.1016/j.tbench.2023.100087","DOIUrl":"10.1016/j.tbench.2023.100087","url":null,"abstract":"<div><p>In this study, we present an optimization sparse approximate inverse (SPAI) preconditioning algorithm on GPU, called GSPAI-Opt. In GSPAI-Opt, it fuses the advantages of two popular SPAI preconditioning algorithms, and has the following novelties: (1) an optimization strategy is proposed to choose whether to use the constant or non-constant thread group for any sparse pattern of the preprocessor, and (2) a parallel framework of optimizing the SPAI preconditioner is proposed on GPU, and (3) for each component of the preconditioner, a decision tree is established to choose the optimal kernel of computing it. Experimental results validate the effectiveness of GSPAI-Opt.</p></div>","PeriodicalId":100155,"journal":{"name":"BenchCouncil Transactions on Benchmarks, Standards and Evaluations","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2022-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2772485923000042/pdfft?md5=8592fe298c854dc9f2e85112414f0c44&pid=1-s2.0-S2772485923000042-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"77940299","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Performance characterization and optimization of pruning patterns for sparse DNN inference 稀疏DNN推理中剪枝模式的性能表征与优化
Pub Date : 2022-10-01 DOI: 10.1016/j.tbench.2023.100090
Yunjie Liu, Jingwei Sun, Jiaqiang Liu, Guangzhong Sun

Deep neural networks are suffering from over parameterized high storage and high consumption problems. Pruning can effectively reduce storage and computation costs of deep neural networks by eliminating their redundant parameters. In existing pruning methods, filter pruning achieves more efficient inference, while element-wise pruning maintains better accuracy. To make a trade-off between the two endpoints, a variety of pruning patterns has been proposed. This study analyzes the performance characteristics of sparse DNNs pruned by different patterns, including element-wise, vector-wise, block-wise, and group-wise. Based on the analysis, we propose an efficient implementation of group-wise sparse DNN inference, which can make better use of GPUs. Experimental results on VGG, ResNet, BERT and ViT show that our optimized group-wise pruning pattern achieves much lower inference latency on GPU than other sparse patterns and the existing group-wise pattern implementation.

深度神经网络存在过参数化的高存储和高消耗问题。剪枝通过消除冗余参数,有效地降低了深度神经网络的存储和计算成本。在现有的剪枝方法中,过滤器剪枝可以实现更高效的推理,而元素剪枝可以保持更好的准确性。为了在两个端点之间进行权衡,提出了各种修剪模式。本研究分析了不同模式下的稀疏dnn的性能特征,包括元素型、矢量型、块型和组型。在此基础上,我们提出了一种有效的分组稀疏DNN推理实现方法,可以更好地利用gpu。在VGG、ResNet、BERT和ViT上的实验结果表明,优化后的组明智修剪模式在GPU上的推理延迟比其他稀疏模式和现有的组明智模式实现要低得多。
{"title":"Performance characterization and optimization of pruning patterns for sparse DNN inference","authors":"Yunjie Liu,&nbsp;Jingwei Sun,&nbsp;Jiaqiang Liu,&nbsp;Guangzhong Sun","doi":"10.1016/j.tbench.2023.100090","DOIUrl":"10.1016/j.tbench.2023.100090","url":null,"abstract":"<div><p>Deep neural networks are suffering from over parameterized high storage and high consumption problems. Pruning can effectively reduce storage and computation costs of deep neural networks by eliminating their redundant parameters. In existing pruning methods, filter pruning achieves more efficient inference, while element-wise pruning maintains better accuracy. To make a trade-off between the two endpoints, a variety of pruning patterns has been proposed. This study analyzes the performance characteristics of sparse DNNs pruned by different patterns, including element-wise, vector-wise, block-wise, and group-wise. Based on the analysis, we propose an efficient implementation of group-wise sparse DNN inference, which can make better use of GPUs. Experimental results on VGG, ResNet, BERT and ViT show that our optimized group-wise pruning pattern achieves much lower inference latency on GPU than other sparse patterns and the existing group-wise pattern implementation.</p></div>","PeriodicalId":100155,"journal":{"name":"BenchCouncil Transactions on Benchmarks, Standards and Evaluations","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2022-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2772485923000078/pdfft?md5=47f436d7570515bb39cfffeda4376c89&pid=1-s2.0-S2772485923000078-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"84120145","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
BenchCouncil Transactions on Benchmarks, Standards and Evaluations
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1