BenchCouncil Transactions on Benchmarks, Standards and Evaluations最新文献_第3页

CoviDetector: A transfer learning-based semi supervised approach to detect Covid-19 using CXR images CoviDetector:一种基于迁移学习的半监督方法，使用CXR图像检测Covid-19

BenchCouncil Transactions on Benchmarks, Standards and Evaluations

Pub Date : 2023-06-01 DOI: 10.1016/j.tbench.2023.100119

Deepraj Chowdhury , Anik Das , Ajoy Dey , Soham Banerjee , Muhammed Golec , Dimitrios Kollias , Mohit Kumar , Guneet Kaur , Rupinder Kaur , Rajesh Chand Arya , Gurleen Wander , Praneet Wander , Gurpreet Singh Wander , Ajith Kumar Parlikad , Sukhpal Singh Gill , Steve Uhlig

COVID-19 was one of the deadliest and most infectious illnesses of this century. Research has been done to decrease pandemic deaths and slow down its spread. COVID-19 detection investigations have utilised Chest X-ray (CXR) images with deep learning techniques with its sensitivity in identifying pneumonic alterations. However, CXR images are not publicly available due to users’ privacy concerns, resulting in a challenge to train a highly accurate deep learning model from scratch. Therefore, we proposed CoviDetector, a new semi-supervised approach based on transfer learning and clustering, which displays improved performance and requires less training data. CXR images are given as input to this model, and individuals are categorised into three classes: (1) COVID-19 positive; (2) Viral pneumonia; and (3) Normal. The performance of CoviDetector has been evaluated on four different datasets, achieving over 99% accuracy on them. Additionally, we generate heatmaps utilising Grad-CAM and overlay them on the CXR images to present the highlighted areas that were deciding factors in detecting COVID-19. Finally, we developed an Android app to offer a user-friendly interface. We release the code, datasets and results’ scripts of CoviDetector for reproducibility purposes; they are available at: https://github.com/dasanik2001/CoviDetector

新冠肺炎是本世纪最致命、最具传染性的疾病之一。已经进行了减少大流行死亡人数和减缓其传播的研究。新冠肺炎检测调查利用了具有深度学习技术的胸部X射线（CXR）图像，其在识别肺炎改变方面的敏感性。然而，由于用户的隐私问题，CXR图像尚未公开，这给从头开始训练高度准确的深度学习模型带来了挑战。因此，我们提出了CoviDetector，这是一种基于迁移学习和聚类的新的半监督方法，它显示出改进的性能，并且需要更少的训练数据。CXR图像被作为该模型的输入，个体被分为三类：（1）新冠肺炎阳性；（2）病毒性肺炎；和（3）正常。CoviDetector的性能已经在四个不同的数据集上进行了评估，在这些数据集上实现了99%以上的准确率。此外，我们使用Grad-CAM生成热图，并将其覆盖在CXR图像上，以呈现突出显示的区域，这些区域是检测新冠肺炎的决定性因素。最后，我们开发了一个Android应用程序，提供了一个用户友好的界面。我们发布CoviDetector的代码、数据集和结果脚本，以实现再现性；它们可在以下位置获得：https://github.com/dasanik2001/CoviDetector

{"title":"CoviDetector: A transfer learning-based semi supervised approach to detect Covid-19 using CXR images","authors":"Deepraj Chowdhury , Anik Das , Ajoy Dey , Soham Banerjee , Muhammed Golec , Dimitrios Kollias , Mohit Kumar , Guneet Kaur , Rupinder Kaur , Rajesh Chand Arya , Gurleen Wander , Praneet Wander , Gurpreet Singh Wander , Ajith Kumar Parlikad , Sukhpal Singh Gill , Steve Uhlig","doi":"10.1016/j.tbench.2023.100119","DOIUrl":"https://doi.org/10.1016/j.tbench.2023.100119","url":null,"abstract":"<div><p>COVID-19 was one of the deadliest and most infectious illnesses of this century. Research has been done to decrease pandemic deaths and slow down its spread. COVID-19 detection investigations have utilised Chest X-ray (CXR) images with deep learning techniques with its sensitivity in identifying pneumonic alterations. However, CXR images are not publicly available due to users’ privacy concerns, resulting in a challenge to train a highly accurate deep learning model from scratch. Therefore, we proposed <strong>CoviDetector</strong>, a new semi-supervised approach based on transfer learning and clustering, which displays improved performance and requires less training data. CXR images are given as input to this model, and individuals are categorised into three classes: (1) COVID-19 positive; (2) Viral pneumonia; and (3) Normal. The performance of CoviDetector has been evaluated on four different datasets, achieving over 99% accuracy on them. Additionally, we generate heatmaps utilising Grad-CAM and overlay them on the CXR images to present the highlighted areas that were deciding factors in detecting COVID-19. Finally, we developed an Android app to offer a user-friendly interface. We release the code, datasets and results’ scripts of CoviDetector for reproducibility purposes; they are available at: <span>https://github.com/dasanik2001/CoviDetector</span><svg><path></path></svg></p></div>","PeriodicalId":100155,"journal":{"name":"BenchCouncil Transactions on Benchmarks, Standards and Evaluations","volume":"3 2","pages":"Article 100119"},"PeriodicalIF":0.0,"publicationDate":"2023-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"49716000","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 2

The Third BenchCouncil International Symposium on Intelligent Computers, Algorithms, and Applications (IC 2023) Call for Papers 第三届智能计算机，算法和应用国际研讨会(IC 2023)征文

BenchCouncil Transactions on Benchmarks, Standards and Evaluations

Pub Date : 2023-06-01 DOI: 10.1016/j.tbench.2023.100123

Sponsored and organized by the International Open Benchmark Council (BenchCouncil), the IC conference is to provide a pioneering technology map through searching and advancing state-of-the-art and state-of-the-practice in processors, systems, algorithms, and applications for machine learning, deep learning, spiking neural network and other AI techniques across multidisciplinary and interdisciplinary areas. IC 2023 invites manuscripts describing original work in the above areas and topics. All accepted papers will be presented at the IC 2023 conference and published by Springer CCIS (Indexed by EI). The IC conferences have been successfully held for two series from 2019 to 2022 and attracted plenty of paper submissions and participants. IC 2023 will be held on December 4-6, 2023 in Sanya and invites manuscripts describing original work in processors, systems, algorithms, and applications for AI techniques across multidisciplinary and interdisciplinary areas. The conference website is https://www.benchcouncil.org/ic2023/.

Important Dates: Paper Submission: July 31, 2023, at 11:59 PM AoE Notification: September 30, 2023, at 11:59 PM AoE Final Papers Due: October 31, 2023, at 11:59 PM AoE Conference Date: December 4-6, 2023 Submission Site: https://ic2023.hotcrp.com/

IC会议由国际开放基准理事会（BenchCouncil）赞助和组织，旨在通过搜索和推进处理器、系统、算法和机器学习、深度学习、，尖峰神经网络和其他跨学科和跨学科领域的人工智能技术。IC2023邀请描述上述领域和主题的原创作品的手稿。所有被接受的论文将在IC 2023会议上发表，并由Springer CCIS（EI索引）出版。从2019年到2022年，IC会议已经成功举办了两个系列，吸引了大量的论文提交和参与者。IC2023将于2023年12月4日至6日在三亚举行，邀请手稿描述处理器、系统、算法以及人工智能技术在多学科和跨学科领域的应用。会议网站是https://www.benchcouncil.org/ic2023/.Important日期：论文提交时间：2023年7月31日，上午11:59 AoE通知：2023月30日，下午11:59 Ao E最终论文截止时间：2025年10月31日下午11:59https://ic2023.hotcrp.com/

{"title":"The Third BenchCouncil International Symposium on Intelligent Computers, Algorithms, and Applications (IC 2023) Call for Papers","authors":"","doi":"10.1016/j.tbench.2023.100123","DOIUrl":"https://doi.org/10.1016/j.tbench.2023.100123","url":null,"abstract":"<div><p>Sponsored and organized by the International Open Benchmark Council (BenchCouncil), the IC conference is to provide a pioneering technology map through searching and advancing state-of-the-art and state-of-the-practice in processors, systems, algorithms, and applications for machine learning, deep learning, spiking neural network and other AI techniques across multidisciplinary and interdisciplinary areas. IC 2023 invites manuscripts describing original work in the above areas and topics. All accepted papers will be presented at the IC 2023 conference and published by Springer CCIS (Indexed by EI). The IC conferences have been successfully held for two series from 2019 to 2022 and attracted plenty of paper submissions and participants. IC 2023 will be held on December 4-6, 2023 in Sanya and invites manuscripts describing original work in processors, systems, algorithms, and applications for AI techniques across multidisciplinary and interdisciplinary areas. The conference website is <span>https://www.benchcouncil.org/ic2023/</span><svg><path></path></svg>.</p><p><strong>Important Dates:</strong> Paper Submission: July 31, 2023, at 11:59 PM AoE Notification: September 30, 2023, at 11:59 PM AoE Final Papers Due: October 31, 2023, at 11:59 PM AoE Conference Date: December 4-6, 2023 Submission Site: <span>https://ic2023.hotcrp.com/</span><svg><path></path></svg></p></div>","PeriodicalId":100155,"journal":{"name":"BenchCouncil Transactions on Benchmarks, Standards and Evaluations","volume":"3 2","pages":"Article 100123"},"PeriodicalIF":0.0,"publicationDate":"2023-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"49732679","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

TBench (BenchCouncil Transactions on Benchmarks, Standards and Evaluations) Calls for Papers bench (BenchCouncil Transactions on benchmark, Standards and evaluation)征文

BenchCouncil Transactions on Benchmarks, Standards and Evaluations

Pub Date : 2023-06-01 DOI: 10.1016/S2772-4859(23)00048-0

BenchCouncil Transactions on Benchmarks, Standards and Evaluations (TBench) is an open-access journal dedicated to advancing the field of benchmarks, data sets, standards, evaluations and optimizations. This journal is a peer-reviewed, subsidized open-access journal where The International Open Benchmark Council (BenchCouncil) pays the open-access fee. Authors do not have to pay any open-access publication fee. However, at least one of the authors must register BenchCouncil International Symposium on Benchmarking, Measuring and Optimizing (Bench) (https://www.benchcouncil.org/bench/) and present their work. It seeks a fast-track publication with an average turnaround time of one month.

We invite submissions covering a wide range of topics from various disciplines, with a particular emphasis on interdisciplinary research. Whether it pertains to computers, AI, medicine, education, finance, business, psychology, or other social disciplines, all relevant contributions are welcome.

At TBench, we prioritize the reproducibility of research. We strongly encourage authors to ensure that their articles are prepared for open-source or artifact evaluation before submission. The journal website is https://www.benchcouncil.org/tbench.

BenchCouncil Transactions on Benchmarks，Standards and Evaluation（TBench）是一本开放获取期刊，致力于推进基准、数据集、标准、评估和优化领域的发展。本期刊是一本同行评审、有补贴的开放获取期刊，由国际开放基准理事会（BenchCouncil）支付开放获取费。作者无需支付任何开放访问出版费。然而，至少有一位作者必须注册BenchCouncil国际基准、测量和优化研讨会（Bench）(https://www.benchcouncil.org/bench/)并展示他们的作品。它寻求一种平均周转时间为一个月的快速出版物。我们邀请来自不同学科的广泛主题的投稿，特别强调跨学科研究。无论是计算机、人工智能、医学、教育、金融、商业、心理学还是其他社会学科，都欢迎所有相关贡献。在TBench，我们优先考虑研究的再现性。我们强烈鼓励作者确保他们的文章在提交前准备好进行开源或工件评估。期刊网站是https://www.benchcouncil.org/tbench.

{"title":"TBench (BenchCouncil Transactions on Benchmarks, Standards and Evaluations) Calls for Papers","authors":"","doi":"10.1016/S2772-4859(23)00048-0","DOIUrl":"https://doi.org/10.1016/S2772-4859(23)00048-0","url":null,"abstract":"<div><p>BenchCouncil Transactions on Benchmarks, Standards and Evaluations (TBench) is an open-access journal dedicated to advancing the field of benchmarks, data sets, standards, evaluations and optimizations. This journal is a peer-reviewed, subsidized open-access journal where The International Open Benchmark Council (BenchCouncil) pays the open-access fee. Authors do not have to pay any open-access publication fee. However, at least one of the authors must register BenchCouncil International Symposium on Benchmarking, Measuring and Optimizing (Bench) (<span>https://www.benchcouncil.org/bench/</span><svg><path></path></svg>) and present their work. It seeks a fast-track publication with an average turnaround time of one month.</p><p>We invite submissions covering a wide range of topics from various disciplines, with a particular emphasis on interdisciplinary research. Whether it pertains to computers, AI, medicine, education, finance, business, psychology, or other social disciplines, all relevant contributions are welcome.</p><p>At TBench, we prioritize the reproducibility of research. We strongly encourage authors to ensure that their articles are prepared for open-source or artifact evaluation before submission. The journal website is <span>https://www.benchcouncil.org/tbench</span><svg><path></path></svg>.</p></div>","PeriodicalId":100155,"journal":{"name":"BenchCouncil Transactions on Benchmarks, Standards and Evaluations","volume":"3 2","pages":"Article 100131"},"PeriodicalIF":0.0,"publicationDate":"2023-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"49715448","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Unlocking the opportunities through ChatGPT Tool towards ameliorating the education system 通过ChatGPT工具释放改善教育系统的机会

BenchCouncil Transactions on Benchmarks, Standards and Evaluations

Pub Date : 2023-06-01 DOI: 10.1016/j.tbench.2023.100115

Mohd Javaid , Abid Haleem , Ravi Pratap Singh , Shahbaz Khan , Ibrahim Haleem Khan

Artificial Intelligence (AI)-based ChatGPT developed by OpenAI is now widely accepted in several fields, including education. Students can learn about ideas and theories by using this technology while generating content with it. ChatGPT is built on State of the Art (SOA), like Deep Learning (DL), Natural Language Processing (NLP), and Machine Learning (ML), an extrapolation of a class of ML-NLP models known as Large Language Model (LLMs). It may be used to automate test and assignment grading, giving instructors more time to concentrate on instruction. This technology can be utilised to customise learning for kids, enabling them to focus more intently on the subject matter and critical thinking ChatGPT is an excellent tool for language lessons since it can translate text from one language to another. It may provide lists of vocabulary terms and meanings, assisting students in developing their language proficiency with resources. Personalised learning opportunities are one of ChatGPT’s significant applications in the classroom. This might include creating educational resources and content tailored to a student’s unique interests, skills, and learning goals. This paper discusses the need for ChatGPT and the significant features of ChatGPT in the education system. Further, it identifies and discusses the significant applications of ChatGPT in education. Using ChatGPT, educators may design lessons and instructional materials specific to each student’s requirements and skills based on current trends. Students may work at their speed and concentrate on the areas where they need the most support, resulting in a more effective and efficient learning environment. Both instructors and students may profit significantly from using ChatGPT in the classroom. Instructors may save time on numerous duties by using this technology. In future, ChatGPT will become a powerful tool for enhancing students’ and teachers’ experience.

由OpenAI开发的基于人工智能（AI）的ChatGPT现在在包括教育在内的多个领域被广泛接受。学生可以使用这项技术学习思想和理论，同时使用它生成内容。ChatGPT建立在最先进的（SOA）之上，如深度学习（DL）、自然语言处理（NLP）和机器学习（ML），这是一类ML-NLP模型（LLM）的外推。它可以用于自动化测试和作业评分，让教师有更多的时间专注于教学。这项技术可以用于为孩子们定制学习，使他们能够更加专注于主题和批判性思维。ChatGPT是语言课程的一个优秀工具，因为它可以将文本从一种语言翻译成另一种语言。它可以提供词汇术语和含义的列表，帮助学生利用资源发展他们的语言能力。个性化学习机会是ChatGPT在课堂上的重要应用之一。这可能包括创建适合学生独特兴趣、技能和学习目标的教育资源和内容。本文讨论了ChatGPT的必要性以及ChatGPT在教育系统中的显著特点。此外，它确定并讨论了ChatGPT在教育中的重要应用。使用ChatGPT，教育工作者可以根据当前趋势设计针对每个学生的要求和技能的课程和教学材料。学生可以按照自己的速度工作，专注于他们最需要支持的领域，从而创造一个更有效、更高效的学习环境。教师和学生都可能从课堂上使用ChatGPT中受益匪浅。教员可以通过使用这项技术来节省许多任务的时间。未来，ChatGPT将成为增强学生和教师体验的强大工具。

{"title":"Unlocking the opportunities through ChatGPT Tool towards ameliorating the education system","authors":"Mohd Javaid , Abid Haleem , Ravi Pratap Singh , Shahbaz Khan , Ibrahim Haleem Khan","doi":"10.1016/j.tbench.2023.100115","DOIUrl":"https://doi.org/10.1016/j.tbench.2023.100115","url":null,"abstract":"<div><p>Artificial Intelligence (AI)-based ChatGPT developed by OpenAI is now widely accepted in several fields, including education. Students can learn about ideas and theories by using this technology while generating content with it. ChatGPT is built on State of the Art (SOA), like Deep Learning (DL), Natural Language Processing (NLP), and Machine Learning (ML), an extrapolation of a class of ML-NLP models known as Large Language Model (LLMs). It may be used to automate test and assignment grading, giving instructors more time to concentrate on instruction. This technology can be utilised to customise learning for kids, enabling them to focus more intently on the subject matter and critical thinking ChatGPT is an excellent tool for language lessons since it can translate text from one language to another. It may provide lists of vocabulary terms and meanings, assisting students in developing their language proficiency with resources. Personalised learning opportunities are one of ChatGPT’s significant applications in the classroom. This might include creating educational resources and content tailored to a student’s unique interests, skills, and learning goals. This paper discusses the need for ChatGPT and the significant features of ChatGPT in the education system. Further, it identifies and discusses the significant applications of ChatGPT in education. Using ChatGPT, educators may design lessons and instructional materials specific to each student’s requirements and skills based on current trends. Students may work at their speed and concentrate on the areas where they need the most support, resulting in a more effective and efficient learning environment. Both instructors and students may profit significantly from using ChatGPT in the classroom. Instructors may save time on numerous duties by using this technology. In future, ChatGPT will become a powerful tool for enhancing students’ and teachers’ experience.</p></div>","PeriodicalId":100155,"journal":{"name":"BenchCouncil Transactions on Benchmarks, Standards and Evaluations","volume":"3 2","pages":"Article 100115"},"PeriodicalIF":0.0,"publicationDate":"2023-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"49716023","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 24

Benchmarking HTAP databases for performance isolation and real-time analytics 对HTAP数据库进行性能隔离和实时分析的基准测试

BenchCouncil Transactions on Benchmarks, Standards and Evaluations

Pub Date : 2023-06-01 DOI: 10.1016/j.tbench.2023.100122

Guoxin Kang, Simin Chen, Hongxiao Li

Hybrid Transactional/Analytical Processing (HTAP) databases are designed to execute real-time analytics and provide performance isolation for online transactions and analytical queries. Real-time analytics emphasize analyzing the fresh data generated by online transactions. And performance isolation depicts the performance interference between concurrently executing online transactions and analytical queries. However, HTAP databases are extreme lack micro-benchmarks to accurately measure data freshness. Despite the abundance of HTAP databases and benchmarks, there needs to be more thorough research on the performance isolation and real-time analytics capabilities of HTAP databases. This paper focuses on the critical designs of mainstream HTAP databases and the state-of-the-art and state-of-the-practice HTAP benchmarks. First, we systematically introduce the advanced technologies adopted by HTAP databases for real-time analytics and performance isolation capabilities. Then, we summarize the pros and cons of the state-of-the-art and state-of-the-practice HTAP benchmarks. Next, we design and implement a micro-benchmark for HTAP databases, which can precisely control the rate of fresh data generation and the granularity of fresh data access. Finally, we devise experiments to evaluate the performance isolation and real-time analytics capabilities of the state-of-the-art HTAP database. In our continued pursuit of transparency and community collaboration, we will soon make available our comprehensive specifications, meticulously crafted source code, and significant results for public access at https://www.benchcouncil.org/mOLxPBench.

混合事务/分析处理（HTAP）数据库旨在执行实时分析，并为在线事务和分析查询提供性能隔离。实时分析强调分析在线交易产生的新鲜数据。性能隔离描述了并发执行的在线事务和分析查询之间的性能干扰。然而，HTAP数据库极度缺乏精确衡量数据新鲜度的微观基准。尽管有大量的HTAP数据库和基准测试，但需要对HTAP数据库的性能隔离和实时分析能力进行更彻底的研究。本文重点介绍了主流HTAP数据库的关键设计以及HTAP基准测试的最新技术和现状。首先，我们系统地介绍了HTAP数据库在实时分析和性能隔离功能方面采用的先进技术。然后，我们总结了最先进的HTAP基准的优缺点和实践状况。接下来，我们设计并实现了一个用于HTAP数据库的微基准测试，它可以精确地控制新数据生成的速率和新数据访问的粒度。最后，我们设计了实验来评估最先进的HTAP数据库的性能隔离和实时分析能力。在我们持续追求透明度和社区合作的过程中，我们将很快在https://www.benchcouncil.org/mOLxPBench.

{"title":"Benchmarking HTAP databases for performance isolation and real-time analytics","authors":"Guoxin Kang, Simin Chen, Hongxiao Li","doi":"10.1016/j.tbench.2023.100122","DOIUrl":"https://doi.org/10.1016/j.tbench.2023.100122","url":null,"abstract":"<div><p><strong>H</strong>ybrid <strong>T</strong>ransactional/<strong>A</strong>nalytical <strong>P</strong>rocessing (HTAP) databases are designed to execute real-time analytics and provide performance isolation for online transactions and analytical queries. Real-time analytics emphasize analyzing the fresh data generated by online transactions. And performance isolation depicts the performance interference between concurrently executing online transactions and analytical queries. However, HTAP databases are extreme lack micro-benchmarks to accurately measure data freshness. Despite the abundance of HTAP databases and benchmarks, there needs to be more thorough research on the performance isolation and real-time analytics capabilities of HTAP databases. This paper focuses on the critical designs of mainstream HTAP databases and the state-of-the-art and state-of-the-practice HTAP benchmarks. First, we systematically introduce the advanced technologies adopted by HTAP databases for real-time analytics and performance isolation capabilities. Then, we summarize the pros and cons of the state-of-the-art and state-of-the-practice HTAP benchmarks. Next, we design and implement a micro-benchmark for HTAP databases, which can precisely control the rate of fresh data generation and the granularity of fresh data access. Finally, we devise experiments to evaluate the performance isolation and real-time analytics capabilities of the state-of-the-art HTAP database. In our continued pursuit of transparency and community collaboration, we will soon make available our comprehensive specifications, meticulously crafted source code, and significant results for public access at <span>https://www.benchcouncil.org/mOLxPBench</span><svg><path></path></svg>.</p></div>","PeriodicalId":100155,"journal":{"name":"BenchCouncil Transactions on Benchmarks, Standards and Evaluations","volume":"3 2","pages":"Article 100122"},"PeriodicalIF":0.0,"publicationDate":"2023-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"49715975","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

ChatGPT for healthcare services: An emerging stage for an innovative perspective 医疗保健服务的ChatGPT:创新视角的新兴阶段

BenchCouncil Transactions on Benchmarks, Standards and Evaluations

Pub Date : 2023-02-01 DOI: 10.1016/j.tbench.2023.100105

Mohd Javaid , Abid Haleem , Ravi Pratap Singh

Generative Pretrained Transformer, often known as GPT, is an innovative kind of Artificial Intelligence (AI) which can produce writing that seems to have been written by a person. OpenAI created this AI language model called ChatGPT. It is built using the GPT architecture and is trained on a large corpus of text data to respond to natural language inquiries that resemble a person’s requirements. This technology has lots of applications in healthcare. The need for accurate and current data is one of the major obstacles to adopting ChatGPT in healthcare. GPT must have access to precise and up-to-date medical data to provide trustworthy suggestions and treatment options. It might be accomplished by ensuring that the data used by GPT is received from reliable sources and that the data is updated regularly. Since sensitive medical information would be involved, it will also be crucial to consider privacy and security issues while utilising GPT in the healthcare industry. This paper briefs about ChatGPT and its need for healthcare, its significant Work Flow Dimensions and typical features of ChatGPT for the Healthcare domain. Finally, it identified and discussed significant applications of ChatGPT for healthcare. ChatGPT can comprehend the conversational context and provide contextually appropriate replies. Its effectiveness as a conversational AI tool makes it useful for chatbots, virtual assistants, and other applications. However, we see many limitations in medical ethics, data interpretation, accountability and other issues related to the privacy. Regarding specialised tasks like text creation, language translation, text categorisation, text summarisation, and creating conversation systems, ChatGPT has been pre-trained on a large corpus of text data, and somewhat satisfactory results can be expected. Moreover, it can also be utilised for various Natural Language Processing (NLP) activities, including sentiment analysis, part-of-speech tagging, and named entity identification.

生成预训练转换器（Generative Pretrained Transformer），通常被称为GPT，是一种创新的人工智能（AI），它可以产生似乎是由个人编写的文字。OpenAI创建了一个名为ChatGPT的人工智能语言模型。它是使用GPT架构构建的，并在大型文本数据语料库上进行训练，以响应类似于个人需求的自然语言查询。这项技术在医疗保健领域有很多应用。对准确和最新数据的需求是在医疗保健中采用ChatGPT的主要障碍之一。GPT必须能够访问精确和最新的医疗数据，以提供值得信赖的建议和治疗选择。这可以通过确保GPT使用的数据是从可靠的来源接收的，并定期更新数据来实现。由于涉及敏感的医疗信息，在医疗保健行业使用GPT时，考虑隐私和安全问题也至关重要。本文简要介绍了ChatGPT及其对医疗保健的需求、其重要的工作流程维度以及ChatGPT在医疗保健领域的典型特征。最后，它确定并讨论了ChatGPT在医疗保健方面的重要应用。ChatGPT可以理解会话上下文并提供上下文适当的回复。它作为一种对话式人工智能工具的有效性使其对聊天机器人、虚拟助理和其他应用程序非常有用。然而，我们看到在医学伦理、数据解释、问责制和其他与隐私相关的问题上存在许多局限性。关于文本创建、语言翻译、文本分类、文本总结和创建对话系统等专业任务，ChatGPT已经在大量文本数据上进行了预训练，预计会取得一些令人满意的结果。此外，它还可以用于各种自然语言处理（NLP）活动，包括情感分析、词性标记和命名实体识别。

{"title":"ChatGPT for healthcare services: An emerging stage for an innovative perspective","authors":"Mohd Javaid , Abid Haleem , Ravi Pratap Singh","doi":"10.1016/j.tbench.2023.100105","DOIUrl":"https://doi.org/10.1016/j.tbench.2023.100105","url":null,"abstract":"<div><p>Generative Pretrained Transformer, often known as GPT, is an innovative kind of Artificial Intelligence (AI) which can produce writing that seems to have been written by a person. OpenAI created this AI language model called ChatGPT. It is built using the GPT architecture and is trained on a large corpus of text data to respond to natural language inquiries that resemble a person’s requirements. This technology has lots of applications in healthcare. The need for accurate and current data is one of the major obstacles to adopting ChatGPT in healthcare. GPT must have access to precise and up-to-date medical data to provide trustworthy suggestions and treatment options. It might be accomplished by ensuring that the data used by GPT is received from reliable sources and that the data is updated regularly. Since sensitive medical information would be involved, it will also be crucial to consider privacy and security issues while utilising GPT in the healthcare industry. This paper briefs about ChatGPT and its need for healthcare, its significant Work Flow Dimensions and typical features of ChatGPT for the Healthcare domain. Finally, it identified and discussed significant applications of ChatGPT for healthcare. ChatGPT can comprehend the conversational context and provide contextually appropriate replies. Its effectiveness as a conversational AI tool makes it useful for chatbots, virtual assistants, and other applications. However, we see many limitations in medical ethics, data interpretation, accountability and other issues related to the privacy. Regarding specialised tasks like text creation, language translation, text categorisation, text summarisation, and creating conversation systems, ChatGPT has been pre-trained on a large corpus of text data, and somewhat satisfactory results can be expected. Moreover, it can also be utilised for various Natural Language Processing (NLP) activities, including sentiment analysis, part-of-speech tagging, and named entity identification.</p></div>","PeriodicalId":100155,"journal":{"name":"BenchCouncil Transactions on Benchmarks, Standards and Evaluations","volume":"3 1","pages":"Article 100105"},"PeriodicalIF":0.0,"publicationDate":"2023-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"49731345","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 52

SNNBench: End-to-end AI-oriented spiking neural network benchmarking SNNBench:端到端面向ai的峰值神经网络基准测试

BenchCouncil Transactions on Benchmarks, Standards and Evaluations

Pub Date : 2023-02-01 DOI: 10.1016/j.tbench.2023.100108

Fei Tang, Wanling Gao

Spiking Neural Networks (SNNs) show great potential for solving Artificial Intelligence (AI) applications. At the preliminary stage of SNNs, benchmarks are essential for evaluating and optimizing SNN algorithms, software, and hardware toward AI scenarios. However, a majority of SNN benchmarks focus on evaluating SNN for brain science, which has distinct neural network architectures and targets. Even though there have several benchmarks evaluating SNN for AI, they only focus on a single stage of training and inference or a processing fragment of a whole stage without accuracy information. Thus, the existing SNN benchmarks lack an end-to-end perspective that not only covers both training and inference but also provides a whole training process to a target accuracy level.

This paper presents SNNBench—the first end-to-end AI-oriented SNN benchmark covering the processing stages of training and inference and containing the accuracy information. Focusing on two typical AI applications: image classification and speech recognition, we provide nine workloads that consider the typical characteristics of SNN, i.e., the dynamics of spiking neurons, and AI, i.e., learning paradigms including supervised and unsupervised learning, learning rules like backpropagation, connection types like fully connected, and accuracy. The evaluations of SNNBench on both CPU and GPU show its effectiveness. The specifications, source code, and results will be publicly available from https://www.benchcouncil.org/SNNBench.

Spiking神经网络在解决人工智能应用方面显示出巨大的潜力。在SNN的初步阶段，基准对于评估和优化面向人工智能场景的SNN算法、软件和硬件至关重要。然而，大多数SNN基准都侧重于评估脑科学的SNN，脑科学具有不同的神经网络架构和目标。尽管有几个评估人工智能SNN的基准，但它们只关注训练和推理的单个阶段或整个阶段的处理片段，而没有准确性信息。因此，现有的SNN基准缺乏端到端的视角，不仅涵盖了训练和推理，而且还提供了达到目标精度水平的整个训练过程。本文提出了SNNBench——第一个面向人工智能的端到端SNN基准，涵盖了训练和推理的处理阶段，并包含准确性信息。专注于两个典型的人工智能应用：图像分类和语音识别，我们提供了九种工作负载，这些工作负载考虑了SNN的典型特征，即尖峰神经元的动力学，以及人工智能，即学习范式，包括监督和非监督学习，学习规则，如反向传播，连接类型，如完全连接，以及准确性。SNNBench在CPU和GPU上的测试表明了它的有效性。规范、源代码和结果将在https://www.benchcouncil.org/SNNBench.

{"title":"SNNBench: End-to-end AI-oriented spiking neural network benchmarking","authors":"Fei Tang, Wanling Gao","doi":"10.1016/j.tbench.2023.100108","DOIUrl":"https://doi.org/10.1016/j.tbench.2023.100108","url":null,"abstract":"<div><p>Spiking Neural Networks (SNNs) show great potential for solving Artificial Intelligence (AI) applications. At the preliminary stage of SNNs, benchmarks are essential for evaluating and optimizing SNN algorithms, software, and hardware toward AI scenarios. However, a majority of SNN benchmarks focus on evaluating SNN for brain science, which has distinct neural network architectures and targets. Even though there have several benchmarks evaluating SNN for AI, they only focus on a single stage of training and inference or a processing fragment of a whole stage without accuracy information. Thus, the existing SNN benchmarks lack an end-to-end perspective that not only covers both training and inference but also provides a whole training process to a target accuracy level.</p><p>This paper presents SNNBench—the first end-to-end AI-oriented SNN benchmark covering the processing stages of training and inference and containing the accuracy information. Focusing on two typical AI applications: image classification and speech recognition, we provide nine workloads that consider the typical characteristics of SNN, i.e., the dynamics of spiking neurons, and AI, i.e., learning paradigms including supervised and unsupervised learning, learning rules like backpropagation, connection types like fully connected, and accuracy. The evaluations of SNNBench on both CPU and GPU show its effectiveness. The specifications, source code, and results will be publicly available from <span>https://www.benchcouncil.org/SNNBench</span><svg><path></path></svg>.</p></div>","PeriodicalId":100155,"journal":{"name":"BenchCouncil Transactions on Benchmarks, Standards and Evaluations","volume":"3 1","pages":"Article 100108"},"PeriodicalIF":0.0,"publicationDate":"2023-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"49714571","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 1

e₹—The digital currency in India: Challenges and prospects 印度的数字货币:挑战与前景

BenchCouncil Transactions on Benchmarks, Standards and Evaluations

Pub Date : 2023-02-01 DOI: 10.1016/j.tbench.2023.100107

Md. Asraful Haque, Mohd Shoaib

The Reserve Bank of India (RBI) has recently launched the country’s first pilot project for the digital currency known as the digital rupee or e-Rupee (e^₹). The launch of the digital rupee represents a significant advancement in the “Digital India” revolution. It will be a fantastic opportunity for India since it might make conducting business easier while enhancing the security and resilience of the overall payments system. Digital currency attempts to rapidly progress monetary policy to disrupt physical money, lower the cost of financial transactions, and reshape how the money will circulate. Although the effects of digital currency cannot be foreseen, it is extremely important to thoroughly research digital currency and its effects on the operational stage. The development of a digital currency infrastructure has some challenges in terms of performance, scalability, and different usage scenarios. The article clarifies what e^₹ is. How does it work? What makes it different from cryptocurrencies? What are the major challenges and prospects for it in India?

印度储备银行（RBI）最近启动了该国首个数字货币试点项目，即数字卢比或电子卢比（e₹). 数字卢比的推出代表着“数字印度”革命的重大进展。这对印度来说将是一个绝佳的机会，因为它可能会让开展业务变得更容易，同时增强整个支付系统的安全性和弹性。数字货币试图迅速推进货币政策，扰乱实物货币，降低金融交易成本，重塑货币流通方式。尽管数字货币的影响是无法预见的，但深入研究数字货币及其在运营阶段的影响是极其重要的。数字货币基础设施的开发在性能、可扩展性和不同的使用场景方面存在一些挑战。这篇文章阐明了₹是。它是如何工作的？它与加密货币的区别是什么？它在印度面临的主要挑战和前景是什么？

引用次数: 2

Bench 2023 Calls For Papers 2023号法官席要求提交文件

BenchCouncil Transactions on Benchmarks, Standards and Evaluations

Pub Date : 2023-02-01 DOI: 10.1016/j.tbench.2023.100116

引用次数: 0

ERMDS: A obfuscation dataset for evaluating robustness of learning-based malware detection system 一个用于评估基于学习的恶意软件检测系统鲁棒性的混淆数据集

BenchCouncil Transactions on Benchmarks, Standards and Evaluations

Pub Date : 2023-02-01 DOI: 10.1016/j.tbench.2023.100106

Lichen Jia , Yang Yang , Bowen Tang , Zihan Jiang

Learning-based malware detection systems (LB-MDS) play a crucial role in defending computer systems from malicious attacks. Nevertheless, these systems can be vulnerable to various attacks, which can have significant consequences. Software obfuscation techniques can be used to modify the features of malware, thereby avoiding its classification as malicious by LB-MDS. However, existing portable executable (PE) malware datasets primarily use a single obfuscation technique, which LB-MDS has already learned, leading to a loss of their robustness evaluation ability. Therefore, creating a dataset with diverse features that were not observed during LB-MDS training has become the main challenge in evaluating the robustness of LB-MDS.

We propose a obfuscation dataset ERMDS that solves the problem of evaluating the robustness of LB-MDS by generating malwares with diverse features. When designing this dataset, we created three types of obfuscation spaces, corresponding to binary obfuscation, source code obfuscation, and packing obfuscation. Each obfuscation space has multiple obfuscation techniques, each with different parameters. The obfuscation techniques in these three obfuscation spaces can be used in combination and can be reused. This enables us to theoretically obtain an infinite number of obfuscation combinations, thereby creating malwares with a diverse range of features that have not been captured by LB-MDS.

To assess the effectiveness of the ERMDS obfuscation dataset, we create an instance of the obfuscation dataset called ERMDS-X. By utilizing this dataset, we conducted an evaluation of the robustness of two LB-MDS models, namely MalConv and EMBER, as well as six commercial antivirus software products, which are anonymized as AV1-AV6. The results of our experiments showed that ERMDS-X effectively reveals the limitations in the robustness of existing LB-MDS models, leading to an average accuracy reduction of 20% in LB-MDS and 32% in commercial antivirus software. We conducted a comprehensive analysis of the factors that contributed to the observed accuracy decline in both LB-MDS and commercial antivirus software. We have released the ERMDS-X dataset as an open-source resource, available on GitHub at https://github.com/lcjia94/ERMDS.

基于学习的恶意软件检测系统（LB-MDS）在保护计算机系统免受恶意攻击方面发挥着至关重要的作用。然而，这些系统可能容易受到各种攻击，从而产生重大后果。软件混淆技术可用于修改恶意软件的特征，从而避免其被LB-MDS归类为恶意软件。然而，现有的可移植可执行（PE）恶意软件数据集主要使用LB-MDS已经学会的单一模糊技术，导致其鲁棒性评估能力的丧失。因此，创建一个具有在LB-MDS训练过程中没有观察到的各种特征的数据集已成为评估LB-MDS鲁棒性的主要挑战。我们提出了一个模糊数据集ERMDS，该数据集通过生成具有各种特征的恶意软件来解决评估LB-MDS-鲁棒性的问题。在设计该数据集时，我们创建了三种类型的模糊空间，分别对应于二进制模糊、源代码模糊和打包模糊。每个模糊处理空间都有多种模糊处理技术，每种技术都有不同的参数。这三个模糊空间中的模糊技术可以组合使用，并且可以重用。这使我们能够在理论上获得无限数量的模糊组合，从而创建具有LB-MDS尚未捕获的各种功能的恶意软件。为了评估ERMDS模糊数据集的有效性，我们创建了一个名为ERMDS-X的模糊数据集实例。通过利用该数据集，我们对两个LB-MDS模型（即MalConv和EMBR）以及六个商业杀毒软件产品（匿名为AV1-AV6）的稳健性进行了评估。我们的实验结果表明，ERMDS-X有效地揭示了现有LB-MDS模型稳健性的局限性，导致LB-MDS的平均准确率降低了20%，商业反病毒软件的平均准确度降低了32%。我们对LB-MDS和商业杀毒软件中导致观察到的准确性下降的因素进行了全面分析。我们已经发布了ERMDS-X数据集作为开源资源，可在GitHub上获得，网址为https://github.com/lcjia94/ERMDS.

{"title":"ERMDS: A obfuscation dataset for evaluating robustness of learning-based malware detection system","authors":"Lichen Jia , Yang Yang , Bowen Tang , Zihan Jiang","doi":"10.1016/j.tbench.2023.100106","DOIUrl":"https://doi.org/10.1016/j.tbench.2023.100106","url":null,"abstract":"<div><p>Learning-based malware detection systems (LB-MDS) play a crucial role in defending computer systems from malicious attacks. Nevertheless, these systems can be vulnerable to various attacks, which can have significant consequences. Software obfuscation techniques can be used to modify the features of malware, thereby avoiding its classification as malicious by LB-MDS. However, existing portable executable (PE) malware datasets primarily use a single obfuscation technique, which LB-MDS has already learned, leading to a loss of their robustness evaluation ability. Therefore, creating a dataset with diverse features that were not observed during LB-MDS training has become the main challenge in evaluating the robustness of LB-MDS.</p><p>We propose a obfuscation dataset ERMDS that solves the problem of evaluating the robustness of LB-MDS by generating malwares with diverse features. When designing this dataset, we created three types of obfuscation spaces, corresponding to binary obfuscation, source code obfuscation, and packing obfuscation. Each obfuscation space has multiple obfuscation techniques, each with different parameters. The obfuscation techniques in these three obfuscation spaces can be used in combination and can be reused. This enables us to theoretically obtain an infinite number of obfuscation combinations, thereby creating malwares with a diverse range of features that have not been captured by LB-MDS.</p><p>To assess the effectiveness of the ERMDS obfuscation dataset, we create an instance of the obfuscation dataset called ERMDS-X. By utilizing this dataset, we conducted an evaluation of the robustness of two LB-MDS models, namely MalConv and EMBER, as well as six commercial antivirus software products, which are anonymized as AV1-AV6. The results of our experiments showed that ERMDS-X effectively reveals the limitations in the robustness of existing LB-MDS models, leading to an average accuracy reduction of 20% in LB-MDS and 32% in commercial antivirus software. We conducted a comprehensive analysis of the factors that contributed to the observed accuracy decline in both LB-MDS and commercial antivirus software. We have released the ERMDS-X dataset as an open-source resource, available on GitHub at <span>https://github.com/lcjia94/ERMDS</span><svg><path></path></svg>.</p></div>","PeriodicalId":100155,"journal":{"name":"BenchCouncil Transactions on Benchmarks, Standards and Evaluations","volume":"3 1","pages":"Article 100106"},"PeriodicalIF":0.0,"publicationDate":"2023-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"49714566","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0