首页 > 最新文献

Big Data and Cognitive Computing最新文献

英文 中文
Crafting a Museum Guide Using ChatGPT4 使用ChatGPT4制作博物馆指南
IF 3.7 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2023-09-04 DOI: 10.3390/bdcc7030148
Georgios Trichopoulos, M. Konstantakis, G. Caridakis, A. Katifori, Myrto Koukouli
This paper introduces a groundbreaking approach to enriching the museum experience using ChatGPT4, a state-of-the-art language model by OpenAI. By developing a museum guide powered by ChatGPT4, we aimed to address the challenges visitors face in navigating vast collections of artifacts and interpreting their significance. Leveraging the model’s natural-language-understanding and -generation capabilities, our guide offers personalized, informative, and engaging experiences. However, caution must be exercised as the generated information may lack scientific integrity and accuracy. To mitigate this, we propose incorporating human oversight and validation mechanisms. The subsequent sections present our own case study, detailing the design, architecture, and experimental evaluation of the museum guide system, highlighting its practical implementation and insights into the benefits and limitations of employing ChatGPT4 in the cultural heritage context.
本文介绍了一种突破性的方法,使用ChatGPT4来丰富博物馆体验,ChatGPT4是OpenAI最先进的语言模型。通过开发一个由ChatGPT4驱动的博物馆指南,我们旨在解决游客在浏览大量文物收藏和解释它们的意义时面临的挑战。利用模型的自然语言理解和生成能力,我们的指南提供个性化的、信息丰富的和引人入胜的体验。然而,必须谨慎行事,因为生成的信息可能缺乏科学的完整性和准确性。为了减轻这种情况,我们建议合并人为监督和验证机制。接下来的部分展示了我们自己的案例研究,详细介绍了博物馆导览系统的设计、架构和实验评估,重点介绍了其实际实施情况,以及在文化遗产背景下使用ChatGPT4的好处和局限性。
{"title":"Crafting a Museum Guide Using ChatGPT4","authors":"Georgios Trichopoulos, M. Konstantakis, G. Caridakis, A. Katifori, Myrto Koukouli","doi":"10.3390/bdcc7030148","DOIUrl":"https://doi.org/10.3390/bdcc7030148","url":null,"abstract":"This paper introduces a groundbreaking approach to enriching the museum experience using ChatGPT4, a state-of-the-art language model by OpenAI. By developing a museum guide powered by ChatGPT4, we aimed to address the challenges visitors face in navigating vast collections of artifacts and interpreting their significance. Leveraging the model’s natural-language-understanding and -generation capabilities, our guide offers personalized, informative, and engaging experiences. However, caution must be exercised as the generated information may lack scientific integrity and accuracy. To mitigate this, we propose incorporating human oversight and validation mechanisms. The subsequent sections present our own case study, detailing the design, architecture, and experimental evaluation of the museum guide system, highlighting its practical implementation and insights into the benefits and limitations of employing ChatGPT4 in the cultural heritage context.","PeriodicalId":36397,"journal":{"name":"Big Data and Cognitive Computing","volume":" ","pages":""},"PeriodicalIF":3.7,"publicationDate":"2023-09-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"43363068","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Innovative Robotic Technologies and Artificial Intelligence in Pharmacy and Medicine: Paving the Way for the Future of Health Care—A Review 制药和医学领域的创新机器人技术和人工智能:为医疗保健的未来铺平道路——综述
IF 3.7 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2023-08-30 DOI: 10.3390/bdcc7030147
M. Stasevych, V. Zvarych
The future of innovative robotic technologies and artificial intelligence (AI) in pharmacy and medicine is promising, with the potential to revolutionize various aspects of health care. These advances aim to increase efficiency, improve patient outcomes, and reduce costs while addressing pressing challenges such as personalized medicine and the need for more effective therapies. This review examines the major advances in robotics and AI in the pharmaceutical and medical fields, analyzing the advantages, obstacles, and potential implications for future health care. In addition, prominent organizations and research institutions leading the way in these technological advancements are highlighted, showcasing their pioneering efforts in creating and utilizing state-of-the-art robotic solutions in pharmacy and medicine. By thoroughly analyzing the current state of robotic technologies in health care and exploring the possibilities for further progress, this work aims to provide readers with a comprehensive understanding of the transformative power of robotics and AI in the evolution of the healthcare sector. Striking a balance between embracing technology and preserving the human touch, investing in R&D, and establishing regulatory frameworks within ethical guidelines will shape a future for robotics and AI systems. The future of pharmacy and medicine is in the seamless integration of robotics and AI systems to benefit patients and healthcare providers.
创新机器人技术和人工智能(AI)在制药和医学领域的未来是有希望的,有可能彻底改变医疗保健的各个方面。这些进展旨在提高效率、改善患者治疗效果和降低成本,同时应对个性化医疗和对更有效疗法的需求等紧迫挑战。本文综述了机器人技术和人工智能在制药和医疗领域的主要进展,分析了它们的优势、障碍以及对未来医疗保健的潜在影响。此外,在这些技术进步方面处于领先地位的杰出组织和研究机构也得到了强调,展示了他们在创造和利用最先进的制药和医学机器人解决方案方面的开创性努力。通过深入分析机器人技术在医疗保健领域的现状,并探索进一步发展的可能性,这项工作旨在让读者全面了解机器人和人工智能在医疗保健领域发展中的变革力量。在拥抱技术和保持人情味之间取得平衡,投资研发,在道德准则范围内建立监管框架,将塑造机器人和人工智能系统的未来。制药和医学的未来在于机器人和人工智能系统的无缝集成,以使患者和医疗保健提供者受益。
{"title":"Innovative Robotic Technologies and Artificial Intelligence in Pharmacy and Medicine: Paving the Way for the Future of Health Care—A Review","authors":"M. Stasevych, V. Zvarych","doi":"10.3390/bdcc7030147","DOIUrl":"https://doi.org/10.3390/bdcc7030147","url":null,"abstract":"The future of innovative robotic technologies and artificial intelligence (AI) in pharmacy and medicine is promising, with the potential to revolutionize various aspects of health care. These advances aim to increase efficiency, improve patient outcomes, and reduce costs while addressing pressing challenges such as personalized medicine and the need for more effective therapies. This review examines the major advances in robotics and AI in the pharmaceutical and medical fields, analyzing the advantages, obstacles, and potential implications for future health care. In addition, prominent organizations and research institutions leading the way in these technological advancements are highlighted, showcasing their pioneering efforts in creating and utilizing state-of-the-art robotic solutions in pharmacy and medicine. By thoroughly analyzing the current state of robotic technologies in health care and exploring the possibilities for further progress, this work aims to provide readers with a comprehensive understanding of the transformative power of robotics and AI in the evolution of the healthcare sector. Striking a balance between embracing technology and preserving the human touch, investing in R&D, and establishing regulatory frameworks within ethical guidelines will shape a future for robotics and AI systems. The future of pharmacy and medicine is in the seamless integration of robotics and AI systems to benefit patients and healthcare providers.","PeriodicalId":36397,"journal":{"name":"Big Data and Cognitive Computing","volume":" ","pages":""},"PeriodicalIF":3.7,"publicationDate":"2023-08-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"46567026","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Enhancing Speech Emotions Recognition Using Multivariate Functional Data Analysis 利用多元函数数据分析增强语音情感识别
IF 3.7 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2023-08-25 DOI: 10.3390/bdcc7030146
Matthieu Saumard
Speech Emotions Recognition (SER) has gained significant attention in the fields of human–computer interaction and speech processing. In this article, we present a novel approach to improve SER performance by interpreting the Mel Frequency Cepstral Coefficients (MFCC) as a multivariate functional data object, which accelerates learning while maintaining high accuracy. To treat MFCCs as functional data, we preprocess them as images and apply resizing techniques. By representing MFCCs as functional data, we leverage the temporal dynamics of speech, capturing essential emotional cues more effectively. Consequently, this enhancement significantly contributes to the learning process of SER methods without compromising performance. Subsequently, we employ a supervised learning model, specifically a functional Support Vector Machine (SVM), directly on the MFCC represented as functional data. This enables the utilization of the full functional information, allowing for more accurate emotion recognition. The proposed approach is rigorously evaluated on two distinct databases, EMO-DB and IEMOCAP, serving as benchmarks for SER evaluation. Our method demonstrates competitive results in terms of accuracy, showcasing its effectiveness in emotion recognition. Furthermore, our approach significantly reduces the learning time, making it computationally efficient and practical for real-world applications. In conclusion, our novel approach of treating MFCCs as multivariate functional data objects exhibits superior performance in SER tasks, delivering both improved accuracy and substantial time savings during the learning process. This advancement holds great potential for enhancing human–computer interaction and enabling more sophisticated emotion-aware applications.
语音情绪识别(SER)在人机交互和语音处理领域受到广泛关注。在本文中,我们提出了一种通过将Mel频率倒谱系数(MFCC)解释为多元功能数据对象来提高SER性能的新方法,该方法在保持高精度的同时加速了学习。为了将mfc作为功能数据处理,我们将其预处理为图像并应用大小调整技术。通过将mfccc表示为功能数据,我们利用了语音的时间动态,更有效地捕获了基本的情感线索。因此,这种增强极大地促进了SER方法的学习过程,而不会影响性能。随后,我们采用监督学习模型,特别是功能支持向量机(SVM),直接对表示为功能数据的MFCC进行学习。这使得充分利用功能信息,允许更准确的情绪识别。所提出的方法在两个不同的数据库(EMO-DB和IEMOCAP)上进行了严格的评估,作为SER评估的基准。我们的方法在准确性方面显示出竞争力,展示了其在情绪识别方面的有效性。此外,我们的方法显著减少了学习时间,使其计算效率高,适用于实际应用。总之,我们将mfccc作为多元功能数据对象的新方法在SER任务中表现出卓越的性能,在学习过程中既提高了准确性,又节省了大量时间。这一进步在增强人机交互和实现更复杂的情感感知应用方面具有巨大的潜力。
{"title":"Enhancing Speech Emotions Recognition Using Multivariate Functional Data Analysis","authors":"Matthieu Saumard","doi":"10.3390/bdcc7030146","DOIUrl":"https://doi.org/10.3390/bdcc7030146","url":null,"abstract":"Speech Emotions Recognition (SER) has gained significant attention in the fields of human–computer interaction and speech processing. In this article, we present a novel approach to improve SER performance by interpreting the Mel Frequency Cepstral Coefficients (MFCC) as a multivariate functional data object, which accelerates learning while maintaining high accuracy. To treat MFCCs as functional data, we preprocess them as images and apply resizing techniques. By representing MFCCs as functional data, we leverage the temporal dynamics of speech, capturing essential emotional cues more effectively. Consequently, this enhancement significantly contributes to the learning process of SER methods without compromising performance. Subsequently, we employ a supervised learning model, specifically a functional Support Vector Machine (SVM), directly on the MFCC represented as functional data. This enables the utilization of the full functional information, allowing for more accurate emotion recognition. The proposed approach is rigorously evaluated on two distinct databases, EMO-DB and IEMOCAP, serving as benchmarks for SER evaluation. Our method demonstrates competitive results in terms of accuracy, showcasing its effectiveness in emotion recognition. Furthermore, our approach significantly reduces the learning time, making it computationally efficient and practical for real-world applications. In conclusion, our novel approach of treating MFCCs as multivariate functional data objects exhibits superior performance in SER tasks, delivering both improved accuracy and substantial time savings during the learning process. This advancement holds great potential for enhancing human–computer interaction and enabling more sophisticated emotion-aware applications.","PeriodicalId":36397,"journal":{"name":"Big Data and Cognitive Computing","volume":" ","pages":""},"PeriodicalIF":3.7,"publicationDate":"2023-08-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"48425737","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Applied Digital Twin Concepts Contributing to Heat Transition in Building, Campus, Neighborhood, and Urban Scale 应用数字孪生概念促进建筑、校园、社区和城市规模的热传导
IF 3.7 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2023-08-25 DOI: 10.3390/bdcc7030145
Ekaterina Lesnyak, Tabea Belkot, Johannes Hurka, Jan Philipp Hörding, L. Kuhlmann, Pavel Paulau, Marvin Schnabel, P. Schönfeldt, Jan Middelberg
The heat transition is a central pillar of the energy transition, aiming to decarbonize and improve the energy efficiency of the heat supply in both the private and industrial sectors. On the one hand, this is achieved by substituting fossil fuels with renewable energy. On the other hand, it involves reducing overall heat consumption and associated transmission and ventilation losses. In addition to refurbishment, digitalization contributes significantly. Despite substantial research on Digital Twins (DTs) for heat transition at different scales, a cross-scale perspective on heat optimization still needs to be developed. In response to this research gap, the present study examines four instances of applied DTs across various scales: building, campus, neighborhood, and urban. The study compares their objectives and conceptual frameworks while also identifying common challenges and potential synergies. The study’s findings indicate that all DT scales face similar data-related challenges, such as gathering, ownership, connectivity, and reliability. Also, hierarchical synergy is identified among the DTs, implying the need for collaboration and exchange. In response to this, the “Wärmewende” data platform, whose objectives and concepts are presented in the paper, promotes research data and knowledge exchange with internal and external stakeholders.
热转型是能源转型的核心支柱,旨在脱碳并提高私营和工业部门供热的能源效率。一方面,这是通过用可再生能源取代化石燃料来实现的。另一方面,它涉及到减少总热量消耗以及相关的传输和通风损失。除了翻新,数字化也起到了重要作用。尽管对用于不同尺度热转换的数字孪晶(DT)进行了大量研究,但热优化的跨尺度视角仍有待发展。为了应对这一研究空白,本研究考察了四个不同规模的应用DT实例:建筑、校园、社区和城市。该研究比较了它们的目标和概念框架,同时也确定了共同的挑战和潜在的协同作用。研究结果表明,所有DT量表都面临着类似的数据相关挑战,如收集、所有权、连接性和可靠性。此外,DTs之间确定了层次协同,这意味着需要合作和交流。为此,“Wärmewende”数据平台促进了与内部和外部利益相关者的研究数据和知识交流。
{"title":"Applied Digital Twin Concepts Contributing to Heat Transition in Building, Campus, Neighborhood, and Urban Scale","authors":"Ekaterina Lesnyak, Tabea Belkot, Johannes Hurka, Jan Philipp Hörding, L. Kuhlmann, Pavel Paulau, Marvin Schnabel, P. Schönfeldt, Jan Middelberg","doi":"10.3390/bdcc7030145","DOIUrl":"https://doi.org/10.3390/bdcc7030145","url":null,"abstract":"The heat transition is a central pillar of the energy transition, aiming to decarbonize and improve the energy efficiency of the heat supply in both the private and industrial sectors. On the one hand, this is achieved by substituting fossil fuels with renewable energy. On the other hand, it involves reducing overall heat consumption and associated transmission and ventilation losses. In addition to refurbishment, digitalization contributes significantly. Despite substantial research on Digital Twins (DTs) for heat transition at different scales, a cross-scale perspective on heat optimization still needs to be developed. In response to this research gap, the present study examines four instances of applied DTs across various scales: building, campus, neighborhood, and urban. The study compares their objectives and conceptual frameworks while also identifying common challenges and potential synergies. The study’s findings indicate that all DT scales face similar data-related challenges, such as gathering, ownership, connectivity, and reliability. Also, hierarchical synergy is identified among the DTs, implying the need for collaboration and exchange. In response to this, the “Wärmewende” data platform, whose objectives and concepts are presented in the paper, promotes research data and knowledge exchange with internal and external stakeholders.","PeriodicalId":36397,"journal":{"name":"Big Data and Cognitive Computing","volume":" ","pages":""},"PeriodicalIF":3.7,"publicationDate":"2023-08-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"45613028","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Enhancing the Early Detection of Chronic Kidney Disease: A Robust Machine Learning Model 增强慢性肾脏疾病的早期检测:一个鲁棒的机器学习模型
IF 3.7 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2023-08-16 DOI: 10.3390/bdcc7030144
M. Arif, A. Mukheimer, Daniyal Asif
Clinical decision-making in chronic disorder prognosis is often hampered by high variance, leading to uncertainty and negative outcomes, especially in cases such as chronic kidney disease (CKD). Machine learning (ML) techniques have emerged as valuable tools for reducing randomness and enhancing clinical decision-making. However, conventional methods for CKD detection often lack accuracy due to their reliance on limited sets of biological attributes. This research proposes a novel ML model for predicting CKD, incorporating various preprocessing steps, feature selection, a hyperparameter optimization technique, and ML algorithms. To address challenges in medical datasets, we employ iterative imputation for missing values and a novel sequential approach for data scaling, combining robust scaling, z-standardization, and min-max scaling. Feature selection is performed using the Boruta algorithm, and the model is developed using ML algorithms. The proposed model was validated on the UCI CKD dataset, achieving outstanding performance with 100% accuracy. Our approach, combining innovative preprocessing steps, the Boruta feature selection, and the k-nearest neighbors algorithm, along with a hyperparameter optimization using grid-search cross-validation (CV), demonstrates its effectiveness in enhancing the early detection of CKD. This research highlights the potential of ML techniques in improving clinical support systems and reducing the impact of uncertainty in chronic disorder prognosis.
慢性疾病预后的临床决策往往受到高方差的阻碍,导致不确定性和负面结果,特别是在慢性肾脏疾病(CKD)等病例中。机器学习(ML)技术已经成为减少随机性和增强临床决策的宝贵工具。然而,传统的CKD检测方法往往缺乏准确性,因为它们依赖于有限的生物学属性集。本研究提出了一种用于预测CKD的新型ML模型,该模型结合了各种预处理步骤、特征选择、超参数优化技术和ML算法。为了解决医疗数据集中的挑战,我们采用了缺失值的迭代插值和一种新的数据缩放顺序方法,结合了鲁棒缩放、z-标准化和最小-最大缩放。使用Boruta算法进行特征选择,使用ML算法开发模型。该模型在UCI CKD数据集上进行了验证,准确率达到100%。我们的方法结合了创新的预处理步骤、Boruta特征选择和k近邻算法,以及使用网格搜索交叉验证(CV)的超参数优化,证明了其在增强CKD早期检测方面的有效性。这项研究强调了ML技术在改善临床支持系统和减少慢性疾病预后不确定性影响方面的潜力。
{"title":"Enhancing the Early Detection of Chronic Kidney Disease: A Robust Machine Learning Model","authors":"M. Arif, A. Mukheimer, Daniyal Asif","doi":"10.3390/bdcc7030144","DOIUrl":"https://doi.org/10.3390/bdcc7030144","url":null,"abstract":"Clinical decision-making in chronic disorder prognosis is often hampered by high variance, leading to uncertainty and negative outcomes, especially in cases such as chronic kidney disease (CKD). Machine learning (ML) techniques have emerged as valuable tools for reducing randomness and enhancing clinical decision-making. However, conventional methods for CKD detection often lack accuracy due to their reliance on limited sets of biological attributes. This research proposes a novel ML model for predicting CKD, incorporating various preprocessing steps, feature selection, a hyperparameter optimization technique, and ML algorithms. To address challenges in medical datasets, we employ iterative imputation for missing values and a novel sequential approach for data scaling, combining robust scaling, z-standardization, and min-max scaling. Feature selection is performed using the Boruta algorithm, and the model is developed using ML algorithms. The proposed model was validated on the UCI CKD dataset, achieving outstanding performance with 100% accuracy. Our approach, combining innovative preprocessing steps, the Boruta feature selection, and the k-nearest neighbors algorithm, along with a hyperparameter optimization using grid-search cross-validation (CV), demonstrates its effectiveness in enhancing the early detection of CKD. This research highlights the potential of ML techniques in improving clinical support systems and reducing the impact of uncertainty in chronic disorder prognosis.","PeriodicalId":36397,"journal":{"name":"Big Data and Cognitive Computing","volume":" ","pages":""},"PeriodicalIF":3.7,"publicationDate":"2023-08-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"48683149","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Ransomware Detection Using Machine Learning: A Survey 利用机器学习进行勒索软件检测:综述
IF 3.7 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2023-08-16 DOI: 10.3390/bdcc7030143
Amjad Alraizza, Abdulmohsen Algarni
Ransomware attacks pose significant security threats to personal and corporate data and information. The owners of computer-based resources suffer from verification and privacy violations, monetary losses, and reputational damage due to successful ransomware assaults. As a result, it is critical to accurately and swiftly identify ransomware. Numerous methods have been proposed for identifying ransomware, each with its own advantages and disadvantages. The main objective of this research is to discuss current trends in and potential future debates on automated ransomware detection. This document includes an overview of ransomware, a timeline of assaults, and details on their background. It also provides comprehensive research on existing methods for identifying, avoiding, minimizing, and recovering from ransomware attacks. An analysis of studies between 2017 and 2022 is another advantage of this research. This provides readers with up-to-date knowledge of the most recent developments in ransomware detection and highlights advancements in methods for combating ransomware attacks. In conclusion, this research highlights unanswered concerns and potential research challenges in ransomware detection.
勒索软件攻击对个人和企业数据和信息构成重大安全威胁。计算机资源的所有者因成功的勒索软件攻击而遭受验证和隐私侵犯、金钱损失和声誉损害。因此,准确、快速地识别勒索软件至关重要。已经提出了许多识别勒索软件的方法,每种方法都有自己的优点和缺点。本研究的主要目的是讨论自动勒索软件检测的当前趋势和未来可能的争论。这份文件包括勒索软件的概述、攻击的时间表以及背景细节。它还对识别、避免、最小化和从勒索软件攻击中恢复的现有方法进行了全面研究。对2017年至2022年期间的研究进行分析是这项研究的另一个优势。这为读者提供了勒索软件检测最新发展的最新知识,并突出了打击勒索软件攻击方法的进步。总之,这项研究突出了勒索软件检测中尚未解决的问题和潜在的研究挑战。
{"title":"Ransomware Detection Using Machine Learning: A Survey","authors":"Amjad Alraizza, Abdulmohsen Algarni","doi":"10.3390/bdcc7030143","DOIUrl":"https://doi.org/10.3390/bdcc7030143","url":null,"abstract":"Ransomware attacks pose significant security threats to personal and corporate data and information. The owners of computer-based resources suffer from verification and privacy violations, monetary losses, and reputational damage due to successful ransomware assaults. As a result, it is critical to accurately and swiftly identify ransomware. Numerous methods have been proposed for identifying ransomware, each with its own advantages and disadvantages. The main objective of this research is to discuss current trends in and potential future debates on automated ransomware detection. This document includes an overview of ransomware, a timeline of assaults, and details on their background. It also provides comprehensive research on existing methods for identifying, avoiding, minimizing, and recovering from ransomware attacks. An analysis of studies between 2017 and 2022 is another advantage of this research. This provides readers with up-to-date knowledge of the most recent developments in ransomware detection and highlights advancements in methods for combating ransomware attacks. In conclusion, this research highlights unanswered concerns and potential research challenges in ransomware detection.","PeriodicalId":36397,"journal":{"name":"Big Data and Cognitive Computing","volume":" ","pages":""},"PeriodicalIF":3.7,"publicationDate":"2023-08-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"43798277","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Breast Cancer Classification Using Concatenated Triple Convolutional Neural Networks Model 基于级联三重卷积神经网络模型的癌症乳腺分类
IF 3.7 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2023-08-16 DOI: 10.3390/bdcc7030142
M. Alshayeji, Jassim Al-Buloushi
Improved disease prediction accuracy and reliability are the main concerns in the development of models for the medical field. This study examined methods for increasing classification accuracy and proposed a precise and reliable framework for categorizing breast cancers using mammography scans. Concatenated Convolutional Neural Networks (CNN) were developed based on three models: Two by transfer learning and one entirely from scratch. Misclassification of lesions from mammography images can also be reduced using this approach. Bayesian optimization performs hyperparameter tuning of the layers, and data augmentation will refine the model by using more training samples. Analysis of the model’s accuracy revealed that it can accurately predict disease with 97.26% accuracy in binary cases and 99.13% accuracy in multi-classification cases. These findings are in contrast with recent studies on the same issue using the same dataset and demonstrated a 16% increase in multi-classification accuracy. In addition, an accuracy improvement of 6.4% was achieved after hyperparameter modification and augmentation. Thus, the model tested in this study was deemed superior to those presented in the extant literature. Hence, the concatenation of three different CNNs from scratch and transfer learning allows the extraction of distinct and significant features without leaving them out, enabling the model to make exact diagnoses.
提高疾病预测的准确性和可靠性是医学领域模型发展的主要问题。本研究探讨了提高分类准确性的方法,并提出了一个精确可靠的框架,用于使用乳房x线摄影扫描对乳腺癌进行分类。串联卷积神经网络(CNN)是基于三个模型开发的:两个是通过迁移学习开发的,另一个是完全从零开始开发的。使用这种方法也可以减少乳房x线摄影图像中病变的错误分类。贝叶斯优化执行层的超参数调优,数据增强将通过使用更多的训练样本来改进模型。对模型的准确率分析表明,该模型对二元病例的准确率为97.26%,对多分类病例的准确率为99.13%。这些发现与最近使用相同数据集对同一问题进行的研究形成对比,并证明多重分类准确率提高了16%。此外,经过超参数修正和增强后,精度提高了6.4%。因此,本研究中测试的模型被认为优于现有文献中提出的模型。因此,从头开始连接三个不同的cnn并进行迁移学习,可以提取出不同且重要的特征,而不会遗漏它们,从而使模型能够做出准确的诊断。
{"title":"Breast Cancer Classification Using Concatenated Triple Convolutional Neural Networks Model","authors":"M. Alshayeji, Jassim Al-Buloushi","doi":"10.3390/bdcc7030142","DOIUrl":"https://doi.org/10.3390/bdcc7030142","url":null,"abstract":"Improved disease prediction accuracy and reliability are the main concerns in the development of models for the medical field. This study examined methods for increasing classification accuracy and proposed a precise and reliable framework for categorizing breast cancers using mammography scans. Concatenated Convolutional Neural Networks (CNN) were developed based on three models: Two by transfer learning and one entirely from scratch. Misclassification of lesions from mammography images can also be reduced using this approach. Bayesian optimization performs hyperparameter tuning of the layers, and data augmentation will refine the model by using more training samples. Analysis of the model’s accuracy revealed that it can accurately predict disease with 97.26% accuracy in binary cases and 99.13% accuracy in multi-classification cases. These findings are in contrast with recent studies on the same issue using the same dataset and demonstrated a 16% increase in multi-classification accuracy. In addition, an accuracy improvement of 6.4% was achieved after hyperparameter modification and augmentation. Thus, the model tested in this study was deemed superior to those presented in the extant literature. Hence, the concatenation of three different CNNs from scratch and transfer learning allows the extraction of distinct and significant features without leaving them out, enabling the model to make exact diagnoses.","PeriodicalId":36397,"journal":{"name":"Big Data and Cognitive Computing","volume":" ","pages":""},"PeriodicalIF":3.7,"publicationDate":"2023-08-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"42639297","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Hadiths Classification Using a Novel Author-Based Hadith Classification Dataset (ABCD) 基于作者的圣训分类数据集(ABCD)
IF 3.7 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2023-08-14 DOI: 10.3390/bdcc7030141
A. Ramzy, Marwan Torki, Mohamed Abdeen, O. Saif, Mustafa ElNainay, AbdAllah Alshanqiti, E. Nabil
Religious studies are a rich land for Natural Language Processing (NLP). The reason is that all religions have their instructions as written texts. In this paper, we apply NLP to Islamic Hadiths, which are the written traditions, sayings, actions, approvals, and discussions of the Prophet Muhammad, his companions, or his followers. A Hadith is composed of two parts: the chain of narrators (Sanad) and the content of the Hadith (Matn). A Hadith is transmitted from its author to a Hadith book author using a chain of narrators. The problem we solve focuses on the classification of Hadiths based on their origin of narration. This is important for several reasons. First, it helps determine the authenticity and reliability of the Hadiths. Second, it helps trace the chain of narration and identify the narrators involved in transmitting Hadiths. Finally, it helps understand the historical and cultural contexts in which Hadiths were transmitted, and the different levels of authority attributed to the narrators. To the best of our knowledge, and based on our literature review, this problem is not solved before using machine/deep learning approaches. To solve this classification problem, we created a novel Author-Based Hadith Classification Dataset (ABCD) collected from classical Hadiths’ books. The ABCD size is 29 K Hadiths and it contains unique 18 K narrators, with all their information. We applied machine learning (ML), and deep learning (DL) approaches. ML was applied on Sanad and Matn separately; then, we did the same with DL. The results revealed that ML performs better than DL using the Matn input data, with a 77% F1-score. DL performed better than ML using the Sanad input data, with a 92% F1-score. We used precision and recall alongside the F1-score; details of the results are explained at the end of the paper. We claim that the ABCD and the reported results will motivate the community to work in this new area. Our dataset and results will represent a baseline for further research on the same problem.
宗教研究是自然语言处理的丰富领域。原因是所有宗教都有书面的指示。在本文中,我们将NLP应用于伊斯兰圣训,这是先知穆罕默德、他的同伴或他的追随者的书面传统、言论、行动、认可和讨论。圣训由两部分组成:讲述者链(Sanad)和圣训内容(Matn)。圣训是通过一连串的叙述者从作者传给圣训书的作者。我们所解决的问题集中在根据圣训的叙述起源对其进行分类上。这一点之所以重要,有几个原因。首先,它有助于确定圣训的真实性和可靠性。其次,它有助于追踪叙事链,并识别参与传播圣训的叙事者。最后,它有助于理解圣训传播的历史和文化背景,以及讲述者的不同权威级别。据我们所知,根据我们的文献综述,在使用机器/深度学习方法之前,这个问题并没有得到解决。为了解决这个分类问题,我们创建了一个新颖的基于作者的圣训分类数据集(ABCD),该数据集收集自经典的圣训书籍。ABCD的大小是29K圣训,它包含唯一的18K叙述者,以及他们的所有信息。我们应用了机器学习(ML)和深度学习(DL)方法。ML分别应用于Sanad和Matn;然后,我们对DL做了同样的处理。结果显示,使用Matn输入数据,ML比DL表现更好,F1得分为77%。使用Sanad输入数据,DL的表现优于ML,F1得分为92%。我们在F1成绩的同时使用了准确性和召回率;文末对结果进行了详细说明。我们声称ABCD和报告的结果将激励社区在这一新领域开展工作。我们的数据集和结果将代表对同一问题进行进一步研究的基线。
{"title":"Hadiths Classification Using a Novel Author-Based Hadith Classification Dataset (ABCD)","authors":"A. Ramzy, Marwan Torki, Mohamed Abdeen, O. Saif, Mustafa ElNainay, AbdAllah Alshanqiti, E. Nabil","doi":"10.3390/bdcc7030141","DOIUrl":"https://doi.org/10.3390/bdcc7030141","url":null,"abstract":"Religious studies are a rich land for Natural Language Processing (NLP). The reason is that all religions have their instructions as written texts. In this paper, we apply NLP to Islamic Hadiths, which are the written traditions, sayings, actions, approvals, and discussions of the Prophet Muhammad, his companions, or his followers. A Hadith is composed of two parts: the chain of narrators (Sanad) and the content of the Hadith (Matn). A Hadith is transmitted from its author to a Hadith book author using a chain of narrators. The problem we solve focuses on the classification of Hadiths based on their origin of narration. This is important for several reasons. First, it helps determine the authenticity and reliability of the Hadiths. Second, it helps trace the chain of narration and identify the narrators involved in transmitting Hadiths. Finally, it helps understand the historical and cultural contexts in which Hadiths were transmitted, and the different levels of authority attributed to the narrators. To the best of our knowledge, and based on our literature review, this problem is not solved before using machine/deep learning approaches. To solve this classification problem, we created a novel Author-Based Hadith Classification Dataset (ABCD) collected from classical Hadiths’ books. The ABCD size is 29 K Hadiths and it contains unique 18 K narrators, with all their information. We applied machine learning (ML), and deep learning (DL) approaches. ML was applied on Sanad and Matn separately; then, we did the same with DL. The results revealed that ML performs better than DL using the Matn input data, with a 77% F1-score. DL performed better than ML using the Sanad input data, with a 92% F1-score. We used precision and recall alongside the F1-score; details of the results are explained at the end of the paper. We claim that the ABCD and the reported results will motivate the community to work in this new area. Our dataset and results will represent a baseline for further research on the same problem.","PeriodicalId":36397,"journal":{"name":"Big Data and Cognitive Computing","volume":" ","pages":""},"PeriodicalIF":3.7,"publicationDate":"2023-08-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"47848493","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
An Intelligent Bat Algorithm for Web Service Selection with QoS Uncertainty 具有QoS不确定性的Web服务选择智能Bat算法
IF 3.7 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2023-08-10 DOI: 10.3390/bdcc7030140
Abdelhak Etchiali, Fethallah Hadjila, Amina Bekkouche
Currently, the selection of web services with an uncertain quality of service (QoS) is gaining much attention in the service-oriented computing paradigm (SOC). In fact, searching for a service composition that fulfills a complex user’s request is known to be NP-complete. The search time is mainly dependent on the number of requested tasks, the size of the available services, and the size of the QoS realizations (i.e., sample size). To handle this problem, we propose a two-stage approach that reduces the search space using heuristics for ranking the task services and a bat algorithm metaheuristic for selecting the final near-optimal compositions. The fitness used by the metaheuristic aims to fulfil all the global constraints of the user. The experimental study showed that the ranking heuristics, termed “fuzzy Pareto dominance” and “Zero-order stochastic dominance”, are highly effective compared to the other heuristics and most of the existing state-of-the-art methods.
目前,在面向服务的计算范式(SOC)中,选择具有不确定服务质量(QoS)的web服务越来越受到关注。事实上,搜索满足复杂用户请求的服务组合是NP完全的。搜索时间主要取决于请求的任务数量、可用服务的大小和QoS实现的大小(即样本大小)。为了解决这个问题,我们提出了一种两阶段的方法,该方法使用启发式方法对任务服务进行排名,并使用bat算法元启发式方法来选择最终的接近最优的组合,从而减少搜索空间。元启发式所使用的适应度旨在满足用户的所有全局约束。实验研究表明,与其他启发式方法和大多数现有的最先进方法相比,被称为“模糊Pareto优势”和“零阶随机优势”的排序启发式方法是非常有效的。
{"title":"An Intelligent Bat Algorithm for Web Service Selection with QoS Uncertainty","authors":"Abdelhak Etchiali, Fethallah Hadjila, Amina Bekkouche","doi":"10.3390/bdcc7030140","DOIUrl":"https://doi.org/10.3390/bdcc7030140","url":null,"abstract":"Currently, the selection of web services with an uncertain quality of service (QoS) is gaining much attention in the service-oriented computing paradigm (SOC). In fact, searching for a service composition that fulfills a complex user’s request is known to be NP-complete. The search time is mainly dependent on the number of requested tasks, the size of the available services, and the size of the QoS realizations (i.e., sample size). To handle this problem, we propose a two-stage approach that reduces the search space using heuristics for ranking the task services and a bat algorithm metaheuristic for selecting the final near-optimal compositions. The fitness used by the metaheuristic aims to fulfil all the global constraints of the user. The experimental study showed that the ranking heuristics, termed “fuzzy Pareto dominance” and “Zero-order stochastic dominance”, are highly effective compared to the other heuristics and most of the existing state-of-the-art methods.","PeriodicalId":36397,"journal":{"name":"Big Data and Cognitive Computing","volume":" ","pages":""},"PeriodicalIF":3.7,"publicationDate":"2023-08-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"46069088","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Executable Digital Process Twins: Towards the Enhancement of Process-Driven Systems 可执行数字过程双胞胎:迈向过程驱动系统的增强
IF 3.7 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2023-08-08 DOI: 10.3390/bdcc7030139
F. Corradini, Sara Pettinari, B. Re, Lorenzo Rossi, F. Tiezzi
The development of process-driven systems and the advancements in digital twins have led to the birth of new ways of monitoring and analyzing systems, i.e., digital process twins. Specifically, a digital process twin can allow the monitoring of system behavior and the analysis of the execution status to improve the whole system. However, the concept of the digital process twin is still theoretical, and process-driven systems cannot really benefit from them. In this regard, this work discusses how to effectively exploit a digital process twin and proposes an implementation that combines the monitoring, refinement, and enactment of system behavior. We demonstrated the proposed solution in a multi-robot scenario.
过程驱动系统的发展和数字孪生的进步导致了监测和分析系统的新方法的诞生,即数字过程孪生。具体来说,数字流程双胞胎可以允许监控系统行为和分析执行状态,以改进整个系统。然而,数字过程孪生的概念仍然是理论上的,过程驱动的系统不能真正从中受益。在这方面,这项工作讨论了如何有效地利用数字过程双胞胎,并提出了一种将系统行为的监控、细化和制定相结合的实现方式。我们在多机器人场景中演示了所提出的解决方案。
{"title":"Executable Digital Process Twins: Towards the Enhancement of Process-Driven Systems","authors":"F. Corradini, Sara Pettinari, B. Re, Lorenzo Rossi, F. Tiezzi","doi":"10.3390/bdcc7030139","DOIUrl":"https://doi.org/10.3390/bdcc7030139","url":null,"abstract":"The development of process-driven systems and the advancements in digital twins have led to the birth of new ways of monitoring and analyzing systems, i.e., digital process twins. Specifically, a digital process twin can allow the monitoring of system behavior and the analysis of the execution status to improve the whole system. However, the concept of the digital process twin is still theoretical, and process-driven systems cannot really benefit from them. In this regard, this work discusses how to effectively exploit a digital process twin and proposes an implementation that combines the monitoring, refinement, and enactment of system behavior. We demonstrated the proposed solution in a multi-robot scenario.","PeriodicalId":36397,"journal":{"name":"Big Data and Cognitive Computing","volume":" ","pages":""},"PeriodicalIF":3.7,"publicationDate":"2023-08-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"48919845","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Big Data and Cognitive Computing
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1