首页 > 最新文献

Future Internet最新文献

英文 中文
kClusterHub: An AutoML-Driven Tool for Effortless Partition-Based Clustering over Varied Data Types kClusterHub:一个自动驱动的工具,用于在各种数据类型上轻松地基于分区进行聚类
Q2 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2023-10-18 DOI: 10.3390/fi15100341
Konstantinos Gratsos , Stefanos Ougiaroglou , Dionisis Margaris 
Partition-based clustering is widely applied over diverse domains. Researchers and practitioners from various scientific disciplines engage with partition-based algorithms relying on specialized software or programming libraries. Addressing the need to bridge the knowledge gap associated with these tools, this paper introduces kClusterHub, an AutoML-driven web tool that simplifies the execution of partition-based clustering over numerical, categorical and mixed data types, while facilitating the identification of the optimal number of clusters, using the elbow method. Through automatic feature analysis, kClusterHub selects the most appropriate algorithm from the trio of k-means, k-modes, and k-prototypes. By empowering users to seamlessly upload datasets and select features, kClusterHub selects the algorithm, provides the elbow graph, recommends the optimal number of clusters, executes clustering, and presents the cluster assignment, through tabular representations and exploratory plots. Therefore, kClusterHub reduces the need for specialized software and programming skills, making clustering more accessible to non-experts. For further enhancing its utility, kClusterHub integrates a REST API to support the programmatic execution of cluster analysis. The paper concludes with an evaluation of kClusterHub’s usability via the System Usability Scale and CPU performance experiments. The results emerge that kClusterHub is a streamlined, efficient and user-friendly AutoML-inspired tool for cluster analysis.
基于分区的聚类广泛应用于各个领域。来自不同科学学科的研究人员和实践者依靠专门的软件或编程库从事基于分区的算法。为了解决与这些工具相关的知识鸿沟的需要,本文介绍了kClusterHub,这是一个自动驱动的web工具,它简化了基于分区的数字、分类和混合数据类型的聚类的执行,同时使用肘形方法促进了簇的最佳数量的识别。通过自动特征分析,kClusterHub从k-means、k-modes和k-prototype中选择最合适的算法。通过使用户能够无缝地上传数据集和选择特征,kClusterHub选择算法,提供肘形图,推荐最优集群数量,执行集群,并通过表格表示和探索图表示集群分配。因此,kClusterHub减少了对专门软件和编程技能的需求,使非专家更容易使用集群。为了进一步增强其实用性,kClusterHub集成了一个REST API来支持集群分析的程序化执行。本文最后通过系统可用性量表和CPU性能实验对kClusterHub的可用性进行了评估。结果表明,kClusterHub是一个简化的、高效的、用户友好的、受automl启发的聚类分析工具。
{"title":"kClusterHub: An AutoML-Driven Tool for Effortless Partition-Based Clustering over Varied Data Types","authors":"Konstantinos Gratsos , Stefanos Ougiaroglou , Dionisis Margaris ","doi":"10.3390/fi15100341","DOIUrl":"https://doi.org/10.3390/fi15100341","url":null,"abstract":"Partition-based clustering is widely applied over diverse domains. Researchers and practitioners from various scientific disciplines engage with partition-based algorithms relying on specialized software or programming libraries. Addressing the need to bridge the knowledge gap associated with these tools, this paper introduces kClusterHub, an AutoML-driven web tool that simplifies the execution of partition-based clustering over numerical, categorical and mixed data types, while facilitating the identification of the optimal number of clusters, using the elbow method. Through automatic feature analysis, kClusterHub selects the most appropriate algorithm from the trio of k-means, k-modes, and k-prototypes. By empowering users to seamlessly upload datasets and select features, kClusterHub selects the algorithm, provides the elbow graph, recommends the optimal number of clusters, executes clustering, and presents the cluster assignment, through tabular representations and exploratory plots. Therefore, kClusterHub reduces the need for specialized software and programming skills, making clustering more accessible to non-experts. For further enhancing its utility, kClusterHub integrates a REST API to support the programmatic execution of cluster analysis. The paper concludes with an evaluation of kClusterHub’s usability via the System Usability Scale and CPU performance experiments. The results emerge that kClusterHub is a streamlined, efficient and user-friendly AutoML-inspired tool for cluster analysis.","PeriodicalId":37982,"journal":{"name":"Future Internet","volume":"74 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-10-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135823219","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Flying Watchdog-Based Guard Patrol with Check Point Data Verification 基于飞行看门狗的警卫巡逻与检查站数据验证
Q2 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2023-10-16 DOI: 10.3390/fi15100340
Endrowednes Kuantama, Avishkar Seth, Alice James, Yihao Zhang
The effectiveness of human security-based guard patrol systems often faces challenges related to the consistency of perimeter checks regarding timing and patterns. Some solutions use autonomous drones for monitoring assistance but primarily optimize their camera-based object detection capabilities for favorable lighting conditions. This research introduces an innovative approach to address these limitations—a flying watchdog designed to augment patrol operations with predetermined flight patterns, enabling checkpoint identification and position verification through vision-based methods. The system has a laser-based data transmitter to relay real-time location and timing information to a receiver. The proposed system consists of drone and ground checkpoints with distinctive shapes and colored lights, further enhanced by solar panels serving as laser data receivers. The result demonstrates the drone’s ability to detect four white dot LEDs with square configurations at distances ranging from 18 to 20 m, even under deficient light conditions based on the OpenCV detection algorithm. Notably, the study underscores the significance of achieving an even distribution of light shapes to mitigate light scattering effects on readings while also confirming that ambient light levels up to a maximum of 390 Lux have no adverse impact on the performance of the sensing device.
基于人类安全的警卫巡逻系统的有效性经常面临与周界检查时间和模式的一致性有关的挑战。一些解决方案使用自主无人机进行监控辅助,但主要是优化基于摄像头的物体检测能力,以适应有利的照明条件。这项研究引入了一种创新的方法来解决这些限制——一种飞行看门狗,旨在通过预定的飞行模式来增强巡逻行动,通过基于视觉的方法实现检查站识别和位置验证。该系统有一个基于激光的数据发射器,将实时位置和定时信息中继到接收器。该系统由无人机和地面检查点组成,具有独特的形状和彩色灯光,并由太阳能电池板作为激光数据接收器进一步增强。结果表明,即使在基于OpenCV检测算法的弱光条件下,无人机也能够在18至20米的距离内检测到四个方形配置的白点led。值得注意的是,该研究强调了实现光形状均匀分布的重要性,以减轻光散射对读数的影响,同时也确认了环境光水平最高可达390勒克斯,对传感设备的性能没有不利影响。
{"title":"Flying Watchdog-Based Guard Patrol with Check Point Data Verification","authors":"Endrowednes Kuantama, Avishkar Seth, Alice James, Yihao Zhang","doi":"10.3390/fi15100340","DOIUrl":"https://doi.org/10.3390/fi15100340","url":null,"abstract":"The effectiveness of human security-based guard patrol systems often faces challenges related to the consistency of perimeter checks regarding timing and patterns. Some solutions use autonomous drones for monitoring assistance but primarily optimize their camera-based object detection capabilities for favorable lighting conditions. This research introduces an innovative approach to address these limitations—a flying watchdog designed to augment patrol operations with predetermined flight patterns, enabling checkpoint identification and position verification through vision-based methods. The system has a laser-based data transmitter to relay real-time location and timing information to a receiver. The proposed system consists of drone and ground checkpoints with distinctive shapes and colored lights, further enhanced by solar panels serving as laser data receivers. The result demonstrates the drone’s ability to detect four white dot LEDs with square configurations at distances ranging from 18 to 20 m, even under deficient light conditions based on the OpenCV detection algorithm. Notably, the study underscores the significance of achieving an even distribution of light shapes to mitigate light scattering effects on readings while also confirming that ambient light levels up to a maximum of 390 Lux have no adverse impact on the performance of the sensing device.","PeriodicalId":37982,"journal":{"name":"Future Internet","volume":"150 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-10-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"136115552","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Reinforcement Learning Approach for Adaptive C-V2X Resource Management 自适应C-V2X资源管理的强化学习方法
Q2 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2023-10-15 DOI: 10.3390/fi15100339
Teguh Indra Bayu, Yung-Fa Huang, Jeang-Kuo Chen
The modulation coding scheme (MCS) index is the essential configuration parameter in cellular vehicle-to-everything (C-V2X) communication. As referenced by the 3rd Generation Partnership Project (3GPP), the MCS index will dictate the transport block size (TBS) index, which will affect the size of transport blocks and the number of physical resource blocks. These numbers are crucial in the C-V2X resource management since it is also bound to the transmission power used in the system. To the authors’ knowledge, this particular area of research has not been previously investigated. Ultimately, this research establishes the fundamental principles for future studies seeking to use the MCS adaptability in many contexts. In this work, we proposed the application of the reinforcement learning (RL) algorithm, as we used the Q-learning approach to adaptively change the MCS index according to the current environmental states. The simulation results showed that our proposed RL approach outperformed the static MCS index and was able to attain stability in a short number of events.
调制编码方案(MCS)索引是蜂窝车联网(C-V2X)通信中必不可少的配置参数。根据第三代合作伙伴计划(3GPP), MCS索引将决定传输块大小(TBS)索引,这将影响传输块的大小和物理资源块的数量。这些数字在C-V2X资源管理中至关重要,因为它也与系统中使用的传输功率有关。据作者所知,这一特定领域的研究以前没有被调查过。最后,本研究为未来在多种情境下使用MCS适应性的研究奠定了基本原则。在这项工作中,我们提出了强化学习(RL)算法的应用,因为我们使用q -学习方法根据当前环境状态自适应地改变MCS指数。仿真结果表明,我们提出的RL方法优于静态MCS指数,并且能够在短数量的事件中获得稳定性。
{"title":"Reinforcement Learning Approach for Adaptive C-V2X Resource Management","authors":"Teguh Indra Bayu, Yung-Fa Huang, Jeang-Kuo Chen","doi":"10.3390/fi15100339","DOIUrl":"https://doi.org/10.3390/fi15100339","url":null,"abstract":"The modulation coding scheme (MCS) index is the essential configuration parameter in cellular vehicle-to-everything (C-V2X) communication. As referenced by the 3rd Generation Partnership Project (3GPP), the MCS index will dictate the transport block size (TBS) index, which will affect the size of transport blocks and the number of physical resource blocks. These numbers are crucial in the C-V2X resource management since it is also bound to the transmission power used in the system. To the authors’ knowledge, this particular area of research has not been previously investigated. Ultimately, this research establishes the fundamental principles for future studies seeking to use the MCS adaptability in many contexts. In this work, we proposed the application of the reinforcement learning (RL) algorithm, as we used the Q-learning approach to adaptively change the MCS index according to the current environmental states. The simulation results showed that our proposed RL approach outperformed the static MCS index and was able to attain stability in a short number of events.","PeriodicalId":37982,"journal":{"name":"Future Internet","volume":"5 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-10-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"136185861","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Financial Data Quality Evaluation Method Based on Multiple Linear Regression 基于多元线性回归的财务数据质量评价方法
Q2 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2023-10-14 DOI: 10.3390/fi15100338
Meng Li, Jiqiang Liu, Yeping Yang
With the rapid growth of customer data in financial institutions, such as trusts, issues of data quality have become increasingly prominent. The main challenge lies in constructing an effective evaluation method that ensures accurate and efficient assessment of customer data quality when dealing with massive customer data. In this paper, we construct a data quality evaluation index system based on the analytic hierarchy process through a comprehensive investigation of existing research on data quality. Then, redundant features are filtered based on the Shapley value, and the multiple linear regression model is employed to adjust the weight of different indices. Finally, a case study of the customer and institution information of a trust institution is conducted. The results demonstrate that the utilization of completeness, accuracy, timeliness, consistency, uniqueness, and compliance to establish a quality evaluation index system proves instrumental in conducting extensive and in-depth research on data quality measurement dimensions. Additionally, the data quality evaluation approach based on multiple linear regression facilitates the batch scoring of data, and the incorporation of the Shapley value facilitates the elimination of invalid features. This enables the intelligent evaluation of large-scale data quality for financial data.
随着信托等金融机构客户数据的快速增长,数据质量问题日益突出。在处理海量客户数据时,如何构建一种有效的评估方法,确保对客户数据质量进行准确、高效的评估,是当前客户数据管理面临的主要挑战。本文在全面调研现有数据质量研究的基础上,构建了基于层次分析法的数据质量评价指标体系。然后,根据Shapley值对冗余特征进行过滤,并采用多元线性回归模型调整不同指标的权重。最后,以某信托机构的客户信息和机构信息为例进行了研究。结果表明,利用完备性、准确性、时效性、一致性、唯一性、遵从性等指标体系构建数据质量评价指标体系,有助于对数据质量度量维度进行广泛而深入的研究。此外,基于多元线性回归的数据质量评价方法便于对数据进行批量评分,Shapley值的引入便于剔除无效特征。这使得对金融数据的大规模数据质量的智能评估成为可能。
{"title":"Financial Data Quality Evaluation Method Based on Multiple Linear Regression","authors":"Meng Li, Jiqiang Liu, Yeping Yang","doi":"10.3390/fi15100338","DOIUrl":"https://doi.org/10.3390/fi15100338","url":null,"abstract":"With the rapid growth of customer data in financial institutions, such as trusts, issues of data quality have become increasingly prominent. The main challenge lies in constructing an effective evaluation method that ensures accurate and efficient assessment of customer data quality when dealing with massive customer data. In this paper, we construct a data quality evaluation index system based on the analytic hierarchy process through a comprehensive investigation of existing research on data quality. Then, redundant features are filtered based on the Shapley value, and the multiple linear regression model is employed to adjust the weight of different indices. Finally, a case study of the customer and institution information of a trust institution is conducted. The results demonstrate that the utilization of completeness, accuracy, timeliness, consistency, uniqueness, and compliance to establish a quality evaluation index system proves instrumental in conducting extensive and in-depth research on data quality measurement dimensions. Additionally, the data quality evaluation approach based on multiple linear regression facilitates the batch scoring of data, and the incorporation of the Shapley value facilitates the elimination of invalid features. This enables the intelligent evaluation of large-scale data quality for financial data.","PeriodicalId":37982,"journal":{"name":"Future Internet","volume":"17 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-10-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135766610","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Edge-Computing-Based People-Counting System for Elevators Using MobileNet–Single-Stage Object Detection 基于mobilenet -单阶段目标检测的边缘计算电梯人员计数系统
Q2 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2023-10-14 DOI: 10.3390/fi15100337
Tsu-Chuan Shen, Edward T.-H. Chu
Existing elevator systems lack the ability to display the number of people waiting on each floor and inside the elevator. This causes an inconvenience as users cannot tell if they should wait or seek alternatives, leading to unnecessary time wastage. In this work, we adopted edge computing by running the MobileNet–Single-Stage Object Detection (SSD) algorithm on edge devices to recognize the number of people inside an elevator and waiting on each floor. To ensure the accuracy of people counting, we fine-tuned the SSD parameters, such as the recognition frequency and confidence thresholds, and utilized the line of interest (LOI) counting strategy for people counting. In our experiment, we deployed four NVIDIA Jetson Nano boards in a four-floor building as edge devices to count people when they entered specific areas. The counting results, such as the number of people waiting on each floor and inside the elevator, were provided to users through a web app. Our experimental results demonstrate that the proposed method achieved an average accuracy of 85% for people counting. Furthermore, when comparing it to sending all images back to a remote server for people counting, the execution time required for edge computing was shorter, without compromising the accuracy significantly.
现有的电梯系统缺乏显示每层楼和电梯内等待人数的能力。这造成了不便,因为用户无法判断他们应该等待还是寻找替代方案,从而导致不必要的时间浪费。在这项工作中,我们采用边缘计算,在边缘设备上运行MobileNet-Single-Stage Object Detection (SSD)算法来识别电梯内和各层等待的人数。为了保证计数的准确性,我们对识别频率和置信度阈值等SSD参数进行了微调,并采用兴趣线计数策略进行计数。在我们的实验中,我们在一栋四层楼的建筑中部署了四块NVIDIA Jetson Nano板,作为边缘设备,在人们进入特定区域时进行计数。计数结果,如每层楼和电梯内的等待人数,通过web应用程序提供给用户。我们的实验结果表明,所提出的方法对人数计数的平均准确率达到85%。此外,与将所有图像发送回远程服务器进行人员计数相比,边缘计算所需的执行时间更短,而且不会显著影响准确性。
{"title":"Edge-Computing-Based People-Counting System for Elevators Using MobileNet–Single-Stage Object Detection","authors":"Tsu-Chuan Shen, Edward T.-H. Chu","doi":"10.3390/fi15100337","DOIUrl":"https://doi.org/10.3390/fi15100337","url":null,"abstract":"Existing elevator systems lack the ability to display the number of people waiting on each floor and inside the elevator. This causes an inconvenience as users cannot tell if they should wait or seek alternatives, leading to unnecessary time wastage. In this work, we adopted edge computing by running the MobileNet–Single-Stage Object Detection (SSD) algorithm on edge devices to recognize the number of people inside an elevator and waiting on each floor. To ensure the accuracy of people counting, we fine-tuned the SSD parameters, such as the recognition frequency and confidence thresholds, and utilized the line of interest (LOI) counting strategy for people counting. In our experiment, we deployed four NVIDIA Jetson Nano boards in a four-floor building as edge devices to count people when they entered specific areas. The counting results, such as the number of people waiting on each floor and inside the elevator, were provided to users through a web app. Our experimental results demonstrate that the proposed method achieved an average accuracy of 85% for people counting. Furthermore, when comparing it to sending all images back to a remote server for people counting, the execution time required for edge computing was shorter, without compromising the accuracy significantly.","PeriodicalId":37982,"journal":{"name":"Future Internet","volume":"68 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-10-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135803422","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Fluent but Not Factual: A Comparative Analysis of ChatGPT and Other AI Chatbots’ Proficiency and Originality in Scientific Writing for Humanities 流利但不真实:ChatGPT与其他人工智能聊天机器人在人文科学写作中的熟练程度和独创性的比较分析
Q2 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2023-10-13 DOI: 10.3390/fi15100336
Edisa Lozić, Benjamin Štular
Historically, mastery of writing was deemed essential to human progress. However, recent advances in generative AI have marked an inflection point in this narrative, including for scientific writing. This article provides a comprehensive analysis of the capabilities and limitations of six AI chatbots in scholarly writing in the humanities and archaeology. The methodology was based on tagging AI-generated content for quantitative accuracy and qualitative precision by human experts. Quantitative accuracy assessed the factual correctness in a manner similar to grading students, while qualitative precision gauged the scientific contribution similar to reviewing a scientific article. In the quantitative test, ChatGPT-4 scored near the passing grade (−5) whereas ChatGPT-3.5 (−18), Bing (−21) and Bard (−31) were not far behind. Claude 2 (−75) and Aria (−80) scored much lower. In the qualitative test, all AI chatbots, but especially ChatGPT-4, demonstrated proficiency in recombining existing knowledge, but all failed to generate original scientific content. As a side note, our results suggest that with ChatGPT-4, the size of large language models has reached a plateau. Furthermore, this paper underscores the intricate and recursive nature of human research. This process of transforming raw data into refined knowledge is computationally irreducible, highlighting the challenges AI chatbots face in emulating human originality in scientific writing. Our results apply to the state of affairs in the third quarter of 2023. In conclusion, while large language models have revolutionised content generation, their ability to produce original scientific contributions in the humanities remains limited. We expect this to change in the near future as current large language model-based AI chatbots evolve into large language model-powered software.
从历史上看,掌握文字被认为是人类进步的必要条件。然而,生成式人工智能的最新进展标志着这一叙事的转折点,包括科学写作。本文全面分析了六种人工智能聊天机器人在人文科学和考古学学术写作中的能力和局限性。该方法基于对人工智能生成的内容进行标记,以便由人类专家进行定量准确性和定性精度。定量准确性评估事实正确性的方式类似于给学生打分,而定性精度评估科学贡献的方式类似于审查一篇科学文章。在定量测试中,ChatGPT-4得分接近及格(- 5),而ChatGPT-3.5 (- 18), Bing(- 21)和Bard(- 31)紧随其后。Claude 2(- 75)和Aria(- 80)得分要低得多。在定性测试中,所有AI聊天机器人,尤其是ChatGPT-4,都表现出对现有知识重组的熟练程度,但都未能产生原创的科学内容。作为旁注,我们的结果表明,使用ChatGPT-4,大型语言模型的规模已经达到了一个平台。此外,本文强调了人类研究的复杂性和递归性。这种将原始数据转化为精炼知识的过程在计算上是不可约的,这凸显了人工智能聊天机器人在模仿人类科学写作原创性方面面临的挑战。我们的结果适用于2023年第三季度的情况。总之,尽管大型语言模型彻底改变了内容生成,但它们在人文科学领域做出原创科学贡献的能力仍然有限。随着目前基于大型语言模型的人工智能聊天机器人进化成基于大型语言模型的软件,我们预计这种情况将在不久的将来发生变化。
{"title":"Fluent but Not Factual: A Comparative Analysis of ChatGPT and Other AI Chatbots’ Proficiency and Originality in Scientific Writing for Humanities","authors":"Edisa Lozić, Benjamin Štular","doi":"10.3390/fi15100336","DOIUrl":"https://doi.org/10.3390/fi15100336","url":null,"abstract":"Historically, mastery of writing was deemed essential to human progress. However, recent advances in generative AI have marked an inflection point in this narrative, including for scientific writing. This article provides a comprehensive analysis of the capabilities and limitations of six AI chatbots in scholarly writing in the humanities and archaeology. The methodology was based on tagging AI-generated content for quantitative accuracy and qualitative precision by human experts. Quantitative accuracy assessed the factual correctness in a manner similar to grading students, while qualitative precision gauged the scientific contribution similar to reviewing a scientific article. In the quantitative test, ChatGPT-4 scored near the passing grade (−5) whereas ChatGPT-3.5 (−18), Bing (−21) and Bard (−31) were not far behind. Claude 2 (−75) and Aria (−80) scored much lower. In the qualitative test, all AI chatbots, but especially ChatGPT-4, demonstrated proficiency in recombining existing knowledge, but all failed to generate original scientific content. As a side note, our results suggest that with ChatGPT-4, the size of large language models has reached a plateau. Furthermore, this paper underscores the intricate and recursive nature of human research. This process of transforming raw data into refined knowledge is computationally irreducible, highlighting the challenges AI chatbots face in emulating human originality in scientific writing. Our results apply to the state of affairs in the third quarter of 2023. In conclusion, while large language models have revolutionised content generation, their ability to produce original scientific contributions in the humanities remains limited. We expect this to change in the near future as current large language model-based AI chatbots evolve into large language model-powered software.","PeriodicalId":37982,"journal":{"name":"Future Internet","volume":"53 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-10-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135858189","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Comparison of Supervised Learning Algorithms on a 5G Dataset Reduced via Principal Component Analysis (PCA) 基于主成分分析(PCA)的5G数据集监督学习算法比较
Q2 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2023-10-11 DOI: 10.3390/fi15100335
Joan D. Gonzalez-Franco, Jorge E. Preciado-Velasco, Jose E. Lozano-Rizk, Raul Rivera-Rodriguez, Jorge Torres-Rodriguez, Miguel A. Alonso-Arevalo
Improving the quality of service (QoS) and meeting service level agreements (SLAs) are critical objectives in next-generation networks. This article presents a study on applying supervised learning (SL) algorithms in a 5G/B5G service dataset after being subjected to a principal component analysis (PCA). The study objective is to evaluate if the reduction of the dimensionality of the dataset via PCA affects the predictive capacity of the SL algorithms. A machine learning (ML) scheme proposed in a previous article used the same algorithms and parameters, which allows for a fair comparison with the results obtained in this work. We searched the best hyperparameters for each SL algorithm, and the simulation results indicate that the support vector machine (SVM) algorithm obtained a precision of 98% and a F1 score of 98.1%. We concluded that the findings of this study hold significance for research in the field of next-generation networks, which involve a wide range of input parameters and can benefit from the application of principal component analysis (PCA) on the performance of QoS and maintaining the SLA.
提高服务质量(QoS)和满足服务水平协议(sla)是下一代网络的关键目标。本文研究了在主成分分析(PCA)之后在5G/B5G服务数据集中应用监督学习(SL)算法的方法。本研究的目的是评估通过PCA对数据集进行降维是否会影响SL算法的预测能力。在前一篇文章中提出的机器学习(ML)方案使用了相同的算法和参数,这允许与本工作中获得的结果进行公平比较。我们为每个SL算法搜索了最佳超参数,仿真结果表明,支持向量机(SVM)算法获得了98%的精度和98.1%的F1分数。我们的结论是,本研究的发现对下一代网络领域的研究具有重要意义,下一代网络涉及广泛的输入参数,并且可以受益于主成分分析(PCA)在QoS性能和SLA维护方面的应用。
{"title":"Comparison of Supervised Learning Algorithms on a 5G Dataset Reduced via Principal Component Analysis (PCA)","authors":"Joan D. Gonzalez-Franco, Jorge E. Preciado-Velasco, Jose E. Lozano-Rizk, Raul Rivera-Rodriguez, Jorge Torres-Rodriguez, Miguel A. Alonso-Arevalo","doi":"10.3390/fi15100335","DOIUrl":"https://doi.org/10.3390/fi15100335","url":null,"abstract":"Improving the quality of service (QoS) and meeting service level agreements (SLAs) are critical objectives in next-generation networks. This article presents a study on applying supervised learning (SL) algorithms in a 5G/B5G service dataset after being subjected to a principal component analysis (PCA). The study objective is to evaluate if the reduction of the dimensionality of the dataset via PCA affects the predictive capacity of the SL algorithms. A machine learning (ML) scheme proposed in a previous article used the same algorithms and parameters, which allows for a fair comparison with the results obtained in this work. We searched the best hyperparameters for each SL algorithm, and the simulation results indicate that the support vector machine (SVM) algorithm obtained a precision of 98% and a F1 score of 98.1%. We concluded that the findings of this study hold significance for research in the field of next-generation networks, which involve a wide range of input parameters and can benefit from the application of principal component analysis (PCA) on the performance of QoS and maintaining the SLA.","PeriodicalId":37982,"journal":{"name":"Future Internet","volume":"39 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-10-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"136210141","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Data-Driven Safe Deliveries: The Synergy of IoT and Machine Learning in Shared Mobility 数据驱动的安全交付:物联网和机器学习在共享移动中的协同作用
Q2 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2023-10-10 DOI: 10.3390/fi15100333
Fatema Elwy, Raafat Aburukba, A. R. Al-Ali, Ahmad Al Nabulsi, Alaa Tarek, Ameen Ayub, Mariam Elsayeh
Shared mobility is one of the smart city applications in which traditional individually owned vehicles are transformed into shared and distributed ownership. Ensuring the safety of both drivers and riders is a fundamental requirement in shared mobility. This work aims to design and implement an adequate framework for shared mobility within the context of a smart city. The characteristics of shared mobility are identified, leading to the proposal of an effective solution for real-time data collection, tracking, and automated decisions focusing on safety. Driver and rider safety is considered by identifying dangerous driving behaviors and the prompt response to accidents. Furthermore, a trip log is recorded to identify the reasons behind the accident. A prototype implementation is presented to validate the proposed framework for a delivery service using motorbikes. The results demonstrate the scalability of the proposed design and the integration of the overall system to enhance the rider’s safety using machine learning techniques. The machine learning approach identifies dangerous driving behaviors with an accuracy of 91.59% using the decision tree approach when compared against the support vector machine and K-nearest neighbor approaches.
共享出行是智慧城市的应用之一,将传统的个人拥有的车辆转变为共享和分布式所有权。确保司机和乘客的安全是共享出行的基本要求。这项工作旨在为智慧城市背景下的共享移动设计和实施一个适当的框架。识别了共享移动的特征,从而提出了一种有效的解决方案,用于实时数据收集、跟踪和以安全为重点的自动决策。驾驶员和乘客的安全是通过识别危险驾驶行为和对事故的迅速反应来考虑的。此外,还记录了旅行日志,以确定事故背后的原因。提出了一个原型实现来验证使用摩托车的交付服务的建议框架。结果证明了所提出设计的可扩展性和整个系统的集成,以使用机器学习技术提高骑手的安全性。与支持向量机和k近邻方法相比,机器学习方法使用决策树方法识别危险驾驶行为的准确率为91.59%。
{"title":"Data-Driven Safe Deliveries: The Synergy of IoT and Machine Learning in Shared Mobility","authors":"Fatema Elwy, Raafat Aburukba, A. R. Al-Ali, Ahmad Al Nabulsi, Alaa Tarek, Ameen Ayub, Mariam Elsayeh","doi":"10.3390/fi15100333","DOIUrl":"https://doi.org/10.3390/fi15100333","url":null,"abstract":"Shared mobility is one of the smart city applications in which traditional individually owned vehicles are transformed into shared and distributed ownership. Ensuring the safety of both drivers and riders is a fundamental requirement in shared mobility. This work aims to design and implement an adequate framework for shared mobility within the context of a smart city. The characteristics of shared mobility are identified, leading to the proposal of an effective solution for real-time data collection, tracking, and automated decisions focusing on safety. Driver and rider safety is considered by identifying dangerous driving behaviors and the prompt response to accidents. Furthermore, a trip log is recorded to identify the reasons behind the accident. A prototype implementation is presented to validate the proposed framework for a delivery service using motorbikes. The results demonstrate the scalability of the proposed design and the integration of the overall system to enhance the rider’s safety using machine learning techniques. The machine learning approach identifies dangerous driving behaviors with an accuracy of 91.59% using the decision tree approach when compared against the support vector machine and K-nearest neighbor approaches.","PeriodicalId":37982,"journal":{"name":"Future Internet","volume":"12 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-10-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"136353063","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Oceania’s 5G Multi-Tier Fixed Wireless Access Link’s Long-Term Resilience and Feasibility Analysis 大洋洲5G多层固定无线接入链路的长期弹性和可行性分析
Q2 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2023-10-10 DOI: 10.3390/fi15100334
Satyanand Singh, Joanna Rosak-Szyrocka, István Drotár, Xavier Fernando
Information and communications technologies play a vital role in achieving the Sustainable Development Goals (SDGs) and bridging the gap between developed and developing countries. However, various socioeconomic factors adversely impact the deployment of digital infrastructure, such as 5G networks, in the countries of Oceania. The high-speed broadband fifth-generation cellular network (5G) will improve the quality of service for growing mobile users and the massive Internet of Things (IoT). It will also provide ultra-low-latency services required by smart city applications. This study investigates the planning process for a 5G radio access network incorporating sub-6 GHz macro-remote radio units (MRRUs) and mmWave micro-remote radio units (mRRUs). We carefully define an optimization problem for 5G network planning, considering the characteristics of urban macro-cells (UMa) and urban micro-cells (UMi) with appropriate channel models and link budgets. We determine the minimum number of MRRUs and mRRUs that can be installed in each area while meeting coverage and user traffic requirements. This will ensure adequate broadband low-latency network coverage with micro-cells instead of macro-cells. This study evaluates the technical feasibility analysis of combining terrestrial and airborne networks to provide 5G coverage in Oceania, with a special emphasis on Fiji.
信息通信技术在实现可持续发展目标和缩小发达国家与发展中国家之间的差距方面发挥着至关重要的作用。然而,各种社会经济因素对大洋洲国家5G网络等数字基础设施的部署产生了不利影响。高速宽带第五代蜂窝网络(5G)将为不断增长的移动用户和大规模物联网(IoT)提高服务质量。它还将提供智慧城市应用所需的超低延迟服务。本研究探讨了包含sub-6 GHz宏远程无线电单元(MRRUs)和毫米波微远程无线电单元(MRRUs)的5G无线接入网络的规划过程。考虑到城市宏蜂窝(UMa)和城市微蜂窝(UMi)的特点,采用适当的信道模型和链路预算,我们仔细定义了5G网络规划的优化问题。我们确定在满足覆盖范围和用户流量需求的情况下,每个区域可以安装的mrru和mrru的最小数量。这将确保足够的宽带低延迟网络覆盖与微蜂窝,而不是大蜂窝。本研究评估了结合地面和机载网络在大洋洲提供5G覆盖的技术可行性分析,特别强调了斐济。
{"title":"Oceania’s 5G Multi-Tier Fixed Wireless Access Link’s Long-Term Resilience and Feasibility Analysis","authors":"Satyanand Singh, Joanna Rosak-Szyrocka, István Drotár, Xavier Fernando","doi":"10.3390/fi15100334","DOIUrl":"https://doi.org/10.3390/fi15100334","url":null,"abstract":"Information and communications technologies play a vital role in achieving the Sustainable Development Goals (SDGs) and bridging the gap between developed and developing countries. However, various socioeconomic factors adversely impact the deployment of digital infrastructure, such as 5G networks, in the countries of Oceania. The high-speed broadband fifth-generation cellular network (5G) will improve the quality of service for growing mobile users and the massive Internet of Things (IoT). It will also provide ultra-low-latency services required by smart city applications. This study investigates the planning process for a 5G radio access network incorporating sub-6 GHz macro-remote radio units (MRRUs) and mmWave micro-remote radio units (mRRUs). We carefully define an optimization problem for 5G network planning, considering the characteristics of urban macro-cells (UMa) and urban micro-cells (UMi) with appropriate channel models and link budgets. We determine the minimum number of MRRUs and mRRUs that can be installed in each area while meeting coverage and user traffic requirements. This will ensure adequate broadband low-latency network coverage with micro-cells instead of macro-cells. This study evaluates the technical feasibility analysis of combining terrestrial and airborne networks to provide 5G coverage in Oceania, with a special emphasis on Fiji.","PeriodicalId":37982,"journal":{"name":"Future Internet","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-10-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"136295507","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Machine Learning: Models, Challenges, and Research Directions 机器学习:模型、挑战和研究方向
Q2 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2023-10-09 DOI: 10.3390/fi15100332
Tala Talaei Khoei, Naima Kaabouch
Machine learning techniques have emerged as a transformative force, revolutionizing various application domains, particularly cybersecurity. The development of optimal machine learning applications requires the integration of multiple processes, such as data pre-processing, model selection, and parameter optimization. While existing surveys have shed light on these techniques, they have mainly focused on specific application domains. A notable gap that exists in current studies is the lack of a comprehensive overview of machine learning architecture and its essential phases in the cybersecurity field. To address this gap, this survey provides a holistic review of current studies in machine learning, covering techniques applicable to any domain. Models are classified into four categories: supervised, semi-supervised, unsupervised, and reinforcement learning. Each of these categories and their models are described. In addition, the survey discusses the current progress related to data pre-processing and hyperparameter tuning techniques. Moreover, this survey identifies and reviews the research gaps and key challenges that the cybersecurity field faces. By analyzing these gaps, we propose some promising research directions for the future. Ultimately, this survey aims to serve as a valuable resource for researchers interested in learning about machine learning, providing them with insights to foster innovation and progress across diverse application domains.
机器学习技术已经成为一股变革力量,彻底改变了各个应用领域,尤其是网络安全。开发最优的机器学习应用需要集成多个过程,如数据预处理、模型选择和参数优化。虽然现有的调查已经阐明了这些技术,但它们主要集中在特定的应用领域。当前研究中存在的一个显著差距是缺乏对机器学习架构及其在网络安全领域的基本阶段的全面概述。为了解决这一差距,本调查提供了当前机器学习研究的全面回顾,涵盖了适用于任何领域的技术。模型分为四类:监督学习、半监督学习、无监督学习和强化学习。描述了每一个类别及其模型。此外,本文还讨论了数据预处理和超参数调优技术的最新进展。此外,本调查确定并回顾了网络安全领域面临的研究差距和关键挑战。通过分析这些差距,我们提出了未来的研究方向。最终,本调查旨在为有兴趣学习机器学习的研究人员提供宝贵的资源,为他们提供见解,以促进不同应用领域的创新和进步。
{"title":"Machine Learning: Models, Challenges, and Research Directions","authors":"Tala Talaei Khoei, Naima Kaabouch","doi":"10.3390/fi15100332","DOIUrl":"https://doi.org/10.3390/fi15100332","url":null,"abstract":"Machine learning techniques have emerged as a transformative force, revolutionizing various application domains, particularly cybersecurity. The development of optimal machine learning applications requires the integration of multiple processes, such as data pre-processing, model selection, and parameter optimization. While existing surveys have shed light on these techniques, they have mainly focused on specific application domains. A notable gap that exists in current studies is the lack of a comprehensive overview of machine learning architecture and its essential phases in the cybersecurity field. To address this gap, this survey provides a holistic review of current studies in machine learning, covering techniques applicable to any domain. Models are classified into four categories: supervised, semi-supervised, unsupervised, and reinforcement learning. Each of these categories and their models are described. In addition, the survey discusses the current progress related to data pre-processing and hyperparameter tuning techniques. Moreover, this survey identifies and reviews the research gaps and key challenges that the cybersecurity field faces. By analyzing these gaps, we propose some promising research directions for the future. Ultimately, this survey aims to serve as a valuable resource for researchers interested in learning about machine learning, providing them with insights to foster innovation and progress across diverse application domains.","PeriodicalId":37982,"journal":{"name":"Future Internet","volume":"63 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-10-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135095251","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Future Internet
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1