首页 > 最新文献

Algorithms最新文献

英文 中文
Using Graph Neural Networks for Social Recommendations 使用图神经网络进行社交推荐
Q3 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2023-11-10 DOI: 10.3390/a16110515
Dharahas Tallapally, John Wang, Katerina Potika, Magdalini Eirinaki
Recommender systems have revolutionized the way users discover and engage with content. Moving beyond the collaborative filtering approach, most modern recommender systems leverage additional sources of information, such as context and social network data. Such data can be modeled using graphs, and the recent advances in Graph Neural Networks have led to the prominence of a new family of graph-based recommender system algorithms. In this work, we propose the RelationalNet algorithm, which not only models user–item, and user–user relationships but also item–item relationships with graphs and uses them as input to the recommendation process. The rationale for utilizing item–item interactions is to enrich the item embeddings by leveraging the similarities between items. By using Graph Neural Networks (GNNs), RelationalNet incorporates social influence and similar item influence into the recommendation process and captures more accurate user interests, especially when traditional methods fall short due to data sparsity. Such models improve the accuracy and effectiveness of recommendation systems by leveraging social connections and item interactions. Results demonstrate that RelationalNet outperforms current state-of-the-art social recommendation algorithms.
推荐系统已经彻底改变了用户发现和参与内容的方式。除了协作过滤方法之外,大多数现代推荐系统还利用其他信息源,例如上下文和社交网络数据。这些数据可以使用图来建模,图神经网络的最新进展导致了一系列新的基于图的推荐系统算法的突出。在这项工作中,我们提出了RelationalNet算法,该算法不仅对用户-物品、用户-用户关系以及物品-物品关系进行建模,并将其作为推荐过程的输入。利用项目-项目交互的基本原理是通过利用项目之间的相似性来丰富项目嵌入。通过使用图神经网络(gnn), RelationalNet将社会影响和类似项目影响纳入推荐过程,并捕获更准确的用户兴趣,特别是当传统方法由于数据稀疏而无法实现时。这些模型通过利用社会联系和项目交互来提高推荐系统的准确性和有效性。结果表明,RelationalNet优于当前最先进的社交推荐算法。
{"title":"Using Graph Neural Networks for Social Recommendations","authors":"Dharahas Tallapally, John Wang, Katerina Potika, Magdalini Eirinaki","doi":"10.3390/a16110515","DOIUrl":"https://doi.org/10.3390/a16110515","url":null,"abstract":"Recommender systems have revolutionized the way users discover and engage with content. Moving beyond the collaborative filtering approach, most modern recommender systems leverage additional sources of information, such as context and social network data. Such data can be modeled using graphs, and the recent advances in Graph Neural Networks have led to the prominence of a new family of graph-based recommender system algorithms. In this work, we propose the RelationalNet algorithm, which not only models user–item, and user–user relationships but also item–item relationships with graphs and uses them as input to the recommendation process. The rationale for utilizing item–item interactions is to enrich the item embeddings by leveraging the similarities between items. By using Graph Neural Networks (GNNs), RelationalNet incorporates social influence and similar item influence into the recommendation process and captures more accurate user interests, especially when traditional methods fall short due to data sparsity. Such models improve the accuracy and effectiveness of recommendation systems by leveraging social connections and item interactions. Results demonstrate that RelationalNet outperforms current state-of-the-art social recommendation algorithms.","PeriodicalId":7636,"journal":{"name":"Algorithms","volume":" 2","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-11-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135188167","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Research on a Classification Method for Strip Steel Surface Defects Based on Knowledge Distillation and a Self-Adaptive Residual Shrinkage Network 基于知识蒸馏和自适应残余收缩网络的带钢表面缺陷分类方法研究
Q3 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2023-11-10 DOI: 10.3390/a16110516
Xinbo Huang, Zhiwei Song, Chao Ji, Ye Zhang, Luya Yang
Different types of surface defects will occur during the production of strip steel. To ensure production quality, it is essential to classify these defects. Our research indicates that two main problems exist in the existing strip steel surface defect classification methods: (1) they cannot solve the problem of unbalanced data using few-shot in reality, (2) they cannot meet the requirement of online real-time classification. To solve the aforementioned problems, a relational knowledge distillation self-adaptive residual shrinkage network (RKD-SARSN) is presented in this work. First, the data enhancement strategy of Cycle GAN defective sample migration is designed. Second, the self-adaptive residual shrinkage network (SARSN) is intended as the backbone network for feature extraction. An adaptive loss function based on accuracy and geometric mean (Gmean) is proposed to solve the problem of unbalanced samples. Finally, a relational knowledge distillation model (RKD) is proposed, and the functions of GUI operation interface encapsulation are designed by combining image processing technology. SARSN is used as a teacher model, its generalization performance is transferred to the lightweight network ResNet34, and it is conveniently deployed as a student model. The results show that the proposed method can improve the deployment efficiency of the model and ensure the real-time performance of the classification algorithms. It is superior to other mainstream algorithms for fine-grained images with unbalanced data classification.
带钢在生产过程中会出现不同类型的表面缺陷。为了保证产品质量,对这些缺陷进行分类是必要的。我们的研究表明,现有的带钢表面缺陷分类方法存在两个主要问题:(1)无法解决现实中数据不平衡的问题,(2)不能满足在线实时分类的要求。为了解决上述问题,本文提出了一种关系知识蒸馏自适应残余收缩网络(RKD-SARSN)。首先,设计了循环GAN缺陷样本迁移的数据增强策略。其次,将自适应残差收缩网络(SARSN)作为特征提取的骨干网络。为了解决样本不平衡问题,提出了一种基于精度和几何均值的自适应损失函数。最后,提出了关系知识精馏模型(RKD),并结合图像处理技术设计了GUI操作界面封装功能。SARSN作为教师模型,将其泛化性能转移到轻量级网络ResNet34中,方便地部署为学生模型。结果表明,该方法可以提高模型的部署效率,保证分类算法的实时性。对于具有非平衡数据的细粒度图像,该算法优于其他主流算法。
{"title":"Research on a Classification Method for Strip Steel Surface Defects Based on Knowledge Distillation and a Self-Adaptive Residual Shrinkage Network","authors":"Xinbo Huang, Zhiwei Song, Chao Ji, Ye Zhang, Luya Yang","doi":"10.3390/a16110516","DOIUrl":"https://doi.org/10.3390/a16110516","url":null,"abstract":"Different types of surface defects will occur during the production of strip steel. To ensure production quality, it is essential to classify these defects. Our research indicates that two main problems exist in the existing strip steel surface defect classification methods: (1) they cannot solve the problem of unbalanced data using few-shot in reality, (2) they cannot meet the requirement of online real-time classification. To solve the aforementioned problems, a relational knowledge distillation self-adaptive residual shrinkage network (RKD-SARSN) is presented in this work. First, the data enhancement strategy of Cycle GAN defective sample migration is designed. Second, the self-adaptive residual shrinkage network (SARSN) is intended as the backbone network for feature extraction. An adaptive loss function based on accuracy and geometric mean (Gmean) is proposed to solve the problem of unbalanced samples. Finally, a relational knowledge distillation model (RKD) is proposed, and the functions of GUI operation interface encapsulation are designed by combining image processing technology. SARSN is used as a teacher model, its generalization performance is transferred to the lightweight network ResNet34, and it is conveniently deployed as a student model. The results show that the proposed method can improve the deployment efficiency of the model and ensure the real-time performance of the classification algorithms. It is superior to other mainstream algorithms for fine-grained images with unbalanced data classification.","PeriodicalId":7636,"journal":{"name":"Algorithms","volume":"2 ","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-11-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135186031","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Trustworthy Digital Representations of Analog Information—An Application-Guided Analysis of a Fundamental Theoretical Problem in Digital Twinning 模拟信息的可信数字表示——应用导向的数字孪生基本理论问题分析
Q3 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2023-11-09 DOI: 10.3390/a16110514
Holger Boche, Yannik N. Böck, Ullrich J. Mönich, Frank H. P. Fitzek
This article compares two methods of algorithmically processing bandlimited time-continuous signals in light of the general problem of finding “suitable” representations of analog information on digital hardware. Albeit abstract, we argue that this problem is fundamental in digital twinning, a signal-processing paradigm the upcoming 6G communication-technology standard relies on heavily. Using computable analysis, we formalize a general framework of machine-readable descriptions for representing analytic objects on Turing machines. Subsequently, we apply this framework to sampling and interpolation theory, providing a thoroughly formalized method for digitally processing the information carried by bandlimited analog signals. We investigate discrete-time descriptions, which form the implicit quasi-standard in digital signal processing, and establish continuous-time descriptions that take the signal’s continuous-time behavior into account. Motivated by an exemplary application of digital twinning, we analyze a textbook model of digital communication systems accordingly. We show that technologically fundamental properties, such as a signal’s (Banach-space) norm, can be computed from continuous-time, but not from discrete-time descriptions of the signal. Given the high trustworthiness requirements within 6G, e.g., employed software must satisfy assessment criteria in a provable manner, we conclude that the problem of “trustworthy” digital representations of analog information is indeed essential to near-future information technology.
本文针对在数字硬件上寻找模拟信息的“合适”表示的一般问题,比较了两种算法处理带限时间连续信号的方法。尽管是抽象的,但我们认为这个问题是数字孪生的基础,这是即将到来的6G通信技术标准严重依赖的信号处理范式。利用可计算分析,我们形式化了一个机器可读描述的一般框架,用于表示图灵机上的分析对象。随后,我们将该框架应用于采样和插值理论,为数字处理带宽有限的模拟信号所携带的信息提供了一种彻底形式化的方法。我们研究了离散时间描述,它构成了数字信号处理中的隐式准标准,并建立了考虑信号连续时间行为的连续时间描述。在数字孪生应用的启发下,我们相应地分析了数字通信系统的教科书模型。我们表明,技术上的基本性质,如信号的(巴拿赫空间)范数,可以从连续时间计算,但不能从信号的离散时间描述计算。考虑到6G内的高可信度要求,例如,所使用的软件必须以可证明的方式满足评估标准,我们得出结论,模拟信息的“可信”数字表示问题对于近期的信息技术确实至关重要。
{"title":"Trustworthy Digital Representations of Analog Information—An Application-Guided Analysis of a Fundamental Theoretical Problem in Digital Twinning","authors":"Holger Boche, Yannik N. Böck, Ullrich J. Mönich, Frank H. P. Fitzek","doi":"10.3390/a16110514","DOIUrl":"https://doi.org/10.3390/a16110514","url":null,"abstract":"This article compares two methods of algorithmically processing bandlimited time-continuous signals in light of the general problem of finding “suitable” representations of analog information on digital hardware. Albeit abstract, we argue that this problem is fundamental in digital twinning, a signal-processing paradigm the upcoming 6G communication-technology standard relies on heavily. Using computable analysis, we formalize a general framework of machine-readable descriptions for representing analytic objects on Turing machines. Subsequently, we apply this framework to sampling and interpolation theory, providing a thoroughly formalized method for digitally processing the information carried by bandlimited analog signals. We investigate discrete-time descriptions, which form the implicit quasi-standard in digital signal processing, and establish continuous-time descriptions that take the signal’s continuous-time behavior into account. Motivated by an exemplary application of digital twinning, we analyze a textbook model of digital communication systems accordingly. We show that technologically fundamental properties, such as a signal’s (Banach-space) norm, can be computed from continuous-time, but not from discrete-time descriptions of the signal. Given the high trustworthiness requirements within 6G, e.g., employed software must satisfy assessment criteria in a provable manner, we conclude that the problem of “trustworthy” digital representations of analog information is indeed essential to near-future information technology.","PeriodicalId":7636,"journal":{"name":"Algorithms","volume":" 22","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-11-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135244385","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A System to Support Readers in Automatically Acquiring Complete Summarized Information on an Event from Different Sources 支持读者从不同来源自动获取事件的完整汇总信息的系统
Q3 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2023-11-08 DOI: 10.3390/a16110513
Pietro Dell’Oglio, Alessandro Bondielli, Francesco Marcelloni
Today, most newspapers utilize social media to disseminate news. On the one hand, this results in an overload of related articles for social media users. On the other hand, since social media tends to form echo chambers around their users, different opinions and information may be hidden. Enabling users to access different information (possibly outside of their echo chambers, without the burden of reading entire articles, often containing redundant information) may be a step forward in allowing them to form their own opinions. To address this challenge, we propose a system that integrates Transformer neural models and text summarization models along with decision rules. Given a reference article already read by the user, our system first collects articles related to the same topic from a configurable number of different sources. Then, it identifies and summarizes the information that differs from the reference article and outputs the summary to the user. The core of the system is the sentence classification algorithm, which classifies sentences in the collected articles into three classes based on similarity with the reference article: sentences classified as dissimilar are summarized by using a pre-trained abstractive summarization model. We evaluated the proposed system in two steps. First, we assessed its effectiveness in identifying content differences between the reference article and the related articles by using human judgments obtained through crowdsourcing as ground truth. We obtained an average F1 score of 0.772 against average F1 scores of 0.797 and 0.676 achieved by two state-of-the-art approaches based, respectively, on model tuning and prompt tuning, which require an appropriate tuning phase and, therefore, greater computational effort. Second, we asked a sample of people to evaluate how well the summary generated by the system represents the information that is not present in the article read by the user. The results are extremely encouraging. Finally, we present a use case.
今天,大多数报纸利用社交媒体传播新闻。一方面,这导致社交媒体用户的相关文章过载。另一方面,由于社交媒体倾向于在用户周围形成回音室,不同的意见和信息可能会被隐藏。允许用户访问不同的信息(可能在他们的回音室之外,没有阅读整篇文章的负担,通常包含冗余信息)可能是允许他们形成自己观点的一步。为了应对这一挑战,我们提出了一个集成Transformer神经模型和文本摘要模型以及决策规则的系统。给定用户已经阅读的参考文章,我们的系统首先从可配置数量的不同来源收集与同一主题相关的文章。然后,它识别和总结与参考文章不同的信息,并将摘要输出给用户。该系统的核心是句子分类算法,该算法根据与参考文章的相似度将收集到的文章中的句子分为三类:分类为不相似的句子通过预训练的抽象摘要模型进行汇总。我们分两步评估了提议的系统。首先,我们评估了它在识别参考文章和相关文章之间的内容差异方面的有效性,通过使用通过众包获得的人类判断作为基础事实。我们获得了0.772的平均F1分数,而分别基于模型调优和提示调优的两种最先进的方法获得的平均F1分数分别为0.797和0.676,这两种方法需要适当的调优阶段,因此需要更多的计算工作量。其次,我们要求一些人评估系统生成的摘要如何很好地代表用户阅读的文章中没有出现的信息。结果非常令人鼓舞。最后,我们给出一个用例。
{"title":"A System to Support Readers in Automatically Acquiring Complete Summarized Information on an Event from Different Sources","authors":"Pietro Dell’Oglio, Alessandro Bondielli, Francesco Marcelloni","doi":"10.3390/a16110513","DOIUrl":"https://doi.org/10.3390/a16110513","url":null,"abstract":"Today, most newspapers utilize social media to disseminate news. On the one hand, this results in an overload of related articles for social media users. On the other hand, since social media tends to form echo chambers around their users, different opinions and information may be hidden. Enabling users to access different information (possibly outside of their echo chambers, without the burden of reading entire articles, often containing redundant information) may be a step forward in allowing them to form their own opinions. To address this challenge, we propose a system that integrates Transformer neural models and text summarization models along with decision rules. Given a reference article already read by the user, our system first collects articles related to the same topic from a configurable number of different sources. Then, it identifies and summarizes the information that differs from the reference article and outputs the summary to the user. The core of the system is the sentence classification algorithm, which classifies sentences in the collected articles into three classes based on similarity with the reference article: sentences classified as dissimilar are summarized by using a pre-trained abstractive summarization model. We evaluated the proposed system in two steps. First, we assessed its effectiveness in identifying content differences between the reference article and the related articles by using human judgments obtained through crowdsourcing as ground truth. We obtained an average F1 score of 0.772 against average F1 scores of 0.797 and 0.676 achieved by two state-of-the-art approaches based, respectively, on model tuning and prompt tuning, which require an appropriate tuning phase and, therefore, greater computational effort. Second, we asked a sample of people to evaluate how well the summary generated by the system represents the information that is not present in the article read by the user. The results are extremely encouraging. Finally, we present a use case.","PeriodicalId":7636,"journal":{"name":"Algorithms","volume":"106 s415","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-11-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135342397","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Special Issue on Algorithms in Decision Support Systems Vol.2 决策支持系统中的算法专刊Vol.2
Q3 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2023-11-08 DOI: 10.3390/a16110512
Edward Rolando Núñez-Valdez
Currently, decision support systems (DSSs) are essential tools that provide information and support for decision making on possible problems that, due to their level of complexity, cannot be easily solved by humans [...]
目前,决策支持系统(DSSs)是必要的工具,它为可能出现的问题提供信息和支持,这些问题由于其复杂性,不容易由人类解决[…]
{"title":"Special Issue on Algorithms in Decision Support Systems Vol.2","authors":"Edward Rolando Núñez-Valdez","doi":"10.3390/a16110512","DOIUrl":"https://doi.org/10.3390/a16110512","url":null,"abstract":"Currently, decision support systems (DSSs) are essential tools that provide information and support for decision making on possible problems that, due to their level of complexity, cannot be easily solved by humans [...]","PeriodicalId":7636,"journal":{"name":"Algorithms","volume":"107 3","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-11-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135342395","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Recommendation System Supporting the Implementation of Sustainable Risk Management Measures in Airport Operations 支持在机场运作中推行可持续风险管理措施的建议系统
Q3 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2023-11-07 DOI: 10.3390/a16110511
Silvia Carpitella, Bruno Brentan, Antonella Certa, Joaquín Izquierdo
This paper introduces a recommendation system aimed at enhancing the sustainable process of risk management within airport operations, with a special focus on Occupational Stress Risks (OSRs). The recommendation system is implemented via a flexible Python code that offers seamless integration into various operational contexts. It leverages Fuzzy Cognitive Maps (FCMs) to conduct comprehensive risk assessments, subsequently generating prioritized recommendations for predefined risk management measures aimed at preventing and/or reducing the most critical OSRs. The system’s reliability has been validated by iterating the procedure with diverse input data (i.e., matrices of varying sizes) and measures. This confirms the system’s effectiveness across a broad spectrum of engineering scenarios.
本文介绍了一个建议系统,旨在加强机场运营风险管理的可持续过程,特别关注职业压力风险(OSRs)。推荐系统是通过灵活的Python代码实现的,可以无缝集成到各种操作环境中。它利用模糊认知地图(fcm)进行全面的风险评估,随后为预先定义的风险管理措施生成优先建议,旨在预防和/或减少最严重的osr。通过对不同输入数据(即不同大小的矩阵)和测量方法进行迭代,验证了系统的可靠性。这证实了该系统在广泛的工程场景中的有效性。
{"title":"A Recommendation System Supporting the Implementation of Sustainable Risk Management Measures in Airport Operations","authors":"Silvia Carpitella, Bruno Brentan, Antonella Certa, Joaquín Izquierdo","doi":"10.3390/a16110511","DOIUrl":"https://doi.org/10.3390/a16110511","url":null,"abstract":"This paper introduces a recommendation system aimed at enhancing the sustainable process of risk management within airport operations, with a special focus on Occupational Stress Risks (OSRs). The recommendation system is implemented via a flexible Python code that offers seamless integration into various operational contexts. It leverages Fuzzy Cognitive Maps (FCMs) to conduct comprehensive risk assessments, subsequently generating prioritized recommendations for predefined risk management measures aimed at preventing and/or reducing the most critical OSRs. The system’s reliability has been validated by iterating the procedure with diverse input data (i.e., matrices of varying sizes) and measures. This confirms the system’s effectiveness across a broad spectrum of engineering scenarios.","PeriodicalId":7636,"journal":{"name":"Algorithms","volume":"45 31","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-11-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135432506","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Detecting and Processing Unsuspected Sensitive Variables for Robust Machine Learning 鲁棒机器学习中未知敏感变量的检测与处理
Q3 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2023-11-07 DOI: 10.3390/a16110510
Laurent Risser, Agustin Martin Picard, Lucas Hervier, Jean-Michel Loubes
The problem of algorithmic bias in machine learning has recently gained a lot of attention due to its potentially strong impact on our societies. In much the same manner, algorithmic biases can alter industrial and safety-critical machine learning applications, where high-dimensional inputs are used. This issue has, however, been mostly left out of the spotlight in the machine learning literature. Contrary to societal applications, where a set of potentially sensitive variables, such as gender or race, can be defined by common sense or by regulations to draw attention to potential risks, the sensitive variables are often unsuspected in industrial and safety-critical applications. In addition, these unsuspected sensitive variables may be indirectly represented as a latent feature of the input data. For instance, the predictions of an image classifier may be altered by reconstruction artefacts in a small subset of the training images. This raises serious and well-founded concerns about the commercial deployment of AI-based solutions, especially in a context where new regulations address bias issues in AI. The purpose of our paper is, then, to first give a large overview of recent advances in robust machine learning. Then, we propose a new procedure to detect and to treat such unknown biases. As far as we know, no equivalent procedure has been proposed in the literature so far. The procedure is also generic enough to be used in a wide variety of industrial contexts. Its relevance is demonstrated on a set of satellite images used to train a classifier. In this illustration, our technique detects that a subset of the training images has reconstruction faults, leading to systematic prediction errors that would have been unsuspected using conventional cross-validation techniques.
机器学习中的算法偏差问题最近引起了很多关注,因为它可能对我们的社会产生巨大影响。以同样的方式,算法偏差可以改变工业和安全关键型机器学习应用,在这些应用中使用高维输入。然而,这个问题在机器学习文献中却很少被关注。与社会应用相反,在社会应用中,一组潜在的敏感变量,如性别或种族,可以通过常识或法规来定义,以引起对潜在风险的注意,而在工业和安全关键应用中,敏感变量通常是不被怀疑的。此外,这些未预料到的敏感变量可以间接地表示为输入数据的潜在特征。例如,图像分类器的预测可能会被一小部分训练图像中的重建伪影所改变。这引发了对基于人工智能的解决方案的商业部署的严重和有根据的担忧,特别是在新法规解决人工智能中的偏见问题的背景下。因此,本文的目的是首先对鲁棒机器学习的最新进展进行概述。然后,我们提出了一种新的方法来检测和处理这些未知的偏差。据我们所知,到目前为止,文献中还没有提出相应的程序。该程序也足够通用,可以在各种工业环境中使用。其相关性在一组用于训练分类器的卫星图像上得到了证明。在这个例子中,我们的技术检测到训练图像的一个子集有重建错误,导致使用传统交叉验证技术无法预料的系统预测错误。
{"title":"Detecting and Processing Unsuspected Sensitive Variables for Robust Machine Learning","authors":"Laurent Risser, Agustin Martin Picard, Lucas Hervier, Jean-Michel Loubes","doi":"10.3390/a16110510","DOIUrl":"https://doi.org/10.3390/a16110510","url":null,"abstract":"The problem of algorithmic bias in machine learning has recently gained a lot of attention due to its potentially strong impact on our societies. In much the same manner, algorithmic biases can alter industrial and safety-critical machine learning applications, where high-dimensional inputs are used. This issue has, however, been mostly left out of the spotlight in the machine learning literature. Contrary to societal applications, where a set of potentially sensitive variables, such as gender or race, can be defined by common sense or by regulations to draw attention to potential risks, the sensitive variables are often unsuspected in industrial and safety-critical applications. In addition, these unsuspected sensitive variables may be indirectly represented as a latent feature of the input data. For instance, the predictions of an image classifier may be altered by reconstruction artefacts in a small subset of the training images. This raises serious and well-founded concerns about the commercial deployment of AI-based solutions, especially in a context where new regulations address bias issues in AI. The purpose of our paper is, then, to first give a large overview of recent advances in robust machine learning. Then, we propose a new procedure to detect and to treat such unknown biases. As far as we know, no equivalent procedure has been proposed in the literature so far. The procedure is also generic enough to be used in a wide variety of industrial contexts. Its relevance is demonstrated on a set of satellite images used to train a classifier. In this illustration, our technique detects that a subset of the training images has reconstruction faults, leading to systematic prediction errors that would have been unsuspected using conventional cross-validation techniques.","PeriodicalId":7636,"journal":{"name":"Algorithms","volume":"45 38","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-11-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135432502","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Predicting the Gap in the Day-Ahead and Real-Time Market Prices Leveraging Exogenous Weather Data 利用外生天气数据预测前一天和实时市场价格的差距
Q3 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2023-11-04 DOI: 10.3390/a16110508
Nika Nizharadze, Arash Farokhi Soofi, Saeed Manshadi
Predicting the price gap between the day-ahead Market (DAM) and the real-time Market (RTM) plays a vital role in the convergence bidding mechanism of Independent System Operators (ISOs) in wholesale electricity markets. This paper presents a model to predict the values of the price gap between the DAM and RTM using statistical machine learning algorithms and deep neural networks. In this paper, we seek to answer these questions: What will be the impact of predicting the DAM and RTM price gap directly on the prediction performance of learning methods? How can exogenous weather data affect the price gap prediction? In this paper, several exogenous features are collected, and the impacts of these features are examined to capture the best relations between the features and the target variable. An ensemble learning algorithm, namely the Random Forest (RF), is used to select the most important features. A Long Short-Term Memory (LSTM) network is used to capture long-term dependencies in predicting direct gap values between the markets stated. Moreover, the advantages of directly predicting the gap price rather than subtracting the price predictions of the DAM and RTM are shown. The presented results are based on the California Independent System Operator (CAISO)’s electricity market data for two years. The results show that direct gap prediction using exogenous weather features decreases the error of learning methods by 46%. Therefore, the presented method mitigates the prediction error of the price gap between the DAM and RTM. Thus, the convergence bidders can increase their profit, and the ISOs can tune their mechanism accordingly.
日前市场(DAM)与实时市场(RTM)之间的价格差预测对独立系统运营商(iso)在电力批发市场的聚合竞价机制中起着至关重要的作用。本文提出了一个利用统计机器学习算法和深度神经网络预测DAM和RTM之间价格差值的模型。在本文中,我们试图回答这些问题:预测DAM和RTM价格差距对学习方法的预测性能有什么直接影响?外生天气数据如何影响价差预测?本文收集了几个外生特征,并研究了这些特征的影响,以捕获特征与目标变量之间的最佳关系。一种集成学习算法,即随机森林(RF),用于选择最重要的特征。长短期记忆(LSTM)网络用于捕获预测市场之间直接缺口值的长期依赖关系。此外,还显示了直接预测缺口价格而不是减去DAM和RTM的价格预测的优势。本文给出的结果是基于加州独立系统运营商(CAISO)两年的电力市场数据。结果表明,使用外源天气特征的直接间隙预测使学习方法的误差降低了46%。因此,该方法减轻了DAM与RTM之间价格差距的预测误差。因此,收敛竞标者可以增加他们的利润,iso可以相应地调整他们的机制。
{"title":"Predicting the Gap in the Day-Ahead and Real-Time Market Prices Leveraging Exogenous Weather Data","authors":"Nika Nizharadze, Arash Farokhi Soofi, Saeed Manshadi","doi":"10.3390/a16110508","DOIUrl":"https://doi.org/10.3390/a16110508","url":null,"abstract":"Predicting the price gap between the day-ahead Market (DAM) and the real-time Market (RTM) plays a vital role in the convergence bidding mechanism of Independent System Operators (ISOs) in wholesale electricity markets. This paper presents a model to predict the values of the price gap between the DAM and RTM using statistical machine learning algorithms and deep neural networks. In this paper, we seek to answer these questions: What will be the impact of predicting the DAM and RTM price gap directly on the prediction performance of learning methods? How can exogenous weather data affect the price gap prediction? In this paper, several exogenous features are collected, and the impacts of these features are examined to capture the best relations between the features and the target variable. An ensemble learning algorithm, namely the Random Forest (RF), is used to select the most important features. A Long Short-Term Memory (LSTM) network is used to capture long-term dependencies in predicting direct gap values between the markets stated. Moreover, the advantages of directly predicting the gap price rather than subtracting the price predictions of the DAM and RTM are shown. The presented results are based on the California Independent System Operator (CAISO)’s electricity market data for two years. The results show that direct gap prediction using exogenous weather features decreases the error of learning methods by 46%. Therefore, the presented method mitigates the prediction error of the price gap between the DAM and RTM. Thus, the convergence bidders can increase their profit, and the ISOs can tune their mechanism accordingly.","PeriodicalId":7636,"journal":{"name":"Algorithms","volume":"32 13","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-11-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135773349","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Parkinson’s Disease Classification Framework Using Vocal Dynamics in Connected Speech 在连接语音中使用声音动力学的帕金森病分类框架
Q3 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2023-11-04 DOI: 10.3390/a16110509
Sai Bharadwaj Appakaya, Ruchira Pratihar, Ravi Sankar
Parkinson’s disease (PD) classification through speech has been an advancing field of research because of its ease of acquisition and processing. The minimal infrastructure requirements of the system have also made it suitable for telemonitoring applications. Researchers have studied the effects of PD on speech from various perspectives using different speech tasks. Typical speech deficits due to PD include voice monotony (e.g., monopitch), breathy or rough quality, and articulatory errors. In connected speech, these symptoms are more emphatic, which is also the basis for speech assessment in popular rating scales used for PD, like the Unified Parkinson’s Disease Rating Scale (UPDRS) and Hoehn and Yahr (HY). The current study introduces an innovative framework that integrates pitch-synchronous segmentation and an optimized set of features to investigate and analyze continuous speech from both PD patients and healthy controls (HC). Comparison of the proposed framework against existing methods has shown its superiority in classification performance and mitigation of overfitting in machine learning models. A set of optimal classifiers with unbiased decision-making was identified after comparing several machine learning models. The outcomes yielded by the classifiers demonstrate that the framework effectively learns the intrinsic characteristics of PD from connected speech, which can potentially offer valuable assistance in clinical diagnosis.
通过言语分类帕金森病(PD)因其易于获取和处理而成为一个前沿研究领域。该系统对基础设施的最低要求也使其适合远程监控应用。研究者利用不同的言语任务从不同的角度研究了PD对言语的影响。由PD引起的典型言语缺陷包括声音单调(例如,单音),呼吸或粗糙的质量,以及发音错误。在关联言语中,这些症状更加突出,这也是常用的PD评定量表(如统一帕金森病评定量表(UPDRS)和Hoehn and Yahr (HY))的言语评估基础。目前的研究引入了一个创新的框架,该框架集成了音高同步分割和一组优化的功能,用于调查和分析PD患者和健康对照(HC)的连续语音。将提出的框架与现有方法进行比较,表明其在分类性能和缓解机器学习模型的过拟合方面具有优势。通过比较几种机器学习模型,确定了一组具有无偏决策的最优分类器。分类器产生的结果表明,该框架可以有效地从连接语音中学习PD的内在特征,这可能为临床诊断提供有价值的帮助。
{"title":"Parkinson’s Disease Classification Framework Using Vocal Dynamics in Connected Speech","authors":"Sai Bharadwaj Appakaya, Ruchira Pratihar, Ravi Sankar","doi":"10.3390/a16110509","DOIUrl":"https://doi.org/10.3390/a16110509","url":null,"abstract":"Parkinson’s disease (PD) classification through speech has been an advancing field of research because of its ease of acquisition and processing. The minimal infrastructure requirements of the system have also made it suitable for telemonitoring applications. Researchers have studied the effects of PD on speech from various perspectives using different speech tasks. Typical speech deficits due to PD include voice monotony (e.g., monopitch), breathy or rough quality, and articulatory errors. In connected speech, these symptoms are more emphatic, which is also the basis for speech assessment in popular rating scales used for PD, like the Unified Parkinson’s Disease Rating Scale (UPDRS) and Hoehn and Yahr (HY). The current study introduces an innovative framework that integrates pitch-synchronous segmentation and an optimized set of features to investigate and analyze continuous speech from both PD patients and healthy controls (HC). Comparison of the proposed framework against existing methods has shown its superiority in classification performance and mitigation of overfitting in machine learning models. A set of optimal classifiers with unbiased decision-making was identified after comparing several machine learning models. The outcomes yielded by the classifiers demonstrate that the framework effectively learns the intrinsic characteristics of PD from connected speech, which can potentially offer valuable assistance in clinical diagnosis.","PeriodicalId":7636,"journal":{"name":"Algorithms","volume":"30 16","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-11-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135774659","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Deep Dive into Fake News Detection: Feature-Centric Classification with Ensemble and Deep Learning Methods 深入研究假新闻检测:以特征为中心的集成和深度学习方法分类
Q3 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2023-11-03 DOI: 10.3390/a16110507
Fawaz Khaled Alarfaj, Jawad Abbas Khan
The online spread of fake news on various platforms has emerged as a significant concern, posing threats to public opinion, political stability, and the dissemination of reliable information. Researchers have turned to advanced technologies, including machine learning (ML) and deep learning (DL) techniques, to detect and classify fake news to address this issue. This research study explores fake news classification using diverse ML and DL approaches. We utilized a well-known “Fake News” dataset sourced from Kaggle, encompassing a labelled news collection. We implemented diverse ML models, including multinomial naïve bayes (MNB), gaussian naïve bayes (GNB), Bernoulli naïve Bayes (BNB), logistic regression (LR), and passive aggressive classifier (PAC). Additionally, we explored DL models, such as long short-term memory (LSTM), convolutional neural networks (CNN), and CNN-LSTM. We compared the performance of these models based on key evaluation metrics, such as accuracy, precision, recall, and the F1 score. Additionally, we conducted cross-validation and hyperparameter tuning to ensure optimal performance. The results provide valuable insights into the strengths and weaknesses of each model in classifying fake news. We observed that DL models, particularly LSTM and CNN-LSTM, showed better performance compared to traditional ML models. These models achieved higher accuracy and demonstrated robustness in classification tasks. These findings emphasize the potential of DL models to tackle the spread of fake news effectively and highlight the importance of utilizing advanced techniques to address this challenging problem.
假新闻在各种平台上的网络传播已经成为一个重大问题,对公众舆论、政治稳定和可靠信息的传播构成威胁。研究人员已经转向先进的技术,包括机器学习(ML)和深度学习(DL)技术,来检测和分类假新闻,以解决这个问题。本研究使用不同的ML和DL方法探索假新闻分类。我们使用了来自Kaggle的著名“假新闻”数据集,包括一个标记的新闻集合。我们实现了多种机器学习模型,包括多项naïve贝叶斯(MNB)、高斯naïve贝叶斯(GNB)、伯努利naïve贝叶斯(BNB)、逻辑回归(LR)和被动攻击分类器(PAC)。此外,我们还探索了深度学习模型,如长短期记忆(LSTM)、卷积神经网络(CNN)和CNN-LSTM。我们基于关键评估指标(如准确性、精度、召回率和F1分数)比较了这些模型的性能。此外,我们进行了交叉验证和超参数调优以确保最佳性能。结果为每个模型在分类假新闻方面的优缺点提供了有价值的见解。我们观察到DL模型,特别是LSTM和CNN-LSTM,与传统的ML模型相比表现出更好的性能。这些模型在分类任务中获得了更高的精度和鲁棒性。这些发现强调了深度学习模型有效解决假新闻传播的潜力,并强调了利用先进技术解决这一具有挑战性问题的重要性。
{"title":"Deep Dive into Fake News Detection: Feature-Centric Classification with Ensemble and Deep Learning Methods","authors":"Fawaz Khaled Alarfaj, Jawad Abbas Khan","doi":"10.3390/a16110507","DOIUrl":"https://doi.org/10.3390/a16110507","url":null,"abstract":"The online spread of fake news on various platforms has emerged as a significant concern, posing threats to public opinion, political stability, and the dissemination of reliable information. Researchers have turned to advanced technologies, including machine learning (ML) and deep learning (DL) techniques, to detect and classify fake news to address this issue. This research study explores fake news classification using diverse ML and DL approaches. We utilized a well-known “Fake News” dataset sourced from Kaggle, encompassing a labelled news collection. We implemented diverse ML models, including multinomial naïve bayes (MNB), gaussian naïve bayes (GNB), Bernoulli naïve Bayes (BNB), logistic regression (LR), and passive aggressive classifier (PAC). Additionally, we explored DL models, such as long short-term memory (LSTM), convolutional neural networks (CNN), and CNN-LSTM. We compared the performance of these models based on key evaluation metrics, such as accuracy, precision, recall, and the F1 score. Additionally, we conducted cross-validation and hyperparameter tuning to ensure optimal performance. The results provide valuable insights into the strengths and weaknesses of each model in classifying fake news. We observed that DL models, particularly LSTM and CNN-LSTM, showed better performance compared to traditional ML models. These models achieved higher accuracy and demonstrated robustness in classification tasks. These findings emphasize the potential of DL models to tackle the spread of fake news effectively and highlight the importance of utilizing advanced techniques to address this challenging problem.","PeriodicalId":7636,"journal":{"name":"Algorithms","volume":"39 6","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-11-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135868337","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Algorithms
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1