首页 > 最新文献

Algorithms最新文献

英文 中文
Artificial Intelligence-Based System for Retinal Disease Diagnosis 基于人工智能的视网膜疾病诊断系统
Pub Date : 2024-07-18 DOI: 10.3390/a17070315
E. V. Orlova
The growth in the number of people suffering from eye diseases determines the relevance of research in the field of diagnosing retinal pathologies. Artificial intelligence models and algorithms based on measurements obtained via electrophysiological methods can significantly improve and speed up the analysis of results and diagnostics. We propose an approach to designing an artificial intelligent diagnosis system (AI diagnosis system) which includes an electrophysiological complex to collect objective information and an intelligent decision support system to justify the diagnosis. The task of diagnosing retinal diseases based on a set of heterogeneous data is considered as a multi-class classification on unbalanced data. The decision support system includes two classifiers—one classifier is based on a fuzzy model and a fuzzy rule base (RB-classifier) and one uses the stochastic gradient boosting algorithm (SGB-classifier). The efficiency of algorithms in a multi-class classification on unbalanced data is assessed based on two indicators—MAUC (multi-class area under curve) and MMCC (multi-class Matthews correlation coefficient). Combining two algorithms in a decision support system provides more accurate and reliable pathology identification. The accuracy of diagnostics using the proposed AI diagnosis system is 5–8% higher than the accuracy of a system using only diagnostics based on electrophysical indicators. The AI diagnosis system differs from other systems of this class in that it is based on the processing of objective electrophysiological data and socio-demographic data about patients, as well as subjective information from the anamnesis, which ensures increased efficiency of medical decision-making. The system is tested using actual data about retinal diseases from the Russian Institute of Eye Diseases and its high efficiency is proven. Simulation experiments conducted in various scenario conditions with different combinations of factors ensured the identification of the main determinants (markers) for each diagnosis of retinal pathology.
眼疾患者人数的增长决定了视网膜病变诊断领域研究的重要性。基于电生理方法测量结果的人工智能模型和算法可以显著改善和加快结果分析和诊断。我们提出了一种设计人工智能诊断系统(AI 诊断系统)的方法,其中包括收集客观信息的电生理综合系统和证明诊断合理性的智能决策支持系统。根据一组异构数据诊断视网膜疾病的任务被视为对不平衡数据的多类分类。决策支持系统包括两个分类器,一个是基于模糊模型和模糊规则库的分类器(RB-分类器),另一个是使用随机梯度提升算法的分类器(SGB-分类器)。在对不平衡数据进行多类分类时,算法的效率是根据两个指标--多类曲线下面积(MAUC)和多类马太相关系数(MMCC)来评估的。在决策支持系统中结合两种算法,可以提供更准确、更可靠的病理鉴定。使用所提出的人工智能诊断系统的诊断准确率比仅使用基于电物理指标的诊断系统的准确率高出 5-8%。该人工智能诊断系统与其他同类系统的不同之处在于,它基于对患者的客观电生理数据和社会人口学数据以及来自病史的主观信息的处理,从而确保提高医疗决策的效率。该系统使用俄罗斯眼科疾病研究所提供的视网膜疾病实际数据进行了测试,其高效性得到了证实。在不同的场景条件下,通过不同的因素组合进行了模拟实验,确保确定了每种视网膜病变诊断的主要决定因素(标记)。
{"title":"Artificial Intelligence-Based System for Retinal Disease Diagnosis","authors":"E. V. Orlova","doi":"10.3390/a17070315","DOIUrl":"https://doi.org/10.3390/a17070315","url":null,"abstract":"The growth in the number of people suffering from eye diseases determines the relevance of research in the field of diagnosing retinal pathologies. Artificial intelligence models and algorithms based on measurements obtained via electrophysiological methods can significantly improve and speed up the analysis of results and diagnostics. We propose an approach to designing an artificial intelligent diagnosis system (AI diagnosis system) which includes an electrophysiological complex to collect objective information and an intelligent decision support system to justify the diagnosis. The task of diagnosing retinal diseases based on a set of heterogeneous data is considered as a multi-class classification on unbalanced data. The decision support system includes two classifiers—one classifier is based on a fuzzy model and a fuzzy rule base (RB-classifier) and one uses the stochastic gradient boosting algorithm (SGB-classifier). The efficiency of algorithms in a multi-class classification on unbalanced data is assessed based on two indicators—MAUC (multi-class area under curve) and MMCC (multi-class Matthews correlation coefficient). Combining two algorithms in a decision support system provides more accurate and reliable pathology identification. The accuracy of diagnostics using the proposed AI diagnosis system is 5–8% higher than the accuracy of a system using only diagnostics based on electrophysical indicators. The AI diagnosis system differs from other systems of this class in that it is based on the processing of objective electrophysiological data and socio-demographic data about patients, as well as subjective information from the anamnesis, which ensures increased efficiency of medical decision-making. The system is tested using actual data about retinal diseases from the Russian Institute of Eye Diseases and its high efficiency is proven. Simulation experiments conducted in various scenario conditions with different combinations of factors ensured the identification of the main determinants (markers) for each diagnosis of retinal pathology.","PeriodicalId":502609,"journal":{"name":"Algorithms","volume":" 12","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-07-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141824561","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Threshold Active Learning Approach for Physical Violence Detection on Images Obtained from Video (Frame-Level) Using Pre-Trained Deep Learning Neural Network Models 使用预训练的深度学习神经网络模型在视频图像(帧级)上进行人身暴力检测的阈值主动学习方法
Pub Date : 2024-07-18 DOI: 10.3390/a17070316
Itzel M. Abundez, Roberto Alejo, Francisco Primero Primero, E. Granda-Gutiérrez, O. Portillo-Rodríguez, Juan Alberto Antonio Velázquez
Public authorities and private companies have used video cameras as part of surveillance systems, and one of their objectives is the rapid detection of physically violent actions. This task is usually performed by human visual inspection, which is labor-intensive. For this reason, different deep learning models have been implemented to remove the human eye from this task, yielding positive results. One of the main problems in detecting physical violence in videos is the variety of scenarios that can exist, which leads to different models being trained on datasets, leading them to detect physical violence in only one or a few types of videos. In this work, we present an approach for physical violence detection on images obtained from video based on threshold active learning, that increases the classifier’s robustness in environments where it was not trained. The proposed approach consists of two stages: In the first stage, pre-trained neural network models are trained on initial datasets, and we use a threshold (μ) to identify those images that the classifier considers ambiguous or hard to classify. Then, they are included in the training dataset, and the model is retrained to improve its classification performance. In the second stage, we test the model with video images from other environments, and we again employ (μ) to detect ambiguous images that a human expert analyzes to determine the real class or delete the ambiguity on them. After that, the ambiguous images are added to the original training set and the classifier is retrained; this process is repeated while ambiguous images exist. The model is a hybrid neural network that uses transfer learning and a threshold μ to detect physical violence on images obtained from video files successfully. In this active learning process, the classifier can detect physical violence in different environments, where the main contribution is the method used to obtain a threshold μ (which is based on the neural network output) that allows human experts to contribute to the classification process to obtain more robust neural networks and high-quality datasets. The experimental results show the proposed approach’s effectiveness in detecting physical violence, where it is trained using an initial dataset, and new images are added to improve its robustness in diverse environments.
公共机构和私营公司已将摄像机用作监控系统的一部分,其目标之一是快速发现身体暴力行动。这项任务通常由人工目测完成,耗费大量人力物力。为此,人们采用了不同的深度学习模型,使人眼不再参与这项任务,并取得了积极的成果。检测视频中的身体暴力的主要问题之一是可能存在的各种场景,这导致在数据集上训练不同的模型,导致它们只能在一种或几种类型的视频中检测到身体暴力。在这项工作中,我们提出了一种基于阈值主动学习的方法,用于检测视频图像中的身体暴力,从而提高分类器在未经训练的环境中的鲁棒性。所提出的方法包括两个阶段:第一阶段,在初始数据集上训练预训练的神经网络模型,我们使用阈值(μ)来识别分类器认为模糊或难以分类的图像。然后,将它们纳入训练数据集,对模型进行再训练,以提高其分类性能。在第二阶段,我们用其他环境中的视频图像来测试模型,并再次使用 (μ) 来检测模糊图像,由人类专家对其进行分析,以确定真正的类别或删除其模糊性。然后,将模糊图像添加到原始训练集,并重新训练分类器;只要存在模糊图像,这一过程就会重复。该模型是一个混合神经网络,利用迁移学习和阈值 μ 成功地检测了从视频文件中获取的图像上的身体暴力。在这个主动学习过程中,分类器可以检测到不同环境中的身体暴力,其主要贡献在于用于获得阈值μ(基于神经网络输出)的方法,该方法允许人类专家参与分类过程,以获得更强大的神经网络和高质量的数据集。实验结果表明了所提出的方法在检测身体暴力方面的有效性,该方法使用初始数据集进行训练,并添加新的图像以提高其在不同环境中的鲁棒性。
{"title":"Threshold Active Learning Approach for Physical Violence Detection on Images Obtained from Video (Frame-Level) Using Pre-Trained Deep Learning Neural Network Models","authors":"Itzel M. Abundez, Roberto Alejo, Francisco Primero Primero, E. Granda-Gutiérrez, O. Portillo-Rodríguez, Juan Alberto Antonio Velázquez","doi":"10.3390/a17070316","DOIUrl":"https://doi.org/10.3390/a17070316","url":null,"abstract":"Public authorities and private companies have used video cameras as part of surveillance systems, and one of their objectives is the rapid detection of physically violent actions. This task is usually performed by human visual inspection, which is labor-intensive. For this reason, different deep learning models have been implemented to remove the human eye from this task, yielding positive results. One of the main problems in detecting physical violence in videos is the variety of scenarios that can exist, which leads to different models being trained on datasets, leading them to detect physical violence in only one or a few types of videos. In this work, we present an approach for physical violence detection on images obtained from video based on threshold active learning, that increases the classifier’s robustness in environments where it was not trained. The proposed approach consists of two stages: In the first stage, pre-trained neural network models are trained on initial datasets, and we use a threshold (μ) to identify those images that the classifier considers ambiguous or hard to classify. Then, they are included in the training dataset, and the model is retrained to improve its classification performance. In the second stage, we test the model with video images from other environments, and we again employ (μ) to detect ambiguous images that a human expert analyzes to determine the real class or delete the ambiguity on them. After that, the ambiguous images are added to the original training set and the classifier is retrained; this process is repeated while ambiguous images exist. The model is a hybrid neural network that uses transfer learning and a threshold μ to detect physical violence on images obtained from video files successfully. In this active learning process, the classifier can detect physical violence in different environments, where the main contribution is the method used to obtain a threshold μ (which is based on the neural network output) that allows human experts to contribute to the classification process to obtain more robust neural networks and high-quality datasets. The experimental results show the proposed approach’s effectiveness in detecting physical violence, where it is trained using an initial dataset, and new images are added to improve its robustness in diverse environments.","PeriodicalId":502609,"journal":{"name":"Algorithms","volume":" 8","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-07-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141827195","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Linear System Identification-Oriented Optimal Tampering Attack Strategy and Implementation Based on Information Entropy with Multiple Binary Observations 基于多重二进制观测信息熵的线性系统识别导向最优篡改攻击策略与实施
Pub Date : 2024-06-03 DOI: 10.3390/a17060239
Zhongwei Bai, Peng Yu, Yan Liu, Jin Guo
With the rapid development of computer technology, communication technology, and control technology, cyber-physical systems (CPSs) have been widely used and developed. However, there are massive information interactions in CPSs, which lead to an increase in the amount of data transmitted over the network. The data communication, once attacked by the network, will seriously affect the security and stability of the system. In this paper, for the data tampering attack existing in the linear system with multiple binary observations, in the case where the estimation algorithm of the defender is unknown, the optimization index is constructed based on information entropy from the attacker’s point of view, and the problem is modeled. For the problem of the multi-parameter optimization with energy constraints, this paper uses particle swarm optimization (PSO) to obtain the optimal data tampering attack solution set, and gives the estimation method of unknown parameters in the case of unknown parameters. To implement the real-time improvement of online implementation, the BP neural network is designed. Finally, the validity of the conclusions is verified through numerical simulation. This means that the attacker can construct effective metrics based on information entropy without the knowledge of the defense’s discrimination algorithm. In addition, the optimal attack strategy implementation based on PSO and BP is also effective.
随着计算机技术、通信技术和控制技术的飞速发展,网络物理系统(CPS)得到了广泛的应用和发展。然而,CPS 中存在大量的信息交互,导致网络传输的数据量不断增加。数据通信一旦受到网络攻击,将严重影响系统的安全性和稳定性。本文针对具有多个二进制观测值的线性系统中存在的数据篡改攻击,在防御方估计算法未知的情况下,从攻击方的角度出发,构建了基于信息熵的优化指标,并对问题进行了建模。针对有能量约束的多参数优化问题,本文利用粒子群优化(PSO)获得最优数据篡改攻击解集,并给出了未知参数情况下的未知参数估计方法。为了实现在线执行的实时改进,设计了 BP 神经网络。最后,通过数值模拟验证了结论的正确性。这意味着攻击方可以在不知道防御方判别算法的情况下,根据信息熵构建有效的度量。此外,基于 PSO 和 BP 的最优攻击策略实施也是有效的。
{"title":"Linear System Identification-Oriented Optimal Tampering Attack Strategy and Implementation Based on Information Entropy with Multiple Binary Observations","authors":"Zhongwei Bai, Peng Yu, Yan Liu, Jin Guo","doi":"10.3390/a17060239","DOIUrl":"https://doi.org/10.3390/a17060239","url":null,"abstract":"With the rapid development of computer technology, communication technology, and control technology, cyber-physical systems (CPSs) have been widely used and developed. However, there are massive information interactions in CPSs, which lead to an increase in the amount of data transmitted over the network. The data communication, once attacked by the network, will seriously affect the security and stability of the system. In this paper, for the data tampering attack existing in the linear system with multiple binary observations, in the case where the estimation algorithm of the defender is unknown, the optimization index is constructed based on information entropy from the attacker’s point of view, and the problem is modeled. For the problem of the multi-parameter optimization with energy constraints, this paper uses particle swarm optimization (PSO) to obtain the optimal data tampering attack solution set, and gives the estimation method of unknown parameters in the case of unknown parameters. To implement the real-time improvement of online implementation, the BP neural network is designed. Finally, the validity of the conclusions is verified through numerical simulation. This means that the attacker can construct effective metrics based on information entropy without the knowledge of the defense’s discrimination algorithm. In addition, the optimal attack strategy implementation based on PSO and BP is also effective.","PeriodicalId":502609,"journal":{"name":"Algorithms","volume":"31 49","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-06-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141270475","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Feature Extraction Based on Sparse Coding Approach for Hand Grasp Type Classification 基于稀疏编码方法的手抓类型分类特征提取
Pub Date : 2024-06-03 DOI: 10.3390/a17060240
Jirayu Samkunta, P. Ketthong, N. T. Mai, M.A.S. Kamal, I. Murakami, Kou Yamada
The kinematics of the human hand exhibit complex and diverse characteristics unique to each individual. Various techniques such as vision-based, ultrasonic-based, and data-glove-based approaches have been employed to analyze human hand movements. However, a critical challenge remains in efficiently analyzing and classifying hand grasp types based on time-series kinematic data. In this paper, we propose a novel sparse coding feature extraction technique based on dictionary learning to address this challenge. Our method enhances model accuracy, reduces training time, and minimizes overfitting risk. We benchmarked our approach against principal component analysis (PCA) and sparse coding based on a Gaussian random dictionary. Our results demonstrate a significant improvement in classification accuracy: achieving 81.78% with our method compared to 31.43% for PCA and 77.27% for the Gaussian random dictionary. Furthermore, our technique outperforms in terms of macro-average F1-score and average area under the curve (AUC) while also significantly reducing the number of features required.
人的手部运动学表现出复杂多样的特征,每个人都有其独特之处。各种技术,如基于视觉的方法、基于超声波的方法和基于数据手套的方法,已被用于分析人的手部运动。然而,基于时间序列运动学数据对手部抓握类型进行有效分析和分类仍是一项严峻挑战。在本文中,我们提出了一种基于字典学习的新型稀疏编码特征提取技术来应对这一挑战。我们的方法提高了模型的准确性,缩短了训练时间,并最大限度地降低了过拟合风险。我们将我们的方法与主成分分析(PCA)和基于高斯随机字典的稀疏编码进行了比较。结果表明,我们的方法显著提高了分类准确率:与 PCA 的 31.43% 和高斯随机字典的 77.27% 相比,我们的方法达到了 81.78%。此外,我们的技术在宏观平均 F1 分数和平均曲线下面积 (AUC) 方面表现出色,同时还显著减少了所需特征的数量。
{"title":"Feature Extraction Based on Sparse Coding Approach for Hand Grasp Type Classification","authors":"Jirayu Samkunta, P. Ketthong, N. T. Mai, M.A.S. Kamal, I. Murakami, Kou Yamada","doi":"10.3390/a17060240","DOIUrl":"https://doi.org/10.3390/a17060240","url":null,"abstract":"The kinematics of the human hand exhibit complex and diverse characteristics unique to each individual. Various techniques such as vision-based, ultrasonic-based, and data-glove-based approaches have been employed to analyze human hand movements. However, a critical challenge remains in efficiently analyzing and classifying hand grasp types based on time-series kinematic data. In this paper, we propose a novel sparse coding feature extraction technique based on dictionary learning to address this challenge. Our method enhances model accuracy, reduces training time, and minimizes overfitting risk. We benchmarked our approach against principal component analysis (PCA) and sparse coding based on a Gaussian random dictionary. Our results demonstrate a significant improvement in classification accuracy: achieving 81.78% with our method compared to 31.43% for PCA and 77.27% for the Gaussian random dictionary. Furthermore, our technique outperforms in terms of macro-average F1-score and average area under the curve (AUC) while also significantly reducing the number of features required.","PeriodicalId":502609,"journal":{"name":"Algorithms","volume":"49 50","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-06-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141269734","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Competitive Analysis of Algorithms for an Online Distribution Problem 在线配送问题算法的竞争力分析
Pub Date : 2024-06-03 DOI: 10.3390/a17060237
Alessandro Barba, L. Bertazzi, Bruce L. Golden
We study an online distribution problem in which a producer has to send a load from an origin to a destination. At each time period before the deadline, they ask for transportation price quotes and have to decide to either accept or not accept the minimum offered price. If this price is not accepted, they have to pay a penalty cost, which may be the cost to ask for new quotes, the penalty cost for a late delivery, or the inventory cost to store the load for a certain duration. The aim is to minimize the sum of the transportation and the penalty costs. This problem has interesting real-world applications, given that transportation quotes can be obtained from professional websites nowadays. We show that the classical online algorithm used to solve the well-known Secretary problem is not able to provide, on average, effective solutions to our problem, given the trade-off between the transportation and the penalty costs. Therefore, we design two classes of online algorithms. The first class is based on a given time of acceptance, while the second is based on a given threshold price. We formally prove the competitive ratio of each algorithm, i.e., the worst-case performance of the online algorithm with respect to the optimal solution of the offline problem, in which all transportation prices are known at the beginning, rather than being revealed over time. The computational results show the algorithms’ performance on average and in the worst-case scenario when the transportation prices are generated on the basis of given probability distributions.
我们研究的是一个在线配送问题,在这个问题中,生产者必须将货物从起点运往终点。在截止日期前的每个时间段,他们都会询问运输报价,并决定接受或不接受最低报价。如果不接受该价格,则必须支付惩罚成本,可能是要求新报价的成本、延迟交货的惩罚成本或在一定时间内储存货物的库存成本。我们的目标是最大限度地降低运输成本和惩罚成本之和。这个问题在现实世界中有着有趣的应用,因为现在可以从专业网站上获得运输报价。我们的研究表明,考虑到运输成本和罚款成本之间的权衡,用于解决众所周知的秘书问题的经典在线算法平均而言无法为我们的问题提供有效的解决方案。因此,我们设计了两类在线算法。第一类基于给定的接受时间,第二类基于给定的门槛价格。我们正式证明了每种算法的竞争比率,即在线算法相对于离线问题最优解的最坏情况性能,在离线问题中,所有运输价格在一开始都是已知的,而不是随着时间的推移而揭示的。计算结果显示了根据给定概率分布生成运输价格时,算法的平均性能和最坏情况下的性能。
{"title":"Competitive Analysis of Algorithms for an Online Distribution Problem","authors":"Alessandro Barba, L. Bertazzi, Bruce L. Golden","doi":"10.3390/a17060237","DOIUrl":"https://doi.org/10.3390/a17060237","url":null,"abstract":"We study an online distribution problem in which a producer has to send a load from an origin to a destination. At each time period before the deadline, they ask for transportation price quotes and have to decide to either accept or not accept the minimum offered price. If this price is not accepted, they have to pay a penalty cost, which may be the cost to ask for new quotes, the penalty cost for a late delivery, or the inventory cost to store the load for a certain duration. The aim is to minimize the sum of the transportation and the penalty costs. This problem has interesting real-world applications, given that transportation quotes can be obtained from professional websites nowadays. We show that the classical online algorithm used to solve the well-known Secretary problem is not able to provide, on average, effective solutions to our problem, given the trade-off between the transportation and the penalty costs. Therefore, we design two classes of online algorithms. The first class is based on a given time of acceptance, while the second is based on a given threshold price. We formally prove the competitive ratio of each algorithm, i.e., the worst-case performance of the online algorithm with respect to the optimal solution of the offline problem, in which all transportation prices are known at the beginning, rather than being revealed over time. The computational results show the algorithms’ performance on average and in the worst-case scenario when the transportation prices are generated on the basis of given probability distributions.","PeriodicalId":502609,"journal":{"name":"Algorithms","volume":"116 5","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-06-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141271908","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Simple Histogram Equalization Technique Improves Performance of VGG Models on Facial Emotion Recognition Datasets 简单的直方图均衡化技术提高了 VGG 模型在人脸情感识别数据集上的性能
Pub Date : 2024-06-03 DOI: 10.3390/a17060238
Jaher Hassan Chowdhury, Qian Liu, S. Ramanna
Facial emotion recognition (FER) is crucial across psychology, neuroscience, computer vision, and machine learning due to the diversified and subjective nature of emotions, varying considerably across individuals, cultures, and contexts. This study explored FER through convolutional neural networks (CNNs) and Histogram Equalization techniques. It investigated the impact of histogram equalization, data augmentation, and various model optimization strategies on FER accuracy across different datasets like KDEF, CK+, and FER2013. Using pre-trained VGG architectures, such as VGG19 and VGG16, this study also examined the effectiveness of fine-tuning hyperparameters and implementing different learning rate schedulers. The evaluation encompassed diverse metrics including accuracy, Area Under the Receiver Operating Characteristic Curve (AUC-ROC), Area Under the Precision–Recall Curve (AUC-PRC), and Weighted F1 score. Notably, the fine-tuned VGG architecture demonstrated a state-of-the-art performance compared to conventional transfer learning models and achieved 100%, 95.92%, and 69.65% on the CK+, KDEF, and FER2013 datasets, respectively.
面部情绪识别(FER)在心理学、神经科学、计算机视觉和机器学习领域都至关重要,因为情绪具有多样化和主观性的特点,不同的个体、文化和环境会有很大的差异。本研究通过卷积神经网络(CNN)和直方图均衡化技术对 FER 进行了探索。它研究了直方图均衡化、数据增强和各种模型优化策略对 KDEF、CK+ 和 FER2013 等不同数据集的 FER 准确率的影响。这项研究还使用预先训练好的 VGG 架构(如 VGG19 和 VGG16),检验了微调超参数和实施不同学习率调度器的效果。评估涵盖了多种指标,包括准确率、接收者操作特征曲线下面积(AUC-ROC)、精度-召回曲线下面积(AUC-PRC)和加权 F1 分数。值得注意的是,与传统的迁移学习模型相比,经过微调的 VGG 架构表现出了最先进的性能,在 CK+、KDEF 和 FER2013 数据集上的准确率分别达到了 100%、95.92% 和 69.65%。
{"title":"Simple Histogram Equalization Technique Improves Performance of VGG Models on Facial Emotion Recognition Datasets","authors":"Jaher Hassan Chowdhury, Qian Liu, S. Ramanna","doi":"10.3390/a17060238","DOIUrl":"https://doi.org/10.3390/a17060238","url":null,"abstract":"Facial emotion recognition (FER) is crucial across psychology, neuroscience, computer vision, and machine learning due to the diversified and subjective nature of emotions, varying considerably across individuals, cultures, and contexts. This study explored FER through convolutional neural networks (CNNs) and Histogram Equalization techniques. It investigated the impact of histogram equalization, data augmentation, and various model optimization strategies on FER accuracy across different datasets like KDEF, CK+, and FER2013. Using pre-trained VGG architectures, such as VGG19 and VGG16, this study also examined the effectiveness of fine-tuning hyperparameters and implementing different learning rate schedulers. The evaluation encompassed diverse metrics including accuracy, Area Under the Receiver Operating Characteristic Curve (AUC-ROC), Area Under the Precision–Recall Curve (AUC-PRC), and Weighted F1 score. Notably, the fine-tuned VGG architecture demonstrated a state-of-the-art performance compared to conventional transfer learning models and achieved 100%, 95.92%, and 69.65% on the CK+, KDEF, and FER2013 datasets, respectively.","PeriodicalId":502609,"journal":{"name":"Algorithms","volume":"60 20","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-06-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141269498","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Hybrid Machine Learning Algorithms to Evaluate Prostate Cancer 评估前列腺癌的混合机器学习算法
Pub Date : 2024-06-02 DOI: 10.3390/a17060236
Dimitrios Morakis, Adam Adamopoulos
The adequacy and efficacy of simple and hybrid machine learning and Computational Intelligence algorithms were evaluated for the classification of potential prostate cancer patients in two distinct categories, the high- and the low-risk group for PCa. The evaluation is based on randomly generated surrogate data for the biomarker PSA, considering that reported epidemiological data indicated that PSA values follow a lognormal distribution. In addition, four more biomarkers were considered, namely, PSAD (PSA density), PSAV (PSA velocity), PSA ratio, and Digital Rectal Exam evaluation (DRE), as well as patient age. Seven simple classification algorithms, namely, Decision Trees, Random Forests, Support Vector Machines, K-Nearest Neighbors, Logistic Regression, Naïve Bayes, and Artificial Neural Networks, were evaluated in terms of classification accuracy. In addition, three hybrid algorithms were developed and introduced in the present work, where Genetic Algorithms were utilized as a metaheuristic searching technique in order to optimize the training set, in terms of minimizing its size, to give optimal classification accuracy for the simple algorithms including K-Nearest Neighbors, a K-means clustering algorithm, and a genetic clustering algorithm. Results indicated that prostate cancer cases can be classified with high accuracy, even by the use of small training sets, with sizes that could be even smaller than 30% of the dataset. Numerous computer experiments indicated that the proposed training set minimization does not cause overfitting of the hybrid algorithms. Finally, an easy-to-use Graphical User Interface (GUI) was implemented, incorporating all the evaluated algorithms and the decision-making procedure.
该研究评估了简单和混合机器学习与计算智能算法在将潜在前列腺癌患者分为两个不同类别(PCa 高风险组和低风险组)方面的充分性和有效性。考虑到已报道的流行病学数据表明 PSA 值呈对数正态分布,该评估基于随机生成的生物标志物 PSA 的替代数据。此外,还考虑了另外四种生物标志物,即 PSAD(PSA 密度)、PSAV(PSA 速度)、PSA 比值和数字直肠检查评估(DRE)以及患者年龄。根据分类准确性评估了七种简单的分类算法,即决策树、随机森林、支持向量机、K-最近邻、逻辑回归、奈夫贝叶和人工神经网络。此外,本研究还开发并引入了三种混合算法,其中遗传算法被用作一种元启发式搜索技术,以优化训练集,使其规模最小,从而使包括 K-近邻算法、K-均值聚类算法和遗传聚类算法在内的简单算法获得最佳分类准确性。结果表明,即使使用较小的训练集,前列腺癌病例的分类准确率也很高,训练集的大小甚至可以小于数据集的 30%。大量计算机实验表明,建议的训练集最小化不会导致混合算法过度拟合。最后,还实现了一个易于使用的图形用户界面(GUI),其中包含了所有经过评估的算法和决策过程。
{"title":"Hybrid Machine Learning Algorithms to Evaluate Prostate Cancer","authors":"Dimitrios Morakis, Adam Adamopoulos","doi":"10.3390/a17060236","DOIUrl":"https://doi.org/10.3390/a17060236","url":null,"abstract":"The adequacy and efficacy of simple and hybrid machine learning and Computational Intelligence algorithms were evaluated for the classification of potential prostate cancer patients in two distinct categories, the high- and the low-risk group for PCa. The evaluation is based on randomly generated surrogate data for the biomarker PSA, considering that reported epidemiological data indicated that PSA values follow a lognormal distribution. In addition, four more biomarkers were considered, namely, PSAD (PSA density), PSAV (PSA velocity), PSA ratio, and Digital Rectal Exam evaluation (DRE), as well as patient age. Seven simple classification algorithms, namely, Decision Trees, Random Forests, Support Vector Machines, K-Nearest Neighbors, Logistic Regression, Naïve Bayes, and Artificial Neural Networks, were evaluated in terms of classification accuracy. In addition, three hybrid algorithms were developed and introduced in the present work, where Genetic Algorithms were utilized as a metaheuristic searching technique in order to optimize the training set, in terms of minimizing its size, to give optimal classification accuracy for the simple algorithms including K-Nearest Neighbors, a K-means clustering algorithm, and a genetic clustering algorithm. Results indicated that prostate cancer cases can be classified with high accuracy, even by the use of small training sets, with sizes that could be even smaller than 30% of the dataset. Numerous computer experiments indicated that the proposed training set minimization does not cause overfitting of the hybrid algorithms. Finally, an easy-to-use Graphical User Interface (GUI) was implemented, incorporating all the evaluated algorithms and the decision-making procedure.","PeriodicalId":502609,"journal":{"name":"Algorithms","volume":"25 23","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-06-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141272773","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Automated Personalized Loudness Control for Multi-Track Recordings 多轨录音的自动个性化响度控制
Pub Date : 2024-05-24 DOI: 10.3390/a17060228
Bogdan Moroșanu, Marian Negru, C. Paleologu
This paper presents a novel approach to automated music mixing, focusing on the optimization of loudness control in multi-track recordings. By taking into consideration the complexity and artistic nature of traditional mixing processes, we introduce a personalized multi-track leveling method using two types of approaches: a customized genetic algorithm and a neural network-based method. Our method tackles common challenges encountered by audio professionals during prolonged mixing sessions, where consistency can decrease as a result of fatigue. Our algorithm serves as a ‘virtual assistant’ to consistently uphold the initial mixing objectives, hence assuring consistent quality throughout the process. In addition, our system automates the repetitive elements of the mixing process, resulting in a substantial reduction in production time. This enables engineers to dedicate their attention to more innovative and intricate jobs. Our experimental framework involves 20 diverse songs and 10 sound engineers possessing a wide range of expertise, offering a useful perspective on the adaptability and effectiveness of our method in real-world scenarios. The results demonstrate the capacity of the algorithms to mimic decision-making, achieving an optimal balance in the mix that resonates with the emotional and technical aspects of music production.
本文介绍了一种新颖的自动音乐混音方法,重点是优化多轨录音的响度控制。考虑到传统混音过程的复杂性和艺术性,我们介绍了一种个性化的多轨音量调节方法,该方法采用了两种类型的方法:一种是定制遗传算法,另一种是基于神经网络的方法。我们的方法可以解决音频专业人员在长时间混音过程中遇到的共同难题,因为疲劳会导致一致性降低。我们的算法可充当 "虚拟助手",始终坚持最初的混音目标,从而确保整个过程的质量始终如一。此外,我们的系统还能自动处理混合过程中的重复性工作,从而大幅缩短生产时间。这样,工程师们就能将精力投入到更具创新性的复杂工作中。我们的实验框架涉及 20 首不同的歌曲和 10 位拥有各种专业知识的录音工程师,为我们的方法在现实世界中的适应性和有效性提供了一个有用的视角。实验结果表明,算法能够模拟决策,在混音中达到最佳平衡,与音乐制作的情感和技术方面产生共鸣。
{"title":"Automated Personalized Loudness Control for Multi-Track Recordings","authors":"Bogdan Moroșanu, Marian Negru, C. Paleologu","doi":"10.3390/a17060228","DOIUrl":"https://doi.org/10.3390/a17060228","url":null,"abstract":"This paper presents a novel approach to automated music mixing, focusing on the optimization of loudness control in multi-track recordings. By taking into consideration the complexity and artistic nature of traditional mixing processes, we introduce a personalized multi-track leveling method using two types of approaches: a customized genetic algorithm and a neural network-based method. Our method tackles common challenges encountered by audio professionals during prolonged mixing sessions, where consistency can decrease as a result of fatigue. Our algorithm serves as a ‘virtual assistant’ to consistently uphold the initial mixing objectives, hence assuring consistent quality throughout the process. In addition, our system automates the repetitive elements of the mixing process, resulting in a substantial reduction in production time. This enables engineers to dedicate their attention to more innovative and intricate jobs. Our experimental framework involves 20 diverse songs and 10 sound engineers possessing a wide range of expertise, offering a useful perspective on the adaptability and effectiveness of our method in real-world scenarios. The results demonstrate the capacity of the algorithms to mimic decision-making, achieving an optimal balance in the mix that resonates with the emotional and technical aspects of music production.","PeriodicalId":502609,"journal":{"name":"Algorithms","volume":"51 2","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-05-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141102211","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Mitigating Co-Activity Conflicts and Resource Overallocation in Construction Projects: A Modular Heuristic Scheduling Approach with Primavera P6 EPPM Integration 缓解建筑项目中的共同活动冲突和资源过度分配:与 Primavera P6 EPPM 集成的模块化启发式排程方法
Pub Date : 2024-05-24 DOI: 10.3390/a17060230
Khwansiri Ninpan, Shuzhang Huang, Francesco Vitillo, Mohamad Ali Assaad, Lies Benmiloud Bechet, Robert Plana
This paper proposes a heuristic approach for managing complex construction projects. The tool incorporates Primavera P6 EPPM and Synchro 4D, enabling proactive clash detection and resolution of spatial conflicts during concurrent tasks. Additionally, it performs resource verification for sufficient allocation before task initiation. This integrated approach facilitates the generation of conflict-free and feasible construction schedules. By adhering to project constraints and seamlessly integrating with existing industry tools, the proposed solution offers a comprehensive and robust approach to construction project management. This constitutes, to our knowledge, the first dynamic digital twin for the delivery of a complex project.
本文提出了一种管理复杂建筑项目的启发式方法。该工具结合了 Primavera P6 EPPM 和 Synchro 4D,可在并发任务期间主动检测冲突并解决空间冲突。此外,该工具还能在任务启动前进行资源核查,以便充分分配资源。这种集成方法有助于生成无冲突、可行的施工进度计划。通过遵守项目约束条件并与现有行业工具无缝集成,所提出的解决方案为建筑项目管理提供了一种全面而稳健的方法。据我们所知,这是首个用于交付复杂项目的动态数字孪生系统。
{"title":"Mitigating Co-Activity Conflicts and Resource Overallocation in Construction Projects: A Modular Heuristic Scheduling Approach with Primavera P6 EPPM Integration","authors":"Khwansiri Ninpan, Shuzhang Huang, Francesco Vitillo, Mohamad Ali Assaad, Lies Benmiloud Bechet, Robert Plana","doi":"10.3390/a17060230","DOIUrl":"https://doi.org/10.3390/a17060230","url":null,"abstract":"This paper proposes a heuristic approach for managing complex construction projects. The tool incorporates Primavera P6 EPPM and Synchro 4D, enabling proactive clash detection and resolution of spatial conflicts during concurrent tasks. Additionally, it performs resource verification for sufficient allocation before task initiation. This integrated approach facilitates the generation of conflict-free and feasible construction schedules. By adhering to project constraints and seamlessly integrating with existing industry tools, the proposed solution offers a comprehensive and robust approach to construction project management. This constitutes, to our knowledge, the first dynamic digital twin for the delivery of a complex project.","PeriodicalId":502609,"journal":{"name":"Algorithms","volume":"66 23","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-05-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141101663","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Explainable AI Frameworks: Navigating the Present Challenges and Unveiling Innovative Applications 可解释的人工智能框架:驾驭当前挑战,揭示创新应用
Pub Date : 2024-05-24 DOI: 10.3390/a17060227
Neeraj Anand Sharma, Rishal Ravikesh Chand, Zain Buksh, A. B. M. S. Ali, Ambreen Hanif, A. Beheshti
This study delves into the realm of Explainable Artificial Intelligence (XAI) frameworks, aiming to empower researchers and practitioners with a deeper understanding of these tools. We establish a comprehensive knowledge base by classifying and analyzing prominent XAI solutions based on key attributes like explanation type, model dependence, and use cases. This resource equips users to navigate the diverse XAI landscape and select the most suitable framework for their specific needs. Furthermore, the study proposes a novel framework called XAIE (eXplainable AI Evaluator) for informed decision-making in XAI adoption. This framework empowers users to assess different XAI options based on their application context objectively. This will lead to more responsible AI development by fostering transparency and trust. Finally, the research identifies the limitations and challenges associated with the existing XAI frameworks, paving the way for future advancements. By highlighting these areas, the study guides researchers and developers in enhancing the capabilities of Explainable AI.
本研究深入探讨了可解释人工智能(XAI)框架,旨在让研究人员和从业人员更深入地了解这些工具。我们根据解释类型、模型依赖性和用例等关键属性对著名的 XAI 解决方案进行分类和分析,从而建立了一个全面的知识库。这一资源使用户能够在多样化的 XAI 领域中游刃有余,并根据自己的具体需求选择最合适的框架。此外,该研究还提出了一个名为 XAIE(eXplainable AI Evaluator)的新框架,用于在采用 XAI 时做出明智决策。该框架使用户能够根据其应用背景客观地评估不同的 XAI 选项。这将通过提高透明度和信任度,促进更负责任的人工智能发展。最后,研究确定了与现有 XAI 框架相关的局限性和挑战,为未来的进步铺平了道路。通过强调这些领域,本研究将指导研究人员和开发人员提高可解释人工智能的能力。
{"title":"Explainable AI Frameworks: Navigating the Present Challenges and Unveiling Innovative Applications","authors":"Neeraj Anand Sharma, Rishal Ravikesh Chand, Zain Buksh, A. B. M. S. Ali, Ambreen Hanif, A. Beheshti","doi":"10.3390/a17060227","DOIUrl":"https://doi.org/10.3390/a17060227","url":null,"abstract":"This study delves into the realm of Explainable Artificial Intelligence (XAI) frameworks, aiming to empower researchers and practitioners with a deeper understanding of these tools. We establish a comprehensive knowledge base by classifying and analyzing prominent XAI solutions based on key attributes like explanation type, model dependence, and use cases. This resource equips users to navigate the diverse XAI landscape and select the most suitable framework for their specific needs. Furthermore, the study proposes a novel framework called XAIE (eXplainable AI Evaluator) for informed decision-making in XAI adoption. This framework empowers users to assess different XAI options based on their application context objectively. This will lead to more responsible AI development by fostering transparency and trust. Finally, the research identifies the limitations and challenges associated with the existing XAI frameworks, paving the way for future advancements. By highlighting these areas, the study guides researchers and developers in enhancing the capabilities of Explainable AI.","PeriodicalId":502609,"journal":{"name":"Algorithms","volume":"10 5","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-05-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141099486","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Algorithms
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1