首页 > 最新文献

Advances in computational intelligence最新文献

英文 中文
Corn cash-futures basis forecasting via neural networks 基于神经网络的玉米现金期货基差预测
Pub Date : 2023-04-12 DOI: 10.1007/s43674-023-00054-2
Xiaojie Xu, Yun Zhang

Cash-futures basis forecasting represents a vital concern for various market participants in the agricultural sector, which has been rarely explored due to limitations on data and traditional econometric methods. The current study explores usefulness of the nonlinear autoregressive neural network technique for the forecasting problem in a unique and proprietary data set of daily corn cash-futures basis across nearly five-hundred cash markets from sixteen most important harvest states in the United States over a 5-year period. Through investigations of various model settings across the hidden neuron, delay, data splitting ratio, and algorithm, a chosen model with five delays and twenty hidden neurons is reached, trained using the Levenberg–Marquardt algorithm and data splitting ratio of 70% vs. 15% vs. 15% for training, validation, and testing. This model results in accurate and stable performance across the cash markets explored, which illustrates usefulness of the machine learning technique for corn cash-futures basis forecasting. Particularly, the model leads to average relative root mean square errors (RRMSEs) of 9.97%, 8.51%, and 9.64% for the training, validation, and testing phases, respectively, and the average RRMSE of 9.83% for the overall sample across all cash markets. Results here might be used as standalone technical forecasts or combined with fundamental forecasts for forming perspectives of cash-futures basis trends and carrying out policy analysis. The empirical framework here is easy to implement, which is an essential consideration to many decision makers, and has potential to be generalized for forecasting cash-futures basis of other commodities.

现金期货基础预测是农业部门各市场参与者关注的一个重要问题,由于数据和传统计量经济方法的限制,很少对其进行探索。目前的研究探索了非线性自回归神经网络技术在5年内对美国16个最重要的收获州的近500个现金市场的每日玉米现金期货基础的独特专有数据集中的预测问题的有用性。通过研究隐藏神经元、延迟、数据分割率和算法的各种模型设置,得出了一个具有5个延迟和20个隐藏神经元的选定模型,使用Levenberg–Marquardt算法进行训练,数据分割率为70%对15%对15%,用于训练、验证和测试。该模型在所探索的现金市场中产生了准确稳定的表现,这说明了机器学习技术在玉米现金期货基差预测中的有用性。特别是,该模型导致训练、验证和测试阶段的平均相对均方根误差(RRMSE)分别为9.97%、8.51%和9.64%,所有现金市场的总体样本的平均RRMSE为9.83%。这里的结果可以用作独立的技术预测,也可以与基本面预测相结合,以形成现金期货基差趋势的视角并进行政策分析。这里的实证框架易于实施,这是许多决策者的重要考虑因素,并且有可能推广用于预测其他商品的现金期货基础。
{"title":"Corn cash-futures basis forecasting via neural networks","authors":"Xiaojie Xu,&nbsp;Yun Zhang","doi":"10.1007/s43674-023-00054-2","DOIUrl":"10.1007/s43674-023-00054-2","url":null,"abstract":"<div><p>Cash-futures basis forecasting represents a vital concern for various market participants in the agricultural sector, which has been rarely explored due to limitations on data and traditional econometric methods. The current study explores usefulness of the nonlinear autoregressive neural network technique for the forecasting problem in a unique and proprietary data set of daily corn cash-futures basis across nearly five-hundred cash markets from sixteen most important harvest states in the United States over a 5-year period. Through investigations of various model settings across the hidden neuron, delay, data splitting ratio, and algorithm, a chosen model with five delays and twenty hidden neurons is reached, trained using the Levenberg–Marquardt algorithm and data splitting ratio of 70% vs. 15% vs. 15% for training, validation, and testing. This model results in accurate and stable performance across the cash markets explored, which illustrates usefulness of the machine learning technique for corn cash-futures basis forecasting. Particularly, the model leads to average relative root mean square errors (RRMSEs) of 9.97%, 8.51%, and 9.64% for the training, validation, and testing phases, respectively, and the average RRMSE of 9.83% for the overall sample across all cash markets. Results here might be used as standalone technical forecasts or combined with fundamental forecasts for forming perspectives of cash-futures basis trends and carrying out policy analysis. The empirical framework here is easy to implement, which is an essential consideration to many decision makers, and has potential to be generalized for forecasting cash-futures basis of other commodities.</p></div>","PeriodicalId":72089,"journal":{"name":"Advances in computational intelligence","volume":"3 2","pages":""},"PeriodicalIF":0.0,"publicationDate":"2023-04-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://link.springer.com/content/pdf/10.1007/s43674-023-00054-2.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"50473535","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 11
Optimization of deep learning models: benchmark and analysis 深度学习模型的优化:基准和分析
Pub Date : 2023-03-30 DOI: 10.1007/s43674-023-00055-1
Rasheed Ahmad, Izzat Alsmadi, Mohammad Al-Ramahi

Model optimization in deep learning (DL) and neural networks is concerned about how and why the model can be successfully trained towards one or more objective functions. The evolutionary learning or training process continuously considers the dynamic parameters of the model. Many researchers propose a deep learning-based solution by randomly selecting a single classifier model architecture. Such approaches generally overlook the hidden and complex nature of the model’s internal working, producing biased results. Larger and deeper NN models bring many complexities and logistic challenges while building and deploying them. To obtain high-quality performance results, an optimal model generally depends on the appropriate architectural settings, such as the number of hidden layers and the number of neurons at each layer. A challenging and time-consuming task is to select and test various combinations of these settings manually. This paper presents an extensive empirical analysis of various deep learning algorithms trained recursively using permutated settings to establish benchmarks and find an optimal model. The paper analyzed the Stack Overflow dataset to predict the quality of posted questions. The extensive empirical analysis revealed that some famous deep learning algorithms such as CNN are the least effective algorithm in solving this problem compared to multilayer perceptron (MLP), which provides efficient computing and the best results in terms of prediction accuracy. The analysis also shows that manipulating the number of neurons alone at each layer in a network does not influence model optimization. This paper’s findings will help to recognize the fact that future models should be built by considering a vast range of model architectural settings for an optimal solution.

深度学习(DL)和神经网络中的模型优化关注如何以及为什么可以针对一个或多个目标函数成功地训练模型。进化学习或训练过程不断地考虑模型的动态参数。许多研究人员通过随机选择单个分类器模型架构,提出了一种基于深度学习的解决方案。这种方法通常忽略了模型内部工作的隐蔽性和复杂性,从而产生有偏见的结果。更大、更深层次的神经网络模型在构建和部署时带来了许多复杂性和后勤挑战。为了获得高质量的性能结果,最佳模型通常取决于适当的架构设置,例如隐藏层的数量和每层神经元的数量。手动选择和测试这些设置的各种组合是一项具有挑战性且耗时的任务。本文对使用置换设置递归训练的各种深度学习算法进行了广泛的实证分析,以建立基准并找到最优模型。本文分析了Stack Overflow数据集来预测发布问题的质量。广泛的实证分析表明,与多层感知器(MLP)相比,一些著名的深度学习算法(如CNN)是解决这一问题的最不有效的算法,多层感知机提供了高效的计算和预测精度方面的最佳结果。分析还表明,单独操纵网络中每一层的神经元数量不会影响模型优化。本文的发现将有助于认识到这样一个事实,即未来的模型应该通过考虑广泛的模型体系结构设置来构建最佳解决方案。
{"title":"Optimization of deep learning models: benchmark and analysis","authors":"Rasheed Ahmad,&nbsp;Izzat Alsmadi,&nbsp;Mohammad Al-Ramahi","doi":"10.1007/s43674-023-00055-1","DOIUrl":"10.1007/s43674-023-00055-1","url":null,"abstract":"<div><p>Model optimization in deep learning (DL) and neural networks is concerned about how and why the model can be successfully trained towards one or more objective functions. The evolutionary learning or training process continuously considers the dynamic parameters of the model. Many researchers propose a deep learning-based solution by randomly selecting a single classifier model architecture. Such approaches generally overlook the hidden and complex nature of the model’s internal working, producing biased results. Larger and deeper NN models bring many complexities and logistic challenges while building and deploying them. To obtain high-quality performance results, an optimal model generally depends on the appropriate architectural settings, such as the number of hidden layers and the number of neurons at each layer. A challenging and time-consuming task is to select and test various combinations of these settings manually. This paper presents an extensive empirical analysis of various deep learning algorithms trained recursively using permutated settings to establish benchmarks and find an optimal model. The paper analyzed the Stack Overflow dataset to predict the quality of posted questions. The extensive empirical analysis revealed that some famous deep learning algorithms such as CNN are the least effective algorithm in solving this problem compared to multilayer perceptron (MLP), which provides efficient computing and the best results in terms of prediction accuracy. The analysis also shows that manipulating the number of neurons alone at each layer in a network does not influence model optimization. This paper’s findings will help to recognize the fact that future models should be built by considering a vast range of model architectural settings for an optimal solution.</p></div>","PeriodicalId":72089,"journal":{"name":"Advances in computational intelligence","volume":"3 2","pages":""},"PeriodicalIF":0.0,"publicationDate":"2023-03-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"50526712","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Gorge: graph convolutional networks on heterogeneous multi-relational graphs for polypharmacy side effect prediction Gorge:用于多药副作用预测的异构多关系图上的图卷积网络
Pub Date : 2023-03-03 DOI: 10.1007/s43674-023-00053-3
Yike Wang, Huifang Ma, Ruoyi Zhang, Zihao Gao

Determining the side effects of multidrug combinations is a very important issue in drug risk studies. However, designing clinical trials to determine frequencies is often time-consuming and expensive, and previous work has often been limited to using the target protein of a drug without screening. Although this alleviates the sparsity of the raw data to some extent, blindly introducing proteins as auxiliary information can lead to a large amount of noisy information being added, which in turn leads to less efficient models. For this reason, we propose a new method called Gorge (graph convolutional networks on heterogeneous multi-relational graphs for polypharmacy side effect prediction). Specifically, we designed two protein auxiliary pathways directly related to drugs and combined these two auxiliary pathways with a multi-relational graph of drug side effects, which both alleviates data sparsity and filters noisy data. Then, we introduce a query-aware attention mechanism that generates different attention pathways for drug entities based on different drug pairs, fine-grained to determine the extent of information delivery. Finally, we output the exact frequency of drug side effects occurring through a tensor factorization decoder, in contrast to most existing methods that can only predict the presence or association of side effects, but not their frequency. We found that Gorge achieves excellent performance on real-world datasets (average AUROC of 0.822 and average AUPR of 0.775), outperforming existing methods. Further examination provides literature evidence for highly ranked predictions.

确定多药联合用药的副作用是药物风险研究中的一个非常重要的问题。然而,设计临床试验来确定频率通常耗时且昂贵,而且以前的工作通常仅限于在没有筛选的情况下使用药物的靶蛋白。尽管这在一定程度上缓解了原始数据的稀疏性,但盲目引入蛋白质作为辅助信息会导致添加大量噪声信息,进而导致模型效率降低。因此,我们提出了一种新的方法,称为Gorge(用于多药副作用预测的异构多关系图上的图卷积网络)。具体而言,我们设计了两种与药物直接相关的蛋白质辅助途径,并将这两种辅助途径与药物副作用的多关系图相结合,既缓解了数据稀疏性,又过滤了噪声数据。然后,我们引入了一种查询感知注意力机制,该机制基于不同的药物对为药物实体生成不同的注意力途径,细粒度地确定信息传递的程度。最后,我们通过张量分解解码器输出药物副作用发生的确切频率,这与大多数现有方法不同,这些方法只能预测副作用的存在或关联,而不能预测其频率。我们发现Gorge在真实世界的数据集上实现了出色的性能(平均AUROC为0.822,平均AUPR为0.775),优于现有方法。进一步的研究为高排名的预测提供了文献证据。
{"title":"Gorge: graph convolutional networks on heterogeneous multi-relational graphs for polypharmacy side effect prediction","authors":"Yike Wang,&nbsp;Huifang Ma,&nbsp;Ruoyi Zhang,&nbsp;Zihao Gao","doi":"10.1007/s43674-023-00053-3","DOIUrl":"10.1007/s43674-023-00053-3","url":null,"abstract":"<div><p>Determining the side effects of multidrug combinations is a very important issue in drug risk studies. However, designing clinical trials to determine frequencies is often time-consuming and expensive, and previous work has often been limited to using the target protein of a drug without screening. Although this alleviates the sparsity of the raw data to some extent, blindly introducing proteins as auxiliary information can lead to a large amount of noisy information being added, which in turn leads to less efficient models. For this reason, we propose a new method called Gorge (graph convolutional networks on heterogeneous multi-relational graphs for polypharmacy side effect prediction). Specifically, we designed two protein auxiliary pathways directly related to drugs and combined these two auxiliary pathways with a multi-relational graph of drug side effects, which both alleviates data sparsity and filters noisy data. Then, we introduce a query-aware attention mechanism that generates different attention pathways for drug entities based on different drug pairs, fine-grained to determine the extent of information delivery. Finally, we output the exact frequency of drug side effects occurring through a tensor factorization decoder, in contrast to most existing methods that can only predict the presence or association of side effects, but not their frequency. We found that Gorge achieves excellent performance on real-world datasets (average AUROC of 0.822 and average AUPR of 0.775), outperforming existing methods. Further examination provides literature evidence for highly ranked predictions.</p></div>","PeriodicalId":72089,"journal":{"name":"Advances in computational intelligence","volume":"3 2","pages":""},"PeriodicalIF":0.0,"publicationDate":"2023-03-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"50446531","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A transparent machine learning algorithm to manage diabetes: TDMSML 管理糖尿病的透明机器学习算法:TDMSML
Pub Date : 2023-02-10 DOI: 10.1007/s43674-022-00051-x
Amrit Kumar Verma, Saroj Kr. Biswas, Manomita Chakraborty, Arpita Nath Boruah

Diabetes is nowadays a very common medical problem among the people worldwide. The disease is becoming more prevalent with the modern and hectic lifestyle followed by people. As a result, designing an adequate medical expert system to assist physicians in treating the disease on time is critical. Expert systems are required to identify the major cause(s) of the disease, so that precautionary measures can be taken ahead of time. Several medical expert systems have already been proposed, but each has its own set of shortcomings, such as the use of trial and error methods, trivial decision-making procedures, and so on. As a result, this paper proposes a Transparent Diabetes Management System Using Machine Learning (TDMSML) expert system that uses decision tree rules to identify the major factor(s) of diabetes. The TDMSML model comprises of three phases: rule generation, transparent rule selection, and major factor identification. The rule generation phase generates rules using decision tree. Transparent rule selection stage selects the transparent rules followed by pruning the redundant rules to get the minimized rule-set. The major factor identification stage extracts the major factor(s) with range(s) from the minimized rule-set. These factor(s) with certain range(s) are characterized as major cause(s) of diabetes disease. The model is validated with the Pima Indian diabetes data set collected from Kaggle.

糖尿病是当今世界人民中一个非常常见的医学问题。随着人们追求现代繁忙的生活方式,这种疾病变得越来越普遍。因此,设计一个足够的医学专家系统来帮助医生按时治疗这种疾病至关重要。需要专家系统来确定疾病的主要原因,以便提前采取预防措施。已经提出了几种医学专家系统,但每种系统都有自己的缺点,如使用试错法、琐碎的决策程序等。因此,本文提出了一种使用机器学习的透明糖尿病管理系统(TDMSML)专家系统,该系统使用决策树规则来识别糖尿病的主要因素。TDMSML模型包括三个阶段:规则生成、透明规则选择和主要因素识别。规则生成阶段使用决策树生成规则。透明规则选择阶段选择透明规则,然后修剪冗余规则以获得最小化的规则集。主要因素识别阶段从最小化规则集中提取具有范围的主要因素。这些具有一定范围的因素是糖尿病的主要病因。该模型通过从Kaggle收集的Pima Indian糖尿病数据集进行了验证。
{"title":"A transparent machine learning algorithm to manage diabetes: TDMSML","authors":"Amrit Kumar Verma,&nbsp;Saroj Kr. Biswas,&nbsp;Manomita Chakraborty,&nbsp;Arpita Nath Boruah","doi":"10.1007/s43674-022-00051-x","DOIUrl":"10.1007/s43674-022-00051-x","url":null,"abstract":"<div><p>Diabetes is nowadays a very common medical problem among the people worldwide. The disease is becoming more prevalent with the modern and hectic lifestyle followed by people. As a result, designing an adequate medical expert system to assist physicians in treating the disease on time is critical. Expert systems are required to identify the major cause(s) of the disease, so that precautionary measures can be taken ahead of time. Several medical expert systems have already been proposed, but each has its own set of shortcomings, such as the use of trial and error methods, trivial decision-making procedures, and so on. As a result, this paper proposes a Transparent Diabetes Management System Using Machine Learning (TDMSML) expert system that uses decision tree rules to identify the major factor(s) of diabetes. The TDMSML model comprises of three phases: rule generation, transparent rule selection, and major factor identification. The rule generation phase generates rules using decision tree. Transparent rule selection stage selects the transparent rules followed by pruning the redundant rules to get the minimized rule-set. The major factor identification stage extracts the major factor(s) with range(s) from the minimized rule-set. These factor(s) with certain range(s) are characterized as major cause(s) of diabetes disease. The model is validated with the Pima Indian diabetes data set collected from Kaggle.</p></div>","PeriodicalId":72089,"journal":{"name":"Advances in computational intelligence","volume":"3 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2023-02-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"50468175","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
LAD in finance: accounting analytics and fraud detection 金融领域的LAD:会计分析和欺诈检测
Pub Date : 2023-01-30 DOI: 10.1007/s43674-023-00052-4
Aditi Kar Gangopadhyay, Tanay Sheth, Sneha Chauhan

The paper explores advancements in accounting analytics using the logical analysis of data (LAD) approach by identifying fraudulent firms and transactions. The straightforward approach to fraud detection with an analytic model is to identify possible predictors of fraud associated with known fraudsters and their historical actions. LAD is a machine learning methodology that combines Boolean functions, optimization, and logic ideas in alignment with the traditional approach. The key characteristic of the LAD is discovering minimal sets of features necessary for explaining all observations and detecting hidden patterns in the data capable of distinguishing observations describing “positive” outcome events from “negative” outcome events. The combinatorial optimization model described in the paper represents a variation on the general theme of set covering and concludes with an outline of LAD applications to detect fraudulent firms and financial frauds. The dataset consists of Annual data of 777 firms from 14 different sectors. The results demonstrate 97.4% accuracy with an F1 score of 0.97. Another dataset on credit card transactions and finance is also used to test the effectiveness of LAD in finance. With the appearance of the immense growth of financial fraud cases, these promising results lead to future advancements in analytical audit fieldwork.

本文通过识别欺诈公司和交易,探讨了使用数据逻辑分析(LAD)方法进行会计分析的进展。使用分析模型进行欺诈检测的直接方法是识别与已知欺诈者及其历史行为相关的欺诈的可能预测因素。LAD是一种机器学习方法,它结合了布尔函数、优化和逻辑思想,与传统方法保持一致。LAD的关键特征是发现解释所有观察结果所需的最小特征集,并检测数据中的隐藏模式,从而能够区分描述“积极”结果事件和“消极”结果事件的观察结果。本文中描述的组合优化模型代表了集合覆盖这一主题的变体,并概述了LAD在检测欺诈企业和金融欺诈方面的应用。该数据集包括来自14个不同行业的777家公司的年度数据。结果显示准确率为97.4%,F1得分为0.97。另一个关于信用卡交易和金融的数据集也用于测试LAD在金融方面的有效性。随着财务欺诈案件的大幅增长,这些有希望的结果为分析审计领域的未来发展带来了希望。
{"title":"LAD in finance: accounting analytics and fraud detection","authors":"Aditi Kar Gangopadhyay,&nbsp;Tanay Sheth,&nbsp;Sneha Chauhan","doi":"10.1007/s43674-023-00052-4","DOIUrl":"10.1007/s43674-023-00052-4","url":null,"abstract":"<div><p>The paper explores advancements in accounting analytics using the logical analysis of data (LAD) approach by identifying fraudulent firms and transactions. The straightforward approach to fraud detection with an analytic model is to identify possible predictors of fraud associated with known fraudsters and their historical actions. LAD is a machine learning methodology that combines Boolean functions, optimization, and logic ideas in alignment with the traditional approach. The key characteristic of the LAD is discovering minimal sets of features necessary for explaining all observations and detecting hidden patterns in the data capable of distinguishing observations describing “positive” outcome events from “negative” outcome events. The combinatorial optimization model described in the paper represents a variation on the general theme of set covering and concludes with an outline of LAD applications to detect fraudulent firms and financial frauds. The dataset consists of Annual data of 777 firms from 14 different sectors. The results demonstrate 97.4% accuracy with an F1 score of 0.97. Another dataset on credit card transactions and finance is also used to test the effectiveness of LAD in finance. With the appearance of the immense growth of financial fraud cases, these promising results lead to future advancements in analytical audit fieldwork.</p></div>","PeriodicalId":72089,"journal":{"name":"Advances in computational intelligence","volume":"3 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2023-01-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"50526002","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
User structural information in priority-based ranking for top-N recommendation 基于优先级的前N推荐排序中的用户结构信息
Pub Date : 2023-01-06 DOI: 10.1007/s43674-022-00050-y
Mohammad Majid Fayezi, Alireza Hashemi Golpayegani

The recommender system is a set of data recovery tools and techniques used to recommend items to users based on their selection. To improve the accuracy of the recommendation, the use of additional information (e.g., social information, trust, item tags, etc.) in addition to user-item ranking data has been an active area of research for the past decade.

In this paper, we present a new method for recommending top-N items, which uses structural information and trust among users within the social network and extracts the implicit connections between users and uses them in the item recommendation process. The proposed method has seven main steps: (1) extract items liked by neighbors, (ii) constructing item features for neighbors, (iii) extract embedding trust features for neighbors, (iv) create user-feature matrix, (v) calculate user’s priority, (vi) calculate item’s priority and finally, (vii) recommend top-N items. We implement the proposed method with three datasets for recommendations. We compare our results with some advanced ranking methods and observe that the accuracy of our method for all users and cold-start users improves. Our method can also create more items for cold-start users in the list of recommended items.

推荐系统是一套数据恢复工具和技术,用于根据用户的选择向用户推荐项目。为了提高推荐的准确性,在过去十年中,除了用户项目排名数据之外,还使用额外的信息(例如,社交信息、信任、项目标签等)一直是一个活跃的研究领域。在本文中,我们提出了一种推荐前N个项目的新方法,该方法利用社交网络中用户之间的结构信息和信任,提取用户之间的隐含联系,并在项目推荐过程中使用。所提出的方法有七个主要步骤:(1)提取邻居喜欢的项目,(ii)为邻居构建项目特征,(iii)提取邻居的嵌入信任特征,(iv)创建用户特征矩阵,(v)计算用户的优先级,(vi)计算项目的优先级,最后,(vii)推荐前N个项目。我们用三个用于推荐的数据集来实现所提出的方法。我们将我们的结果与一些先进的排名方法进行了比较,并观察到我们的方法对所有用户和冷启动用户的准确性有所提高。我们的方法还可以在推荐项目列表中为冷启动用户创建更多项目。
{"title":"User structural information in priority-based ranking for top-N recommendation","authors":"Mohammad Majid Fayezi,&nbsp;Alireza Hashemi Golpayegani","doi":"10.1007/s43674-022-00050-y","DOIUrl":"10.1007/s43674-022-00050-y","url":null,"abstract":"<div><p>The recommender system is a set of data recovery tools and techniques used to recommend items to users based on their selection. To improve the accuracy of the recommendation, the use of additional information (e.g., social information, trust, item tags, etc.) in addition to user-item ranking data has been an active area of research for the past decade.</p><p>In this paper, we present a new method for recommending top-N items, which uses structural information and trust among users within the social network and extracts the implicit connections between users and uses them in the item recommendation process. The proposed method has seven main steps: (1) extract items liked by neighbors, (ii) constructing item features for neighbors, (iii) extract embedding trust features for neighbors, (iv) create user-feature matrix, (v) calculate user’s priority, (vi) calculate item’s priority and finally, (vii) recommend top-N items. We implement the proposed method with three datasets for recommendations. We compare our results with some advanced ranking methods and observe that the accuracy of our method for all users and cold-start users improves. Our method can also create more items for cold-start users in the list of recommended items.</p></div>","PeriodicalId":72089,"journal":{"name":"Advances in computational intelligence","volume":"3 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2023-01-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://link.springer.com/content/pdf/10.1007/s43674-022-00050-y.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"50455557","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Fish recognition model for fraud prevention using convolutional neural networks 基于卷积神经网络的防欺诈鱼类识别模型
Pub Date : 2022-12-19 DOI: 10.1007/s43674-022-00048-6
Rhayane S. Monteiro, Morgana C. O. Ribeiro, Calebi A. S. Viana, Mário W. L. Moreira, Glácio S. Araúo, Joel J. P. C. Rodrigues

Fraud, misidentification, and adulteration of food, whether unintentional or purposeful, are a worldwide and growing concern. Aquaculture and fisheries are recognized as one of the sectors most vulnerable to food fraud. Besides, a series of risks related to health and distrust between consumer and popular market makes this sector develop an effective solution for fraud control. Species identification is an essential aspect to expose commercial fraud. Convolutional neural networks (CNNs) are one of the most powerful tools for image recognition and classification tasks. Thus, the objective of this study is to propose a model of recognition of fish species based on CNNs. After the implementation and comparison of the results of the CNNs, it was found that the Xception architecture achieved better performance with 86% accuracy. It was also possible to build a web application mockup. The proposal is easily applied in other aquaculture areas such as the species recognition of lobsters, shrimp, among other seafood.

食品的欺诈、误认和掺假,无论是无意的还是有目的的,都是全世界日益关注的问题。水产养殖和渔业被公认为最容易受到粮食欺诈的部门之一。此外,一系列与健康相关的风险以及消费者和大众市场之间的不信任,使该行业成为控制欺诈的有效解决方案。物种鉴定是揭露商业欺诈的一个重要方面。卷积神经网络是图像识别和分类任务中最强大的工具之一。因此,本研究的目的是提出一种基于细胞神经网络的鱼类物种识别模型。在实现和比较CNNs的结果后,发现Xception架构实现了更好的性能,准确率为86%。还可以构建一个web应用程序模型。该提案很容易应用于其他水产养殖领域,如龙虾、虾和其他海鲜的物种识别。
{"title":"Fish recognition model for fraud prevention using convolutional neural networks","authors":"Rhayane S. Monteiro,&nbsp;Morgana C. O. Ribeiro,&nbsp;Calebi A. S. Viana,&nbsp;Mário W. L. Moreira,&nbsp;Glácio S. Araúo,&nbsp;Joel J. P. C. Rodrigues","doi":"10.1007/s43674-022-00048-6","DOIUrl":"10.1007/s43674-022-00048-6","url":null,"abstract":"<div><p>Fraud, misidentification, and adulteration of food, whether unintentional or purposeful, are a worldwide and growing concern. Aquaculture and fisheries are recognized as one of the sectors most vulnerable to food fraud. Besides, a series of risks related to health and distrust between consumer and popular market makes this sector develop an effective solution for fraud control. Species identification is an essential aspect to expose commercial fraud. Convolutional neural networks (CNNs) are one of the most powerful tools for image recognition and classification tasks. Thus, the objective of this study is to propose a model of recognition of fish species based on CNNs. After the implementation and comparison of the results of the CNNs, it was found that the Xception architecture achieved better performance with 86% accuracy. It was also possible to build a web application mockup. The proposal is easily applied in other aquaculture areas such as the species recognition of lobsters, shrimp, among other seafood.</p></div>","PeriodicalId":72089,"journal":{"name":"Advances in computational intelligence","volume":"3 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2022-12-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"50494855","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
Differentially private transferrable deep learning with membership-mappings 具有成员映射的差分私有可转移深度学习
Pub Date : 2022-12-15 DOI: 10.1007/s43674-022-00049-5
Mohit Kumar

Despite a recent surge of research interest in privacy and transferrable deep learning, optimizing the tradeoff between privacy requirements and performance of machine learning models remains a challenge. This motivates the development of an approach that optimizes both privacy-preservation mechanism and learning of the deep models for achieving a robust performance. This paper considers the problem of semi-supervised transfer and multi-task learning under differential privacy framework. An alternative conception of deep autoencoder, referred to as Conditionally Deep Membership-Mapping Autoencoder (CDMMA), is considered for transferrable deep learning. Under practice-oriented settings, an analytical solution for the learning of CDMMA can be derived by means of variational optimization. The paper proposes a transfer and multi-task learning approach that combines CDMMA with a tailored noise adding mechanism to transfer knowledge from source to target domain in a privacy-preserving manner.

尽管最近对隐私和可转移深度学习的研究兴趣激增,但优化机器学习模型的隐私要求和性能之间的权衡仍然是一个挑战。这促使开发一种方法,优化隐私保护机制和深度模型的学习,以实现稳健的性能。本文研究了差分隐私框架下的半监督迁移和多任务学习问题。深度自动编码器的另一个概念,称为条件深度成员映射自动编码器(CDMMA),被认为是可转移的深度学习。在面向实践的环境下,可以通过变分优化的方法导出CDMMA学习的解析解。本文提出了一种转移和多任务学习方法,该方法将CDMMA与定制的噪声添加机制相结合,以保护隐私的方式将知识从源域转移到目标域。
{"title":"Differentially private transferrable deep learning with membership-mappings","authors":"Mohit Kumar","doi":"10.1007/s43674-022-00049-5","DOIUrl":"10.1007/s43674-022-00049-5","url":null,"abstract":"<div><p>Despite a recent surge of research interest in privacy and transferrable deep learning, optimizing the tradeoff between privacy requirements and performance of machine learning models remains a challenge. This motivates the development of an approach that optimizes both privacy-preservation mechanism and learning of the deep models for achieving a robust performance. This paper considers the problem of semi-supervised transfer and multi-task learning under differential privacy framework. An alternative conception of deep autoencoder, referred to as <i>Conditionally Deep Membership-Mapping Autoencoder (CDMMA)</i>, is considered for transferrable deep learning. Under practice-oriented settings, an analytical solution for the learning of CDMMA can be derived by means of variational optimization. The paper proposes a transfer and multi-task learning approach that combines CDMMA with a tailored noise adding mechanism to transfer knowledge from source to target domain in a privacy-preserving manner.</p></div>","PeriodicalId":72089,"journal":{"name":"Advances in computational intelligence","volume":"3 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2022-12-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"50483727","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 9
A novel hybrid dimension reduction and deep learning-based classification for neuromuscular disorder 一种新的基于降维和深度学习的神经肌肉疾病分类方法
Pub Date : 2022-11-21 DOI: 10.1007/s43674-022-00047-7
Babita Pandey, Devendra Kumar Pandey, Aditya Khamparia, Seema Shukla

Correct classification of neuromuscular disorders is essential to provide accurate diagnosis. Presently, gene microarray technology is a widely accepted technology to monitor the expression level of a large number of genes simultaneously. The gene microarray data are a high dimensional data, which usually contains small samples having a large number of genes. Therefore, dimension reduction is a crucial task for correct classification of diseases. Dimension reduction eliminates those genes which are less expressive and enhances the efficiency of the classification model. In the present paper, we developed a novel hybrid dimension reduction method and a deep learning-based classification model for neuromuscular disorders. The hybrid dimension reduction method is deployed in three phase: in the first phase, the expressive genes are selected using F test method, and the mutual information method and the best one among them are selected for further processing. In second phase, the gene selected by the best model is further transformed to low dimension by PCA. In third phase, the deep learning-based classification model is deployed. For experimentation, two diseased and multi-diseased micro array data sets, which is publicly available, is used. The best accuracy by 50-100-50-25-13 deep learning architecture with hybrid dimension reduction, where 100 genes select by F test and PCA with 50 principal components is 89% for NMD data set. The best accuracy by 50-100-2 deep learning architecture with hybrid dimension reduction, where 100 genes select by F test and PCA with 50 principal components is 97% for FSHD data set. The proposed hybrid method gives better classification accuracy result and reduces the search space and time complexity as well for both two diseased and multi-diseased micro array data sets.

神经肌肉疾病的正确分类对于提供准确的诊断至关重要。目前,基因微阵列技术是一种广泛接受的同时监测大量基因表达水平的技术。基因微阵列数据是高维数据,其通常包含具有大量基因的小样本。因此,降维是正确分类疾病的关键任务。降维消除了那些表达能力较差的基因,提高了分类模型的效率。在本文中,我们开发了一种新的混合降维方法和一种基于深度学习的神经肌肉疾病分类模型。混合降维方法分为三个阶段:在第一阶段,使用F检验方法选择表达基因,并选择互信息方法和其中最好的方法进行进一步处理。在第二阶段,通过PCA将最佳模型选择的基因进一步转化为低维。在第三阶段,部署了基于深度学习的分类模型。为了进行实验,使用了公开可用的两个患病和多患病的微阵列数据集。对于NMD数据集,50-100-50-25-13具有混合降维的深度学习架构(其中通过F检验和具有50个主成分的PCA选择100个基因)的最佳准确率为89%。具有混合降维的50-100-2深度学习架构(其中通过F检验和具有50个主成分的PCA选择100个基因)的FSHD数据集的最佳准确率为97%。所提出的混合方法对两个病变和多病变的微阵列数据集都给出了更好的分类精度结果,并降低了搜索空间和时间的复杂性。
{"title":"A novel hybrid dimension reduction and deep learning-based classification for neuromuscular disorder","authors":"Babita Pandey,&nbsp;Devendra Kumar Pandey,&nbsp;Aditya Khamparia,&nbsp;Seema Shukla","doi":"10.1007/s43674-022-00047-7","DOIUrl":"10.1007/s43674-022-00047-7","url":null,"abstract":"<div><p>Correct classification of neuromuscular disorders is essential to provide accurate diagnosis. Presently, gene microarray technology is a widely accepted technology to monitor the expression level of a large number of genes simultaneously. The gene microarray data are a high dimensional data, which usually contains small samples having a large number of genes. Therefore, dimension reduction is a crucial task for correct classification of diseases. Dimension reduction eliminates those genes which are less expressive and enhances the efficiency of the classification model. In the present paper, we developed a novel hybrid dimension reduction method and a deep learning-based classification model for neuromuscular disorders. The hybrid dimension reduction method is deployed in three phase: in the first phase, the expressive genes are selected using <i>F</i> test method, and the mutual information method and the best one among them are selected for further processing. In second phase, the gene selected by the best model is further transformed to low dimension by PCA. In third phase, the deep learning-based classification model is deployed. For experimentation, two diseased and multi-diseased micro array data sets, which is publicly available, is used. The best accuracy by 50-100-50-25-13 deep learning architecture with hybrid dimension reduction, where 100 genes select by <i>F</i> test and PCA with 50 principal components is 89% for NMD data set. The best accuracy by 50-100-2 deep learning architecture with hybrid dimension reduction, where 100 genes select by <i>F</i> test and PCA with 50 principal components is 97% for FSHD data set. The proposed hybrid method gives better classification accuracy result and reduces the search space and time complexity as well for both two diseased and multi-diseased micro array data sets.</p></div>","PeriodicalId":72089,"journal":{"name":"Advances in computational intelligence","volume":"2 6","pages":""},"PeriodicalIF":0.0,"publicationDate":"2022-11-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"50503646","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
An effective Reinforcement Learning method for preventing the overfitting of Convolutional Neural Networks 一种防止卷积神经网络过拟合的有效强化学习方法
Pub Date : 2022-09-29 DOI: 10.1007/s43674-022-00046-8
Ali Mahdavi-Hormat, Mohammad Bagher Menhaj, Ashkan Shakarami

Convolutional Neural Networks are machine learning models that have proven abilities in many variants of tasks. This powerful machine learning model sometimes suffers from overfitting. This paper proposes a method based on Reinforcement Learning for addressing this problem. In this research, the parameters of a target layer in the Convolutional Neural Network take as a state for the Agent of the Reinforcement Learning section. Then the Agent gives some actions as forming parameters of a hyperbolic secant function. This function’s form is changed gradually and implicitly by the proposed method. The inputs of the function are the weights of the layer, and its outputs multiply by the same weights to updating them. In this study, the proposed method is inspired by the Deep Deterministic Policy Gradient model because the actions of the Agent are into a continuous domain. To show the proposed method’s effectiveness, the classification task is considered using Convolutional Neural Networks. In this study, 7 datasets have been used for evaluating the model; MNIST, Extended MNIST, small-notMNIST, Fashion-MNIST, sign language MNIST, CIFAR-10, and CIFAR-100.

卷积神经网络是一种机器学习模型,已被证明在许多任务变体中具有能力。这种强大的机器学习模型有时会受到过拟合的影响。针对这一问题,本文提出了一种基于强化学习的方法。在本研究中,卷积神经网络中目标层的参数作为强化学习部分的Agent的状态。然后,Agent给出了双曲割线函数的一些形成参数。通过所提出的方法,该函数的形式逐渐而隐含地发生了变化。函数的输入是层的权重,其输出乘以相同的权重以更新它们。在本研究中,所提出的方法受到了深度确定性策略梯度模型的启发,因为Agent的行为进入了一个连续的域。为了证明所提出的方法的有效性,使用卷积神经网络来考虑分类任务。在本研究中,使用了7个数据集来评估模型;MNIST、Extended MNIST,small notMNIST和Fashion MNIST。
{"title":"An effective Reinforcement Learning method for preventing the overfitting of Convolutional Neural Networks","authors":"Ali Mahdavi-Hormat,&nbsp;Mohammad Bagher Menhaj,&nbsp;Ashkan Shakarami","doi":"10.1007/s43674-022-00046-8","DOIUrl":"10.1007/s43674-022-00046-8","url":null,"abstract":"<div><p>Convolutional Neural Networks are machine learning models that have proven abilities in many variants of tasks. This powerful machine learning model sometimes suffers from overfitting. This paper proposes a method based on Reinforcement Learning for addressing this problem. In this research, the parameters of a target layer in the Convolutional Neural Network take as a state for the Agent of the Reinforcement Learning section. Then the Agent gives some actions as forming parameters of a hyperbolic secant function. This function’s form is changed gradually and implicitly by the proposed method. The inputs of the function are the weights of the layer, and its outputs multiply by the same weights to updating them. In this study, the proposed method is inspired by the Deep Deterministic Policy Gradient model because the actions of the Agent are into a continuous domain. To show the proposed method’s effectiveness, the classification task is considered using Convolutional Neural Networks. In this study, 7 datasets have been used for evaluating the model; MNIST, Extended MNIST, small-notMNIST, Fashion-MNIST, sign language MNIST, CIFAR-10, and CIFAR-100.\u0000</p></div>","PeriodicalId":72089,"journal":{"name":"Advances in computational intelligence","volume":"2 5","pages":""},"PeriodicalIF":0.0,"publicationDate":"2022-09-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"50524501","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
期刊
Advances in computational intelligence
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1