Pub Date : 2023-09-22DOI: 10.1007/s10472-023-09895-6
Safae Rbihou, Nour-Eddine Joudar, Khalid Haddouch
The continuous Hopfield network (CHN) has provided a powerful approach to optimization problems and has shown good performance in different domains. However, two primary challenges still remain for this network: defining appropriate parameters and hyperparameters. In this study, our objective is to address these challenges and achieve optimal solutions for combinatorial optimization problems, thereby improving the overall performance of the continuous Hopfield network. To accomplish this, we propose a new technique for tuning the parameters of the CHN by considering its stability. To evaluate our approach, three well-known combinatorial optimization problems, namely, weighted constraint satisfaction problems, task assignment problems, and the traveling salesman problem, were employed. The experiments demonstrate that the proposed approach offers several advantages for CHN parameter tuning and the selection of optimal hyperparameter combinations.
{"title":"Parameter tuning of continuous Hopfield network applied to combinatorial optimization","authors":"Safae Rbihou, Nour-Eddine Joudar, Khalid Haddouch","doi":"10.1007/s10472-023-09895-6","DOIUrl":"10.1007/s10472-023-09895-6","url":null,"abstract":"<div><p>The continuous Hopfield network (CHN) has provided a powerful approach to optimization problems and has shown good performance in different domains. However, two primary challenges still remain for this network: defining appropriate parameters and hyperparameters. In this study, our objective is to address these challenges and achieve optimal solutions for combinatorial optimization problems, thereby improving the overall performance of the continuous Hopfield network. To accomplish this, we propose a new technique for tuning the parameters of the CHN by considering its stability. To evaluate our approach, three well-known combinatorial optimization problems, namely, weighted constraint satisfaction problems, task assignment problems, and the traveling salesman problem, were employed. The experiments demonstrate that the proposed approach offers several advantages for CHN parameter tuning and the selection of optimal hyperparameter combinations.</p></div>","PeriodicalId":7971,"journal":{"name":"Annals of Mathematics and Artificial Intelligence","volume":"92 2","pages":"257 - 275"},"PeriodicalIF":1.2,"publicationDate":"2023-09-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"136060942","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2023-09-20DOI: 10.1007/s10472-023-09893-8
Vahide Bulut
Finding a collision-free feasible path for mobile robots is very important because they are essential in many fields such as healthcare, military, and industry. In this paper, a novel Clustering Obstacles (CO)-based path planning algorithm for mobile robots is presented using a quintic trigonometric Bézier curve and its two shape parameters. The CO-based algorithm forms clusters of geometrically shaped obstacles and finds the cluster centers. Moreover, the proposed waypoint algorithm (WP) finds the waypoints of the predefined skeleton path in addition to the start and destination points in an environment. Based on all these points, the predefined quintic trigonometric Bézier path candidates, taking the skeleton path as their convex hull, are then generated using the shape parameters of this curve. Moreover, the performance of the proposed algorithm is compared with K-Means and agglomerative hierarchical algorithms to obtain the quintic trigonometric Bézier paths desired by the user. The experimental results show that the CO-based path planning algorithm achieves better cluster centers and consequently better collision-free predefined paths.
为移动机器人寻找一条无碰撞的可行路径非常重要,因为它们在医疗保健、军事和工业等许多领域都不可或缺。本文提出了一种新颖的基于障碍物聚类(CO)的移动机器人路径规划算法,该算法使用了五次方三角贝塞尔曲线及其两个形状参数。基于 CO 的算法可形成几何形状的障碍物集群,并找到集群中心。此外,所提出的航点算法(WP)除了能在环境中找到起点和终点外,还能找到预定义骨架路径的航点。在所有这些点的基础上,以骨架路径为凸壳,利用该曲线的形状参数生成预定义的五次方三角贝塞尔路径候选路径。此外,为了获得用户所需的五次方三角贝齐尔路径,将所提出算法的性能与 K-Means 和聚类分层算法进行了比较。实验结果表明,基于 CO 的路径规划算法能获得更好的聚类中心,从而获得更好的无碰撞预定义路径。
{"title":"Path planning algorithm for mobile robots based on clustering-obstacles and quintic trigonometric Bézier curve","authors":"Vahide Bulut","doi":"10.1007/s10472-023-09893-8","DOIUrl":"10.1007/s10472-023-09893-8","url":null,"abstract":"<div><p>Finding a collision-free feasible path for mobile robots is very important because they are essential in many fields such as healthcare, military, and industry. In this paper, a novel Clustering Obstacles (CO)-based path planning algorithm for mobile robots is presented using a quintic trigonometric Bézier curve and its two shape parameters. The CO-based algorithm forms clusters of geometrically shaped obstacles and finds the cluster centers. Moreover, the proposed waypoint algorithm (WP) finds the waypoints of the predefined skeleton path in addition to the start and destination points in an environment. Based on all these points, the predefined quintic trigonometric Bézier path candidates, taking the skeleton path as their convex hull, are then generated using the shape parameters of this curve. Moreover, the performance of the proposed algorithm is compared with K-Means and agglomerative hierarchical algorithms to obtain the quintic trigonometric Bézier paths desired by the user. The experimental results show that the CO-based path planning algorithm achieves better cluster centers and consequently better collision-free predefined paths.</p></div>","PeriodicalId":7971,"journal":{"name":"Annals of Mathematics and Artificial Intelligence","volume":"92 2","pages":"235 - 256"},"PeriodicalIF":1.2,"publicationDate":"2023-09-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"136313827","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2023-09-01DOI: 10.1007/s10472-023-09882-x
Panos K. Syriopoulos, Nektarios G. Kalampalikis, Sotiris B. Kotsiantis, Michael N. Vrahatis
The k-nearest neighbors (k/NN) algorithm is a simple yet powerful non-parametric classifier that is robust to noisy data and easy to implement. However, with the growing literature on k/NN methods, it is increasingly challenging for new researchers and practitioners to navigate the field. This review paper aims to provide a comprehensive overview of the latest developments in the k/NN algorithm, including its strengths and weaknesses, applications, benchmarks, and available software with corresponding publications and citation analysis. The review also discusses the potential of k/NN in various data science tasks, such as anomaly detection, dimensionality reduction and missing value imputation. By offering an in-depth analysis of k/NN, this paper serves as a valuable resource for researchers and practitioners to make informed decisions and identify the best k/NN implementation for a given application.
{"title":"kNN Classification: a review","authors":"Panos K. Syriopoulos, Nektarios G. Kalampalikis, Sotiris B. Kotsiantis, Michael N. Vrahatis","doi":"10.1007/s10472-023-09882-x","DOIUrl":"10.1007/s10472-023-09882-x","url":null,"abstract":"<div><p>The <i>k-</i>nearest neighbors (<i>k</i>/NN) algorithm is a simple yet powerful non-parametric classifier that is robust to noisy data and easy to implement. However, with the growing literature on <i>k</i>/NN methods, it is increasingly challenging for new researchers and practitioners to navigate the field. This review paper aims to provide a comprehensive overview of the latest developments in the <i>k</i>/NN algorithm, including its strengths and weaknesses, applications, benchmarks, and available software with corresponding publications and citation analysis. The review also discusses the potential of <i>k</i>/NN in various data science tasks, such as anomaly detection, dimensionality reduction and missing value imputation. By offering an in-depth analysis of <i>k</i>/NN, this paper serves as a valuable resource for researchers and practitioners to make informed decisions and identify the best <i>k</i>/NN implementation for a given application.</p></div>","PeriodicalId":7971,"journal":{"name":"Annals of Mathematics and Artificial Intelligence","volume":"93 1","pages":"43 - 75"},"PeriodicalIF":1.2,"publicationDate":"2023-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"48667907","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2023-08-29DOI: 10.1007/s10472-023-09891-w
Müge Saadetoğlu, Benedek Nagy, Aydın Avkan
A digitized rigid motion is called digitally continuous if two neighbor pixels still stay neighbors after the motion. This concept plays important role when people or computers (artificial intelligence, machine vision) need to recognize the object shown in the image. In this paper, digital rotations of a pixel with its closest neighbors are of our interest. We compare the neighborhood motion map results among the three regular grids, when the center of rotation is the midpoint of a main pixel, a grid point (corner of a pixel) or an edge midpoint. The first measure about the quality of digital rotations is based on bijectivity, e.g., measuring how many of the cases produce bijective and how many produce not bijective neighborhood motion maps (Avkan et. al, 2022). Now, a second measure is investigated, the quality of bijective digital rotations is measured by the digital continuity of the resulted image: we measure how many of the cases are bijective and also digitally continuous. We show that rotations on the triangular grid prove to be digitally continuous at many more real angles and also as a special case, many more integer angles compared to the square grid or to the hexagonal grid with respect to the three different rotation centers.
{"title":"Digital continuity of rotations in the 2D regular grids","authors":"Müge Saadetoğlu, Benedek Nagy, Aydın Avkan","doi":"10.1007/s10472-023-09891-w","DOIUrl":"10.1007/s10472-023-09891-w","url":null,"abstract":"<div><p>A digitized rigid motion is called digitally continuous if two neighbor pixels still stay neighbors after the motion. This concept plays important role when people or computers (artificial intelligence, machine vision) need to recognize the object shown in the image. In this paper, digital rotations of a pixel with its closest neighbors are of our interest. We compare the neighborhood motion map results among the three regular grids, when the center of rotation is the midpoint of a main pixel, a grid point (corner of a pixel) or an edge midpoint. The first measure about the quality of digital rotations is based on bijectivity, e.g., measuring how many of the cases produce bijective and how many produce not bijective neighborhood motion maps (Avkan et. al, 2022). Now, a second measure is investigated, the quality of bijective digital rotations is measured by the digital continuity of the resulted image: we measure how many of the cases are bijective and also digitally continuous. We show that rotations on the triangular grid prove to be digitally continuous at many more real angles and also as a special case, many more integer angles compared to the square grid or to the hexagonal grid with respect to the three different rotation centers.</p></div>","PeriodicalId":7971,"journal":{"name":"Annals of Mathematics and Artificial Intelligence","volume":"92 1","pages":"115 - 137"},"PeriodicalIF":1.2,"publicationDate":"2023-08-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"43672320","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2023-08-28DOI: 10.1007/s10472-023-09889-4
Somrita Saha, Arindam Biswas
A digital plane is a digitization of a Euclidean plane. A plane is specified by its normal, which is a 3D vector with integer coordinates, as considered in this case. It is established here that a 3D digital straight line segment, shifted by an integer amount, can produce the digitized plane. 3D plane’s normals are classified based on the Greatest Common Divisor (GCD) of its components, and the net code is calculated separately for each case. Experimental results are provided for several normals. Also, we show that the digital plane segment generated is a connected digital plane. The proposed method mainly involves integer arithmetic.
{"title":"A combinatorial technique for generation of digital plane using GCD","authors":"Somrita Saha, Arindam Biswas","doi":"10.1007/s10472-023-09889-4","DOIUrl":"10.1007/s10472-023-09889-4","url":null,"abstract":"<div><p>A digital plane is a digitization of a Euclidean plane. A plane is specified by its normal, which is a 3D vector with integer coordinates, as considered in this case. It is established here that a 3D digital straight line segment, shifted by an integer amount, can produce the digitized plane. 3D plane’s normals are classified based on the Greatest Common Divisor (GCD) of its components, and the net code is calculated separately for each case. Experimental results are provided for several normals. Also, we show that the digital plane segment generated is a connected digital plane. The proposed method mainly involves integer arithmetic.</p></div>","PeriodicalId":7971,"journal":{"name":"Annals of Mathematics and Artificial Intelligence","volume":"92 1","pages":"139 - 167"},"PeriodicalIF":1.2,"publicationDate":"2023-08-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"45076952","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2023-08-23DOI: 10.1007/s10472-023-09892-9
Bugra Caskurlu, Fatih Erdem Kizilkaya
Hedonic games are a prominent model of coalition formation, in which each agent’s utility only depends on the coalition she resides. The subclass of hedonic games that models the formation of general partnerships (Larson 2018), where all affiliates receive the same utility, is referred to as hedonic games with common ranking property (HGCRP). Aside from their economic motivation, HGCRP came into prominence since they are guaranteed to have core stable solutions that can be found efficiently (Farrell and Scotchmer Q. J. Econ. 103(2), 279–297 1988). We improve upon existing results by proving that every instance of HGCRP has a solution that is Pareto optimal, core stable, and individually stable. The economic significance of this result is that efficiency is not to be totally sacrificed for the sake of stability in HGCRP. We establish that finding such a solution is NP-hard even if the sizes of the coalitions are bounded above by 3; however, it is polynomial time solvable if the sizes of the coalitions are bounded above by 2. We show that the gap between the total utility of a core stable solution and that of the socially-optimal solution (OPT) is bounded above by n, where n is the number of agents, and that this bound is tight. Our investigations reveal that computing OPT is inapproximable within better than (O(n^{1-epsilon })) for any fixed (epsilon > 0), and that this inapproximability lower bound is polynomially tight. However, OPT can be computed in polynomial time if the sizes of the coalitions are bounded above by 2.
{"title":"On hedonic games with common ranking property","authors":"Bugra Caskurlu, Fatih Erdem Kizilkaya","doi":"10.1007/s10472-023-09892-9","DOIUrl":"10.1007/s10472-023-09892-9","url":null,"abstract":"<div><p>Hedonic games are a prominent model of coalition formation, in which each agent’s utility only depends on the coalition she resides. The subclass of hedonic games that models the formation of general partnerships (Larson 2018), where all affiliates receive the same utility, is referred to as hedonic games with common ranking property (HGCRP). Aside from their economic motivation, HGCRP came into prominence since they are guaranteed to have core stable solutions that can be found efficiently (Farrell and Scotchmer Q. J. Econ. <b>103</b>(2), 279–297 1988). We improve upon existing results by proving that every instance of HGCRP has a solution that is Pareto optimal, core stable, and individually stable. The economic significance of this result is that efficiency is not to be totally sacrificed for the sake of stability in HGCRP. We establish that finding such a solution is <b>NP-hard</b> even if the sizes of the coalitions are bounded above by 3; however, it is polynomial time solvable if the sizes of the coalitions are bounded above by 2. We show that the gap between the total utility of a core stable solution and that of the socially-optimal solution (OPT) is bounded above by <i>n</i>, where <i>n</i> is the number of agents, and that this bound is tight. Our investigations reveal that computing OPT is inapproximable within better than <span>(O(n^{1-epsilon }))</span> for any fixed <span>(epsilon > 0)</span>, and that this inapproximability lower bound is polynomially tight. However, OPT can be computed in polynomial time if the sizes of the coalitions are bounded above by 2.</p></div>","PeriodicalId":7971,"journal":{"name":"Annals of Mathematics and Artificial Intelligence","volume":"92 3","pages":"581 - 599"},"PeriodicalIF":1.2,"publicationDate":"2023-08-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135520475","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2023-08-18DOI: 10.1007/s10472-023-09890-x
Dimitri Weiss, Kevin Tierney
A solver’s runtime and the quality of the solutions it generates are strongly influenced by its parameter settings. Finding good parameter configurations is a formidable challenge, even for fixed problem instance distributions. However, when the instance distribution can change over time, a once effective configuration may no longer provide adequate performance. Realtime algorithm configuration (RAC) offers assistance in finding high-quality configurations for such distributions by automatically adjusting the configurations it recommends based on instances seen so far. Existing RAC methods treat the solver as a black box, meaning the solver is given a configuration as input, and it outputs either a solution or runtime as an objective function for the configurator. However, analyzing intermediate output from the solver can enable configurators to avoid wasting time on poorly performing configurations. We propose a gray-box approach that utilizes intermediate output during evaluation and implement it within the RAC method Contextual Preselection with Plackett-Luce (CPPL blue). We apply cost-sensitive machine learning with pairwise comparisons to determine whether ongoing evaluations can be terminated to free resources. We compare our approach to a black-box equivalent on several experimental settings and show that our approach reduces the total solving time in several scenarios and improves solution quality in an additional scenario.
{"title":"Realtime gray-box algorithm configuration using cost-sensitive classification","authors":"Dimitri Weiss, Kevin Tierney","doi":"10.1007/s10472-023-09890-x","DOIUrl":"10.1007/s10472-023-09890-x","url":null,"abstract":"<div><p>A solver’s runtime and the quality of the solutions it generates are strongly influenced by its parameter settings. Finding good parameter configurations is a formidable challenge, even for fixed problem instance distributions. However, when the instance distribution can change over time, a once effective configuration may no longer provide adequate performance. Realtime algorithm configuration (RAC) offers assistance in finding high-quality configurations for such distributions by automatically adjusting the configurations it recommends based on instances seen so far. Existing RAC methods treat the solver as a black box, meaning the solver is given a configuration as input, and it outputs either a solution or runtime as an objective function for the configurator. However, analyzing intermediate output from the solver can enable configurators to avoid wasting time on poorly performing configurations. We propose a gray-box approach that utilizes intermediate output during evaluation and implement it within the RAC method Contextual Preselection with Plackett-Luce (CPPL blue). We apply cost-sensitive machine learning with pairwise comparisons to determine whether ongoing evaluations can be terminated to free resources. We compare our approach to a black-box equivalent on several experimental settings and show that our approach reduces the total solving time in several scenarios and improves solution quality in an additional scenario.</p></div>","PeriodicalId":7971,"journal":{"name":"Annals of Mathematics and Artificial Intelligence","volume":"93 1","pages":"109 - 130"},"PeriodicalIF":1.2,"publicationDate":"2023-08-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://link.springer.com/content/pdf/10.1007/s10472-023-09890-x.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"42645690","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2023-08-14DOI: 10.1007/s10472-023-09888-5
Ammon Washburn, Neng Fan, Hao Helen Zhang
In this paper, we develop two SVM-based classifiers named stable nested one-class support vector machines (SN-1SVMs) and decoupled margin-moment based SVMs (DMMB-SVMs), to predict the specific type of pancreatic carcinoma using quantitative histopathological signatures of images. For each patient, the diagnosis can produce hundreds of images, which can be used to classify the pancreatic tissues into three classes: chronic pancreatitis, intraductal papillary mucinous neoplasms, and pancreatic carcinoma. The proposed two approaches tackle the classification problems from two different perspectives: the SN-1SVM treats each image as a classification point in a nested fashion to predict malignancy of the tissues, while the DMMB-SVM treats each patient as a classification point by assembling information across images. One attractive feature of the DMMB-SVM is that, in addition to utilizing the mean information, it also takes into account the covariance of features extracted from images for each patient. We conduct numerical experiments to evaluate and compare performance of the two methods. It is observed that the SN-1SVM can take advantage of the data structure more effectively, while the DMMB-SVM demonstrates better computational efficiency and classification accuracy. To further improve interpretability of the final classifier, we also consider the (ell _1)-norm in the DMMB-SVM to handle feature selection.
{"title":"Novel SVM-based classification approaches for evaluating pancreatic carcinoma","authors":"Ammon Washburn, Neng Fan, Hao Helen Zhang","doi":"10.1007/s10472-023-09888-5","DOIUrl":"10.1007/s10472-023-09888-5","url":null,"abstract":"<div><p>In this paper, we develop two SVM-based classifiers named stable nested one-class support vector machines (SN-1SVMs) and decoupled margin-moment based SVMs (DMMB-SVMs), to predict the specific type of pancreatic carcinoma using quantitative histopathological signatures of images. For each patient, the diagnosis can produce hundreds of images, which can be used to classify the pancreatic tissues into three classes: chronic pancreatitis, intraductal papillary mucinous neoplasms, and pancreatic carcinoma. The proposed two approaches tackle the classification problems from two different perspectives: the SN-1SVM treats each image as a classification point in a nested fashion to predict malignancy of the tissues, while the DMMB-SVM treats each patient as a classification point by assembling information across images. One attractive feature of the DMMB-SVM is that, in addition to utilizing the mean information, it also takes into account the covariance of features extracted from images for each patient. We conduct numerical experiments to evaluate and compare performance of the two methods. It is observed that the SN-1SVM can take advantage of the data structure more effectively, while the DMMB-SVM demonstrates better computational efficiency and classification accuracy. To further improve interpretability of the final classifier, we also consider the <span>(ell _1)</span>-norm in the DMMB-SVM to handle feature selection.</p></div>","PeriodicalId":7971,"journal":{"name":"Annals of Mathematics and Artificial Intelligence","volume":"93 1","pages":"93 - 108"},"PeriodicalIF":1.2,"publicationDate":"2023-08-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"47529434","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2023-08-08DOI: 10.1007/s10472-023-09887-6
Xiaomeng Dong, Tao Tan, Michael Potter, Yun-Chan Tsai, Gaurav Kumar, V. Ratna Saripalli, Theodore Trafalis
There is a parameter ubiquitous throughout the deep learning world: learning rate. There is likewise a ubiquitous question: what should that learning rate be? The true answer to this question is often tedious and time consuming to obtain, and a great deal of arcane knowledge has accumulated in recent years over how to pick and modify learning rates to achieve optimal training performance. Moreover, the long hours spent carefully crafting the perfect learning rate can come to nothing the moment your network architecture, optimizer, dataset, or initial conditions change ever so slightly. But it need not be this way. We propose a new answer to the great learning rate question: the Autonomous Learning Rate Controller. Find it at https://github.com/fastestimator/ARC/tree/v2.0.
{"title":"To raise or not to raise: the autonomous learning rate question","authors":"Xiaomeng Dong, Tao Tan, Michael Potter, Yun-Chan Tsai, Gaurav Kumar, V. Ratna Saripalli, Theodore Trafalis","doi":"10.1007/s10472-023-09887-6","DOIUrl":"10.1007/s10472-023-09887-6","url":null,"abstract":"<div><p>There is a parameter ubiquitous throughout the deep learning world: learning rate. There is likewise a ubiquitous question: what should that learning rate be? The true answer to this question is often tedious and time consuming to obtain, and a great deal of arcane knowledge has accumulated in recent years over how to pick and modify learning rates to achieve optimal training performance. Moreover, the long hours spent carefully crafting the perfect learning rate can come to nothing the moment your network architecture, optimizer, dataset, or initial conditions change ever so slightly. But it need not be this way. We propose a new answer to the great learning rate question: the Autonomous Learning Rate Controller. Find it at https://github.com/fastestimator/ARC/tree/v2.0.</p></div>","PeriodicalId":7971,"journal":{"name":"Annals of Mathematics and Artificial Intelligence","volume":"92 6","pages":"1679 - 1698"},"PeriodicalIF":1.2,"publicationDate":"2023-08-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"89058610","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2023-08-02DOI: 10.1007/s10472-023-09853-2
Mohadese Basirati, Romain Billot, Patrick Meyer
Strategic spatial planning is becoming more popular around the world as a decision-making way to build a unified vision for directing the medium- to long-term development of land/marine areas. Recently, the study of marine areas in terms of spatial planning such as Marine Spatial Planning (MSP) has received much attention. One of the challenging issues in MSP is to make a balance between determining the ideal zone for a new activity while also considering the locations of existing activities. This spatial zoning problem for multi-uses with multiple objectives could be formulated as optimization models. This paper presents and compares the results of two multi-objective evolutionary-based algorithms (MOEAs), Synchronous Hypervolume-based non-dominated sorting genetic algorithm-II (SH-NSGA-II) which is an extension of NSGA-II and a memetic algorithm (MA) in which SH-NSGA-II is enhanced with a local search. These proposed algorithms are used to solve the multi-objective spatial zoning optimization problem, which seeks to maximize the zone interest value assigned to the new activity while simultaneously maximizing its spatial compactness. We introduce several innovations in these proposed algorithms to address the problem constraints and to improve the robustness of the traditional NSGA-II and MA approaches. Unlike traditional ones, a different stop condition, multiple crossover, mutation, and repairing operators, and also a local search operator are developed. A comparative study is presented between the results obtained using both algorithms. To guarantee robust results for both algorithms, their parameters are calibrated and tuned using the Multi-Response Surface Methodology (MRSM) method. The effective and non-effective components, as well as the validity of the regression models, are determined using analysis of variance (ANOVA). Although SH-NSGA-II has revealed a good efficiency, its performance is still improved using a local search scheme within SH-NSGA-II, which is specially tailored to the problem characteristics. The two methods are designed for raster data.
{"title":"Two parameter-tuned multi-objective evolutionary-based algorithms for zoning management in marine spatial planning","authors":"Mohadese Basirati, Romain Billot, Patrick Meyer","doi":"10.1007/s10472-023-09853-2","DOIUrl":"10.1007/s10472-023-09853-2","url":null,"abstract":"<div><p>Strategic spatial planning is becoming more popular around the world as a decision-making way to build a unified vision for directing the medium- to long-term development of land/marine areas. Recently, the study of marine areas in terms of spatial planning such as Marine Spatial Planning (MSP) has received much attention. One of the challenging issues in MSP is to make a balance between determining the ideal zone for a new activity while also considering the locations of existing activities. This spatial zoning problem for multi-uses with multiple objectives could be formulated as optimization models. This paper presents and compares the results of two multi-objective evolutionary-based algorithms (MOEAs), Synchronous Hypervolume-based non-dominated sorting genetic algorithm-II (SH-NSGA-II) which is an extension of NSGA-II and a memetic algorithm (MA) in which SH-NSGA-II is enhanced with a local search. These proposed algorithms are used to solve the multi-objective spatial zoning optimization problem, which seeks to maximize the zone interest value assigned to the new activity while simultaneously maximizing its spatial compactness. We introduce several innovations in these proposed algorithms to address the problem constraints and to improve the robustness of the traditional NSGA-II and MA approaches. Unlike traditional ones, a different stop condition, multiple crossover, mutation, and repairing operators, and also a local search operator are developed. A comparative study is presented between the results obtained using both algorithms. To guarantee robust results for both algorithms, their parameters are calibrated and tuned using the Multi-Response Surface Methodology (MRSM) method. The effective and non-effective components, as well as the validity of the regression models, are determined using analysis of variance (ANOVA). Although SH-NSGA-II has revealed a good efficiency, its performance is still improved using a local search scheme within SH-NSGA-II, which is specially tailored to the problem characteristics. The two methods are designed for raster data.</p></div>","PeriodicalId":7971,"journal":{"name":"Annals of Mathematics and Artificial Intelligence","volume":"93 1","pages":"187 - 218"},"PeriodicalIF":1.2,"publicationDate":"2023-08-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"43203539","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}