Johannes J. Pienaar, Anna S. Bosman, Katherine M. Malan
{"title":"Hilbert curves for efficient exploratory landscape analysis neighbourhood sampling","authors":"Johannes J. Pienaar, Anna S. Bosman, Katherine M. Malan","doi":"arxiv-2408.00526","DOIUrl":null,"url":null,"abstract":"Landscape analysis aims to characterise optimisation problems based on their\nobjective (or fitness) function landscape properties. The problem search space\nis typically sampled, and various landscape features are estimated based on the\nsamples. One particularly salient set of features is information content, which\nrequires the samples to be sequences of neighbouring solutions, such that the\nlocal relationships between consecutive sample points are preserved. Generating\nsuch spatially correlated samples that also provide good search space coverage\nis challenging. It is therefore common to first obtain an unordered sample with\ngood search space coverage, and then apply an ordering algorithm such as the\nnearest neighbour to minimise the distance between consecutive points in the\nsample. However, the nearest neighbour algorithm becomes computationally\nprohibitive in higher dimensions, thus there is a need for more efficient\nalternatives. In this study, Hilbert space-filling curves are proposed as a\nmethod to efficiently obtain high-quality ordered samples. Hilbert curves are a\nspecial case of fractal curves, and guarantee uniform coverage of a bounded\nsearch space while providing a spatially correlated sample. We study the\neffectiveness of Hilbert curves as samplers, and discover that they are capable\nof extracting salient features at a fraction of the computational cost compared\nto Latin hypercube sampling with post-factum ordering. Further, we investigate\nthe use of Hilbert curves as an ordering strategy, and find that they order the\nsample significantly faster than the nearest neighbour ordering, without\nsacrificing the saliency of the extracted features.","PeriodicalId":501347,"journal":{"name":"arXiv - CS - Neural and Evolutionary Computing","volume":"17 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"arXiv - CS - Neural and Evolutionary Computing","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/arxiv-2408.00526","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Landscape analysis aims to characterise optimisation problems based on their
objective (or fitness) function landscape properties. The problem search space
is typically sampled, and various landscape features are estimated based on the
samples. One particularly salient set of features is information content, which
requires the samples to be sequences of neighbouring solutions, such that the
local relationships between consecutive sample points are preserved. Generating
such spatially correlated samples that also provide good search space coverage
is challenging. It is therefore common to first obtain an unordered sample with
good search space coverage, and then apply an ordering algorithm such as the
nearest neighbour to minimise the distance between consecutive points in the
sample. However, the nearest neighbour algorithm becomes computationally
prohibitive in higher dimensions, thus there is a need for more efficient
alternatives. In this study, Hilbert space-filling curves are proposed as a
method to efficiently obtain high-quality ordered samples. Hilbert curves are a
special case of fractal curves, and guarantee uniform coverage of a bounded
search space while providing a spatially correlated sample. We study the
effectiveness of Hilbert curves as samplers, and discover that they are capable
of extracting salient features at a fraction of the computational cost compared
to Latin hypercube sampling with post-factum ordering. Further, we investigate
the use of Hilbert curves as an ordering strategy, and find that they order the
sample significantly faster than the nearest neighbour ordering, without
sacrificing the saliency of the extracted features.