首页 > 最新文献

Algorithms最新文献

英文 中文
Phylogeny and species diversity of the genus Helvella with emphasis on eighteen new species from China. Helvella 属的系统发育和物种多样性,重点是来自中国的 18 个新物种。
Q3 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2023-11-01 Epub Date: 2023-08-29 DOI: 10.3114/fuse.2023.12.08
N Mao, Y Y Xu, Y X Zhang, H Zhou, X B Huang, C L Hou, L Fan

Helvella is a widespread, frequently encountered fungal group appearing in forests, but the species diversity and molecular phylogeny of Helvella in China remains incompletely understood. In this work, we performed comprehensive phylogenetic analyses using multilocus sequence data. Six datasets were employed, including a five-locus concatenated dataset (ITS, nrLSU, tef1-α, rpb2, hsp), a two-locus concatenated dataset (ITS, nrLSU), and four single-locus datasets (ITS) that were divided based on the four different phylogenetic clades of Helvella recognized in this study. A total of I 946 sequences were used, of which 713 were newly generated, including 170 sequences of ITS, 174 sequences of nrLSU, 131 sequences of tef1-α, 107 sequences of rpb2 and 131 sequences of hsp. The phylogeny based on the five-locus concatenated dataset revealed that Helvellas. str. is monophyletic and four phylogenetic clades are clearly recognized, i.e., Acetabulum clade, Crispa clade, Elastica clade, and Lacunosa clade. A total of 24 lineages or subclades were recognized, II of which were new, the remaining 13 corresponding with previous studies. Chinese Helvella species are distributed in 22 lineages across four clades. Phylogenetic analyses based on the two-locus concatenated dataset and four single-locus datasets confirmed the presence of at least 93 phylogenetic species in China. Among them, 58 are identified as known species, including a species with a newly designated lectotype and epitype, 18 are newly described in this paper, and the remaining 17 taxa are putatively new to science but remain unnamed due to the paucity or absence of ascomatal materials. In addition, the Helvella species previously recorded in China are discussed. A list of 76 confirmed species, including newly proposed species, is provided. The occurrence of H. crispa and H. elastica are not confirmed although both are commonly recorded in China. Citation: Mao N, Xu YY, Zhang YX, Zhou H, Huang XB, Hou CL, Fan L (2023). Phylogeny and species diversity of the genus Helvella with emphasis on eighteen new species from China. Fungal Systematics and Evolution 12: 111-152. doi: 10.3114/fuse.2023.12.08.

赫尔维拉(Helvella)是一种广泛存在于森林中、经常出现的真菌类群,但对中国赫尔维拉的物种多样性和分子系统发育仍不完全了解。在这项工作中,我们利用多焦点序列数据进行了全面的系统发育分析。本研究采用了六个数据集,包括一个五分点序列数据集(ITS、nrLSU、tef1-α、rpb2、hsp)、一个双分点序列数据集(ITS、nrLSU)和四个单分点序列数据集(ITS)。共使用了 I 946 个序列,其中 713 个是新产生的,包括 170 个 ITS 序列、174 个 nrLSU 序列、131 个 tef1-α 序列、107 个 rpb2 序列和 131 个 hsp 序列。基于五焦点数据集的系统发育显示,Helvellas. str.是单系的,并清晰地识别出四个系统发育支系,即 Acetabulum 支系、Crispa 支系、Elastica 支系和 Lacunosa 支系。共确认了 24 个支系或亚支系,其中 2 个是新的支系或亚支系,其余 13 个支系或亚支系与以前的研究一致。中国的 Helvella 物种分布在 4 个支系的 22 个系中。基于双焦点数据集和四个单焦点数据集的系统发育分析证实,中国至少存在 93 个系统发育物种。其中,58 个被鉴定为已知种,包括 1 个新指定的讲座型和表型种,18 个为本文新描述的种,其余 17 个分类群被认为是科学上的新种,但由于缺乏或没有顶生材料而仍未命名。此外,本文还讨论了以前在中国记录的 Helvella 物种。本报告提供了 76 个已确认物种的清单,其中包括新提出的物种。虽然 H. crispa 和 H. elastica 在中国都有常见记录,但它们的出现未得到证实。引用:Mao N, Xu YY, Zhang YX, Zhou H, Huang XB, Hou CL, Fan L (2023).Helvella 属的系统发育和物种多样性,重点是来自中国的 18 个新种。Fungal Systematics and Evolution 12: 111-152. doi: 10.3114/fuse.2023.12.08.
{"title":"Phylogeny and species diversity of the genus <i>Helvella</i> with emphasis on eighteen new species from China.","authors":"N Mao, Y Y Xu, Y X Zhang, H Zhou, X B Huang, C L Hou, L Fan","doi":"10.3114/fuse.2023.12.08","DOIUrl":"10.3114/fuse.2023.12.08","url":null,"abstract":"<p><p><b></b> <i>Helvella</i> is a widespread, frequently encountered fungal group appearing in forests, but the species diversity and molecular phylogeny of <i>Helvella</i> in China remains incompletely understood. In this work, we performed comprehensive phylogenetic analyses using multilocus sequence data. Six datasets were employed, including a five-locus concatenated dataset (ITS, nrLSU, <i>tef1-α, rpb2, hsp)</i>, a two-locus concatenated dataset (ITS, nrLSU), and four single-locus datasets (ITS) that were divided based on the four different phylogenetic clades of <i>Helvella</i> recognized in this study. A total of I 946 sequences were used, of which 713 were newly generated, including 170 sequences of ITS, 174 sequences of nrLSU, 131 sequences of <i>tef1-α</i>, 107 sequences of <i>rpb2</i> and 131 sequences of <i>hsp.</i> The phylogeny based on the five-locus concatenated dataset revealed that <i>Helvellas. str</i>. is monophyletic and four phylogenetic clades are clearly recognized, <i>i.e., Acetabulum</i> clade, <i>Crispa</i> clade, <i>Elastica</i> clade, and <i>Lacunosa</i> clade. A total of 24 lineages or subclades were recognized, II of which were new, the remaining 13 corresponding with previous studies. Chinese <i>Helvella</i> species are distributed in 22 lineages across four clades. Phylogenetic analyses based on the two-locus concatenated dataset and four single-locus datasets confirmed the presence of at least 93 phylogenetic species in China. Among them, 58 are identified as known species, including a species with a newly designated lectotype and epitype, 18 are newly described in this paper, and the remaining 17 taxa are putatively new to science but remain unnamed due to the paucity or absence of ascomatal materials. In addition, the <i>Helvella</i> species previously recorded in China are discussed. A list of 76 confirmed species, including newly proposed species, is provided. The occurrence of <i>H. crispa</i> and <i>H. elastica</i> are not confirmed although both are commonly recorded in China. <b>Citation:</b> Mao N, Xu YY, Zhang YX, Zhou H, Huang XB, Hou CL, Fan L (2023). Phylogeny and species diversity of the genus <i>Helvella</i> with emphasis on eighteen new species from China. <i>Fungal Systematics and Evolution</i> <b>12</b>: 111-152. doi: 10.3114/fuse.2023.12.08.</p>","PeriodicalId":7636,"journal":{"name":"Algorithms","volume":"10 1","pages":"111-152"},"PeriodicalIF":0.0,"publicationDate":"2023-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10964050/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"78899145","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Finding Bottlenecks in Message Passing Interface Programs by Scalable Critical Path Analysis 利用可扩展关键路径分析发现消息传递接口程序中的瓶颈
Q3 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2023-10-31 DOI: 10.3390/a16110505
Vladimir Korkhov, Ivan Gankevich, Anton Gavrikov, Maria Mingazova, Ivan Petriakov, Dmitrii Tereshchenko, Artem Shatalin, Vitaly Slobodskoy
Bottlenecks and imbalance in parallel programs can significantly affect performance of parallel execution. Finding these bottlenecks is a key issue in performance analysis of MPI programs especially on a large scale. One of the ways to discover bottlenecks is to analyze the critical path of the parallel program: the longest execution path in the program activity graph. There are a number of methods of finding the critical path; however, most of them suffer a performance drop when scaled. In this paper, we analyze several methods of critical path finding based on classical Dijkstra and Delta-stepping algorithms along with the proposed algorithm based on topological sorting. Corresponding algorithms for each approach are presented including additional enhancements for increasing performance. The implementation of the algorithms and resulting performance for several benchmark applications (NAS Parallel Benchmarks, CP2K, OpenFOAM, LAMMPS, and MiniFE) are analyzed and discussed.
并行程序中的瓶颈和不平衡会严重影响并行执行的性能。发现这些瓶颈是MPI程序性能分析中的一个关键问题,特别是在大规模的MPI程序中。发现瓶颈的方法之一是分析并行程序的关键路径:程序活动图中最长的执行路径。找到关键路径的方法有很多种;然而,它们中的大多数在扩展时都会出现性能下降。本文分析了几种基于经典Dijkstra算法和delta步进算法的关键路径查找方法,并提出了基于拓扑排序的关键路径查找算法。提出了每种方法的相应算法,包括提高性能的附加增强。分析和讨论了算法的实现和几个基准测试应用程序(NAS Parallel benchmark、CP2K、OpenFOAM、LAMMPS和MiniFE)的性能。
{"title":"Finding Bottlenecks in Message Passing Interface Programs by Scalable Critical Path Analysis","authors":"Vladimir Korkhov, Ivan Gankevich, Anton Gavrikov, Maria Mingazova, Ivan Petriakov, Dmitrii Tereshchenko, Artem Shatalin, Vitaly Slobodskoy","doi":"10.3390/a16110505","DOIUrl":"https://doi.org/10.3390/a16110505","url":null,"abstract":"Bottlenecks and imbalance in parallel programs can significantly affect performance of parallel execution. Finding these bottlenecks is a key issue in performance analysis of MPI programs especially on a large scale. One of the ways to discover bottlenecks is to analyze the critical path of the parallel program: the longest execution path in the program activity graph. There are a number of methods of finding the critical path; however, most of them suffer a performance drop when scaled. In this paper, we analyze several methods of critical path finding based on classical Dijkstra and Delta-stepping algorithms along with the proposed algorithm based on topological sorting. Corresponding algorithms for each approach are presented including additional enhancements for increasing performance. The implementation of the algorithms and resulting performance for several benchmark applications (NAS Parallel Benchmarks, CP2K, OpenFOAM, LAMMPS, and MiniFE) are analyzed and discussed.","PeriodicalId":7636,"journal":{"name":"Algorithms","volume":"181 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-10-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135869745","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Dynamic Demand-Responsive Feeder Bus Network Design for a Short Headway Trunk Line 基于动态需求响应的短距离干线接驳巴士网络设计
Q3 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2023-10-31 DOI: 10.3390/a16110506
Amirreza Nickkar, Young-Jae Lee
Recent advancements in technology have increased the potential for demand-responsive feeder transit services to enhance mobility in areas with limited public transit access. For long rail headways, feeder bus network algorithms are straightforward, as the maximum feeder service cycle time is determined by rail headway, and bus–train matching is unnecessary. However, for short rail headways, the algorithm must address both passenger–feeder-bus and feeder-bus–train matching. This study presents a simulated annealing (SA) algorithm for flexible feeder bus routing, accommodating short headway trunk lines and multiple bus relocations for various stations and trains. A 5 min headway rail trunk line example was utilized to test the algorithm. The algorithm effectively managed bus relocations when optimal routes were infeasible at specific stations. Additionally, the algorithm minimized total costs, accounting for vehicle operating expenses and passenger in-vehicle travel time costs, while considering multiple vehicle relocations.
最近技术的进步增加了需求响应支线运输服务的潜力,以提高公共交通有限地区的机动性。对于较长的轨道进尺,接驳巴士网络算法比较简单,因为最大接驳服务周期时间由轨道进尺决定,不需要进行车车匹配。然而,对于较短的轨道,该算法必须同时解决乘客-馈线-公共汽车和馈线-公共汽车-列车的匹配问题。本研究提出一种模拟退火(SA)演算法,用于灵活的接驳巴士路线,以适应不同车站和列车的短车头干线和多次巴士重新安置。以5分钟车头距轨道干线为例,对该算法进行了验证。该算法有效地管理了特定站点无法实现最优路线时的公交重新调度。此外,该算法在考虑多个车辆重新安置的同时,考虑了车辆运营费用和乘客车内旅行时间成本,使总成本最小化。
{"title":"Dynamic Demand-Responsive Feeder Bus Network Design for a Short Headway Trunk Line","authors":"Amirreza Nickkar, Young-Jae Lee","doi":"10.3390/a16110506","DOIUrl":"https://doi.org/10.3390/a16110506","url":null,"abstract":"Recent advancements in technology have increased the potential for demand-responsive feeder transit services to enhance mobility in areas with limited public transit access. For long rail headways, feeder bus network algorithms are straightforward, as the maximum feeder service cycle time is determined by rail headway, and bus–train matching is unnecessary. However, for short rail headways, the algorithm must address both passenger–feeder-bus and feeder-bus–train matching. This study presents a simulated annealing (SA) algorithm for flexible feeder bus routing, accommodating short headway trunk lines and multiple bus relocations for various stations and trains. A 5 min headway rail trunk line example was utilized to test the algorithm. The algorithm effectively managed bus relocations when optimal routes were infeasible at specific stations. Additionally, the algorithm minimized total costs, accounting for vehicle operating expenses and passenger in-vehicle travel time costs, while considering multiple vehicle relocations.","PeriodicalId":7636,"journal":{"name":"Algorithms","volume":"2007 26","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-10-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135813923","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Decision-Maker’s Preference-Driven Dynamic Multi-Objective Optimization 决策者偏好驱动的动态多目标优化
Q3 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2023-10-30 DOI: 10.3390/a16110504
Adekunle Rotimi Adekoya, Mardé Helbig
Dynamic multi-objective optimization problems (DMOPs) are optimization problems where elements of the problems, such as the objective functions and/or constraints, change with time. These problems are characterized by two or more objective functions, where at least two objective functions are in conflict with one another. When solving real-world problems, the incorporation of human decision-makers (DMs)’ preferences or expert knowledge into the optimization process and thereby restricting the search to a specific region of the Pareto-optimal Front (POF) may result in more preferred or suitable solutions. This study proposes approaches that enable DMs to influence the search process with their preferences by reformulating the optimization problems as constrained problems. The subsequent constrained problems are solved using various constraint handling approaches, such as the penalization of infeasible solutions and the restriction of the search to the feasible region of the search space. The proposed constraint handling approaches are compared by incorporating the approaches into a differential evolution (DE) algorithm and measuring the algorithm’s performance using both standard performance measures for dynamic multi-objective optimization (DMOO), as well as newly proposed measures for constrained DMOPs. The new measures indicate how well an algorithm was able to find solutions in the objective space that best reflect the DM’s preferences and the Pareto-optimality goal of dynamic multi-objective optimization algorithms (DMOAs). The results indicate that the constraint handling approaches are effective in finding Pareto-optimal solutions that satisfy the preference constraints of a DM.
动态多目标优化问题(dops)是问题的要素(如目标函数和/或约束)随时间变化的优化问题。这些问题以两个或多个目标函数为特征,其中至少有两个目标函数相互冲突。在解决现实问题时,将人类决策者(DMs)的偏好或专家知识纳入优化过程,从而将搜索限制在帕累托最优前沿(POF)的特定区域,可能会产生更优选或更合适的解决方案。本研究提出了一些方法,通过将优化问题重新表述为约束问题,使决策经理能够用他们的偏好影响搜索过程。随后的约束问题采用各种约束处理方法来解决,如对不可行解的惩罚和将搜索限制在搜索空间的可行区域。通过将所提出的约束处理方法合并到差分进化(DE)算法中,并使用动态多目标优化(DMOO)的标准性能度量和约束dmop的新度量来衡量算法的性能,对所提出的约束处理方法进行了比较。这些新指标表明了算法在目标空间中找到最能反映决策制定者偏好和动态多目标优化算法(DMOAs)的帕累托最优目标的解决方案的能力。结果表明,约束处理方法可以有效地找到满足偏好约束的pareto最优解。
{"title":"Decision-Maker’s Preference-Driven Dynamic Multi-Objective Optimization","authors":"Adekunle Rotimi Adekoya, Mardé Helbig","doi":"10.3390/a16110504","DOIUrl":"https://doi.org/10.3390/a16110504","url":null,"abstract":"Dynamic multi-objective optimization problems (DMOPs) are optimization problems where elements of the problems, such as the objective functions and/or constraints, change with time. These problems are characterized by two or more objective functions, where at least two objective functions are in conflict with one another. When solving real-world problems, the incorporation of human decision-makers (DMs)’ preferences or expert knowledge into the optimization process and thereby restricting the search to a specific region of the Pareto-optimal Front (POF) may result in more preferred or suitable solutions. This study proposes approaches that enable DMs to influence the search process with their preferences by reformulating the optimization problems as constrained problems. The subsequent constrained problems are solved using various constraint handling approaches, such as the penalization of infeasible solutions and the restriction of the search to the feasible region of the search space. The proposed constraint handling approaches are compared by incorporating the approaches into a differential evolution (DE) algorithm and measuring the algorithm’s performance using both standard performance measures for dynamic multi-objective optimization (DMOO), as well as newly proposed measures for constrained DMOPs. The new measures indicate how well an algorithm was able to find solutions in the objective space that best reflect the DM’s preferences and the Pareto-optimality goal of dynamic multi-objective optimization algorithms (DMOAs). The results indicate that the constraint handling approaches are effective in finding Pareto-optimal solutions that satisfy the preference constraints of a DM.","PeriodicalId":7636,"journal":{"name":"Algorithms","volume":"2023 7-8","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-10-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"136103636","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Automating Stimulation Frequency Selection for SSVEP-Based Brain-Computer Interfaces 基于ssvep的脑机接口刺激频率自动选择
Q3 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2023-10-29 DOI: 10.3390/a16110502
Alexey Kozin, Anton Gerasimov, Maxim Bakaev, Anton Pashkov, Olga Razumnikova
Brain–computer interfaces (BCIs) based on steady-state visually evoked potentials (SSVEPs) are inexpensive and do not require user training. However, the highly personalized reaction to visual stimulation is an obstacle to the wider application of this technique, as it can be ineffective, tiring, or even harmful at certain frequencies. In our experimental study, we proposed a new approach to the selection of optimal frequencies of photostimulation. By using a custom photostimulation device, we covered a frequency range from 5 to 25 Hz with 1 Hz increments, recording the subjects’ brainwave activity (EEG) and analyzing the signal-to-noise ratio (SNR) changes at the corresponding frequencies. The proposed set of SNR-based coefficients and the discomfort index, determined by the ratio of theta and beta rhythms in the EEG signal, enables the automation of obtaining the recommended stimulation frequencies for use in SSVEP-based BCIs.
基于稳态视觉诱发电位(ssvep)的脑机接口(bci)价格低廉,不需要用户培训。然而,对视觉刺激的高度个性化反应阻碍了这项技术的广泛应用,因为它在某些频率下可能无效、累人甚至有害。在我们的实验研究中,我们提出了一种选择最佳光刺激频率的新方法。我们使用定制的光刺激装置,在5 ~ 25 Hz的频率范围内,以1 Hz的增量记录受试者的脑电波活动,并分析相应频率下的信噪比变化。所提出的一组基于信噪比的系数和不适指数,由脑电图信号中θ和β节律的比值决定,可以自动获得推荐的刺激频率,用于基于ssvep的脑机接口。
{"title":"Automating Stimulation Frequency Selection for SSVEP-Based Brain-Computer Interfaces","authors":"Alexey Kozin, Anton Gerasimov, Maxim Bakaev, Anton Pashkov, Olga Razumnikova","doi":"10.3390/a16110502","DOIUrl":"https://doi.org/10.3390/a16110502","url":null,"abstract":"Brain–computer interfaces (BCIs) based on steady-state visually evoked potentials (SSVEPs) are inexpensive and do not require user training. However, the highly personalized reaction to visual stimulation is an obstacle to the wider application of this technique, as it can be ineffective, tiring, or even harmful at certain frequencies. In our experimental study, we proposed a new approach to the selection of optimal frequencies of photostimulation. By using a custom photostimulation device, we covered a frequency range from 5 to 25 Hz with 1 Hz increments, recording the subjects’ brainwave activity (EEG) and analyzing the signal-to-noise ratio (SNR) changes at the corresponding frequencies. The proposed set of SNR-based coefficients and the discomfort index, determined by the ratio of theta and beta rhythms in the EEG signal, enables the automation of obtaining the recommended stimulation frequencies for use in SSVEP-based BCIs.","PeriodicalId":7636,"journal":{"name":"Algorithms","volume":"312 2","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-10-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"136158114","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Machine Learning-Based Approach for Predicting Diabetes Employing Socio-Demographic Characteristics 利用社会人口统计学特征预测糖尿病的机器学习方法
Q3 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2023-10-29 DOI: 10.3390/a16110503
Md. Ashikur Rahman, Lway Faisal Abdulrazak, Md. Mamun Ali, Imran Mahmud, Kawsar Ahmed, Francis M. Bui
Diabetes is one of the fatal diseases that play a vital role in the growth of other diseases in the human body. From a clinical perspective, the most significant approach to mitigating the effects of diabetes is early-stage control and management, with the aim of a potential cure. However, lack of awareness and expensive clinical tests are the primary reasons why clinical diagnosis and preventive measures are neglected in lower-income countries like Bangladesh, Pakistan, and India. From this perspective, this study aims to build an automated machine learning (ML) model, which will predict diabetes at an early stage using socio-demographic characteristics rather than clinical attributes, due to the fact that clinical features are not always accessible to all people from lower-income countries. To find the best fit of the supervised ML classifier of the model, we applied six classification algorithms and found that RF outperformed with an accuracy of 99.36%. In addition, the most significant risk factors were found based on the SHAP value by all the applied classifiers. This study reveals that polyuria, polydipsia, and delayed healing are the most significant risk factors for developing diabetes. The findings indicate that the proposed model is highly capable of predicting diabetes in the early stages.
糖尿病是一种致命疾病,对人体其他疾病的发展起着至关重要的作用。从临床角度来看,减轻糖尿病影响的最重要方法是早期控制和管理,以潜在的治愈为目标。然而,缺乏认识和昂贵的临床检测是孟加拉国、巴基斯坦和印度等低收入国家忽视临床诊断和预防措施的主要原因。从这个角度来看,本研究旨在建立一个自动机器学习(ML)模型,该模型将使用社会人口统计学特征而不是临床属性来预测早期阶段的糖尿病,因为临床特征并不总是适用于低收入国家的所有人。为了找到模型的有监督ML分类器的最佳拟合,我们应用了六种分类算法,发现RF以99.36%的准确率优于模型。此外,所有应用的分类器根据SHAP值发现了最显著的危险因素。本研究表明,多尿、多饮和延迟愈合是发展为糖尿病的最重要的危险因素。研究结果表明,该模型对早期糖尿病的预测能力很强。
{"title":"Machine Learning-Based Approach for Predicting Diabetes Employing Socio-Demographic Characteristics","authors":"Md. Ashikur Rahman, Lway Faisal Abdulrazak, Md. Mamun Ali, Imran Mahmud, Kawsar Ahmed, Francis M. Bui","doi":"10.3390/a16110503","DOIUrl":"https://doi.org/10.3390/a16110503","url":null,"abstract":"Diabetes is one of the fatal diseases that play a vital role in the growth of other diseases in the human body. From a clinical perspective, the most significant approach to mitigating the effects of diabetes is early-stage control and management, with the aim of a potential cure. However, lack of awareness and expensive clinical tests are the primary reasons why clinical diagnosis and preventive measures are neglected in lower-income countries like Bangladesh, Pakistan, and India. From this perspective, this study aims to build an automated machine learning (ML) model, which will predict diabetes at an early stage using socio-demographic characteristics rather than clinical attributes, due to the fact that clinical features are not always accessible to all people from lower-income countries. To find the best fit of the supervised ML classifier of the model, we applied six classification algorithms and found that RF outperformed with an accuracy of 99.36%. In addition, the most significant risk factors were found based on the SHAP value by all the applied classifiers. This study reveals that polyuria, polydipsia, and delayed healing are the most significant risk factors for developing diabetes. The findings indicate that the proposed model is highly capable of predicting diabetes in the early stages.","PeriodicalId":7636,"journal":{"name":"Algorithms","volume":"59 6","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-10-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"136136169","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Denoising Diffusion Models on Model-Based Latent Space 基于模型潜在空间的扩散模型去噪
Q3 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2023-10-28 DOI: 10.3390/a16110501
Carmelo Scribano, Danilo Pezzi, Giorgia Franchini, Marco Prato
With the recent advancements in the field of diffusion generative models, it has been shown that defining the generative process in the latent space of a powerful pretrained autoencoder can offer substantial advantages. This approach, by abstracting away imperceptible image details and introducing substantial spatial compression, renders the learning of the generative process more manageable while significantly reducing computational and memory demands. In this work, we propose to replace autoencoder coding with a model-based coding scheme based on traditional lossy image compression techniques; this choice not only further diminishes computational expenses but also allows us to probe the boundaries of latent-space image generation. Our objectives culminate in the proposal of a valuable approximation for training continuous diffusion models within a discrete space, accompanied by enhancements to the generative model for categorical values. Beyond the good results obtained for the problem at hand, we believe that the proposed work holds promise for enhancing the adaptability of generative diffusion models across diverse data types beyond the realm of imagery.
随着扩散生成模型领域的最新进展,已经证明在强大的预训练自编码器的潜在空间中定义生成过程可以提供实质性的优势。这种方法通过抽象掉难以察觉的图像细节并引入大量的空间压缩,使生成过程的学习更易于管理,同时显着减少了计算和内存需求。在这项工作中,我们提出用基于模型的编码方案取代自动编码器编码,该方案基于传统的有损图像压缩技术;这种选择不仅进一步减少了计算费用,而且还允许我们探索潜在空间图像生成的边界。我们的最终目标是提出一个有价值的近似,用于在离散空间内训练连续扩散模型,同时增强分类值的生成模型。除了为手头的问题获得的良好结果之外,我们相信所提出的工作有望增强生成扩散模型在图像领域之外的不同数据类型之间的适应性。
{"title":"Denoising Diffusion Models on Model-Based Latent Space","authors":"Carmelo Scribano, Danilo Pezzi, Giorgia Franchini, Marco Prato","doi":"10.3390/a16110501","DOIUrl":"https://doi.org/10.3390/a16110501","url":null,"abstract":"With the recent advancements in the field of diffusion generative models, it has been shown that defining the generative process in the latent space of a powerful pretrained autoencoder can offer substantial advantages. This approach, by abstracting away imperceptible image details and introducing substantial spatial compression, renders the learning of the generative process more manageable while significantly reducing computational and memory demands. In this work, we propose to replace autoencoder coding with a model-based coding scheme based on traditional lossy image compression techniques; this choice not only further diminishes computational expenses but also allows us to probe the boundaries of latent-space image generation. Our objectives culminate in the proposal of a valuable approximation for training continuous diffusion models within a discrete space, accompanied by enhancements to the generative model for categorical values. Beyond the good results obtained for the problem at hand, we believe that the proposed work holds promise for enhancing the adaptability of generative diffusion models across diverse data types beyond the realm of imagery.","PeriodicalId":7636,"journal":{"name":"Algorithms","volume":"90 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-10-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"136231666","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Two-Way Linear Probing Revisited 双向线性探测
Q3 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2023-10-28 DOI: 10.3390/a16110500
Ketan Dalal, Luc Devroye, Ebrahim Malalla
Linear probing continues to be one of the best practical hashing algorithms due to its good average performance, efficiency, and simplicity of implementation. However, the worst-case performance of linear probing seems to degrade with high load factors due to a primary-clustering tendency of one collision to cause more nearby collisions. It is known that the maximum cluster size produced by linear probing, and hence the length of the longest probe sequence needed to insert or search for a key in a hash table of size n, is Ω(logn), in probability. In this article, we introduce linear probing hashing schemes that employ two linear probe sequences to find empty cells for the keys. Our results show that two-way linear probing is a promising alternative to linear probing for hash tables. We show that two-way linear probing has an asymptotically almost surely O(loglogn) maximum cluster size when the load factor is constant. Matching lower bounds on the maximum cluster size produced by any two-way linear probing algorithm are obtained as well. Our analysis is based on a novel approach that uses the multiple-choice paradigm and witness trees.
线性探测仍然是最实用的散列算法之一,因为它具有良好的平均性能、效率和实现的简单性。然而,在高负载因素下,线性探测的最坏情况性能似乎会下降,因为一个碰撞的主要聚类倾向会导致更多的附近碰撞。众所周知,线性探测产生的最大簇大小,以及在大小为n的哈希表中插入或搜索键所需的最长探测序列的长度,在概率上是Ω(logn)。在本文中,我们将介绍线性探测散列方案,该方案使用两个线性探测序列来查找键的空单元格。我们的结果表明,双向线性探测是哈希表线性探测的一个很有前途的替代方案。我们表明,当负载因子恒定时,双向线性探测具有渐近几乎肯定的O(对数)最大簇大小。得到了任意双向线性探测算法所产生的最大簇大小的匹配下界。我们的分析是基于一种使用多项选择范例和见证树的新方法。
{"title":"Two-Way Linear Probing Revisited","authors":"Ketan Dalal, Luc Devroye, Ebrahim Malalla","doi":"10.3390/a16110500","DOIUrl":"https://doi.org/10.3390/a16110500","url":null,"abstract":"Linear probing continues to be one of the best practical hashing algorithms due to its good average performance, efficiency, and simplicity of implementation. However, the worst-case performance of linear probing seems to degrade with high load factors due to a primary-clustering tendency of one collision to cause more nearby collisions. It is known that the maximum cluster size produced by linear probing, and hence the length of the longest probe sequence needed to insert or search for a key in a hash table of size n, is Ω(logn), in probability. In this article, we introduce linear probing hashing schemes that employ two linear probe sequences to find empty cells for the keys. Our results show that two-way linear probing is a promising alternative to linear probing for hash tables. We show that two-way linear probing has an asymptotically almost surely O(loglogn) maximum cluster size when the load factor is constant. Matching lower bounds on the maximum cluster size produced by any two-way linear probing algorithm are obtained as well. Our analysis is based on a novel approach that uses the multiple-choice paradigm and witness trees.","PeriodicalId":7636,"journal":{"name":"Algorithms","volume":"48 8","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-10-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"136231798","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Discovering Non-Linear Boolean Functions by Evolving Walsh Transforms with Genetic Programming 用遗传规划进化Walsh变换发现非线性布尔函数
Q3 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2023-10-27 DOI: 10.3390/a16110499
Luigi Rovito, Andrea De Lorenzo, Luca Manzoni
Stream ciphers usually rely on highly secure Boolean functions to ensure safe communication within unsafe channels. However, discovering secure Boolean functions is a non-trivial optimization problem that has been addressed by many optimization techniques: in particular by evolutionary algorithms. We investigate in this article the employment of Genetic Programming (GP) for evolving Boolean functions with large non-linearity by examining the search space consisting of Walsh transforms. Especially, we build generic Walsh spectra starting from the evolution of Walsh transform coefficients. Then, by leveraging spectral inversion, we build pseudo-Boolean functions from which we are able to determine the corresponding nearest Boolean functions, whose computation involves filling via a random criterion a certain amount of “uncertain” positions in the final truth table. We show that by using a balancedness-preserving strategy, it is possible to exploit those positions to obtain a function that is as balanced as possible. We perform experiments by comparing different types of symbolic representations for the Walsh transform, and we analyze the percentage of uncertain positions. We systematically review the outcomes of these comparisons to highlight the best type of setting for this problem. We evolve Boolean functions from 6 to 16 bits and compare the GP-based evolution with random search to show that evolving Walsh transforms leads to highly non-linear functions that are balanced as well.
流密码通常依靠高度安全的布尔函数来确保不安全通道内的安全通信。然而,发现安全布尔函数是一个重要的优化问题,许多优化技术已经解决了这个问题:特别是进化算法。本文通过研究由Walsh变换组成的搜索空间,研究遗传规划(GP)在演化具有大非线性的布尔函数中的应用。特别地,我们从Walsh变换系数的演化出发,建立了通用的Walsh谱。然后,通过利用谱反演,我们构建伪布尔函数,从中我们能够确定相应的最接近的布尔函数,其计算涉及通过随机标准填充最终真值表中一定数量的“不确定”位置。我们表明,通过使用平衡保持策略,可以利用这些位置来获得尽可能平衡的函数。我们通过比较不同类型的沃尔什变换符号表示来进行实验,并分析了不确定位置的百分比。我们系统地回顾了这些比较的结果,以突出该问题的最佳设置类型。我们将布尔函数从6位进化到16位,并将基于gp的进化与随机搜索进行比较,以表明进化的沃尔什变换也会导致高度非线性的平衡函数。
{"title":"Discovering Non-Linear Boolean Functions by Evolving Walsh Transforms with Genetic Programming","authors":"Luigi Rovito, Andrea De Lorenzo, Luca Manzoni","doi":"10.3390/a16110499","DOIUrl":"https://doi.org/10.3390/a16110499","url":null,"abstract":"Stream ciphers usually rely on highly secure Boolean functions to ensure safe communication within unsafe channels. However, discovering secure Boolean functions is a non-trivial optimization problem that has been addressed by many optimization techniques: in particular by evolutionary algorithms. We investigate in this article the employment of Genetic Programming (GP) for evolving Boolean functions with large non-linearity by examining the search space consisting of Walsh transforms. Especially, we build generic Walsh spectra starting from the evolution of Walsh transform coefficients. Then, by leveraging spectral inversion, we build pseudo-Boolean functions from which we are able to determine the corresponding nearest Boolean functions, whose computation involves filling via a random criterion a certain amount of “uncertain” positions in the final truth table. We show that by using a balancedness-preserving strategy, it is possible to exploit those positions to obtain a function that is as balanced as possible. We perform experiments by comparing different types of symbolic representations for the Walsh transform, and we analyze the percentage of uncertain positions. We systematically review the outcomes of these comparisons to highlight the best type of setting for this problem. We evolve Boolean functions from 6 to 16 bits and compare the GP-based evolution with random search to show that evolving Walsh transforms leads to highly non-linear functions that are balanced as well.","PeriodicalId":7636,"journal":{"name":"Algorithms","volume":"8 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-10-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"136235709","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
On the Intersection of Computational Geometry Algorithms with Mobile Robot Path Planning 计算几何算法与移动机器人路径规划的交叉研究
Q3 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2023-10-27 DOI: 10.3390/a16110498
Ehsan Latif, Ramviyas Parasuraman
In the mathematical discipline of computational geometry (CG), practical algorithms for resolving geometric input and output issues are designed, analyzed, and put into practice. It is sometimes used to refer to pattern recognition and to define the solid modeling methods for manipulating curves and surfaces. CG is a rich field encompassing theories to solve complex optimization problems, such as path planning for mobile robot systems and extension to distributed multi-robot systems. This brief review discusses the fundamentals of CG and its application in solving well-known automated path-planning problems in single- and multi-robot systems. We also discuss three winning algorithms from the CG-SHOP (Computational Geometry: Solving Hard Optimization Problems) 2021 competition to evidence the practicality of CG in multi-robotic systems. We also mention some open problems at the intersection of CG and robotics. This review provides insights into the potential use of CG in robotics and future research directions at their intersection.
在计算几何(CG)的数学学科中,设计、分析和实施解决几何输入和输出问题的实用算法。它有时被用来指模式识别和定义实体建模方法来操纵曲线和曲面。CG是一个内容丰富的领域,涵盖了解决复杂优化问题的理论,如移动机器人系统的路径规划和扩展到分布式多机器人系统。本文简要讨论了CG的基本原理及其在解决单机器人和多机器人系统中众所周知的自动路径规划问题中的应用。我们还讨论了2021年CG- shop(计算几何:解决困难优化问题)竞赛中的三种获奖算法,以证明CG在多机器人系统中的实用性。我们还提到了CG和机器人交叉领域的一些开放问题。本文综述了CG在机器人技术中的潜在应用以及未来的研究方向。
{"title":"On the Intersection of Computational Geometry Algorithms with Mobile Robot Path Planning","authors":"Ehsan Latif, Ramviyas Parasuraman","doi":"10.3390/a16110498","DOIUrl":"https://doi.org/10.3390/a16110498","url":null,"abstract":"In the mathematical discipline of computational geometry (CG), practical algorithms for resolving geometric input and output issues are designed, analyzed, and put into practice. It is sometimes used to refer to pattern recognition and to define the solid modeling methods for manipulating curves and surfaces. CG is a rich field encompassing theories to solve complex optimization problems, such as path planning for mobile robot systems and extension to distributed multi-robot systems. This brief review discusses the fundamentals of CG and its application in solving well-known automated path-planning problems in single- and multi-robot systems. We also discuss three winning algorithms from the CG-SHOP (Computational Geometry: Solving Hard Optimization Problems) 2021 competition to evidence the practicality of CG in multi-robotic systems. We also mention some open problems at the intersection of CG and robotics. This review provides insights into the potential use of CG in robotics and future research directions at their intersection.","PeriodicalId":7636,"journal":{"name":"Algorithms","volume":"15 4","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-10-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"136235587","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Algorithms
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1