Journal of Algorithms & Computational Technology最新文献

英文中文

Inexact fixed-point iteration method for nonlinear complementarity problems 非线性互补问题的不精确不动点迭代法

Q4 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS

Journal of Algorithms & Computational Technology

Pub Date : 2023-01-01 DOI: 10.1177/17483026231191264

Xiaobo Song, Xu Zhang, Yuhua Zeng, Zheng Peng

Based on the modulus decomposition, the structured nonlinear complementarity problem is reformulated as an implicit fixed-point system of nonlinear equations. Distinguishing from some existing modulus-based matrix splitting methods, we present a flexible modulus-based inexact fixed-point iteration method for the resulting system, in which the subproblem can be solved approximately by a linear system-solver. The global convergence of the proposed method is established by assuming that the system matrix is positive definite. Some numerical comparisons are reported to illustrate the applicability of the proposed method, especially for large-scale problems.

基于模分解，将结构非线性互补问题重新表述为隐式不动点非线性方程组。区别于现有的基于模的矩阵分裂方法，我们提出了一种基于柔性模的不精确不动点迭代方法，其中子问题可以用线性系统求解器近似求解。在系统矩阵为正定的前提下，证明了该方法的全局收敛性。通过数值比较说明了该方法的适用性，特别是对大规模问题的适用性。

引用次数: 0

Modelling influenza and SARS-CoV-2 interaction: Analysis for Catalonia region 模拟流感和SARS-CoV-2相互作用:加泰罗尼亚地区分析

Q4 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS

Journal of Algorithms & Computational Technology

Pub Date : 2023-01-01 DOI: 10.1177/17483026231186012

Pau Fonseca i Casas, Victor Garcia i Carrasco, Joan Garcia i Subirana

The aim is to analyse that, during the current pandemic situation, the reduction in the number of cases of influenza suggests that the non-pharmaceutical interventions (NPIs) applied to contain the expansion of SARS-CoV-2 also affect the influenza expansion. We analyse the interaction of influenza and SARS-CoV-2 spread based on an extended SEIRD model for the Catalonia region in Spain. We show that the dynamic evolution of the spread of SARS-CoV-2 and influenza generates a small interference. This interference causes a small reduction in the number of cases of seasonal influenza, reducing its expansion over the population. Other elements like the face mask mandates, social distancing and hand cleaning boost the reduction in both expansions. Influenza expansion will present a small reduction in the number of cases due to the interaction with SARS-CoV-2 expansion but mainly because the NPIs applied to the population.

目的是分析，在当前大流行形势下，流感病例数的减少表明，用于遏制SARS-CoV-2扩散的非药物干预措施(npi)也会影响流感的扩散。我们基于西班牙加泰罗尼亚地区的扩展SEIRD模型分析了流感和SARS-CoV-2传播的相互作用。我们的研究表明，SARS-CoV-2和流感传播的动态演变产生了小的干扰。这种干预使季节性流感病例数量略有减少，减少了其在人口中的蔓延。其他因素，如口罩规定、保持社交距离和洗手，促进了这两种扩张的减少。由于与SARS-CoV-2扩大的相互作用，流感扩大将导致病例数小幅减少，但主要是因为npi适用于人群。

引用次数: 0

Statistical analysis of various splitting criteria for decision trees 决策树各种分裂准则的统计分析

Q4 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS

Journal of Algorithms & Computational Technology

Pub Date : 2023-01-01 DOI: 10.1177/17483026231198181

Fadwa Aaboub, Hasna Chamlal, Tayeb Ouaderhman

Decision trees are frequently used to overcome classification problems in the fields of data mining and machine learning, owing to their many perks, including their clear and simple architecture, excellent quality, and resilience. Various decision tree algorithms are developed using a variety of attribute selection criteria, following the top-down partitioning strategy. However, their effectiveness is influenced by the choice of the splitting method. Therefore, in this work, six decision tree algorithms that are based on six different attribute evaluation metrics are gathered in order to compare their performances. The choice of the decision trees that will be compared is done based on four different categories of the splitting criteria that are criteria based on information theory, criteria based on distance, statistical-based criteria, and other splitting criteria. These approaches include iterative dichotomizer 3 (first category), C[Formula: see text] (first category), classification and regression trees (second category), Pearson’s correlation coefficient based decision tree (third category), dispersion ratio (third category), and feature weight based decision tree algorithm (last category). On eleven data sets, the six procedures are assessed in terms of classification accuracy, tree depth, leaf nodes, and tree construction time. Furthermore, the Friedman and post hoc Nemenyi tests are used to examine the results that were obtained. The results of these two tests indicate that the iterative dichotomizer 3 and classification and regression trees decision tree methods perform better than the other decision tree methodologies.

决策树由于其清晰简单的体系结构、优良的质量和弹性等优点，经常被用于克服数据挖掘和机器学习领域的分类问题。根据自顶向下的划分策略，使用各种属性选择标准开发了各种决策树算法。但是，分割方法的选择会影响其有效性。因此，在这项工作中，为了比较它们的性能，我们收集了基于六种不同属性评价指标的六种决策树算法。要比较的决策树的选择是基于四种不同类别的分割标准完成的，这四种标准是基于信息论的标准、基于距离的标准、基于统计的标准和其他分割标准。这些方法包括迭代二分类器3(第一类)、C[公式:见文本](第一类)、分类和回归树(第二类)、基于Pearson相关系数的决策树(第三类)、离散比(第三类)和基于特征权重的决策树算法(最后一类)。在11个数据集上，从分类精度、树深度、叶节点和树构建时间等方面对这6种方法进行了评估。此外，弗里德曼和事后Nemenyi测试被用来检查所获得的结果。这两个测试的结果表明，迭代二分类器3和分类回归树决策树方法的性能优于其他决策树方法。

{"title":"Statistical analysis of various splitting criteria for decision trees","authors":"Fadwa Aaboub, Hasna Chamlal, Tayeb Ouaderhman","doi":"10.1177/17483026231198181","DOIUrl":"https://doi.org/10.1177/17483026231198181","url":null,"abstract":"Decision trees are frequently used to overcome classification problems in the fields of data mining and machine learning, owing to their many perks, including their clear and simple architecture, excellent quality, and resilience. Various decision tree algorithms are developed using a variety of attribute selection criteria, following the top-down partitioning strategy. However, their effectiveness is influenced by the choice of the splitting method. Therefore, in this work, six decision tree algorithms that are based on six different attribute evaluation metrics are gathered in order to compare their performances. The choice of the decision trees that will be compared is done based on four different categories of the splitting criteria that are criteria based on information theory, criteria based on distance, statistical-based criteria, and other splitting criteria. These approaches include iterative dichotomizer 3 (first category), C[Formula: see text] (first category), classification and regression trees (second category), Pearson’s correlation coefficient based decision tree (third category), dispersion ratio (third category), and feature weight based decision tree algorithm (last category). On eleven data sets, the six procedures are assessed in terms of classification accuracy, tree depth, leaf nodes, and tree construction time. Furthermore, the Friedman and post hoc Nemenyi tests are used to examine the results that were obtained. The results of these two tests indicate that the iterative dichotomizer 3 and classification and regression trees decision tree methods perform better than the other decision tree methodologies.","PeriodicalId":45079,"journal":{"name":"Journal of Algorithms & Computational Technology","volume":"112 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135002104","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Pretrained back propagation based adaptive resonance theory network for adaptive learning 基于预训练反向传播的自适应共振理论网络自适应学习

Q4 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS

Journal of Algorithms & Computational Technology

Pub Date : 2023-01-01 DOI: 10.1177/17483026231205009

Caixia Zhang, Cong Jiang, Qingyang Xu

The deep convolutional neural network performs well in current computer vision tasks. However, most of these models are trained on an aforehand complete dataset. New application scenario data sets should be added to the original training data set for model retraining when application scenarios change significantly. When the scenario changes only slightly, the transfer learning can be used for network training by a small data set of new scenarios to adapt it to the new scenario. In actual application, we hope that our model has bio-like intelligence and can adaptively learn new knowledge. This paper proposes a pretrained adaptive resonance network (PAN) based on the CNN and an intra-node back propagation ART network, which can adaptively learn new knowledge using prior information. The PAN network explores the difference between the new data and the stored information and learns this difference to realize the adaptive growth of the network. The model is testified on the MNIST and Omniglot data set, which show the effectiveness of PAN in adaptive incremental learning and its competitive classification accuracy.

深度卷积神经网络在当前的计算机视觉任务中表现良好。然而，这些模型大多是在事先完整的数据集上训练的。当应用场景发生重大变化时，需要在原有训练数据集中增加新的应用场景数据集，进行模型再训练。当场景变化很小时，迁移学习可以通过一个小的新场景数据集进行网络训练，使其适应新的场景。在实际应用中，我们希望我们的模型具有生物智能，能够自适应地学习新知识。本文提出了一种基于CNN和节点内反向传播ART网络的预训练自适应共振网络(PAN)，该网络可以利用先验信息自适应学习新知识。PAN网络探索新数据与存储信息之间的差异，并学习这种差异，实现网络的自适应增长。在MNIST和Omniglot数据集上对该模型进行了验证，证明了PAN在自适应增量学习方面的有效性和具有竞争力的分类精度。

引用次数: 0

DataSifter II: Partially synthetic data sharing of sensitive information containing time-varying correlated observations. DataSifter II：包含时变相关观测值的敏感信息的部分合成数据共享。

IF 0.9 Q4 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS

Journal of Algorithms & Computational Technology

Pub Date : 2022-01-01 Epub Date: 2022-01-20 DOI: 10.1177/17483026211065379

Nina Zhou, Lu Wang, Simeone Marino, Yi Zhao, Ivo D Dinov

There is a significant public demand for rapid data-driven scientific investigations using aggregated sensitive information. However, many technical challenges and regulatory policies hinder efficient data sharing. In this study, we describe a partially synthetic data generation technique for creating anonymized data archives whose joint distributions closely resemble those of the original (sensitive) data. Specifically, we introduce the DataSifter technique for time-varying correlated data (DataSifter II), which relies on an iterative model-based imputation using generalized linear mixed model and random effects-expectation maximization tree. DataSifter II can be used to generate synthetic repeated measures data for testing and validating new analytical techniques. Compared to the multiple imputation method, DataSifter II application on simulated and real clinical data demonstrates that the new method provides extensive reduction of re-identification risk (data privacy) while preserving the analytical value (data utility) in the obfuscated data. The performance of the DataSifter II on a simulation involving 20% artificially missingness in the data, shows at least 80% reduction of the disclosure risk, compared to the multiple imputation method, without a substantial impact on the data analytical value. In a separate clinical data (Medical Information Mart for Intensive Care III) validation, a model-based statistical inference drawn from the original data agrees with an analogous analytical inference obtained using the DataSifter II obfuscated (sifted) data. For large time-varying datasets containing sensitive information, the proposed technique provides an automated tool for alleviating the barriers of data sharing and facilitating effective, advanced, and collaborative analytics.

公众对利用汇总的敏感信息进行快速数据驱动科学调查的需求很大。然而，许多技术挑战和监管政策阻碍了高效的数据共享。在本研究中，我们介绍了一种部分合成数据生成技术，用于创建联合分布与原始（敏感）数据非常相似的匿名数据档案。具体来说，我们介绍了针对时变相关数据的 DataSifter 技术（DataSifter II），该技术依赖于使用广义线性混合模型和随机效应期望最大化树的基于模型的迭代估算。DataSifter II 可用于生成合成重复测量数据，以测试和验证新的分析技术。与多重估算方法相比，DataSifter II 在模拟和真实临床数据上的应用表明，新方法在保留混淆数据的分析价值（数据效用）的同时，还大大降低了再识别风险（数据隐私）。与多重估算方法相比，DataSifter II 在模拟数据中 20% 的人为缺失率上的表现至少降低了 80% 的泄露风险，同时对数据分析价值没有实质性影响。在单独的临床数据（重症监护医学信息库 III）验证中，从原始数据中得出的基于模型的统计推断与使用 DataSifter II 混淆（筛选）数据得出的类似分析推断一致。对于包含敏感信息的大型时变数据集，所提出的技术提供了一种自动化工具，可用于减轻数据共享的障碍，促进有效、先进和协作分析。

{"title":"DataSifter II: Partially synthetic data sharing of sensitive information containing time-varying correlated observations.","authors":"Nina Zhou, Lu Wang, Simeone Marino, Yi Zhao, Ivo D Dinov","doi":"10.1177/17483026211065379","DOIUrl":"10.1177/17483026211065379","url":null,"abstract":"There is a significant public demand for rapid data-driven scientific investigations using aggregated sensitive information. However, many technical challenges and regulatory policies hinder efficient data sharing. In this study, we describe a partially synthetic data generation technique for creating anonymized data archives whose joint distributions closely resemble those of the original (sensitive) data. Specifically, we introduce the DataSifter technique for time-varying correlated data (DataSifter II), which relies on an iterative model-based imputation using generalized linear mixed model and random effects-expectation maximization tree. DataSifter II can be used to generate synthetic repeated measures data for testing and validating new analytical techniques. Compared to the multiple imputation method, DataSifter II application on simulated and real clinical data demonstrates that the new method provides extensive reduction of re-identification risk (data privacy) while preserving the analytical value (data utility) in the obfuscated data. The performance of the DataSifter II on a simulation involving 20% artificially missingness in the data, shows at least 80% reduction of the disclosure risk, compared to the multiple imputation method, without a substantial impact on the data analytical value. In a separate clinical data (Medical Information Mart for Intensive Care III) validation, a model-based statistical inference drawn from the original data agrees with an analogous analytical inference obtained using the DataSifter II obfuscated (sifted) data. For large time-varying datasets containing sensitive information, the proposed technique provides an automated tool for alleviating the barriers of data sharing and facilitating effective, advanced, and collaborative analytics.","PeriodicalId":45079,"journal":{"name":"Journal of Algorithms & Computational Technology","volume":"16 ","pages":""},"PeriodicalIF":0.9,"publicationDate":"2022-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ftp.ncbi.nlm.nih.gov/pub/pmc/oa_pdf/76/65/nihms-1800751.PMC9585991.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"10553433","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Message-passing implementation of the data diffusion communication model in fast multipole methods: large scale biomolecular simulations. 快速多极方法中数据扩散通信模型的消息传递实现：大规模生物分子模拟。

IF 0.8 Q4 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS

Journal of Algorithms & Computational Technology

Pub Date : 2008-01-01 DOI: 10.1260/174830108786231722

Jakub Kurzak, B Montgomery Pettitt

Biomolecular simulations require increasingly efficient parallel codes. We present an efficient communication algorithm for irregular problems exhibiting an all-to-many communication pattern. The algorithm is developed using message passing on distributed memory machines and assumes explicit knowledge of the interconnection topology. The algorithm maximizes locality of interprocessor communication by adopting to an arbitrary interconnection topology and at the same time takes multiprocessor nodes into account. The solution is incorporated into our implementation of the fast multipole method with periodic boundary conditions used for molecular dynamics simulations, but we believe it generalizes to many algorithms demonstrating an all-to-many communication pattern. We show that an irregular algorithm can be forced to behave like a systolic algorithm.

生物分子模拟需要越来越高效的并行代码。提出了一种基于多对多通信模式的不规则问题的高效通信算法。该算法使用分布式存储机器上的消息传递来开发，并假设对互连拓扑有明确的了解。该算法采用任意互连拓扑结构，最大限度地提高了处理器间通信的局部性，同时考虑了多处理器节点。该解决方案被纳入我们用于分子动力学模拟的具有周期性边界条件的快速多极方法的实现中，但我们相信它可以推广到许多展示多对多通信模式的算法。我们证明了一个不规则的算法可以被迫表现得像一个收缩算法。

引用次数: 0

类型

全部化学•材料生命科学医学物理工程技术环境•农林材料科学地球科学法学管理学化学环境科学与生态学计算机科学教育学经济学农林科学人文科学生物学数学物理与天体物理心理学综合性期刊其他工业工程理学历史学农学文学信息工程

数据库

全部 ACS Publications Elsevier ieeexplore Springer The Royal Society of Chemistry Wiley

期刊

Journal of Algorithms & Computational Technology

全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.

﹀