首页 > 最新文献

Doklady Mathematics最新文献

英文 中文
Intransitively Winning Chess Players’ Positions 非传递性地赢得棋手的位置
IF 0.5 4区 数学 Q3 MATHEMATICS Pub Date : 2025-03-28 DOI: 10.1134/S1064562424702417
A. Poddiakov

Chess players’ positions in intransitive (rock-paper-scissors) relations are considered. Intransitivity of chess players’ positions means that: position A of White is preferable (it should be chosen if choice is possible) to position B of Black, if A and B are on a chessboard; position B of Black is preferable to position C of White, if B and C are on the chessboard; position C of White is preferable to position D of Black, if C and D are on the chessboard; but position D of Black is preferable to position A of White, if A and D are on the chessboard. Intransitivity of winningness of chess players’ positions is considered to be a consequence of complexity of the chess environment—in contrast with simpler games with transitive positions only. The space of relations between winningness of chess players’ positions is non-Euclidean. The Zermelo-von Neumann theorem is complemented by statements about possibility vs. impossibility of building pure winning strategies based on the assumption of transitivity of players’ positions. Questions about the possibility of intransitive players’ positions in other positional games are raised.

象棋选手在不及物关系(石头-剪刀-布)中的位置被考虑。棋手位置的不可及性意味着:如果A和B在一个棋盘上,白棋的位置A比黑棋的位置B更可取(如果可能的话,应该选择A);如果棋盘上有B和C,黑棋的位置B比白棋的位置C更可取;如果棋盘上有C和D,白棋的C位置比黑棋的D位置更可取;但如果棋盘上有A和D,黑棋的D位置比白棋的A位置更可取。与只有传递位置的简单游戏相比,棋手获胜的非传递性被认为是国际象棋环境复杂性的结果。棋手的胜负关系空间是非欧几里得的。Zermelo-von Neumann定理还补充了基于玩家位置传递性假设而构建纯粹获胜策略的可能性与不可能性。对其他位置博弈中不及物棋手位置的可能性提出了疑问。
{"title":"Intransitively Winning Chess Players’ Positions","authors":"A. Poddiakov","doi":"10.1134/S1064562424702417","DOIUrl":"10.1134/S1064562424702417","url":null,"abstract":"<p>Chess players’ positions in intransitive (rock-paper-scissors) relations are considered. Intransitivity of chess players’ positions means that: position A of White is preferable (it should be chosen if choice is possible) to position B of Black, if A and B are on a chessboard; position B of Black is preferable to position C of White, if B and C are on the chessboard; position C of White is preferable to position D of Black, if C and D are on the chessboard; but position D of Black is preferable to position A of White, if A and D are on the chessboard. Intransitivity of winningness of chess players’ positions is considered to be a consequence of complexity of the chess environment—in contrast with simpler games with transitive positions only. The space of relations between winningness of chess players’ positions is non-Euclidean. The Zermelo-von Neumann theorem is complemented by statements about possibility <i>vs</i>. impossibility of building pure winning strategies based on the assumption of transitivity of players’ positions. Questions about the possibility of intransitive players’ positions in other positional games are raised.</p>","PeriodicalId":531,"journal":{"name":"Doklady Mathematics","volume":"110 2 supplement","pages":"S391 - S398"},"PeriodicalIF":0.5,"publicationDate":"2025-03-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143716670","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
How to Maximize the Total Strength of Survivors in a Battle and Tournament in Gladiator Game Models 如何在角斗士游戏模式的战斗和锦标赛中最大化幸存者的总力量
IF 0.5 4区 数学 Q3 MATHEMATICS Pub Date : 2025-03-28 DOI: 10.1134/S1064562424602725
M. A. Khodiakova

In 1984, Kaminsky, Luks, and Nelson formulated the gladiator game model of two teams with given strengths. Suppose that a team wants to maximize its expected strength at the end of a battle. We consider an optimization problem: how to distribute the team’s strength among its gladiators. In the above we suppose that the teams distribute their strengths at the beginning of a battle. We also consider Nash equilibria when the teams may change gladiators’ strengths before every fight. We consider two cases. In both, the first team wants to maximize its strength. The second team wants to maximize its strength too in the first case or wants to minimize the first team’s strength in the second case.

在1984年,Kaminsky, Luks和Nelson提出了角斗士游戏模型,即两个具有特定优势的团队。假设一个团队希望在战斗结束时最大化其预期力量。我们考虑一个优化问题:如何在角斗士之间分配团队的力量。在上面的例子中,我们假设两支队伍在战斗开始时分配他们的力量。我们还考虑纳什均衡,当团队在每次战斗前可能会改变角斗士的力量。我们考虑两种情况。在这两种情况下,一线队都希望最大限度地发挥自己的实力。在第一种情况下,第二支队伍想要最大化自己的实力,或者在第二种情况下,想要最小化第一支队伍的实力。
{"title":"How to Maximize the Total Strength of Survivors in a Battle and Tournament in Gladiator Game Models","authors":"M. A. Khodiakova","doi":"10.1134/S1064562424602725","DOIUrl":"10.1134/S1064562424602725","url":null,"abstract":"<p>In 1984, Kaminsky, Luks, and Nelson formulated the gladiator game model of two teams with given strengths. Suppose that a team wants to maximize its expected strength at the end of a battle. We consider an optimization problem: how to distribute the team’s strength among its gladiators. In the above we suppose that the teams distribute their strengths at the beginning of a battle. We also consider Nash equilibria when the teams may change gladiators’ strengths before every fight. We consider two cases. In both, the first team wants to maximize its strength. The second team wants to maximize its strength too in the first case or wants to minimize the first team’s strength in the second case.</p>","PeriodicalId":531,"journal":{"name":"Doklady Mathematics","volume":"110 2 supplement","pages":"S452 - S462"},"PeriodicalIF":0.5,"publicationDate":"2025-03-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143716736","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Dynamic Models of Competition and Cooperation in Cournot Oligopoly Taking into Account the Environmental Impact 考虑环境影响的古诺寡头竞争与合作动态模型
IF 0.5 4区 数学 Q3 MATHEMATICS Pub Date : 2025-03-28 DOI: 10.1134/S1064562424602610
P. D. Demchuk, A. V. Korolev, G. A. Ugolnitsky

The basic model of the Cournot oligopoly taking into account competition-cooperation and environmental pollution as a differential game in a normal form is described. The numerical analysis for independent and cooperative behavior is carried out for an example used in the future. Games in the form of the characteristic von Neumann–Morgenstern, Petrosyan–Zaccour, and Gromova–Petrosyan functions are constructed, and the Shapley values are calculated. Hierarchical games with information regulations for direct and reverse Stackelberg games are analyzed, payoffs’ comparative analysis for all methods of organization is provided. All the results are presented for the dynamic game with three players.

本文描述了古诺寡头垄断的基本模型,将竞争-合作和环境污染作为一种标准形式的微分博弈来考虑。并对独立行为和合作行为进行了数值分析。构造了特征von Neumann-Morgenstern、Petrosyan-Zaccour和Gromova-Petrosyan函数形式的博弈,并计算了Shapley值。分析了具有信息规则的直接和反向Stackelberg博弈的层次博弈,并对各种组织方式的收益进行了比较分析。本文给出了三人动态博弈的所有结果。
{"title":"Dynamic Models of Competition and Cooperation in Cournot Oligopoly Taking into Account the Environmental Impact","authors":"P. D. Demchuk,&nbsp;A. V. Korolev,&nbsp;G. A. Ugolnitsky","doi":"10.1134/S1064562424602610","DOIUrl":"10.1134/S1064562424602610","url":null,"abstract":"<p>The basic model of the Cournot oligopoly taking into account competition-cooperation and environmental pollution as a differential game in a normal form is described. The numerical analysis for independent and cooperative behavior is carried out for an example used in the future. Games in the form of the characteristic von Neumann–Morgenstern, Petrosyan–Zaccour, and Gromova–Petrosyan functions are constructed, and the Shapley values are calculated. Hierarchical games with information regulations for direct and reverse Stackelberg games are analyzed, payoffs’ comparative analysis for all methods of organization is provided. All the results are presented for the dynamic game with three players.</p>","PeriodicalId":531,"journal":{"name":"Doklady Mathematics","volume":"110 2 supplement","pages":"S473 - S486"},"PeriodicalIF":0.5,"publicationDate":"2025-03-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143716740","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Dynamic Models of Competition with Endogenous Network Formation: The Case of Constant Output 具有内生网络形成的动态竞争模型:以恒定产出为例
IF 0.5 4区 数学 Q3 MATHEMATICS Pub Date : 2025-03-28 DOI: 10.1134/S1064562424702375
V. A. Kochevadov, A. A. Sedakov

The paper examines discrete-time network models of competition with a finite planning horizon. Firms produce a homogeneous product in constant quantities and sell it in a common market. In a nonterminal period, the behavior of each firm is characterized by a multicomponent profile that includes, among other things, the amount of investment and the structure of bilateral links with partner firms. The latter affects the technological state of the firm and allows it to reduce its current costs. The endogenous structure of partner firms is described by a network. For the models under study, an open-loop Nash equilibrium is characterized.

本文研究具有有限规划视界的离散时间竞争网络模型。企业以恒定的数量生产同质产品,并在共同市场上销售。在非终结期,每个企业的行为都具有多要素特征,其中包括投资额和与合作伙伴企业的双边关系结构。后者影响企业的技术状态,使其能够降低当前成本。伙伴企业的内生结构用网络来描述。对于所研究的模型,具有开环纳什均衡的特征。
{"title":"Dynamic Models of Competition with Endogenous Network Formation: The Case of Constant Output","authors":"V. A. Kochevadov,&nbsp;A. A. Sedakov","doi":"10.1134/S1064562424702375","DOIUrl":"10.1134/S1064562424702375","url":null,"abstract":"<p>The paper examines discrete-time network models of competition with a finite planning horizon. Firms produce a homogeneous product in constant quantities and sell it in a common market. In a nonterminal period, the behavior of each firm is characterized by a multicomponent profile that includes, among other things, the amount of investment and the structure of bilateral links with partner firms. The latter affects the technological state of the firm and allows it to reduce its current costs. The endogenous structure of partner firms is described by a network. For the models under study, an open-loop Nash equilibrium is characterized.</p>","PeriodicalId":531,"journal":{"name":"Doklady Mathematics","volume":"110 2 supplement","pages":"S357 - S366"},"PeriodicalIF":0.5,"publicationDate":"2025-03-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143716947","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Zero Order Algorithm for Decentralized Optimization Problems 分散优化问题的零阶算法
IF 0.5 4区 数学 Q3 MATHEMATICS Pub Date : 2025-03-22 DOI: 10.1134/S1064562424602336
A. S. Veprikov, E. D. Petrov, G. V. Evseev, A. N. Beznosikov

In this paper we consider a distributed optimization problem in the black-box formulation. This means that the target function   f is decomposed into the sum of (n) functions ({{f}_{i}}), where (n) is the number of workers, it is assumed that each worker has access only to the zero-order noisy oracle, i.e., only to the values of ({{f}_{i}}(x)) with added noise. In this paper, we propose a new method ZO-MARINA based on the state-of-the-art distributed optimization algorithm MARINA. In particular, the following modifications are made to solve the problem in the black-box formulation: (i) we use approximations of the gradient instead of the true value, (ii) the difference of two approximated gradients at some coordinates is used instead of the compression operator. In this paper, a theoretical convergence analysis is provided for non-convex functions and functions satisfying the PL condition. The convergence rate of the proposed algorithm is correlated with the results for the algorithm that uses the first-order oracle. The theoretical results are validated in computational experiments to find optimal hyperparameters for the Resnet-18 neural network, that is trained on the CIFAR-10 dataset and the SVM model on the LibSVM library dataset and on the Mnist-784 dataset.

本文考虑了黑箱公式中的一个分布式优化问题。这意味着目标函数f被分解为(n)函数({{f}_{i}})的和,其中(n)是工作人员的数量,假设每个工作人员只能访问零阶有噪声的oracle,即只能访问带有附加噪声的({{f}_{i}}(x))的值。本文在最先进的分布式优化算法MARINA的基础上,提出了一种新的算法ZO-MARINA。特别地,为了解决黑箱公式中的问题,我们做了以下修改:(i)我们使用梯度的近似值而不是真值,(ii)使用两个近似梯度在某些坐标处的差值而不是压缩算子。本文给出了非凸函数和满足PL条件的函数的理论收敛性分析。该算法的收敛速度与使用一阶oracle的算法的结果相关。利用CIFAR-10数据集和LibSVM库数据集和mist -784数据集上的SVM模型对Resnet-18神经网络进行训练,并通过计算实验验证了理论结果,找到了最优超参数。
{"title":"Zero Order Algorithm for Decentralized Optimization Problems","authors":"A. S. Veprikov,&nbsp;E. D. Petrov,&nbsp;G. V. Evseev,&nbsp;A. N. Beznosikov","doi":"10.1134/S1064562424602336","DOIUrl":"10.1134/S1064562424602336","url":null,"abstract":"<p>In this paper we consider a distributed optimization problem in the black-box formulation. This means that the target function   <i>f</i> is decomposed into the sum of <span>(n)</span> functions <span>({{f}_{i}})</span>, where <span>(n)</span> is the number of workers, it is assumed that each worker has access only to the zero-order noisy oracle, i.e., only to the values of <span>({{f}_{i}}(x))</span> with added noise. In this paper, we propose a new method <span>ZO-MARINA</span> based on the state-of-the-art distributed optimization algorithm <i><span>MARINA</span></i>. In particular, the following modifications are made to solve the problem in the black-box formulation: (i) we use approximations of the gradient instead of the true value, (ii) the difference of two approximated gradients at some coordinates is used instead of the compression operator. In this paper, a theoretical convergence analysis is provided for non-convex functions and functions satisfying the PL condition. The convergence rate of the proposed algorithm is correlated with the results for the algorithm that uses the first-order oracle. The theoretical results are validated in computational experiments to find optimal hyperparameters for the Resnet-18 neural network, that is trained on the CIFAR-10 dataset and the SVM model on the LibSVM library dataset and on the Mnist-784 dataset.</p>","PeriodicalId":531,"journal":{"name":"Doklady Mathematics","volume":"110 1 supplement","pages":"S261 - S277"},"PeriodicalIF":0.5,"publicationDate":"2025-03-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://link.springer.com/content/pdf/10.1134/S1064562424602336.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143676487","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Heterogeneous Computational Scheduling Using Adaptive Neural Hyper-Heuristic 基于自适应神经超启发式的异构计算调度
IF 0.5 4区 数学 Q3 MATHEMATICS Pub Date : 2025-03-22 DOI: 10.1134/S106456242460221X
A. Allahverdyan, A. Zhadan, I. Kondratov, O. Petrosian, A. Romanovskii, V. Kharin, Yin Li

In heterogeneous computing environments, efficiently scheduling tasks, especially those forming Directed Acyclic Graphs (DAGs), is critical. This is particularly true for various Cloud and Edge computing tasks, as well as training Large Language Models (LLMs). This paper introduces a new scheduling approach using an Adaptive Neural Hyper-heuristic. By integrating a neural network trained with genetic algorithms, our method aims to minimize makespan. The approach uses a two-level algorithm: the first level prioritizes tasks using adaptive heuristic and the second level assigns resources based on the Earliest Finish Time (EFT) algorithm. Our tests show that this method significantly improves over traditional scheduling heuristics and other machine learning-based approaches. It reduces the makespan by 6.7% for small-scale DAGs and 28.49% for large-scale DAGs compared to the leading DONF algorithm. Additionally, it achieves a proximity of 84.08% to 96.43% to the optimal solutions found using Mixed-Integer Linear Programming (MILP), demonstrating its effectiveness in diverse computational settings.

在异构计算环境中,高效调度任务,尤其是那些形成有向无环图(DAG)的任务,至关重要。对于各种云计算和边缘计算任务以及大型语言模型(LLM)的训练而言,尤其如此。本文介绍了一种使用自适应神经超启发式的新调度方法。通过将经过遗传算法训练的神经网络整合在一起,我们的方法旨在最大限度地减少时间跨度。该方法使用两级算法:第一级使用自适应启发式确定任务的优先级,第二级根据最早完成时间(EFT)算法分配资源。我们的测试表明,与传统的调度启发式方法和其他基于机器学习的方法相比,这种方法有明显改善。与领先的 DONF 算法相比,小规模 DAG 的 makepan 降低了 6.7%,大规模 DAG 的 makespan 降低了 28.49%。此外,它还实现了 84.08% 至 96.43% 的接近度,接近于使用混合整数线性规划(MILP)找到的最优解,证明了它在各种计算环境中的有效性。
{"title":"Heterogeneous Computational Scheduling Using Adaptive Neural Hyper-Heuristic","authors":"A. Allahverdyan,&nbsp;A. Zhadan,&nbsp;I. Kondratov,&nbsp;O. Petrosian,&nbsp;A. Romanovskii,&nbsp;V. Kharin,&nbsp;Yin Li","doi":"10.1134/S106456242460221X","DOIUrl":"10.1134/S106456242460221X","url":null,"abstract":"<p>In heterogeneous computing environments, efficiently scheduling tasks, especially those forming Directed Acyclic Graphs (DAGs), is critical. This is particularly true for various Cloud and Edge computing tasks, as well as training Large Language Models (LLMs). This paper introduces a new scheduling approach using an Adaptive Neural Hyper-heuristic. By integrating a neural network trained with genetic algorithms, our method aims to minimize makespan. The approach uses a two-level algorithm: the first level prioritizes tasks using adaptive heuristic and the second level assigns resources based on the Earliest Finish Time (EFT) algorithm. Our tests show that this method significantly improves over traditional scheduling heuristics and other machine learning-based approaches. It reduces the makespan by 6.7% for small-scale DAGs and 28.49% for large-scale DAGs compared to the leading DONF algorithm. Additionally, it achieves a proximity of 84.08% to 96.43% to the optimal solutions found using Mixed-Integer Linear Programming (MILP), demonstrating its effectiveness in diverse computational settings.</p>","PeriodicalId":531,"journal":{"name":"Doklady Mathematics","volume":"110 1 supplement","pages":"S151 - S161"},"PeriodicalIF":0.5,"publicationDate":"2025-03-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://link.springer.com/content/pdf/10.1134/S106456242460221X.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143676271","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Prediction of Industrial Cyber Attacks Using Normalizing Flows 利用规范化流预测工业网络攻击
IF 0.5 4区 数学 Q3 MATHEMATICS Pub Date : 2025-03-22 DOI: 10.1134/S1064562424602269
V. P. Stepashkina, M. I. Hushchyn

This paper presents the development and evaluation of methods for detecting cyberattacks on industrial systems using neural network approaches. The focus is on the task of detecting anomalies in multivariate time series, where the diversity and complexity of potential attack scenarios require the use of advanced models. To address these challenges, a transformer-based autoencoder architecture was used, which was further enhanced by transitioning to a variational autoencoder (VAE) and integrating normalizing flows. These modifications allowed the model to better capture the data distribution, enabling effective anomaly detection, including those not present in the training set. As a result, high performance was achieved, with an F1 score of 0.93 and a ROC-AUC of 0.87. The results underscore the effectiveness of the proposed methodology and provide valuable contributions to the field of anomaly detection and cybersecurity in industrial systems.

本文介绍了利用神经网络方法检测工业系统网络攻击的方法的开发和评估。重点是检测多变量时间序列中的异常情况,潜在攻击场景的多样性和复杂性要求使用先进的模型。为了应对这些挑战,我们使用了基于变压器的自动编码器架构,并通过过渡到变异自动编码器(VAE)和整合归一化流量进一步增强了该架构。这些修改使模型能够更好地捕捉数据分布,从而实现有效的异常检测,包括训练集中不存在的异常。因此,该模型取得了很高的性能,F1 得分为 0.93,ROC-AUC 为 0.87。结果凸显了所提方法的有效性,为工业系统的异常检测和网络安全领域做出了宝贵贡献。
{"title":"Prediction of Industrial Cyber Attacks Using Normalizing Flows","authors":"V. P. Stepashkina,&nbsp;M. I. Hushchyn","doi":"10.1134/S1064562424602269","DOIUrl":"10.1134/S1064562424602269","url":null,"abstract":"<p>This paper presents the development and evaluation of methods for detecting cyberattacks on industrial systems using neural network approaches. The focus is on the task of detecting anomalies in multivariate time series, where the diversity and complexity of potential attack scenarios require the use of advanced models. To address these challenges, a transformer-based autoencoder architecture was used, which was further enhanced by transitioning to a variational autoencoder (VAE) and integrating normalizing flows. These modifications allowed the model to better capture the data distribution, enabling effective anomaly detection, including those not present in the training set. As a result, high performance was achieved, with an F1 score of 0.93 and a ROC-AUC of 0.87. The results underscore the effectiveness of the proposed methodology and provide valuable contributions to the field of anomaly detection and cybersecurity in industrial systems.</p>","PeriodicalId":531,"journal":{"name":"Doklady Mathematics","volume":"110 1 supplement","pages":"S95 - S102"},"PeriodicalIF":0.5,"publicationDate":"2025-03-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143676386","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Deep Learning-Driven Approach for Handwritten Chinese Character Classification 基于深度学习的手写体汉字分类方法
IF 0.5 4区 数学 Q3 MATHEMATICS Pub Date : 2025-03-22 DOI: 10.1134/S1064562424602245
B. Kriuk, F. Kriuk

Handwritten character recognition (HCR) is a challenging problem for machine learning researchers. Unlike printed text data, handwritten character datasets have more variation due to human-introduced bias. With numerous unique character classes present, some data, such as Logographic Scripts or Sino-Korean character sequences, bring new complications to the HCR problem. The classification task on such datasets requires the model to learn high-complexity details of the images that share similar features. With recent advances in computational resource availability and further computer vision theory development, some research teams have effectively addressed the arising challenges. Although known for achieving high accuracy while keeping the number of parameters small, many common approaches are still not generalizable and use dataset-specific solutions to achieve better results. Due to complex structure, existing methods frequently prevent the solutions from gaining popularity. This paper proposes a highly scalable approach for detailed character image classification by introducing the model architecture, data preprocessing steps, and testing design instructions. We also perform experiments to compare the performance of our method with that of existing ones to show the improvements achieved.

手写字符识别(HCR)是机器学习研究人员面临的一个具有挑战性的问题。与印刷文本数据不同,由于人为引入的偏差,手写字符数据集有更多的变化。由于存在许多独特的字符类,一些数据,如Logographic Scripts或Sino-Korean字符序列,给HCR问题带来了新的复杂性。在这些数据集上的分类任务需要模型学习具有相似特征的图像的高复杂性细节。随着计算资源的可用性和计算机视觉理论的进一步发展,一些研究团队已经有效地解决了出现的挑战。虽然以在保持参数数量较少的情况下实现高精度而闻名,但许多常见方法仍然无法推广,并且使用特定于数据集的解决方案来获得更好的结果。现有方法由于结构复杂,往往阻碍了解决方案的普及。本文通过介绍模型架构、数据预处理步骤和测试设计说明,提出了一种高度可扩展的精细字符图像分类方法。我们还进行了实验,将我们的方法与现有方法的性能进行比较,以显示所取得的改进。
{"title":"Deep Learning-Driven Approach for Handwritten Chinese Character Classification","authors":"B. Kriuk,&nbsp;F. Kriuk","doi":"10.1134/S1064562424602245","DOIUrl":"10.1134/S1064562424602245","url":null,"abstract":"<p>Handwritten character recognition (HCR) is a challenging problem for machine learning researchers. Unlike printed text data, handwritten character datasets have more variation due to human-introduced bias. With numerous unique character classes present, some data, such as Logographic Scripts or Sino-Korean character sequences, bring new complications to the HCR problem. The classification task on such datasets requires the model to learn high-complexity details of the images that share similar features. With recent advances in computational resource availability and further computer vision theory development, some research teams have effectively addressed the arising challenges. Although known for achieving high accuracy while keeping the number of parameters small, many common approaches are still not generalizable and use dataset-specific solutions to achieve better results. Due to complex structure, existing methods frequently prevent the solutions from gaining popularity. This paper proposes a highly scalable approach for detailed character image classification by introducing the model architecture, data preprocessing steps, and testing design instructions. We also perform experiments to compare the performance of our method with that of existing ones to show the improvements achieved.</p>","PeriodicalId":531,"journal":{"name":"Doklady Mathematics","volume":"110 1 supplement","pages":"S278 - S287"},"PeriodicalIF":0.5,"publicationDate":"2025-03-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143676206","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
MDS-ViTNet: Improving Saliency Prediction for Eye-Tracking with Vision Transformer MDS-ViTNet:基于视觉变换器的眼动追踪显著性预测改进
IF 0.5 4区 数学 Q3 MATHEMATICS Pub Date : 2025-03-22 DOI: 10.1134/S1064562424602117
I. Polezhaev, I. Goncharenko, N. Iurina

In this paper, we present a novel methodology we call MDS-ViTNet (Multi Decoder Saliency by Vision Transformer Network) for enhancing visual saliency prediction or eye-tracking. This approach holds significant potential for diverse fields, including marketing, medicine, robotics, and retail. We propose a network architecture that leverages the Vision Transformer, moving beyond the conventional ImageNet backbone. The framework adopts an encoder-decoder structure, with the encoder utilizing a Swin transformer to efficiently embed most important features. This process involves a Transfer Learning method, wherein layers from the Vision Transformer are converted by the Encoder Transformer and seamlessly integrated into a CNN Decoder. This methodology ensures minimal information loss from the original input image. The decoder employs a multi-decoding technique, utilizing dual decoders to generate two distinct attention maps. These maps are subsequently combined into a singular output via an additional CNN model. Our trained model MDS-ViTNet achieves state-of-the-art results across several benchmarks. Committed to fostering further collaboration, we intend to make our code, models, and datasets accessible to the public.

在本文中,我们提出了一种新的方法,我们称之为MDS-ViTNet (Multi Decoder Saliency by Vision Transformer Network),用于增强视觉显著性预测或眼动追踪。这种方法在不同的领域具有巨大的潜力,包括营销、医药、机器人和零售。我们提出了一种利用视觉转换器的网络架构,超越了传统的ImageNet主干。该框架采用编码器-解码器结构,编码器利用Swin变压器有效嵌入最重要的特性。这个过程涉及一种迁移学习方法,其中来自视觉转换器的层由编码器转换器转换并无缝集成到CNN解码器中。这种方法确保了原始输入图像的最小信息损失。解码器采用多重解码技术,利用双解码器生成两个不同的注意图。这些地图随后通过一个额外的CNN模型组合成一个单一的输出。我们训练有素的MDS-ViTNet模型在几个基准测试中取得了最先进的结果。为了促进进一步的合作,我们打算让我们的代码、模型和数据集对公众开放。
{"title":"MDS-ViTNet: Improving Saliency Prediction for Eye-Tracking with Vision Transformer","authors":"I. Polezhaev,&nbsp;I. Goncharenko,&nbsp;N. Iurina","doi":"10.1134/S1064562424602117","DOIUrl":"10.1134/S1064562424602117","url":null,"abstract":"<p>In this paper, we present a novel methodology we call MDS-ViTNet (Multi Decoder Saliency by Vision Transformer Network) for enhancing visual saliency prediction or eye-tracking. This approach holds significant potential for diverse fields, including marketing, medicine, robotics, and retail. We propose a network architecture that leverages the Vision Transformer, moving beyond the conventional ImageNet backbone. The framework adopts an encoder-decoder structure, with the encoder utilizing a Swin transformer to efficiently embed most important features. This process involves a Transfer Learning method, wherein layers from the Vision Transformer are converted by the Encoder Transformer and seamlessly integrated into a CNN Decoder. This methodology ensures minimal information loss from the original input image. The decoder employs a multi-decoding technique, utilizing dual decoders to generate two distinct attention maps. These maps are subsequently combined into a singular output via an additional CNN model. Our trained model MDS-ViTNet achieves state-of-the-art results across several benchmarks. Committed to fostering further collaboration, we intend to make our code, models, and datasets accessible to the public.</p>","PeriodicalId":531,"journal":{"name":"Doklady Mathematics","volume":"110 1 supplement","pages":"S230 - S235"},"PeriodicalIF":0.5,"publicationDate":"2025-03-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://link.springer.com/content/pdf/10.1134/S1064562424602117.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143676210","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Neural Network-Based Coronary Dominance Classification of RCA Angiograms 基于神经网络的RCA血管造影冠状动脉优势分类
IF 0.5 4区 数学 Q3 MATHEMATICS Pub Date : 2025-03-22 DOI: 10.1134/S1064562424602026
I. Kruzhilov, E. Ikryannikov, A. Shadrin, R. Utegenov, G. Zubkova, I. Bessonov

Coronary arterial dominance classification is essential for SYNTAX score estimation, which is a tool used to determine the complexity of coronary artery disease and guide patient selection toward optimal revascularization strategy. We developed coronary dominance classification algorithm based on the analysis of right coronary artery (RCA) angiograms using neural network.

We employed convolutional neural network ConvNext and Swin transformer for 2D image (frames) classification, along with a majority vote for cardio angiographic view classification. An auxiliary network was also used to detect irrelevant images which were then excluded from the data set.

5-fold cross validation gave the following dominance classification metrics (p = 95%): macro recall = 93.1% ± 4.3%, accuracy = 93.5% ± 3.8%, macro F1 = 89.2% ± 5.6%. The most common case in which the model regularly failed was RCA occlusion, as it requires utilization of left coronary artery (LCA) information.

The use of machine learning approaches to classify coronary dominance based on RCA alone has been shown to be successful with satisfactory accuracy. However, for higher accuracy, it is necessary to utilize LCA information in the case of an occluded RCA and detect cases where there is high uncertainty.

冠状动脉优势分类对于SYNTAX评分至关重要,它是确定冠状动脉疾病复杂性和指导患者选择最佳血运重建策略的工具。在对右冠状动脉(RCA)血管造影进行分析的基础上,提出了一种基于神经网络的冠状动脉优势度分类算法。我们使用卷积神经网络ConvNext和Swin变压器进行二维图像(帧)分类,并使用多数投票进行心血管造影视图分类。一个辅助网络也被用来检测不相关的图像,然后从数据集中排除。5倍交叉验证的优势分类指标为:宏观召回率= 93.1%±4.3%,准确率= 93.5%±3.8%,宏观F1 = 89.2%±5.6%。模型经常失败的最常见情况是RCA闭塞,因为它需要利用左冠状动脉(LCA)信息。使用机器学习方法单独基于RCA对冠状动脉优势进行分类已被证明是成功的,具有令人满意的准确性。然而,为了提高准确性,有必要在RCA闭塞的情况下利用LCA信息,并检测具有高不确定性的情况。
{"title":"Neural Network-Based Coronary Dominance Classification of RCA Angiograms","authors":"I. Kruzhilov,&nbsp;E. Ikryannikov,&nbsp;A. Shadrin,&nbsp;R. Utegenov,&nbsp;G. Zubkova,&nbsp;I. Bessonov","doi":"10.1134/S1064562424602026","DOIUrl":"10.1134/S1064562424602026","url":null,"abstract":"<p>Coronary arterial dominance classification is essential for SYNTAX score estimation, which is a tool used to determine the complexity of coronary artery disease and guide patient selection toward optimal revascularization strategy. We developed coronary dominance classification algorithm based on the analysis of right coronary artery (RCA) angiograms using neural network.</p><p>We employed convolutional neural network ConvNext and Swin transformer for 2D image (frames) classification, along with a majority vote for cardio angiographic view classification. An auxiliary network was also used to detect irrelevant images which were then excluded from the data set.</p><p>5-fold cross validation gave the following dominance classification metrics (<i>p</i> = 95%): macro recall = 93.1% ± 4.3%, accuracy = 93.5% ± 3.8%, macro F1 = 89.2% ± 5.6%. The most common case in which the model regularly failed was RCA occlusion, as it requires utilization of left coronary artery (LCA) information.</p><p>The use of machine learning approaches to classify coronary dominance based on RCA alone has been shown to be successful with satisfactory accuracy. However, for higher accuracy, it is necessary to utilize LCA information in the case of an occluded RCA and detect cases where there is high uncertainty.</p>","PeriodicalId":531,"journal":{"name":"Doklady Mathematics","volume":"110 1 supplement","pages":"S212 - S222"},"PeriodicalIF":0.5,"publicationDate":"2025-03-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://link.springer.com/content/pdf/10.1134/S1064562424602026.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143676208","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Doklady Mathematics
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1