首页 > 最新文献

Doklady Mathematics最新文献

英文 中文
No Two Users Are Alike: Generating Audiences with Neural Clustering for Temporal Point Processes 没有两个用户是相同的:用神经聚类生成时点过程受众
IF 0.5 4区 数学 Q3 MATHEMATICS Pub Date : 2024-03-25 DOI: 10.1134/S1064562423701661
V. Zhuzhel, V. Grabar, N. Kaploukhaya, R. Rivera-Castro, L. Mironova, A. Zaytsev, E. Burnaev

Identifying the right user to target is a common problem for different Internet platforms. Although numerous systems address this task, they are heavily tailored for specific environments and settings. It is challenging for practitioners to apply these findings to their problems. The reason is that most systems are designed for settings with millions of highly active users and with personal information, as is the case in social networks or other services with high virality. There exists a gap in the literature for systems that are for medium-sized data and where the only data available are the event sequences of a user. It motivates us to present Look-A-Liker (LAL) as an unsupervised deep cluster system. It uses temporal point processes to identify similar users for targeting tasks. We use data from the leading Internet marketplace for the gastronomic sector for experiments. LAL generalizes beyond proprietary data. Using event sequences of users, it is possible to obtain state-of-the-art results compared to novel methods such as Transformer architectures and multimodal learning. Our approach produces the up to 20% ROC AUC score improvement on real-world datasets from 0.803 to 0.959. Although LAL focuses on hundreds of thousands of sequences, we show how it quickly expands to millions of user sequences. We provide a fully reproducible implementation with code and datasets in https://github.com/adasegroup/sequence_clusterers.

摘要识别正确的目标用户是不同互联网平台面临的共同问题。虽然有许多系统可以解决这一问题,但它们都是针对特定环境和设置而量身定制的。对于从业人员来说,将这些研究成果应用到他们的问题中具有挑战性。原因在于,大多数系统都是针对拥有数百万高活跃度用户和个人信息的环境而设计的,如社交网络或其他具有高病毒性的服务。对于中等规模数据的系统,以及仅有用户事件序列数据的系统,文献中存在空白。这促使我们提出了无监督深度聚类系统 Look-A-Liker (LAL)。它利用时间点过程来识别目标任务中的相似用户。我们使用领先的美食行业互联网市场的数据进行实验。LAL 不局限于专有数据。通过使用用户的事件序列,我们可以获得与 Transformer 架构和多模态学习等新方法相比最先进的结果。在实际数据集上,我们的方法可将 ROC AUC 分数从 0.803 提高到 0.959,最高提高 20%。虽然 LAL 专注于数十万个序列,但我们展示了它如何快速扩展到数百万个用户序列。我们在 https://github.com/adasegroup/sequence_clusterers 中提供了完全可重现的实现方法,包括代码和数据集。
{"title":"No Two Users Are Alike: Generating Audiences with Neural Clustering for Temporal Point Processes","authors":"V. Zhuzhel,&nbsp;V. Grabar,&nbsp;N. Kaploukhaya,&nbsp;R. Rivera-Castro,&nbsp;L. Mironova,&nbsp;A. Zaytsev,&nbsp;E. Burnaev","doi":"10.1134/S1064562423701661","DOIUrl":"10.1134/S1064562423701661","url":null,"abstract":"<p>Identifying the right user to target is a common problem for different Internet platforms. Although numerous systems address this task, they are heavily tailored for specific environments and settings. It is challenging for practitioners to apply these findings to their problems. The reason is that most systems are designed for settings with millions of highly active users and with personal information, as is the case in social networks or other services with high virality. There exists a gap in the literature for systems that are for medium-sized data and where the only data available are the event sequences of a user. It motivates us to present Look-A-Liker (LAL) as an unsupervised deep cluster system. It uses temporal point processes to identify similar users for targeting tasks. We use data from the leading Internet marketplace for the gastronomic sector for experiments. LAL generalizes beyond proprietary data. Using event sequences of users, it is possible to obtain state-of-the-art results compared to novel methods such as Transformer architectures and multimodal learning. Our approach produces the up to 20% ROC AUC score improvement on real-world datasets from 0.803 to 0.959. Although LAL focuses on hundreds of thousands of sequences, we show how it quickly expands to millions of user sequences. We provide a fully reproducible implementation with code and datasets in https://github.com/adasegroup/sequence_clusterers.</p>","PeriodicalId":531,"journal":{"name":"Doklady Mathematics","volume":null,"pages":null},"PeriodicalIF":0.5,"publicationDate":"2024-03-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140884601","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Barcodes as Summary of Loss Function Topology 条形码作为损失函数拓扑结构的总结
IF 0.5 4区 数学 Q3 MATHEMATICS Pub Date : 2024-03-25 DOI: 10.1134/S1064562423701570
S. A. Barannikov, A. A. Korotin, D. A. Oganesyan, D. I. Emtsev, E. V. Burnaev

We propose to study neural networks’ loss surfaces by methods of topological data analysis. We suggest to apply barcodes of Morse complexes to explore topology of loss surfaces. An algorithm for calculations of the loss function’s barcodes of local minima is described. We have conducted experiments for calculating barcodes of local minima for benchmark functions and for loss surfaces of small neural networks. Our experiments confirm our two principal observations for neural networks’ loss surfaces. First, the barcodes of local minima are located in a small lower part of the range of values of neural networks’ loss function. Secondly, increase of the neural network’s depth and width lowers the barcodes of local minima. This has some natural implications for the neural network’s learning and for its generalization properties.

摘要 我们建议用拓扑数据分析方法研究神经网络的损失面。我们建议应用莫尔斯复合条形码来探索损失面的拓扑结构。本文介绍了计算损失函数局部极小值条形码的算法。我们对基准函数和小型神经网络损失面的局部极小值条形码进行了计算实验。我们的实验证实了我们对神经网络损失面的两个主要观察结果。首先,局部极小值条形码位于神经网络损失函数值范围的较低小部分。其次,神经网络深度和宽度的增加会降低局部最小值的条形码。这对神经网络的学习及其泛化特性自然会产生一些影响。
{"title":"Barcodes as Summary of Loss Function Topology","authors":"S. A. Barannikov,&nbsp;A. A. Korotin,&nbsp;D. A. Oganesyan,&nbsp;D. I. Emtsev,&nbsp;E. V. Burnaev","doi":"10.1134/S1064562423701570","DOIUrl":"10.1134/S1064562423701570","url":null,"abstract":"<p>We propose to study neural networks’ loss surfaces by methods of topological data analysis. We suggest to apply barcodes of Morse complexes to explore topology of loss surfaces. An algorithm for calculations of the loss function’s barcodes of local minima is described. We have conducted experiments for calculating barcodes of local minima for benchmark functions and for loss surfaces of small neural networks. Our experiments confirm our two principal observations for neural networks’ loss surfaces. First, the barcodes of local minima are located in a small lower part of the range of values of neural networks’ loss function. Secondly, increase of the neural network’s depth and width lowers the barcodes of local minima. This has some natural implications for the neural network’s learning and for its generalization properties.</p>","PeriodicalId":531,"journal":{"name":"Doklady Mathematics","volume":null,"pages":null},"PeriodicalIF":0.5,"publicationDate":"2024-03-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140299614","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Min-Max Optimization over Slowly Time-Varying Graphs 慢速时变图上的最小-最大优化
IF 0.5 4区 数学 Q3 MATHEMATICS Pub Date : 2024-03-25 DOI: 10.1134/S1064562423701533
Nhat Trung Nguyen, A. Rogozin, D. Metelev, A. Gasnikov

Distributed optimization is an important direction of research in modern optimization theory. Its applications include large scale machine learning, distributed signal processing and many others. The paper studies decentralized min-max optimization for saddle point problems. Saddle point problems arise in training adversarial networks and in robust machine learning. The focus of the work is optimization over (slowly) time-varying networks. The topology of the network changes from time to time, and the velocity of changes is limited. We show that, analogically to decentralized optimization, it is sufficient to change only two edges per iteration in order to slow down convergence to the arbitrary time-varying case. At the same time, we investigate several classes of time-varying graphs for which the communication complexity can be reduced.

分布式优化是现代优化理论的一个重要研究方向。其应用包括大规模机器学习、分布式信号处理等。本文研究了鞍点问题的分布式最小优化。鞍点问题出现在对抗网络训练和鲁棒机器学习中。工作重点是(缓慢)时变网络的优化。网络的拓扑结构会随时发生变化,而变化的速度是有限的。我们的研究表明,与分散优化类似,每次迭代只需改变两条边就足以减缓任意时变情况下的收敛速度。同时,我们还研究了几类可以降低通信复杂度的时变图。
{"title":"Min-Max Optimization over Slowly Time-Varying Graphs","authors":"Nhat Trung Nguyen,&nbsp;A. Rogozin,&nbsp;D. Metelev,&nbsp;A. Gasnikov","doi":"10.1134/S1064562423701533","DOIUrl":"10.1134/S1064562423701533","url":null,"abstract":"<p>Distributed optimization is an important direction of research in modern optimization theory. Its applications include large scale machine learning, distributed signal processing and many others. The paper studies decentralized min-max optimization for saddle point problems. Saddle point problems arise in training adversarial networks and in robust machine learning. The focus of the work is optimization over (slowly) time-varying networks. The topology of the network changes from time to time, and the velocity of changes is limited. We show that, analogically to decentralized optimization, it is sufficient to change only two edges per iteration in order to slow down convergence to the arbitrary time-varying case. At the same time, we investigate several classes of time-varying graphs for which the communication complexity can be reduced.</p>","PeriodicalId":531,"journal":{"name":"Doklady Mathematics","volume":null,"pages":null},"PeriodicalIF":0.5,"publicationDate":"2024-03-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142413767","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Automated System for Analysis of OCT Retina Images Development and Testing 开发和测试 OCT 视网膜图像自动分析系统
IF 0.5 4区 数学 Q3 MATHEMATICS Pub Date : 2024-03-25 DOI: 10.1134/S1064562423701545
L. E. Aksenova, K. D. Aksenov, E. V. Kozina, V. V. Myasnikova

Neovascular age-related macular degeneration (n-AMD) is a form of AMD that is responsible for most cases of severe vision loss. Anti-VEGF therapy, which is the gold standard for the treatment of this pathology, is accompanied by OCT monitoring. However, this process is hampered by the lack of methods for accurately quantifying OCT images. The aim of this study is to develop and evaluate the accuracy of the automated calculation of the quantitative characteristics of PED, SRF, and IRF biomarkers. A neural network with U-NET architecture was trained on a manually annotated dataset that included 385 OCT images. The dice coefficient measured on the validation dataset was 0.9, 0.72, and 0.69 for PED, SRF, and IRF. The results of the quantitative calculation of these biomarkers did not statistically differ from the measurements of an ophthalmologist. Comparison of groups with respect to the anatomical outcome of therapy showed that PED height, extent, and square are different for groups with adherence and non-adherence PED; and PED height, PED square, and IRF square are different for groups with nonadherence and tear PED. Thus, the algorithm for the quantitative calculation of biomarkers provides more information for assessing the results of therapy, which can improve the outcomes of treatment in patients with n-AMD.

新生血管性老年黄斑变性(n-AMD)是老年黄斑变性的一种形式,是大多数严重视力丧失病例的罪魁祸首。抗血管内皮生长因子(VEGF)疗法是治疗这种病变的金标准,同时还需要进行 OCT 监测。然而,由于缺乏准确量化 OCT 图像的方法,这一过程受到了阻碍。本研究旨在开发和评估 PED、SRF 和 IRF 生物标记物定量特征自动计算的准确性。采用 U-NET 架构的神经网络在人工标注的数据集上进行了训练,该数据集包括 385 幅 OCT 图像。在验证数据集上测得的 PED、SRF 和 IRF 骰子系数分别为 0.9、0.72 和 0.69。这些生物标志物的定量计算结果与眼科医生的测量结果没有统计学差异。对各组治疗的解剖结果进行比较后发现,PED 高度、范围和平方在坚持和不坚持 PED 的组别中是不同的;PED 高度、PED 平方和 IRF 平方在不坚持和流泪 PED 的组别中是不同的。因此,生物标志物定量计算算法可为评估治疗效果提供更多信息,从而改善 n-AMD 患者的治疗效果。
{"title":"Automated System for Analysis of OCT Retina Images Development and Testing","authors":"L. E. Aksenova,&nbsp;K. D. Aksenov,&nbsp;E. V. Kozina,&nbsp;V. V. Myasnikova","doi":"10.1134/S1064562423701545","DOIUrl":"10.1134/S1064562423701545","url":null,"abstract":"<p>Neovascular age-related macular degeneration (n-AMD) is a form of AMD that is responsible for most cases of severe vision loss. Anti-VEGF therapy, which is the gold standard for the treatment of this pathology, is accompanied by OCT monitoring. However, this process is hampered by the lack of methods for accurately quantifying OCT images. The aim of this study is to develop and evaluate the accuracy of the automated calculation of the quantitative characteristics of PED, SRF, and IRF biomarkers. A neural network with U-NET architecture was trained on a manually annotated dataset that included 385 OCT images. The dice coefficient measured on the validation dataset was 0.9, 0.72, and 0.69 for PED, SRF, and IRF. The results of the quantitative calculation of these biomarkers did not statistically differ from the measurements of an ophthalmologist. Comparison of groups with respect to the anatomical outcome of therapy showed that PED height, extent, and square are different for groups with adherence and non-adherence PED; and PED height, PED square, and IRF square are different for groups with nonadherence and tear PED. Thus, the algorithm for the quantitative calculation of biomarkers provides more information for assessing the results of therapy, which can improve the outcomes of treatment in patients with n-AMD.</p>","PeriodicalId":531,"journal":{"name":"Doklady Mathematics","volume":null,"pages":null},"PeriodicalIF":0.5,"publicationDate":"2024-03-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142413743","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Possibility of Recovering Message Segments Based on Side Information about Original Characters 根据原始字符的侧面信息恢复信息片段的可能性
IF 0.5 4区 数学 Q3 MATHEMATICS Pub Date : 2024-03-25 DOI: 10.1134/S106456242370151X
A. G. Malashina

To provide secure information exchange in communication channels, the correctness of the operation of the relevant information protection systems must be preliminary studied. The mathematical algorithms used in such systems are correct and can theoretically provide the correct statistical properties of the output stream compared to the input. However, at the stage of implementation (programming) of these protection algorithms or at the stages of assembling the final equipment (using hardware, making adjustments) and its operation in real conditions, it is possible to introduce distortions that violate the operation of certain elements of information security tools (for example, a random number generator). As a result, by the nature of the transmitted signal, it becomes possible to determine that the output stream for a number of characteristics is steadily different from the ideal encrypted stream, which in theory should have come from the equipment and appeared at the output of the communication channel. In this situation, it is necessary to understand how the introduction of certain distortions affects the degree of security of the system being created. For this purpose, the parameters of various message sources are described, which simulate receiving an output stream with distortions. At the same time, the degree of security of the corresponding communication channel is proposed to be determined by estimating the proportion of the input stream that can be restored from the output using side information resulting from the introduction of appropriate distortions in the operation of the system.

为了在通信信道中进行安全的信息交换,必须对相关信息保护系统运行的正确性进行初步研究。这些系统中使用的数学算法是正确的,理论上可以提供输出流与输入流相比的正确统计特性。然而,在这些保护算法的实施(编程)阶段,或在组装最终设备(使用硬件、进行调整)及其在实际条件下运行的阶段,有可能引入扭曲,从而违反信息安全工具某些元素(例如随机数生成器)的运行。因此,根据传输信号的性质,可以确定输出流的一些特征与理想的加密流有很大差异,而理想的加密流理论上应该来自设备并出现在通信信道的输出端。在这种情况下,有必要了解引入某些失真会如何影响所创建系统的安全程度。为此,我们对各种信息源的参数进行了描述,模拟接收带有失真的输出流。同时,建议通过估算在系统运行中引入适当失真所产生的侧信息可从输出中还原的输入流比例来确定相应通信信道的安全程度。
{"title":"Possibility of Recovering Message Segments Based on Side Information about Original Characters","authors":"A. G. Malashina","doi":"10.1134/S106456242370151X","DOIUrl":"10.1134/S106456242370151X","url":null,"abstract":"<p>To provide secure information exchange in communication channels, the correctness of the operation of the relevant information protection systems must be preliminary studied. The mathematical algorithms used in such systems are correct and can theoretically provide the correct statistical properties of the output stream compared to the input. However, at the stage of implementation (programming) of these protection algorithms or at the stages of assembling the final equipment (using hardware, making adjustments) and its operation in real conditions, it is possible to introduce distortions that violate the operation of certain elements of information security tools (for example, a random number generator). As a result, by the nature of the transmitted signal, it becomes possible to determine that the output stream for a number of characteristics is steadily different from the ideal encrypted stream, which in theory should have come from the equipment and appeared at the output of the communication channel. In this situation, it is necessary to understand how the introduction of certain distortions affects the degree of security of the system being created. For this purpose, the parameters of various message sources are described, which simulate receiving an output stream with distortions. At the same time, the degree of security of the corresponding communication channel is proposed to be determined by estimating the proportion of the input stream that can be restored from the output using side information resulting from the introduction of appropriate distortions in the operation of the system.</p>","PeriodicalId":531,"journal":{"name":"Doklady Mathematics","volume":null,"pages":null},"PeriodicalIF":0.5,"publicationDate":"2024-03-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142413769","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
ESGify: Automated Classification of Environmental, Social, and Corporate Governance Risks ESGify:环境、社会和公司治理风险的自动分类
IF 0.5 4区 数学 Q3 MATHEMATICS Pub Date : 2024-03-25 DOI: 10.1134/S1064562423701673
A. Kazakov, S. Denisova, I. Barsola, E. Kalugina, I. Molchanova, I. Egorov, A. Kosterina, E. Tereshchenko, L. Shutikhina, I. Doroshchenko, N. Sotiriadi, S. Budennyy

The growing recognition of environmental, social, and governance (ESG) factors in financial decision-making has spurred the need for effective and comprehensive ESG risk assessment tools. In this study, we introduce an open-source Natural Language Processing (NLP) model, “ESGify”1,2, based on MPNet-base architecture and aimed to classify texts within the frames of ESG risks. We also present a hierarchical and detailed methodology for ESG risk classification, leveraging the expertise of ESG professionals and global best practices. Anchored by a manually annotated multilabel dataset of 2000 news articles and domain adaptation with texts of sustainability reports, ESGify is developed to automate ESG risk classification following the established methodology. We compare augmentation techniques based on back translation and Large Language Models (LLMs) to improve the model quality and achieve 0.5 F1-weighted model quality in the dataset with 47 classes. This result outperforms ChatGPT 3.5 with a simple prompt. The model weights and documentation is hosted on Github https://github.com/sb-ai-lab/ESGify under the Apache 2.0 license.

摘要 人们日益认识到金融决策中的环境、社会和治理(ESG)因素,因此需要有效而全面的 ESG 风险评估工具。在本研究中,我们介绍了一个开源的自然语言处理(NLP)模型 "ESGify "1,2,该模型基于 MPNet 基础架构,旨在对 ESG 风险框架内的文本进行分类。我们还利用 ESG 专业人士的专业知识和全球最佳实践,提出了分层的 ESG 风险分类详细方法。ESGify 以包含 2000 篇新闻文章的人工标注多标签数据集和可持续发展报告文本的领域适应性为基础,按照既定方法自动进行 ESG 风险分类。我们比较了基于反向翻译和大型语言模型(LLM)的增强技术,以提高模型质量,并在包含 47 个类别的数据集中实现了 0.5 的 F1 加权模型质量。这一结果优于使用简单提示的 ChatGPT 3.5。模型权重和文档在 Apache 2.0 许可下托管于 Github https://github.com/sb-ai-lab/ESGify。
{"title":"ESGify: Automated Classification of Environmental, Social, and Corporate Governance Risks","authors":"A. Kazakov,&nbsp;S. Denisova,&nbsp;I. Barsola,&nbsp;E. Kalugina,&nbsp;I. Molchanova,&nbsp;I. Egorov,&nbsp;A. Kosterina,&nbsp;E. Tereshchenko,&nbsp;L. Shutikhina,&nbsp;I. Doroshchenko,&nbsp;N. Sotiriadi,&nbsp;S. Budennyy","doi":"10.1134/S1064562423701673","DOIUrl":"10.1134/S1064562423701673","url":null,"abstract":"<p>The growing recognition of environmental, social, and governance (ESG) factors in financial decision-making has spurred the need for effective and comprehensive ESG risk assessment tools. In this study, we introduce an open-source Natural Language Processing (NLP) model, “ESGify”<sup>1,2</sup>, based on MPNet-base architecture and aimed to classify texts within the frames of ESG risks. We also present a hierarchical and detailed methodology for ESG risk classification, leveraging the expertise of ESG professionals and global best practices. Anchored by a manually annotated multilabel dataset of 2000 news articles and domain adaptation with texts of sustainability reports, ESGify is developed to automate ESG risk classification following the established methodology. We compare augmentation techniques based on back translation and Large Language Models (LLMs) to improve the model quality and achieve 0.5 F1-weighted model quality in the dataset with 47 classes. This result outperforms ChatGPT 3.5 with a simple prompt. The model weights and documentation is hosted on Github https://github.com/sb-ai-lab/ESGify under the Apache 2.0 license.</p>","PeriodicalId":531,"journal":{"name":"Doklady Mathematics","volume":null,"pages":null},"PeriodicalIF":0.5,"publicationDate":"2024-03-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140884603","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Investigation of Neural Network Algorithms for Human Movement Prediction Based on LSTM and Transformers 基于 LSTM 和变压器的人体运动预测神经网络算法研究
IF 0.5 4区 数学 Q3 MATHEMATICS Pub Date : 2024-03-25 DOI: 10.1134/S1064562423701624
S. V. Zhiganov, Y. S. Ivanov, D. M. Grabar

The problem of predicting the position of a person on future frames of a video stream is solved, and in-depth experimental studies on the application of traditional and SOTA blocks for this task are carried out. An original architecture of KeyFNet and its modifications based on transform blocks is presented, which is able to predict coordinates in the video stream for 30, 60, 90, and 120 frames ahead with high accuracy. The novelty lies in the application of a combined algorithm based on multiple FNet blocks with fast Fourier transform as an attention mechanism concatenating the coordinates of key points. Experiments on Human3.6M and on our own real data confirmed the effectiveness of the proposed approach based on FNet blocks, compared to the traditional approach based on LSTM. The proposed algorithm matches the accuracy of advanced models, but outperforms them in terms of speed, uses less computational resources, and thus can be applied in collaborative robotic solutions.

摘要 解决了在视频流的未来帧上预测人物位置的问题,并对传统块和 SOTA 块在此任务中的应用进行了深入的实验研究。本文介绍了 KeyFNet 的原始架构及其基于变换块的修改,该架构能够高精度地预测视频流中未来 30、60、90 和 120 帧的坐标。其新颖之处在于应用了基于多个 FNet 块的组合算法,并将快速傅立叶变换作为一种关注机制,将关键点的坐标串联起来。在 Human3.6M 和我们自己的真实数据上进行的实验证实,与基于 LSTM 的传统方法相比,基于 FNet 块的拟议方法非常有效。所提出的算法与先进模型的准确性相当,但在速度方面优于它们,使用的计算资源更少,因此可以应用于协作机器人解决方案中。
{"title":"Investigation of Neural Network Algorithms for Human Movement Prediction Based on LSTM and Transformers","authors":"S. V. Zhiganov,&nbsp;Y. S. Ivanov,&nbsp;D. M. Grabar","doi":"10.1134/S1064562423701624","DOIUrl":"10.1134/S1064562423701624","url":null,"abstract":"<p>The problem of predicting the position of a person on future frames of a video stream is solved, and in-depth experimental studies on the application of traditional and SOTA blocks for this task are carried out. An original architecture of KeyFNet and its modifications based on transform blocks is presented, which is able to predict coordinates in the video stream for 30, 60, 90, and 120 frames ahead with high accuracy. The novelty lies in the application of a combined algorithm based on multiple FNet blocks with fast Fourier transform as an attention mechanism concatenating the coordinates of key points. Experiments on Human3.6M and on our own real data confirmed the effectiveness of the proposed approach based on FNet blocks, compared to the traditional approach based on LSTM. The proposed algorithm matches the accuracy of advanced models, but outperforms them in terms of speed, uses less computational resources, and thus can be applied in collaborative robotic solutions.</p>","PeriodicalId":531,"journal":{"name":"Doklady Mathematics","volume":null,"pages":null},"PeriodicalIF":0.5,"publicationDate":"2024-03-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140884692","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
On Barenblatt–Zeldovich Intermediate Asymptotics 论巴伦布拉特-塞尔多维奇中间渐近线
IF 0.5 4区 数学 Q3 MATHEMATICS Pub Date : 2024-03-14 DOI: 10.1134/S1064562423701351
V. A. Kostin, D. V. Kostin, A. V. Kostin

The concept of intermediate asymptotics for the solution of an evolution equation with initial data and a related solution obtained without initial conditions was introduced by G.N. Barenblatt and Ya.B. Zeldovich in the context of extending the concept of strict determinism in statistical physics and quantum mechanics. Here, according to V.P. Maslov, to axiomatize the mathematical theory, we need to know the conditions satisfied by the initial data of the problem. We show that the correct solvability of a problem without initial conditions for fractional differential equations in a Banach space is a necessary, but not sufficient, condition for intermediate asymptotics. Examples of intermediate asymptotics are given.

摘要 G.N. Barenblatt 和 Ya.B. Zeldovich 在扩展统计物理学和量子力学中的严格确定性概念时,提出了有初始数据的演化方程解和无初始条件的相关解的中间渐近概念。在这里,根据 V.P. Maslov 的观点,为了使数学理论公理化,我们需要知道问题的初始数据所满足的条件。我们证明,巴拿赫空间中分数微分方程无初始条件问题的正确可解性是中间渐近的必要条件,但不是充分条件。我们给出了中间渐近的例子。
{"title":"On Barenblatt–Zeldovich Intermediate Asymptotics","authors":"V. A. Kostin,&nbsp;D. V. Kostin,&nbsp;A. V. Kostin","doi":"10.1134/S1064562423701351","DOIUrl":"10.1134/S1064562423701351","url":null,"abstract":"<p>The concept of intermediate asymptotics for the solution of an evolution equation with initial data and a related solution obtained without initial conditions was introduced by G.N. Barenblatt and Ya.B. Zeldovich in the context of extending the concept of strict determinism in statistical physics and quantum mechanics. Here, according to V.P. Maslov, to axiomatize the mathematical theory, we need to know the conditions satisfied by the initial data of the problem. We show that the correct solvability of a problem without initial conditions for fractional differential equations in a Banach space is a necessary, but not sufficient, condition for intermediate asymptotics. Examples of intermediate asymptotics are given.</p>","PeriodicalId":531,"journal":{"name":"Doklady Mathematics","volume":null,"pages":null},"PeriodicalIF":0.5,"publicationDate":"2024-03-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140299836","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Upper Bound for the Competitive Facility Location Problem with Demand Uncertainty 具有需求不确定性的竞争性设施选址问题的上限
IF 0.5 4区 数学 Q3 MATHEMATICS Pub Date : 2024-03-14 DOI: 10.1134/S1064562423600318
V. L. Beresnev, A. A. Melnikov

We consider a competitive facility location problem with two competing parties operating in a situation of uncertain demand scenarios. The problem of finding the best solutions for the parties is formulated as a discrete bilevel mathematical programming problem. A procedure for computing an upper bound for the objective function on solution subsets is suggested. The procedure could be employed in implicit enumeration schemes capable of computing an optimal solution for the problem under study. Within the procedure, additional constraints (cuts) iteratively augment the high-point relaxation of the initial bilevel problem, which strengthens the relaxation and improves the upper bound’s quality. A new procedure for generating such cuts is proposed, which allows us to construct the strongest cuts without enumerating the parameters encoding them.

我们考虑的是一个竞争性设施选址问题,竞争双方在需求不确定的情况下开展业务。为双方寻找最佳解决方案的问题被表述为一个离散的双层数学编程问题。提出了一种计算解子集目标函数上限的程序。该程序可用于隐式枚举方案,该方案能够计算出所研究问题的最优解。在该程序中,额外的约束条件(切割)会迭代地增强初始双层问题的高点松弛,从而加强松弛并提高上界的质量。我们提出了一种生成这种切分的新程序,它允许我们在不列举编码参数的情况下构建最强的切分。
{"title":"Upper Bound for the Competitive Facility Location Problem with Demand Uncertainty","authors":"V. L. Beresnev,&nbsp;A. A. Melnikov","doi":"10.1134/S1064562423600318","DOIUrl":"10.1134/S1064562423600318","url":null,"abstract":"<p>We consider a competitive facility location problem with two competing parties operating in a situation of uncertain demand scenarios. The problem of finding the best solutions for the parties is formulated as a discrete bilevel mathematical programming problem. A procedure for computing an upper bound for the objective function on solution subsets is suggested. The procedure could be employed in implicit enumeration schemes capable of computing an optimal solution for the problem under study. Within the procedure, additional constraints (cuts) iteratively augment the high-point relaxation of the initial bilevel problem, which strengthens the relaxation and improves the upper bound’s quality. A new procedure for generating such cuts is proposed, which allows us to construct the strongest cuts without enumerating the parameters encoding them.</p>","PeriodicalId":531,"journal":{"name":"Doklady Mathematics","volume":null,"pages":null},"PeriodicalIF":0.5,"publicationDate":"2024-03-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142411889","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Invariants of Fifth-Order Homogeneous Systems with Dissipation 带耗散的五阶均相系统的不变式
IF 0.5 4区 数学 Q3 MATHEMATICS Pub Date : 2024-03-14 DOI: 10.1134/S1064562423701466
M. V. Shamolin

New cases of integrable fifth-order dynamical systems that are homogeneous with respect to some of the variables are obtained, in which a system on the tangent bundle of a two-dimensional manifold can be distinguished. In this case, the force field is divided into an internal (conservative) and an external one, which has dissipation of different signs. The external field is introduced using some unimodular transformation and generalizes previously considered fields. Complete sets of both first integrals and invariant differential forms are given.

我们获得了可积分五阶动力系统的新情况,这些系统在某些变量上是同质的,其中一个二维流形切线束上的系统可以区分开来。在这种情况下,力场分为内部力场(保守力场)和外部力场,后者具有不同符号的耗散。外部力场是通过某种单模态变换引入的,并概括了之前考虑过的力场。给出了第一积分和不变微分形式的完整集合。
{"title":"Invariants of Fifth-Order Homogeneous Systems with Dissipation","authors":"M. V. Shamolin","doi":"10.1134/S1064562423701466","DOIUrl":"10.1134/S1064562423701466","url":null,"abstract":"<p>New cases of integrable fifth-order dynamical systems that are homogeneous with respect to some of the variables are obtained, in which a system on the tangent bundle of a two-dimensional manifold can be distinguished. In this case, the force field is divided into an internal (conservative) and an external one, which has dissipation of different signs. The external field is introduced using some unimodular transformation and generalizes previously considered fields. Complete sets of both first integrals and invariant differential forms are given.</p>","PeriodicalId":531,"journal":{"name":"Doklady Mathematics","volume":null,"pages":null},"PeriodicalIF":0.5,"publicationDate":"2024-03-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142411749","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Doklady Mathematics
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1