Pub Date : 2024-03-11DOI: 10.1134/S106456242470176X
N. E. Kalenov, A. N. Sotnikov
The goals, objectives, and structure of the ontology of the Common Digital Space of Scientific Knowledge (CDSSK) are considered. The CDSSK is an integrated information structure that combines state scientific information systems presented on the Internet (the Great Russian Encyclopedia, the National Electronic Library, the State Catalog of Geographical Names, etc.) with industry information systems, databases, and electronic libraries (MathNet, Socionet, Scientific Heritage of Russia, etc.). CDSSK can be considered as an information basis for solving artificial intelligence problems. The article presents the unified structure of the CDSSK ontology developed at the Joint Supercomputer Center of the Russian Academy of Sciences and its modeling on an example of ten subject classes and eight auxiliary classes of objects of the CDSSK universal subspace.
{"title":"Common Digital Space of Scientific Knowledge as an Integrator of Polythematic Information Resources","authors":"N. E. Kalenov, A. N. Sotnikov","doi":"10.1134/S106456242470176X","DOIUrl":"10.1134/S106456242470176X","url":null,"abstract":"<p>The goals, objectives, and structure of the ontology of the Common Digital Space of Scientific Knowledge (CDSSK) are considered. The CDSSK is an integrated information structure that combines state scientific information systems presented on the Internet (the Great Russian Encyclopedia, the National Electronic Library, the State Catalog of Geographical Names, etc.) with industry information systems, databases, and electronic libraries (MathNet, Socionet, Scientific Heritage of Russia, etc.). CDSSK can be considered as an information basis for solving artificial intelligence problems. The article presents the unified structure of the CDSSK ontology developed at the Joint Supercomputer Center of the Russian Academy of Sciences and its modeling on an example of ten subject classes and eight auxiliary classes of objects of the CDSSK universal subspace.</p>","PeriodicalId":531,"journal":{"name":"Doklady Mathematics","volume":null,"pages":null},"PeriodicalIF":0.5,"publicationDate":"2024-03-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140099212","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-03-11DOI: 10.1134/S1064562423701193
A. V. Medvedev, A. G. Djakonov
For prediction tasks using tabular data, it is possible to extract additional information about the target variable by examining the relationships between the objects. Specifically, if it is possible to receive agraph in which the objects are represented as vertices and the relationships are expressed as edges, then it is likely that the graph structure contains valuable information. Recent research has indicated that jointly training graph neural networks and gradient boostings on this type of data can increase the accuracy of predictions. This article proposes new methods for learning on tabular data that incorporates a graph structure, in an attempt to combine modern multilayer techniques for processing tabular data and graph neural networks. In addition, we discuss ways to mitigate the computational complexity of the proposed models and conduct experiments in both inductive and transductive settings. Our findings demonstrate tha the proposed approaches provide comparable quality to modern methods.
{"title":"Towards Efficient Learning of GNNs on High-Dimensional Multilayered Representations of Tabular Data","authors":"A. V. Medvedev, A. G. Djakonov","doi":"10.1134/S1064562423701193","DOIUrl":"10.1134/S1064562423701193","url":null,"abstract":"<p>For prediction tasks using tabular data, it is possible to extract additional information about the target variable by examining the relationships between the objects. Specifically, if it is possible to receive agraph in which the objects are represented as vertices and the relationships are expressed as edges, then it is likely that the graph structure contains valuable information. Recent research has indicated that jointly training graph neural networks and gradient boostings on this type of data can increase the accuracy of predictions. This article proposes new methods for learning on tabular data that incorporates a graph structure, in an attempt to combine modern multilayer techniques for processing tabular data and graph neural networks. In addition, we discuss ways to mitigate the computational complexity of the proposed models and conduct experiments in both inductive and transductive settings. Our findings demonstrate tha the proposed approaches provide comparable quality to modern methods.</p>","PeriodicalId":531,"journal":{"name":"Doklady Mathematics","volume":null,"pages":null},"PeriodicalIF":0.5,"publicationDate":"2024-03-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140099219","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-03-11DOI: 10.1134/S106456242370117X
D. P. Kuznetsov, D. R. Ledneva
The paper introduces a novel methodology for predicting intentions in dialog systems through a graph-based approach. This methodology involves constructing graph structures that represent dialogs, thus capturing contextual information effectively. By analyzing results from various open and closed domain datasets, the authors demonstrate the substantial enhancement in intention prediction accuracy achieved by combining graph models with text encoders. The primary focus of the study revolves around assessing the impact of diverse graph architectures and encoders on the performance of the proposed technique. Through empirical evaluation, the experimental outcomes affirm the superiority of graph neural networks in terms of both (Recall@k) (MAR) metric and computational resources when compared to alternative methods. This research uncovers a novel avenue for intention prediction in dialog systems by leveraging graph-based representations.
{"title":"Graph Models for Contextual Intention Prediction in Dialog Systems","authors":"D. P. Kuznetsov, D. R. Ledneva","doi":"10.1134/S106456242370117X","DOIUrl":"10.1134/S106456242370117X","url":null,"abstract":"<p>The paper introduces a novel methodology for predicting intentions in dialog systems through a graph-based approach. This methodology involves constructing graph structures that represent dialogs, thus capturing contextual information effectively. By analyzing results from various open and closed domain datasets, the authors demonstrate the substantial enhancement in intention prediction accuracy achieved by combining graph models with text encoders. The primary focus of the study revolves around assessing the impact of diverse graph architectures and encoders on the performance of the proposed technique. Through empirical evaluation, the experimental outcomes affirm the superiority of graph neural networks in terms of both <span>(Recall@k)</span> (MAR) metric and computational resources when compared to alternative methods. This research uncovers a novel avenue for intention prediction in dialog systems by leveraging graph-based representations.</p>","PeriodicalId":531,"journal":{"name":"Doklady Mathematics","volume":null,"pages":null},"PeriodicalIF":0.5,"publicationDate":"2024-03-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140099298","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-03-11DOI: 10.1134/S1064562423701144
M. Danilova
This article provides a survey of the results of several research studies [12–14, 26], in which open questions related to the high-probability convergence analysis of stochastic first-order optimization methods under mild assumptions on the noise were gradually addressed. In the beginning, we introduce the concept of gradient clipping, which plays a pivotal role in the development of stochastic methods for successful operation in the case of heavy-tailed distributions. Next, we examine the importance of obtaining the high-probability convergence guarantees and their connection with in-expectation convergence guarantees. The concluding sections of the article are dedicated to presenting the primary findings related to minimization problems and the results of numerical experiments.
{"title":"Algorithms with Gradient Clipping for Stochastic Optimization with Heavy-Tailed Noise","authors":"M. Danilova","doi":"10.1134/S1064562423701144","DOIUrl":"10.1134/S1064562423701144","url":null,"abstract":"<p>This article provides a survey of the results of several research studies [12–14, 26], in which open questions related to the high-probability convergence analysis of stochastic first-order optimization methods under mild assumptions on the noise were gradually addressed. In the beginning, we introduce the concept of gradient clipping, which plays a pivotal role in the development of stochastic methods for successful operation in the case of heavy-tailed distributions. Next, we examine the importance of obtaining the high-probability convergence guarantees and their connection with in-expectation convergence guarantees. The concluding sections of the article are dedicated to presenting the primary findings related to minimization problems and the results of numerical experiments.</p>","PeriodicalId":531,"journal":{"name":"Doklady Mathematics","volume":null,"pages":null},"PeriodicalIF":0.5,"publicationDate":"2024-03-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140099631","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-03-11DOI: 10.1134/S1064562423701156
A. A. Hvatov, R. V. Titov
Differential equation discovery, a machine learning subfield, is used to develop interpretable models, particularly, in nature-related applications. By expertly incorporating the general parametric form of the equation of motion and appropriate differential terms, algorithms can autonomously uncover equations from data. This paper explores the prerequisites and tools for independent equation discovery without expert input, eliminating the need for equation form assumptions. We focus on addressing the challenge of assessing the adequacy of discovered equations when the correct equation is unknown, with the aim of providing insights for reliable equation discovery without prior knowledge of the equation form.
{"title":"Towards Discovery of the Differential Equations","authors":"A. A. Hvatov, R. V. Titov","doi":"10.1134/S1064562423701156","DOIUrl":"10.1134/S1064562423701156","url":null,"abstract":"<p>Differential equation discovery, a machine learning subfield, is used to develop interpretable models, particularly, in nature-related applications. By expertly incorporating the general parametric form of the equation of motion and appropriate differential terms, algorithms can autonomously uncover equations from data. This paper explores the prerequisites and tools for independent equation discovery without expert input, eliminating the need for equation form assumptions. We focus on addressing the challenge of assessing the adequacy of discovered equations when the correct equation is unknown, with the aim of providing insights for reliable equation discovery without prior knowledge of the equation form.</p>","PeriodicalId":531,"journal":{"name":"Doklady Mathematics","volume":null,"pages":null},"PeriodicalIF":0.5,"publicationDate":"2024-03-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140099313","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-03-11DOI: 10.1134/S106456242355001X
A. L. Semenov
This article is the author’s review of the singularity in which events in the field of artificial intelligence (AI) are developing. A general view is offered on the role of revolutions in information technology as they expand the human personality. The current stage of personal expansion is considered, covering the last decade, especially 2023. The most important and common socially significant documents expressing concern about AI, as well as those that assert an optimistic view of events, are considered: ethical principles, important directions, requirements, and restrictions. It takes a closer look at the pending European AI Act (AIA) and how different groups are reacting to it. Cultural and historical factors are highlighted that can counteract the negative and catastrophic developments that may result from AI. Possible mechanisms for preserving genuine knowledge among professionals and disseminating it among the general public are analyzed.
{"title":"Artificial Intelligence in Society","authors":"A. L. Semenov","doi":"10.1134/S106456242355001X","DOIUrl":"10.1134/S106456242355001X","url":null,"abstract":"<p>This article is the author’s review of the singularity in which events in the field of artificial intelligence (AI) are developing. A general view is offered on the role of revolutions in information technology as they expand the human personality. The current stage of personal expansion is considered, covering the last decade, especially 2023. The most important and common socially significant documents expressing concern about AI, as well as those that assert an optimistic view of events, are considered: ethical principles, important directions, requirements, and restrictions. It takes a closer look at the pending European AI Act (AIA) and how different groups are reacting to it. Cultural and historical factors are highlighted that can counteract the negative and catastrophic developments that may result from AI. Possible mechanisms for preserving genuine knowledge among professionals and disseminating it among the general public are analyzed.</p>","PeriodicalId":531,"journal":{"name":"Doklady Mathematics","volume":null,"pages":null},"PeriodicalIF":0.5,"publicationDate":"2024-03-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140099209","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-03-11DOI: 10.1134/S1064562424701758
D. I. Borisov, R. R. Suleimanov
We consider a system of second-order semilinear elliptic equations in a multidimensional domain with an arbitrarily curved boundary contained in a narrow layer along the unperturbed boundary. The Dirichlet or Neumann condition is imposed on the curved boundary. In the case of the Neumann condition, rather natural and weak conditions are additionally imposed on the structure of the curving. Under these conditions, we show that the homogenized problem is one for the same system of equations in the unperturbed problem with a boundary condition of the same kind as on the perturbed boundary. The main result is operator (W_{2}^{1})- and L2- estimates.
{"title":"Operator Estimates for Problems in Domains with Singularly Curved Boundary: Dirichlet and Neumann Conditions","authors":"D. I. Borisov, R. R. Suleimanov","doi":"10.1134/S1064562424701758","DOIUrl":"10.1134/S1064562424701758","url":null,"abstract":"<p>We consider a system of second-order semilinear elliptic equations in a multidimensional domain with an arbitrarily curved boundary contained in a narrow layer along the unperturbed boundary. The Dirichlet or Neumann condition is imposed on the curved boundary. In the case of the Neumann condition, rather natural and weak conditions are additionally imposed on the structure of the curving. Under these conditions, we show that the homogenized problem is one for the same system of equations in the unperturbed problem with a boundary condition of the same kind as on the perturbed boundary. The main result is operator <span>(W_{2}^{1})</span>- and <i>L</i><sub>2</sub>- estimates.</p>","PeriodicalId":531,"journal":{"name":"Doklady Mathematics","volume":null,"pages":null},"PeriodicalIF":0.5,"publicationDate":"2024-03-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140099305","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-03-11DOI: 10.1134/S1064562423900015
The AI Journey Team
{"title":"Introductory Words of AI Journey Team","authors":"The AI Journey Team","doi":"10.1134/S1064562423900015","DOIUrl":"10.1134/S1064562423900015","url":null,"abstract":"","PeriodicalId":531,"journal":{"name":"Doklady Mathematics","volume":null,"pages":null},"PeriodicalIF":0.5,"publicationDate":"2024-03-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140099634","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-03-11DOI: 10.1134/S1064562423701181
A. I. Predelina, S. Yu. Dulikov, A. M. Alekseev
This paper is dedicated to the development of a novel method for coordination analysis (CA) in English using the neural (deep learning) methods. An efficient solution for the task allows identifying potentially valuable links and relationships between specific parts of a sentence, making the extraction of coordinate structures an important text preprocessing tool. In this study, a number of ideas for approaching the task within the framework of one-stage detectors were tested. The achieved results are comparable in quality to the current most advanced CA methods while allowing to process more than three-fold more sentences per unit time.
摘要 本文致力于开发一种使用神经(深度学习)方法进行英语协调分析(CA)的新方法。对这一任务的有效解决方案可以识别句子特定部分之间潜在的有价值的联系和关系,从而使提取坐标结构成为重要的文本预处理工具。在本研究中,测试了在单级检测器框架内处理该任务的若干想法。所取得的结果在质量上可与目前最先进的 CA 方法相媲美,同时单位时间内可处理的句子数量增加了三倍以上。
{"title":"Neural Networks for Coordination Analysis","authors":"A. I. Predelina, S. Yu. Dulikov, A. M. Alekseev","doi":"10.1134/S1064562423701181","DOIUrl":"10.1134/S1064562423701181","url":null,"abstract":"<p>This paper is dedicated to the development of a novel method for coordination analysis (CA) in English using the neural (deep learning) methods. An efficient solution for the task allows identifying potentially valuable links and relationships between specific parts of a sentence, making the extraction of coordinate structures an important text preprocessing tool. In this study, a number of ideas for approaching the task within the framework of one-stage detectors were tested. The achieved results are comparable in quality to the current most advanced CA methods while allowing to process more than three-fold more sentences per unit time.</p>","PeriodicalId":531,"journal":{"name":"Doklady Mathematics","volume":null,"pages":null},"PeriodicalIF":0.5,"publicationDate":"2024-03-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140099234","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-03-11DOI: 10.1134/S1064562423701211
G. M. Gritsay, A. V. Grabovoy, A. S. Kildyakov, Yu. V. Chekhovich
Recent advances in text generative models make it possible to create artificial texts that look like human-written texts. A large number of methods for detecting texts obtained using large language models have already been developed. However, improvement of detection methods occurs simultaneously with the improvement of generation methods. Therefore, it is necessary to explore new generative models and modernize existing approaches to their detection. In this paper, we present a large analysis of existing detection methods, as well as a study of lexical, syntactic, and stylistic features of the generated fragments. Taking into account the developments, we have tested the most qualitative, in our opinion, methods of detecting machine-generated documents for their further application in the scientific domain. Experiments were conducted for Russian and English languages on the collected datasets. The developed methods improved the detection quality to a value of 0.968 on the F1-score metric for Russian and 0.825 for English, respectively. The described techniques can be applied to detect generated fragments in scientific, research, and graduate papers.
摘要 文本生成模型方面的最新进展使创建与人类书写文本相似的人工文本成为可能。目前已开发出大量使用大型语言模型检测文本的方法。然而,检测方法的改进与生成方法的改进是同步进行的。因此,有必要探索新的生成模型,并更新现有的检测方法。在本文中,我们对现有的检测方法进行了大量分析,并对生成片段的词法、句法和文体特征进行了研究。考虑到发展情况,我们测试了我们认为最有质量的机器生成文档检测方法,以便在科学领域进一步应用。我们在收集到的数据集上对俄语和英语进行了实验。所开发的方法提高了检测质量,俄语的 F1 分数指标值为 0.968,英语的 F1 分数指标值为 0.825。所述技术可用于检测科学、研究和研究生论文中生成的片段。
{"title":"Artificially Generated Text Fragments Search in Academic Documents","authors":"G. M. Gritsay, A. V. Grabovoy, A. S. Kildyakov, Yu. V. Chekhovich","doi":"10.1134/S1064562423701211","DOIUrl":"10.1134/S1064562423701211","url":null,"abstract":"<p>Recent advances in text generative models make it possible to create artificial texts that look like human-written texts. A large number of methods for detecting texts obtained using large language models have already been developed. However, improvement of detection methods occurs simultaneously with the improvement of generation methods. Therefore, it is necessary to explore new generative models and modernize existing approaches to their detection. In this paper, we present a large analysis of existing detection methods, as well as a study of lexical, syntactic, and stylistic features of the generated fragments. Taking into account the developments, we have tested the most qualitative, in our opinion, methods of detecting machine-generated documents for their further application in the scientific domain. Experiments were conducted for Russian and English languages on the collected datasets. The developed methods improved the detection quality to a value of 0.968 on the F1-score metric for Russian and 0.825 for English, respectively. The described techniques can be applied to detect generated fragments in scientific, research, and graduate papers.</p>","PeriodicalId":531,"journal":{"name":"Doklady Mathematics","volume":null,"pages":null},"PeriodicalIF":0.5,"publicationDate":"2024-03-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140099334","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}