首页 > 最新文献

AUTOMATIC DOCUMENTATION AND MATHEMATICAL LINGUISTICS最新文献

英文 中文
Development of an Expert System Based on Fuzzy Logic for Pneumonia Diagnostics 基于模糊逻辑的肺炎诊断专家系统的开发
IF 0.5 Q4 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2025-03-26 DOI: 10.3103/S000510552570027X
A. I. Enikeeva, R. A. Burnashev, R. R. Farahov

The paper is devoted to the development of an expert system for the diagnosis of pneumonia based on fuzzy logic implemented using the Mamdani algorithm. It discusses the main stages of system development, including the fuzzification of input data, definition of fuzzy rules based on medical expert knowledge, the aggregation of fuzzy inferences, and their defuzzification to obtain the final diagnostic result. The system’s web interface is implemented using the Django framework, which ensures the ease of interaction for users. The use of a medical expert system for diagnosing pneumonia can reduce the time required to establish a diagnosis and improve its quality of integrating the experience of medical experts and modern information technologies.

本文致力于开发一个基于模糊逻辑的肺炎诊断专家系统,该系统采用Mamdani算法实现。讨论了系统开发的主要阶段,包括输入数据的模糊化、基于医学专家知识的模糊规则的定义、模糊推理的聚合以及它们的去模糊化以获得最终诊断结果。系统的web界面使用Django框架实现,保证了用户交互的便利性。利用医学专家系统进行肺炎诊断,结合医学专家的经验和现代信息技术,可以缩短建立诊断所需的时间,提高诊断质量。
{"title":"Development of an Expert System Based on Fuzzy Logic for Pneumonia Diagnostics","authors":"A. I. Enikeeva,&nbsp;R. A. Burnashev,&nbsp;R. R. Farahov","doi":"10.3103/S000510552570027X","DOIUrl":"10.3103/S000510552570027X","url":null,"abstract":"<p>The paper is devoted to the development of an expert system for the diagnosis of pneumonia based on fuzzy logic implemented using the Mamdani algorithm. It discusses the main stages of system development, including the fuzzification of input data, definition of fuzzy rules based on medical expert knowledge, the aggregation of fuzzy inferences, and their defuzzification to obtain the final diagnostic result. The system’s web interface is implemented using the Django framework, which ensures the ease of interaction for users. The use of a medical expert system for diagnosing pneumonia can reduce the time required to establish a diagnosis and improve its quality of integrating the experience of medical experts and modern information technologies.</p>","PeriodicalId":42995,"journal":{"name":"AUTOMATIC DOCUMENTATION AND MATHEMATICAL LINGUISTICS","volume":"58 4 supplement","pages":"S202 - S215"},"PeriodicalIF":0.5,"publicationDate":"2025-03-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143707104","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Application of Computer Vision Methods to Old Tatar Text Recognition 计算机视觉方法在旧鞑靼语文本识别中的应用
IF 0.5 Q4 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2025-03-26 DOI: 10.3103/S0005105525700256
I. A. Valishin

A tool is developed that recognizes strings, words, and Arabic characters from scanned images. The possibilities and prospects for using this tool in research activities are considered. The results of experiments on the operational performance of the instrument are presented with the example of digitized images of Old Tatar writing.

开发了一种从扫描图像中识别字符串、单词和阿拉伯字符的工具。讨论了在研究活动中使用该工具的可能性和前景。以古鞑靼文字数字化图像为例,给出了仪器工作性能的实验结果。
{"title":"Application of Computer Vision Methods to Old Tatar Text Recognition","authors":"I. A. Valishin","doi":"10.3103/S0005105525700256","DOIUrl":"10.3103/S0005105525700256","url":null,"abstract":"<p>A tool is developed that recognizes strings, words, and Arabic characters from scanned images. The possibilities and prospects for using this tool in research activities are considered. The results of experiments on the operational performance of the instrument are presented with the example of digitized images of Old Tatar writing.</p>","PeriodicalId":42995,"journal":{"name":"AUTOMATIC DOCUMENTATION AND MATHEMATICAL LINGUISTICS","volume":"58 4 supplement","pages":"S177 - S191"},"PeriodicalIF":0.5,"publicationDate":"2025-03-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143707064","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Applying the Douglas–Peucker Algorithm in Online Authentication of Remote Work Tools for Specialist Training in 10.00.00 “Information Security” Integrated Group of Specialties Douglas-Peucker算法在10.00.00“信息安全”综合专业群专家培训远程工作工具在线认证中的应用
IF 0.5 Q4 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2025-03-26 DOI: 10.3103/S0005105525700323
A. G. Uymin, V. S. Grekov

With educational systems shifting to distance learning and the trend towards remote work growing, an urgent need has arisen to develop reliable biometric identification and authentication technologies to verify employees working remotely. Such technologies can provide a high degree of protection and usability, making their development and optimization extremely important. The issue is that accuracy and efficiency of mouse gesture recognition systems need to be improved without any specialized devices used and in the shortest possible time. This requires efficient preprocessing of such gestures to simplify their trajectories while preserving their key features. The Douglas–Peucker algorithm is proposed to be used for preliminary processing of mouse gesture trajectory data. This algorithm allows significantly reducing the number of points in the trajectories, simplifying them while preserving the principal shape of the gestures. The data with simplified trajectories are then used to train neural networks. The experimental part of the work showed that, when applied, the Douglas–Peucker algorithm allows for a 60% reduction in the number of points on the trajectories, increasing the gesture recognition accuracy from 70 to 82%. Such data simplification contributes to speeding up the neural networks' training process and improving their operational efficiency. The study confirmed the efficiency of using the Douglas–Peucker algorithm for preliminary data processing in mouse gesture recognition problems. The results can be applied to develop more intuitive and adaptive user interfaces. In addition, directions for further research, including optimization of the algorithm’s parameters for different types of gestures and exploring the possibility of combining it with other machine learning methods, are proposed.

随着教育系统转向远程学习和远程工作趋势的增长,迫切需要开发可靠的生物识别和认证技术来验证远程工作的员工。这些技术可以提供高度的保护和可用性,因此它们的开发和优化非常重要。问题是,鼠标手势识别系统的准确性和效率需要在不使用任何专门设备的情况下,在尽可能短的时间内得到提高。这需要对这些手势进行有效的预处理,以简化它们的轨迹,同时保留它们的关键特征。提出采用Douglas-Peucker算法对鼠标手势轨迹数据进行初步处理。该算法允许显著减少轨迹中点的数量,简化它们,同时保留手势的主要形状。然后使用简化轨迹的数据来训练神经网络。实验部分的工作表明,当应用Douglas-Peucker算法时,可以将轨迹上的点数量减少60%,将手势识别准确率从70%提高到82%。这种数据简化有助于加快神经网络的训练过程,提高其运行效率。该研究证实了使用Douglas-Peucker算法对鼠标手势识别问题进行初步数据处理的效率。研究结果可用于开发更直观、适应性更强的用户界面。此外,还提出了进一步研究的方向,包括针对不同类型的手势优化算法的参数,探索与其他机器学习方法结合的可能性。
{"title":"Applying the Douglas–Peucker Algorithm in Online Authentication of Remote Work Tools for Specialist Training in 10.00.00 “Information Security” Integrated Group of Specialties","authors":"A. G. Uymin,&nbsp;V. S. Grekov","doi":"10.3103/S0005105525700323","DOIUrl":"10.3103/S0005105525700323","url":null,"abstract":"<p>With educational systems shifting to distance learning and the trend towards remote work growing, an urgent need has arisen to develop reliable biometric identification and authentication technologies to verify employees working remotely. Such technologies can provide a high degree of protection and usability, making their development and optimization extremely important. The issue is that accuracy and efficiency of mouse gesture recognition systems need to be improved without any specialized devices used and in the shortest possible time. This requires efficient preprocessing of such gestures to simplify their trajectories while preserving their key features. The Douglas–Peucker algorithm is proposed to be used for preliminary processing of mouse gesture trajectory data. This algorithm allows significantly reducing the number of points in the trajectories, simplifying them while preserving the principal shape of the gestures. The data with simplified trajectories are then used to train neural networks. The experimental part of the work showed that, when applied, the Douglas–Peucker algorithm allows for a 60% reduction in the number of points on the trajectories, increasing the gesture recognition accuracy from 70 to 82%. Such data simplification contributes to speeding up the neural networks' training process and improving their operational efficiency. The study confirmed the efficiency of using the Douglas–Peucker algorithm for preliminary data processing in mouse gesture recognition problems. The results can be applied to develop more intuitive and adaptive user interfaces. In addition, directions for further research, including optimization of the algorithm’s parameters for different types of gestures and exploring the possibility of combining it with other machine learning methods, are proposed.</p>","PeriodicalId":42995,"journal":{"name":"AUTOMATIC DOCUMENTATION AND MATHEMATICAL LINGUISTICS","volume":"58 4 supplement","pages":"S265 - S268"},"PeriodicalIF":0.5,"publicationDate":"2025-03-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143707157","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Neural Network Architecture of Embodied Intelligence 具身智能的神经网络架构
IF 0.5 Q4 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2025-03-26 DOI: 10.3103/S0005105525700311
A. R. Nurutdinov

In recent years, advances in artificial intelligence and machine learning have been driven by advances in the development of large language models (LLMs) that are based on deep neural networks. At the same time, in spite of their substantial capabilities, LLMs have fundamental limitations, such as their spontaneous unreliability in facts and judgments; commission of simple errors that are dissonant with high competence in general; credulity, manifested by a willingness to accept a user’s false claims as true; and lack of knowledge concerning events occurring after training has been completed. Probably the key reason for these limitations is that bioinspired intelligence learning takes place through an assimilation of implicit knowledge in terms of an embodied form of intelligence to solve interactive real-world physical problems. Bioinspired studies of the nervous systems of organisms suggest that the cerebellum, which coordinates movement and maintains balance in human beings, is a prime candidate for uncovering methods of realizing embodied physical intelligence. Its simple, repetitive structure and ability to control complex movements offer hope for the possibility of creating an analog to adaptive neural networks. This paper explores the bioinspired architecture of the cerebellum as a form of analog computational networks that are capable of modeling complex, real-world physical systems. For a simple example, a realization of embodied AI in the form of a multicomponent model of an octopus tentacle is presented that demonstrates the potential for creating adaptive physical systems that learn from and interact with the environment.

近年来,基于深度神经网络的大型语言模型(llm)的发展进步推动了人工智能和机器学习的进步。与此同时,法学硕士虽然有很大的能力,但也有根本性的局限性,比如他们在事实和判断上的自发不可靠性;犯与一般高能力不一致的简单错误;轻信,表现为愿意将用户的虚假声明视为真实;对培训结束后发生的事件缺乏了解。产生这些限制的关键原因可能是,受生物启发的智能学习是通过将隐含知识同化为具体形式的智能来解决互动的现实世界物理问题。生物神经系统的生物启发研究表明,协调运动和保持人体平衡的小脑,是揭示实现身体智能的方法的主要候选。它简单、重复的结构和控制复杂运动的能力,为创造一种模拟自适应神经网络的可能性带来了希望。本文探讨了小脑的生物启发架构,作为一种模拟计算网络的形式,能够模拟复杂的,现实世界的物理系统。举一个简单的例子,以章鱼触手的多组件模型的形式实现了嵌入式人工智能,该模型展示了创建自适应物理系统的潜力,该系统可以从环境中学习并与环境交互。
{"title":"Neural Network Architecture of Embodied Intelligence","authors":"A. R. Nurutdinov","doi":"10.3103/S0005105525700311","DOIUrl":"10.3103/S0005105525700311","url":null,"abstract":"<p>In recent years, advances in artificial intelligence and machine learning have been driven by advances in the development of large language models (LLMs) that are based on deep neural networks. At the same time, in spite of their substantial capabilities, LLMs have fundamental limitations, such as their spontaneous unreliability in facts and judgments; commission of simple errors that are dissonant with high competence in general; credulity, manifested by a willingness to accept a user’s false claims as true; and lack of knowledge concerning events occurring after training has been completed. Probably the key reason for these limitations is that bioinspired intelligence learning takes place through an assimilation of implicit knowledge in terms of an embodied form of intelligence to solve interactive real-world physical problems. Bioinspired studies of the nervous systems of organisms suggest that the cerebellum, which coordinates movement and maintains balance in human beings, is a prime candidate for uncovering methods of realizing embodied physical intelligence. Its simple, repetitive structure and ability to control complex movements offer hope for the possibility of creating an analog to adaptive neural networks. This paper explores the bioinspired architecture of the cerebellum as a form of analog computational networks that are capable of modeling complex, real-world physical systems. For a simple example, a realization of embodied AI in the form of a multicomponent model of an octopus tentacle is presented that demonstrates the potential for creating adaptive physical systems that learn from and interact with the environment.</p>","PeriodicalId":42995,"journal":{"name":"AUTOMATIC DOCUMENTATION AND MATHEMATICAL LINGUISTICS","volume":"58 4 supplement","pages":"S241 - S264"},"PeriodicalIF":0.5,"publicationDate":"2025-03-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143707154","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Automated Grading of Students’ Short Answers Using Language Models 利用语言模型对学生的简答题进行自动评分
IF 0.5 Q4 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2025-03-23 DOI: 10.3103/S0005105525700177
Ch. B. Minnegalieva, I. I. Kashapov, O. D. Morozova

Methods of assessing student answers using language models are currently being studied by various specialists. The results of automated assessment depend on the subject area and characteristics of the academic discipline. This paper analyzes students’ answers received during a Computer Graphics and Design course. It is proposed to determine the cosine similarity of document vectors obtained using language models and refine the estimates by checking keywords. The results obtained can be used for preliminary assessment of students’ answers and form the basis for further research.

目前,不同专家正在研究利用语言模型评估学生答案的方法。自动评估的结果取决于学科领域和学科特点。本文分析了在计算机图形学与设计课程中收到的学生答案。建议确定使用语言模型获得的文档向量的余弦相似度,并通过检查关键词来完善估计值。所得结果可用于对学生答案进行初步评估,并为进一步研究奠定基础。
{"title":"Automated Grading of Students’ Short Answers Using Language Models","authors":"Ch. B. Minnegalieva,&nbsp;I. I. Kashapov,&nbsp;O. D. Morozova","doi":"10.3103/S0005105525700177","DOIUrl":"10.3103/S0005105525700177","url":null,"abstract":"<p>Methods of assessing student answers using language models are currently being studied by various specialists. The results of automated assessment depend on the subject area and characteristics of the academic discipline. This paper analyzes students’ answers received during a Computer Graphics and Design course. It is proposed to determine the cosine similarity of document vectors obtained using language models and refine the estimates by checking keywords. The results obtained can be used for preliminary assessment of students’ answers and form the basis for further research.</p>","PeriodicalId":42995,"journal":{"name":"AUTOMATIC DOCUMENTATION AND MATHEMATICAL LINGUISTICS","volume":"58 3 supplement","pages":"S109 - S114"},"PeriodicalIF":0.5,"publicationDate":"2025-03-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143676500","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Experimental Study of Cognitive Function of Generating Elliptical Sentences in Planimetric Tasks 平面任务中生成省略句认知功能的实验研究
IF 0.5 Q4 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2025-03-23 DOI: 10.3103/S0005105525700190
V. A. Parkhomenko, X. A. Naidenova, T. A. Martirova, A. V. Schukin

The paper is devoted to the study of the cognitive function associated with the generation of elliptical sentences in the Russian language. The study is conducted by testing this cognitive ability using a computer system specially developed by the authors for this purpose. Testing of this cognitive ability is proposed and implemented for the first time. The system is an extension of Moodle and is openly hosted in the github repository. Elliptical constructions are limited to verbal and nominal ellipses, which are theoretically possible to be completely reconstructed based on the context of the sentence. The study is conducted with the participation of SPbPU students as respondents. The texts of planimetric tasks are chosen as the subject area. As a result of the analysis of testing data, the following results are obtained: the influence of the respondent’s knowledge of the subject area (planimetry) on the test results is established; a tendency towards self-study of respondents was discovered, which is manifested in a reduction in time and an increase in scores as they pass tests; it is shown that respondents are poorly motivated if they do not see feedback on the answer to the completed task. The paper discusses the problems of further development of the testing system and its use in adapting questionnaires (tasks) to assess the knowledge of SPbPU students in the field of automation of bug detection in programs, as well as for diagnosing the functional state of operator specialists and express diagnosis of dementia. It also seems promising to use the system to improve the processes of syntactic parsing of elliptic sentences and automate the restoration of ellipses in the subject area of planimetry.

本文主要研究俄语中省略句生成的认知功能。这项研究是通过使用作者为此目的专门开发的计算机系统测试这种认知能力来进行的。本文首次提出并实现了对这种认知能力的测试。该系统是Moodle的扩展,并公开托管在github存储库中。省略句结构仅限于动词性省略句和名词性省略句,理论上可以根据句子的上下文完全重构。本研究以上海理工大学学生为调查对象进行。选择平面任务的文本作为主题区域。通过对测试数据的分析,得到以下结果:确定了被调查者对学科领域(平面测量学)的知识对测试结果的影响;发现应答者有自学的倾向,这表现在他们通过考试时时间减少而分数增加;研究表明,如果受访者没有看到对完成任务的答案的反馈,他们的动机就会很差。本文讨论了该测试系统的进一步开发问题,以及该测试系统在应用问卷(任务)评估SPbPU学生在程序自动化错误检测领域的知识、诊断操作员专家的功能状态和痴呆的快速诊断方面的问题。在平面测量学科领域,利用该系统改进省略句的句法解析过程,实现省略句的自动还原也很有前景。
{"title":"Experimental Study of Cognitive Function of Generating Elliptical Sentences in Planimetric Tasks","authors":"V. A. Parkhomenko,&nbsp;X. A. Naidenova,&nbsp;T. A. Martirova,&nbsp;A. V. Schukin","doi":"10.3103/S0005105525700190","DOIUrl":"10.3103/S0005105525700190","url":null,"abstract":"<p>The paper is devoted to the study of the cognitive function associated with the generation of elliptical sentences in the Russian language. The study is conducted by testing this cognitive ability using a computer system specially developed by the authors for this purpose. Testing of this cognitive ability is proposed and implemented for the first time. The system is an extension of Moodle and is openly hosted in the github repository. Elliptical constructions are limited to verbal and nominal ellipses, which are theoretically possible to be completely reconstructed based on the context of the sentence. The study is conducted with the participation of SPbPU students as respondents. The texts of planimetric tasks are chosen as the subject area. As a result of the analysis of testing data, the following results are obtained: the influence of the respondent’s knowledge of the subject area (planimetry) on the test results is established; a tendency towards self-study of respondents was discovered, which is manifested in a reduction in time and an increase in scores as they pass tests; it is shown that respondents are poorly motivated if they do not see feedback on the answer to the completed task. The paper discusses the problems of further development of the testing system and its use in adapting questionnaires (tasks) to assess the knowledge of SPbPU students in the field of automation of bug detection in programs, as well as for diagnosing the functional state of operator specialists and express diagnosis of dementia. It also seems promising to use the system to improve the processes of syntactic parsing of elliptic sentences and automate the restoration of ellipses in the subject area of planimetry.</p>","PeriodicalId":42995,"journal":{"name":"AUTOMATIC DOCUMENTATION AND MATHEMATICAL LINGUISTICS","volume":"58 3 supplement","pages":"S123 - S130"},"PeriodicalIF":0.5,"publicationDate":"2025-03-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143676218","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Subjective Notes on Search Engines 关于搜索引擎的主观笔记
IF 0.5 Q4 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2025-03-23 DOI: 10.3103/S0005105525700086
Y. E. Polak

This work commemorates the 25th anniversary of the establishment of the main search engines for our country, Yandex and Google, which happened one after another. This article attempts to describe some events from the history of the development of internet navigation tools from the point of view of a witness (and partly a participant).

这部作品是为了纪念我国的主要搜索引擎Yandex和b谷歌相继成立25周年。本文试图从一个目击者(部分参与者)的角度来描述互联网导航工具发展史上的一些事件。
{"title":"Subjective Notes on Search Engines","authors":"Y. E. Polak","doi":"10.3103/S0005105525700086","DOIUrl":"10.3103/S0005105525700086","url":null,"abstract":"<p>This work commemorates the 25th anniversary of the establishment of the main search engines for our country, Yandex and Google, which happened one after another. This article attempts to describe some events from the history of the development of internet navigation tools from the point of view of a witness (and partly a participant).</p>","PeriodicalId":42995,"journal":{"name":"AUTOMATIC DOCUMENTATION AND MATHEMATICAL LINGUISTICS","volume":"58 1 supplement","pages":"S27 - S42"},"PeriodicalIF":0.5,"publicationDate":"2025-03-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143676531","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Application of Synthetic Data to the Problem of Anomaly Detection in the Field of Information Security 合成数据在信息安全领域异常检测问题中的应用
IF 0.5 Q4 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2025-03-23 DOI: 10.3103/S0005105525700128
A. I. Gurianov

Synthetic data are highly relevant for machine learning. Modern algorithms to generate synthetic data make it possible to generate data that are very similar in their statistical properties to the original data. Synthetic data is used in practice in a wide range of tasks, including those related to data augmentation. The author of the article proposes a method of data augmentation combining the approaches of increasing the sample size using synthetic data and synthetic anomaly generation. This method has been used to address the information security problem of anomaly detection in server logs to detect attacks. The model trained for the task presents high results. This demonstrates the effectiveness of the use of synthetic data to increase sample size and generate anomalies, as well as the ability to use these approaches together with high efficiency.

合成数据与机器学习密切相关。生成合成数据的现代算法可以生成在统计特性上与原始数据非常相似的数据。合成数据在实践中被广泛用于各种任务,包括与数据增强相关的任务。文章作者提出了一种数据扩增方法,它结合了使用合成数据增加样本量和合成异常生成两种方法。这种方法已被用于解决服务器日志中的异常检测这一信息安全问题,以检测攻击行为。为该任务训练的模型取得了很好的效果。这证明了使用合成数据来增加样本量和生成异常数据的有效性,以及同时高效使用这些方法的能力。
{"title":"Application of Synthetic Data to the Problem of Anomaly Detection in the Field of Information Security","authors":"A. I. Gurianov","doi":"10.3103/S0005105525700128","DOIUrl":"10.3103/S0005105525700128","url":null,"abstract":"<div><p>Synthetic data are highly relevant for machine learning. Modern algorithms to generate synthetic data make it possible to generate data that are very similar in their statistical properties to the original data. Synthetic data is used in practice in a wide range of tasks, including those related to data augmentation. The author of the article proposes a method of data augmentation combining the approaches of increasing the sample size using synthetic data and synthetic anomaly generation. This method has been used to address the information security problem of anomaly detection in server logs to detect attacks. The model trained for the task presents high results. This demonstrates the effectiveness of the use of synthetic data to increase sample size and generate anomalies, as well as the ability to use these approaches together with high efficiency.</p></div>","PeriodicalId":42995,"journal":{"name":"AUTOMATIC DOCUMENTATION AND MATHEMATICAL LINGUISTICS","volume":"58 2 supplement","pages":"S68 - S72"},"PeriodicalIF":0.5,"publicationDate":"2025-03-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143676536","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Analyzing Machine Learning Models Based on Explainable Artificial Intelligence Methods in Educational Analytics 在教育分析中分析基于可解释人工智能方法的机器学习模型
IF 0.5 Q4 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2025-03-23 DOI: 10.3103/S0005105525700189
D. A. Minullin, F. M. Gafarov

The problem of predicting early dropout of students of Russian universities is urgent and requires the development of new innovative approaches to address it. To do so, it is possible to develop predictive systems based on the use of student data that are available in the information systems of universities. This paper investigates machine learning models for the prediction of early student dropout, trained on the basis of student characteristics and performance data. The main scientific novelty of this work lies in the use of explainable artificial intelligence (AI) methods to interpret and explain the performance of the trained machine learning models. Explainable AI methods allow us to understand which of the input features (student characteristics) have the greatest influence on the results of the machine learning models and can also help understand why models make certain decisions. The findings expand the understanding of the influence of various factors on early dropout of students.

预测俄罗斯大学学生早期辍学的问题十分紧迫,需要开发新的创新方法来解决这一问题。为此,可以利用大学信息系统中提供的学生数据开发预测系统。本文研究了预测早期学生辍学的机器学习模型,该模型基于学生特征和表现数据进行训练。这项工作的主要科学新颖之处在于使用可解释的人工智能(AI)方法来解释和解释训练有素的机器学习模型的性能。可解释的人工智能方法使我们能够理解哪些输入特征(学生特征)对机器学习模型的结果影响最大,也可以帮助理解为什么模型做出某些决策。研究结果拓展了对各种因素对学生早期辍学影响的认识。
{"title":"Analyzing Machine Learning Models Based on Explainable Artificial Intelligence Methods in Educational Analytics","authors":"D. A. Minullin,&nbsp;F. M. Gafarov","doi":"10.3103/S0005105525700189","DOIUrl":"10.3103/S0005105525700189","url":null,"abstract":"<p>The problem of predicting early dropout of students of Russian universities is urgent and requires the development of new innovative approaches to address it. To do so, it is possible to develop predictive systems based on the use of student data that are available in the information systems of universities. This paper investigates machine learning models for the prediction of early student dropout, trained on the basis of student characteristics and performance data. The main scientific novelty of this work lies in the use of explainable artificial intelligence (AI) methods to interpret and explain the performance of the trained machine learning models. Explainable AI methods allow us to understand which of the input features (student characteristics) have the greatest influence on the results of the machine learning models and can also help understand why models make certain decisions. The findings expand the understanding of the influence of various factors on early dropout of students.</p>","PeriodicalId":42995,"journal":{"name":"AUTOMATIC DOCUMENTATION AND MATHEMATICAL LINGUISTICS","volume":"58 3 supplement","pages":"S115 - S122"},"PeriodicalIF":0.5,"publicationDate":"2025-03-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143676217","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Design and Development of a Training Blockchain Simulator 训练区块链模拟器的设计与开发
IF 0.5 Q4 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2025-03-23 DOI: 10.3103/S0005105525700165
O. M. Mekhovnikov, A. S. Toschev

This article presents an educational blockchain simulator that is intended to train students and beginning blockchain developers. This simulator was created to provide users with an intuitive and accessible tool for learning the basic concepts and mechanisms of blockchain functioning. The article discusses the main aspects of the design and architecture of the simulator and provides a demonstration of the application. In addition, the possibilities for further development of the simulator and its potential as a teaching tool for research in the field of blockchain technologies are discussed. The resulting simulator contributes to the field of education and science, helping increase the level of competence of specialists and the development of innovative solutions in blockchain.

本文介绍一个教育区块链模拟器,旨在培训学生和初学区块链的开发人员。创建该模拟器的目的是为用户学习区块链运行的基本概念和机制提供一个直观、易用的工具。文章讨论了该模拟器设计和架构的主要方面,并提供了应用演示。此外,还讨论了进一步开发该模拟器的可能性及其作为区块链技术研究领域教学工具的潜力。由此产生的模拟器为教育和科学领域做出了贡献,有助于提高专家的能力水平和区块链创新解决方案的开发。
{"title":"Design and Development of a Training Blockchain Simulator","authors":"O. M. Mekhovnikov,&nbsp;A. S. Toschev","doi":"10.3103/S0005105525700165","DOIUrl":"10.3103/S0005105525700165","url":null,"abstract":"<p>This article presents an educational blockchain simulator that is intended to train students and beginning blockchain developers. This simulator was created to provide users with an intuitive and accessible tool for learning the basic concepts and mechanisms of blockchain functioning. The article discusses the main aspects of the design and architecture of the simulator and provides a demonstration of the application. In addition, the possibilities for further development of the simulator and its potential as a teaching tool for research in the field of blockchain technologies are discussed. The resulting simulator contributes to the field of education and science, helping increase the level of competence of specialists and the development of innovative solutions in blockchain.</p>","PeriodicalId":42995,"journal":{"name":"AUTOMATIC DOCUMENTATION AND MATHEMATICAL LINGUISTICS","volume":"58 3 supplement","pages":"S102 - S108"},"PeriodicalIF":0.5,"publicationDate":"2025-03-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143676499","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
AUTOMATIC DOCUMENTATION AND MATHEMATICAL LINGUISTICS
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1