首页 > 最新文献

AUTOMATIC DOCUMENTATION AND MATHEMATICAL LINGUISTICS最新文献

英文 中文
Analytical Statistics on Scientific Publications of the Kazan Federal University on Scilit 喀山联邦大学科学出版物分析统计
IF 0.5 Q4 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2025-04-16 DOI: 10.3103/S0005105525700438
A. V. Ermakov

This paper examines issues related to the presentation of information concerning publications by Kazan Federal University researchers, teachers, graduate students, and students, as well as concerning the university’s scientific sources in the information and analytical materials of the Scilit system. Specific examples present the advantages of the complete and correct setting of metadata for scientific publications, as well as problems that arise when handling bibliographic information carelessly.

本文探讨了喀山联邦大学研究人员、教师、研究生和学生的出版物信息呈现问题,以及该大学在Scilit系统的信息和分析材料中的科学来源问题。具体的例子展示了完整和正确设置科学出版物元数据的好处,以及在处理书目信息时不小心出现的问题。
{"title":"Analytical Statistics on Scientific Publications of the Kazan Federal University on Scilit","authors":"A. V. Ermakov","doi":"10.3103/S0005105525700438","DOIUrl":"10.3103/S0005105525700438","url":null,"abstract":"<p>This paper examines issues related to the presentation of information concerning publications by Kazan Federal University researchers, teachers, graduate students, and students, as well as concerning the university’s scientific sources in the information and analytical materials of the Scilit system. Specific examples present the advantages of the complete and correct setting of metadata for scientific publications, as well as problems that arise when handling bibliographic information carelessly.</p>","PeriodicalId":42995,"journal":{"name":"AUTOMATIC DOCUMENTATION AND MATHEMATICAL LINGUISTICS","volume":"58 6 supplement","pages":"S343 - S351"},"PeriodicalIF":0.5,"publicationDate":"2025-04-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143840278","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Automatic Annotation of HTML Documents Using the Microdata Standard 使用微数据标准自动注释 HTML 文档
IF 0.5 Q4 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2025-04-16 DOI: 10.3103/S0005105525700359
T. F. Ibragimov, A. A. Ferenets

The development of an application that is based on machine learning methods for automatic annotation of web pages according to the Microdata standard is described, with the possibility of an extension to other standards and injecting data to JSX files. Datasets were collected and prepared for training machine learning (ML) models. The ML model metrics were collected and analyzed.

描述了基于机器学习方法的应用程序的开发,该应用程序根据微数据标准自动注释网页,具有扩展到其他标准的可能性,并将数据注入JSX文件。收集数据集并准备用于训练机器学习(ML)模型。收集并分析ML模型指标。
{"title":"Automatic Annotation of HTML Documents Using the Microdata Standard","authors":"T. F. Ibragimov,&nbsp;A. A. Ferenets","doi":"10.3103/S0005105525700359","DOIUrl":"10.3103/S0005105525700359","url":null,"abstract":"<p>The development of an application that is based on machine learning methods for automatic annotation of web pages according to the Microdata standard is described, with the possibility of an extension to other standards and injecting data to JSX files. Datasets were collected and prepared for training machine learning (ML) models. The ML model metrics were collected and analyzed.</p>","PeriodicalId":42995,"journal":{"name":"AUTOMATIC DOCUMENTATION AND MATHEMATICAL LINGUISTICS","volume":"58 5 supplement","pages":"S283 - S288"},"PeriodicalIF":0.5,"publicationDate":"2025-04-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143835677","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Alive Publications Are Gaining Popularity 活体出版物越来越受欢迎
IF 0.5 Q4 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2025-04-16 DOI: 10.3103/S0005105525700402
M. M. Gorbunov-Posadov

Alive publications are exemplars of a new genre for presenting the results of scientific research; in this type, scientific work is published online and then is constantly developed and improved by its author. Serious errors and typos are no longer fatal, nor do they haunt the author for the rest of his or her life. The reader of an alive publication knows that the author is constantly monitoring changes in this branch of science. A Russian author who supports an alive publication is currently hopelessly losing in many bibliometric indicators favored by conservative science officials. An alive publication encourages the development of the bibliographical apparatus. Each citation will soon have to be continually updated to the date of the last revision of the alive publication. It is to be expected that alive publications will spread over to the scientific world, and the author’s concern for the publication’s evolution will become like a parent’s care for the development of a child, and, to the delight of the reader, the internet will be filled with scientific publications that do not lose their relevance over time.

Alive 出版物是展示科学研究成果的一种新体裁的典范;在这种体裁中,科学著作在线发表,然后由作者不断发展和完善。严重的错误和错别字不再是致命的,也不会困扰作者的余生。活出版物的读者知道,作者一直在关注这一科学分支的变化。目前,支持有生命力的出版物的俄罗斯作者在保守的科学官员所青睐的许多文献计量指标上都无可救药地败下阵来。有生命力的出版物鼓励书目机构的发展。每条引文不久都必须不断更新,以反映活出版物的最后修订日期。可以预见,有生命力的出版物将遍布科学界,作者对出版物演变的关注将变得像父母对孩子成长的呵护一样,令读者欣喜的是,互联网上将充斥着那些不会随时间推移而失去相关性的科学出版物。
{"title":"Alive Publications Are Gaining Popularity","authors":"M. M. Gorbunov-Posadov","doi":"10.3103/S0005105525700402","DOIUrl":"10.3103/S0005105525700402","url":null,"abstract":"<p>Alive publications are exemplars of a new genre for presenting the results of scientific research; in this type, scientific work is published online and then is constantly developed and improved by its author. Serious errors and typos are no longer fatal, nor do they haunt the author for the rest of his or her life. The reader of an alive publication knows that the author is constantly monitoring changes in this branch of science. A Russian author who supports an alive publication is currently hopelessly losing in many bibliometric indicators favored by conservative science officials. An alive publication encourages the development of the bibliographical apparatus. Each citation will soon have to be continually updated to the date of the last revision of the alive publication. It is to be expected that alive publications will spread over to the scientific world, and the author’s concern for the publication’s evolution will become like a parent’s care for the development of a child, and, to the delight of the reader, the internet will be filled with scientific publications that do not lose their relevance over time.</p>","PeriodicalId":42995,"journal":{"name":"AUTOMATIC DOCUMENTATION AND MATHEMATICAL LINGUISTICS","volume":"58 6 supplement","pages":"S318 - S322"},"PeriodicalIF":0.5,"publicationDate":"2025-04-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143840280","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Review of Technologies for Ensuring Security and Protection of E-Mail Systems in a Scientific Organization 科学组织中确保电子邮件系统安全和保护的技术综述
IF 0.5 Q4 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2025-04-16 DOI: 10.3103/S0005105525700463
G. M. Mikhaylov, A. M. Chernetsov

This paper provides an overview of the modern technologies used in processing e-mail messages to address the problem of receiving trusted e-mail and describes them. Recommended settings for successful operation are provided.

本文概述了用于处理电子邮件消息的现代技术,以解决接收可信电子邮件的问题,并对它们进行了描述。提供了操作成功的建议设置。
{"title":"Review of Technologies for Ensuring Security and Protection of E-Mail Systems in a Scientific Organization","authors":"G. M. Mikhaylov,&nbsp;A. M. Chernetsov","doi":"10.3103/S0005105525700463","DOIUrl":"10.3103/S0005105525700463","url":null,"abstract":"<p>This paper provides an overview of the modern technologies used in processing e-mail messages to address the problem of receiving trusted e-mail and describes them. Recommended settings for successful operation are provided.</p>","PeriodicalId":42995,"journal":{"name":"AUTOMATIC DOCUMENTATION AND MATHEMATICAL LINGUISTICS","volume":"58 6 supplement","pages":"S373 - S375"},"PeriodicalIF":0.5,"publicationDate":"2025-04-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143840330","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
An Ontology-Based Approach for Distributed Multiagent Modeling of Radio-Technical Systems 基于本体的无线电技术系统分布式多智能体建模方法
IF 0.5 Q4 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2025-04-16 DOI: 10.3103/S0005105525700499
A. O. Schiriy

The ontology-based approach to multiagent modeling involves the implementation of a modeling system through the creation of ontologies. The IEEE 1516 Standard for Modeling and Simulation High Level Architecture is an example of a holistic implementation of an ontology-based approach to agent-based modeling. This work is devoted to a multiagent modeling system designed for modeling complex radio engineering systems (especially radar systems). This is relevant due to the need to replace part of the field tests of radio engineering systems with simulation experiments. One motivation for switching to the IEEE 1516 standard for a heavy multiagent modeling system, among others, is to ensure scalability, openness, and the multiple reuse of the developed agent models, which is completely logical, based on the existing well-developed and proven standard establishing rules for the interaction of models and the development of software interfaces. The general principles of the construction and architecture of the modeling system are given. The basic requirements for the main modeling agents and their role and place in the complex modeling system are shown, a special place among which is occupied by the simulator of the background-target environment. The possibility of combining two simulation schemes is also discussed: discrete-event and step-by-step. The fact is that the step-by-step scheme has the advantages of simplicity and clarity, and it is convenient to model processing algorithms and components of radio engineering systems. However, it is impossible to implement true autonomy and asynchrony of agents in it. Combining two modeling schemes allows you to combine their advantages.

基于本体的多智能体建模方法涉及通过创建本体来实现建模系统。IEEE 1516建模和仿真高级体系结构标准是基于本体的方法全面实现基于代理的建模的一个例子。本工作致力于为复杂无线电工程系统(特别是雷达系统)的建模而设计的多智能体建模系统。这是相关的,因为需要用模拟实验取代无线电工程系统的部分现场测试。对于一个重型多智能体建模系统,切换到IEEE 1516标准的动机之一,是为了确保开发的智能体模型的可伸缩性、开放性和多重重用,这是完全合乎逻辑的,基于现有的开发良好且经过验证的标准,建立了模型交互和软件接口开发的规则。给出了建模系统的一般构造原则和体系结构。阐述了主要建模主体的基本要求及其在复杂建模系统中的作用和地位,其中背景目标环境模拟器占有特殊的地位。本文还讨论了离散事件和分步模拟两种方案相结合的可能性。该方案具有简单、清晰的优点,便于对无线电工程系统的处理算法和部件进行建模。然而,它不可能实现代理的真正自治和异步。结合两种建模方案可以让您结合它们的优点。
{"title":"An Ontology-Based Approach for Distributed Multiagent Modeling of Radio-Technical Systems","authors":"A. O. Schiriy","doi":"10.3103/S0005105525700499","DOIUrl":"10.3103/S0005105525700499","url":null,"abstract":"<p>The ontology-based approach to multiagent modeling involves the implementation of a modeling system through the creation of ontologies. The IEEE 1516 Standard for Modeling and Simulation High Level Architecture is an example of a holistic implementation of an ontology-based approach to agent-based modeling. This work is devoted to a multiagent modeling system designed for modeling complex radio engineering systems (especially radar systems). This is relevant due to the need to replace part of the field tests of radio engineering systems with simulation experiments. One motivation for switching to the IEEE 1516 standard for a heavy multiagent modeling system, among others, is to ensure scalability, openness, and the multiple reuse of the developed agent models, which is completely logical, based on the existing well-developed and proven standard establishing rules for the interaction of models and the development of software interfaces. The general principles of the construction and architecture of the modeling system are given. The basic requirements for the main modeling agents and their role and place in the complex modeling system are shown, a special place among which is occupied by the simulator of the background-target environment. The possibility of combining two simulation schemes is also discussed: discrete-event and step-by-step. The fact is that the step-by-step scheme has the advantages of simplicity and clarity, and it is convenient to model processing algorithms and components of radio engineering systems. However, it is impossible to implement true autonomy and asynchrony of agents in it. Combining two modeling schemes allows you to combine their advantages.</p>","PeriodicalId":42995,"journal":{"name":"AUTOMATIC DOCUMENTATION AND MATHEMATICAL LINGUISTICS","volume":"58 6 supplement","pages":"S398 - S402"},"PeriodicalIF":0.5,"publicationDate":"2025-04-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143840389","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Two-Level Information and Analytical Control System for Intelligent Traffic Lights 智能交通灯的两级信息分析控制系统
IF 0.5 Q4 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2025-04-16 DOI: 10.3103/S0005105525700335
M. V. Bobyr, N. I. Khrapova

Problems that arise in the field of traffic are of great importance. To solve existing problems, various intelligent systems are being developed, one of which is the Smart City system. This work is devoted to the development of an information and analytical system (IAS) for controlling an intelligent traffic light. The presented system consists of two levels, each of which contains a set of specific operations. The first level is responsible for detecting objects, in particular pedestrians and cars at an intersection, and the second level calculates the operating time of traffic light signals for the control signal that is transmitted to the device. For comparative analysis, the combined method histogram of oriented gradients + support vector machines (HOG+SVM). HOG was chosen, based upon counting the number of gradient directions on individual image areas, and SVM were used to construct hyperplanes in n-dimensional space to separate objects belonging to different classes. The results of an experimental study, om which the recognition of objects in images was carried out, showed the superiority of the developed information and analytical system over existing methods. The average accuracy of detecting pedestrians and cars through the IAS was 69.4%. In addition, the experiment showed that the accuracy of detecting objects in images is directly proportional to the distance from the video camera to the object.

交通领域出现的问题是非常重要的。为了解决存在的问题,各种各样的智能系统正在被开发,其中之一就是智慧城市系统。本课题致力于开发一种用于控制智能交通灯的信息和分析系统。所呈现的系统由两个级别组成,每个级别都包含一组特定的操作。第一级负责检测物体,特别是十字路口的行人和汽车,第二级计算红绿灯信号的运行时间,以便将控制信号传输到设备。为了进行对比分析,采用直方图定向梯度+支持向量机(HOG+SVM)的组合方法。基于对单个图像区域梯度方向的计数,选择HOG,并利用SVM在n维空间中构造超平面来分离不同类别的物体。对图像中物体的识别进行了实验研究,结果表明所开发的信息和分析系统比现有方法具有优越性。通过IAS检测行人和汽车的平均准确率为69.4%。此外,实验表明,图像中目标的检测精度与摄像机到目标的距离成正比。
{"title":"A Two-Level Information and Analytical Control System for Intelligent Traffic Lights","authors":"M. V. Bobyr,&nbsp;N. I. Khrapova","doi":"10.3103/S0005105525700335","DOIUrl":"10.3103/S0005105525700335","url":null,"abstract":"<p>Problems that arise in the field of traffic are of great importance. To solve existing problems, various intelligent systems are being developed, one of which is the Smart City system. This work is devoted to the development of an information and analytical system (IAS) for controlling an intelligent traffic light. The presented system consists of two levels, each of which contains a set of specific operations. The first level is responsible for detecting objects, in particular pedestrians and cars at an intersection, and the second level calculates the operating time of traffic light signals for the control signal that is transmitted to the device. For comparative analysis, the combined method histogram of oriented gradients + support vector machines (HOG+SVM). HOG was chosen, based upon counting the number of gradient directions on individual image areas, and SVM were used to construct hyperplanes in n-dimensional space to separate objects belonging to different classes. The results of an experimental study, om which the recognition of objects in images was carried out, showed the superiority of the developed information and analytical system over existing methods. The average accuracy of detecting pedestrians and cars through the IAS was 69.4%. In addition, the experiment showed that the accuracy of detecting objects in images is directly proportional to the distance from the video camera to the object.</p>","PeriodicalId":42995,"journal":{"name":"AUTOMATIC DOCUMENTATION AND MATHEMATICAL LINGUISTICS","volume":"58 5 supplement","pages":"S269 - S278"},"PeriodicalIF":0.5,"publicationDate":"2025-04-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143835552","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Development of Lightweight Parsers with Different Go Language Granularity 不同Go语言粒度的轻量级解析器的开发
IF 0.5 Q4 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2025-04-16 DOI: 10.3103/S0005105525700426
D. S. Drozdov, S. S. Mikhalkovich

This article considers an approach to creating a family of lightweight grammars for the Go language, with a special symbol Any denoting the skipped part of the program [1]. A formal definition of a more granular grammar is given, along with examples illustrating the increase in grammar rule granularity. The efficiency of the constructed lightweight parsers is analyzed in terms of memory usage and runtime on seven industrial repositories. It is shown that increasing the granularity of the grammar does not lead to a significant increase in parser resource consumption and varies only slightly depending on repository type and coding style in Go. Furthermore, the advantages of using lightweight grammars with Any over full grammars are summarized. An example of using a lightweight grammar to determine code complexity is presented. The results can also be applied to estimate the parser’s contribution to overall resource consumption, e.g., in tasks such as code binding and project markup.

本文考虑了一种为Go语言创建一系列轻量级语法的方法,使用特殊符号Any表示程序中被跳过的部分[1]。本文给出了更细粒度语法的正式定义,以及说明语法规则粒度增加的示例。根据七个工业存储库上的内存使用和运行时,分析了构建的轻量级解析器的效率。结果表明,增加语法的粒度不会导致解析器资源消耗的显著增加,并且仅根据Go中的存储库类型和编码风格略有不同。此外,还总结了使用轻量级语法与Any相比使用完整语法的优点。给出了一个使用轻量级语法来确定代码复杂度的示例。结果还可以用于估计解析器对总体资源消耗的贡献,例如,在代码绑定和项目标记等任务中。
{"title":"Development of Lightweight Parsers with Different Go Language Granularity","authors":"D. S. Drozdov,&nbsp;S. S. Mikhalkovich","doi":"10.3103/S0005105525700426","DOIUrl":"10.3103/S0005105525700426","url":null,"abstract":"<p>This article considers an approach to creating a family of lightweight grammars for the Go language, with a special symbol Any denoting the skipped part of the program [1]. A formal definition of a more granular grammar is given, along with examples illustrating the increase in grammar rule granularity. The efficiency of the constructed lightweight parsers is analyzed in terms of memory usage and runtime on seven industrial repositories. It is shown that increasing the granularity of the grammar does not lead to a significant increase in parser resource consumption and varies only slightly depending on repository type and coding style in Go. Furthermore, the advantages of using lightweight grammars with Any over full grammars are summarized. An example of using a lightweight grammar to determine code complexity is presented. The results can also be applied to estimate the parser’s contribution to overall resource consumption, e.g., in tasks such as code binding and project markup.</p>","PeriodicalId":42995,"journal":{"name":"AUTOMATIC DOCUMENTATION AND MATHEMATICAL LINGUISTICS","volume":"58 6 supplement","pages":"S333 - S342"},"PeriodicalIF":0.5,"publicationDate":"2025-04-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143840279","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Approach to Creating an HTML Version of a Scientific Article from a Manuscript in MS Word Format for a Low-Budget Publisher 为低预算出版商从MS Word格式的手稿中创建科学文章的HTML版本的方法
IF 0.5 Q4 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2025-04-16 DOI: 10.3103/S0005105525700475
R. Y. Skornyakova

The most common approach to creating an HTML version of a journal article used by scientific publishers is to first create an XML version of the article in accordance with the NISO Journal Article Tag Suite (JATS) standard, followed by automatic conversion to HTML and PDF. However, obtaining an XML version from a manuscript in the MS Word .docx format, often used by authors, when it contains a large number of complex formulas and tables, is a difficult task. Existing software either cannot cope with it in full or is expensive and inaccessible to small publishers on a limited budget. This paper proposes an approach to creating an HTML version of a journal article from a manuscript in .docx format containing formulas in MathType format, which does not require significant financial or time costs from the publisher. It also describes a prototype converter from .docx format to HTML and JATS XML formats that implements this approach and applicable to KIAM preprints.

科学出版商使用的创建期刊文章HTML版本的最常见方法是首先根据NISO期刊文章标签套件(JATS)标准创建文章的XML版本,然后自动转换为HTML和PDF。然而,当手稿包含大量复杂的公式和表格时,从作者经常使用的MS Word .docx格式的手稿中获取XML版本是一项困难的任务。现有的软件要么无法完全应对它,要么价格昂贵,对预算有限的小型发行商来说难以获得。本文提出了一种方法,可以从包含MathType格式公式的.docx格式手稿中创建期刊文章的HTML版本,这不需要出版商花费大量的资金或时间。它还描述了从.docx格式到HTML和JATS XML格式的原型转换器,该转换器实现了这种方法并适用于KIAM预印本。
{"title":"Approach to Creating an HTML Version of a Scientific Article from a Manuscript in MS Word Format for a Low-Budget Publisher","authors":"R. Y. Skornyakova","doi":"10.3103/S0005105525700475","DOIUrl":"10.3103/S0005105525700475","url":null,"abstract":"<p>The most common approach to creating an HTML version of a journal article used by scientific publishers is to first create an XML version of the article in accordance with the NISO Journal Article Tag Suite (JATS) standard, followed by automatic conversion to HTML and PDF. However, obtaining an XML version from a manuscript in the MS Word .docx format, often used by authors, when it contains a large number of complex formulas and tables, is a difficult task. Existing software either cannot cope with it in full or is expensive and inaccessible to small publishers on a limited budget. This paper proposes an approach to creating an HTML version of a journal article from a manuscript in .docx format containing formulas in MathType format, which does not require significant financial or time costs from the publisher. It also describes a prototype converter from .docx format to HTML and JATS XML formats that implements this approach and applicable to KIAM preprints.</p>","PeriodicalId":42995,"journal":{"name":"AUTOMATIC DOCUMENTATION AND MATHEMATICAL LINGUISTICS","volume":"58 6 supplement","pages":"S376 - S388"},"PeriodicalIF":0.5,"publicationDate":"2025-04-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143840388","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
What Should the Educational Programming Language Be 教育编程语言应该是什么
IF 0.5 Q4 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2025-04-16 DOI: 10.3103/S0005105525700414
L. V. Gorodnyaya

The article is devoted to the development of solutions in the project of a simulator for teaching programming, intended for initial familiarization with the basic concepts of process interaction and calculation management. In the transition to multiprocessor architectures, the relevance of a special language and information support for the introduction to programming increases is growing. No matter how complex the world of parallelism is, a programmer training system will have to master it and create a methodology for fully familiarizing itself with its nonobvious phenomena. This is a sufficient reason for developing an educational programming language aimed at the initial training of primary and secondary school students, as well as junior students and nonprofessionals, for operating interacting processes and programming parallel computations. The given language has been developed through many years of experience in managing the interaction of toy robots moving on a checkered board. The material of this article material is of interest to programmers, students, and graduate students specializing in the field of systems and theoretical programming and to all those interested in the problems of modern computer science, programming, and information technology, especially the problems of parallel computing, supercomputers and the use of multiprocessor complexes and computer networks in general.

这篇文章专门讨论了在编程教学模拟器项目中开发解决方案的问题,该模拟器用于初步熟悉过程交互和计算管理的基本概念。在向多处理器体系结构过渡的过程中,编程入门所需的特殊语言和信息支持的相关性日益增加。无论并行世界有多么复杂,程序员培训系统都必须掌握它,并创建一种方法,使自己完全熟悉其不明显的现象。这就是开发一种教育编程语言的充分理由,该语言旨在对中小学生、低年级学生和非专业人员进行操作交互进程和编制并行计算程序的初步培训。该语言是在多年管理在棋盘上移动的玩具机器人的交互过程中积累的经验基础上开发出来的。这篇文章的材料对系统和理论编程领域的程序员、学生和研究生,以及所有对现代计算机科学、编程和信息技术问题,特别是并行计算、超级计算机和多处理器综合体的使用以及一般计算机网络问题感兴趣的人都有意义。
{"title":"What Should the Educational Programming Language Be","authors":"L. V. Gorodnyaya","doi":"10.3103/S0005105525700414","DOIUrl":"10.3103/S0005105525700414","url":null,"abstract":"<p>The article is devoted to the development of solutions in the project of a simulator for teaching programming, intended for initial familiarization with the basic concepts of process interaction and calculation management. In the transition to multiprocessor architectures, the relevance of a special language and information support for the introduction to programming increases is growing. No matter how complex the world of parallelism is, a programmer training system will have to master it and create a methodology for fully familiarizing itself with its nonobvious phenomena. This is a sufficient reason for developing an educational programming language aimed at the initial training of primary and secondary school students, as well as junior students and nonprofessionals, for operating interacting processes and programming parallel computations. The given language has been developed through many years of experience in managing the interaction of toy robots moving on a checkered board. The material of this article material is of interest to programmers, students, and graduate students specializing in the field of systems and theoretical programming and to all those interested in the problems of modern computer science, programming, and information technology, especially the problems of parallel computing, supercomputers and the use of multiprocessor complexes and computer networks in general.</p>","PeriodicalId":42995,"journal":{"name":"AUTOMATIC DOCUMENTATION AND MATHEMATICAL LINGUISTICS","volume":"58 6 supplement","pages":"S323 - S332"},"PeriodicalIF":0.5,"publicationDate":"2025-04-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143840281","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Automatic Annotation of Training Datasets in Computer Vision Using Machine Learning Methods 基于机器学习方法的计算机视觉训练数据集自动标注
IF 0.5 Q4 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2025-04-16 DOI: 10.3103/S0005105525700347
A. K. Zhuravlyov, K. A. Grigorian

This paper addresses the automatic annotation of training datasets in the field of computer vision using machine learning methods. Data annotation is a key stage in the development and training of deep learning models, but creating labeled data often requires significant time and labor. This paper proposes a mechanism for automatic annotation based on the use of convolutional neural networks and active learning methods. The proposed methodology includes the analysis and evaluation of existing approaches to automatic annotation. The effectiveness of the proposed solutions is assessed using publicly available datasets. The results demonstrate that the proposed method significantly reduces the time required for data annotation, although operator intervention is still necessary. The literature review presents an analysis of modern annotation methods and existing automatic systems, providing a better understanding of the context and advantages of the proposed approach. The conclusion discusses the study achievements, its limitations, and possible directions for future research in this field.

本文研究了计算机视觉领域中使用机器学习方法对训练数据集进行自动标注的问题。数据注释是深度学习模型开发和训练的关键阶段,但创建标记数据通常需要大量的时间和人力。本文提出了一种基于卷积神经网络和主动学习方法的自动标注机制。提出的方法包括对现有自动标注方法的分析和评价。建议的解决方案的有效性使用公开可用的数据集进行评估。结果表明,尽管仍然需要操作员的干预,但该方法显著减少了数据注释所需的时间。文献综述介绍了现代标注方法和现有自动系统的分析,提供了一个更好的理解上下文和所提出的方法的优势。结语部分讨论了研究成果、局限性以及未来可能的研究方向。
{"title":"Automatic Annotation of Training Datasets in Computer Vision Using Machine Learning Methods","authors":"A. K. Zhuravlyov,&nbsp;K. A. Grigorian","doi":"10.3103/S0005105525700347","DOIUrl":"10.3103/S0005105525700347","url":null,"abstract":"<p>This paper addresses the automatic annotation of training datasets in the field of computer vision using machine learning methods. Data annotation is a key stage in the development and training of deep learning models, but creating labeled data often requires significant time and labor. This paper proposes a mechanism for automatic annotation based on the use of convolutional neural networks and active learning methods. The proposed methodology includes the analysis and evaluation of existing approaches to automatic annotation. The effectiveness of the proposed solutions is assessed using publicly available datasets. The results demonstrate that the proposed method significantly reduces the time required for data annotation, although operator intervention is still necessary. The literature review presents an analysis of modern annotation methods and existing automatic systems, providing a better understanding of the context and advantages of the proposed approach. The conclusion discusses the study achievements, its limitations, and possible directions for future research in this field.</p>","PeriodicalId":42995,"journal":{"name":"AUTOMATIC DOCUMENTATION AND MATHEMATICAL LINGUISTICS","volume":"58 5 supplement","pages":"S279 - S282"},"PeriodicalIF":0.5,"publicationDate":"2025-04-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143835592","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
AUTOMATIC DOCUMENTATION AND MATHEMATICAL LINGUISTICS
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1