Pub Date : 2022-06-12DOI: 10.26907/1562-5419-2022-25-2-137-147
Daniil Andreevic Klinov, K. Grigorian
The article is devoted to the creation of an effective solution for user segmentation. The article presents an analysis of existing user segmentation services, an analysis of approaches to user segmentation (ABCDx segmentation, demographic segmentation, segmentation based on a user journey map), an analysis of clustering algorithms (K-means, Mini-Batch K-means, DBSCAN, Agglomerative Clustering, Spectral Clustering). The study of these areas is aimed at creating a “flexible” segmentation solution that adapts to each user sample. Dispersion analysis (ANOVA test), analysis of clustering metrics is also used to assess the quality of user segmentation. With the help of these areas, an effective solution for user segmentation has been developed using advanced analytics and machine learning technology.
{"title":"Development of a Method for User Segmentation using Clustering Algorithms and Advanced Analytics","authors":"Daniil Andreevic Klinov, K. Grigorian","doi":"10.26907/1562-5419-2022-25-2-137-147","DOIUrl":"https://doi.org/10.26907/1562-5419-2022-25-2-137-147","url":null,"abstract":"The article is devoted to the creation of an effective solution for user segmentation. The article presents an analysis of existing user segmentation services, an analysis of approaches to user segmentation (ABCDx segmentation, demographic segmentation, segmentation based on a user journey map), an analysis of clustering algorithms (K-means, Mini-Batch K-means, DBSCAN, Agglomerative Clustering, Spectral Clustering). The study of these areas is aimed at creating a “flexible” segmentation solution that adapts to each user sample. Dispersion analysis (ANOVA test), analysis of clustering metrics is also used to assess the quality of user segmentation. With the help of these areas, an effective solution for user segmentation has been developed using advanced analytics and machine learning technology.","PeriodicalId":262909,"journal":{"name":"Russian Digital Libraries Journal","volume":"32 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-06-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125323724","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2022-06-12DOI: 10.26907/1562-5419-2022-25-2-121-147
Andrey Evgenyevic Grishin, K. Grigorian
The article is devoted to automation of the software design stage. In the course of the study, the reasons for the high importance of this stage and the relevance of its automation were analyzed. The main stages of this stage were also considered and the existing systems that allow automating each of them were considered. In addition, an own solution was proposed within the framework of the problem of class structure refactoring based on the combinatorial optimization method. A solution method has been developed to improve the quality of the class hierarchy and tested on a real model.
{"title":"Development of the Expert System for Building the Architecture of Software Products","authors":"Andrey Evgenyevic Grishin, K. Grigorian","doi":"10.26907/1562-5419-2022-25-2-121-147","DOIUrl":"https://doi.org/10.26907/1562-5419-2022-25-2-121-147","url":null,"abstract":"The article is devoted to automation of the software design stage. In the course of the study, the reasons for the high importance of this stage and the relevance of its automation were analyzed. The main stages of this stage were also considered and the existing systems that allow automating each of them were considered. In addition, an own solution was proposed within the framework of the problem of class structure refactoring based on the combinatorial optimization method. A solution method has been developed to improve the quality of the class hierarchy and tested on a real model.","PeriodicalId":262909,"journal":{"name":"Russian Digital Libraries Journal","volume":"37 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-06-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129616658","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2022-06-12DOI: 10.26907/1562-5419-2022-25-2-159-178
Aigul Ildarovna Sibgatullina, Azat Shavkatovich Yakupov
Every year the size of the global big data market is growing. Analysing these data is essential for good decision-making. Big data technologies lead to a significant cost reduction with use of cloud services, distributed file systems, when there is a need to store large amounts of information. The quality of data analytics is dependent on the quality of the data themselves. This is especially important if the data has a retention policy and migrates from one source to another, increasing the risk of a data loss. Prevention of negative consequences from data migration is achieved through the process of data reconciliation – a comprehensive verification of large amounts of information in order to confirm their consistency. This article discusses probabilistic data structures that can be used to solve the problem, and suggests an implementation – data integrity verification module using a Counting Bloom filter. This module is integrated into Apache Airflow to automate its invocation.
{"title":"Development a Data Validation Module to Satisfy the Retention Policy Metric","authors":"Aigul Ildarovna Sibgatullina, Azat Shavkatovich Yakupov","doi":"10.26907/1562-5419-2022-25-2-159-178","DOIUrl":"https://doi.org/10.26907/1562-5419-2022-25-2-159-178","url":null,"abstract":"Every year the size of the global big data market is growing. Analysing these data is essential for good decision-making. Big data technologies lead to a significant cost reduction with use of cloud services, distributed file systems, when there is a need to store large amounts of information. The quality of data analytics is dependent on the quality of the data themselves. This is especially important if the data has a retention policy and migrates from one source to another, increasing the risk of a data loss. Prevention of negative consequences from data migration is achieved through the process of data reconciliation – a comprehensive verification of large amounts of information in order to confirm their consistency. \u0000This article discusses probabilistic data structures that can be used to solve the problem, and suggests an implementation – data integrity verification module using a Counting Bloom filter. This module is integrated into Apache Airflow to automate its invocation.","PeriodicalId":262909,"journal":{"name":"Russian Digital Libraries Journal","volume":"47 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-06-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127497339","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2022-05-12DOI: 10.26907/1562-5419-2022-25-2-177-196
Rustem Raficovich Yamikov, K. Grigorian
The growth in the number of IT products with machine-learning features is increasing the relevance of automating machine-learning processes. The use of MLOps techniques is aimed at providing training and efficient deployment of applications in a production environment by automating side infrastructure issues that are not directly related to model development. In this paper, we review the components, principles, and approaches of MLOps and analyze existing platforms and solutions for building machine learning pipelines. In addition, we propose an approach to build a machine learning pipeline based on basic DevOps tools and open-source libraries.
{"title":"Analysis and Development of the MLOps Pipeline for ML Model Deployment","authors":"Rustem Raficovich Yamikov, K. Grigorian","doi":"10.26907/1562-5419-2022-25-2-177-196","DOIUrl":"https://doi.org/10.26907/1562-5419-2022-25-2-177-196","url":null,"abstract":"The growth in the number of IT products with machine-learning features is increasing the relevance of automating machine-learning processes. The use of MLOps techniques is aimed at providing training and efficient deployment of applications in a production environment by automating side infrastructure issues that are not directly related to model development. \u0000In this paper, we review the components, principles, and approaches of MLOps and analyze existing platforms and solutions for building machine learning pipelines. In addition, we propose an approach to build a machine learning pipeline based on basic DevOps tools and open-source libraries.","PeriodicalId":262909,"journal":{"name":"Russian Digital Libraries Journal","volume":"55 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-05-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133914967","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2022-04-29DOI: 10.26907/1562-5419-2022-25-1-2-14
R. Galiev, Karen Albertovich Grigoryan
Cloud platforms and services have become an important factor in the explosive development of the digital economy in the last decade. The ability to quickly scale the service, coupled with a reduction in investment costs at the start of projects within the framework of the Iaas, PaaS, SaaS approaches, gave positive results and formed the basis of new business models and corporate solutions. In this article, we discuss the reasons for the importance of multicloud and explore approaches to integrating cloud services in a multicloud architecture. The article also proposes a way to solve the problem of cloud migration - developing a system for migrating cloud resources between cloud services.
{"title":"Development of a Multicloud Service for Cloud Resource Migration","authors":"R. Galiev, Karen Albertovich Grigoryan","doi":"10.26907/1562-5419-2022-25-1-2-14","DOIUrl":"https://doi.org/10.26907/1562-5419-2022-25-1-2-14","url":null,"abstract":"Cloud platforms and services have become an important factor in the explosive development of the digital economy in the last decade. The ability to quickly scale the service, coupled with a reduction in investment costs at the start of projects within the framework of the Iaas, PaaS, SaaS approaches, gave positive results and formed the basis of new business models and corporate solutions. \u0000 \u0000In this article, we discuss the reasons for the importance of multicloud and explore approaches to integrating cloud services in a multicloud architecture. The article also proposes a way to solve the problem of cloud migration - developing a system for migrating cloud resources between cloud services.","PeriodicalId":262909,"journal":{"name":"Russian Digital Libraries Journal","volume":"82 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-04-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131968171","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2022-04-29DOI: 10.26907/1562-5419-2022-25-1-64-74
Maxim Yurevich Novikov
The article is devoted to the issue of the use of digital educational services in the process of teaching students of higher educational institutions. Based on the experience of using various teaching methods and digital resources in teaching students, new digital solutions are proposed that solve some of the didactic tasks and provide new opportunities for organizing the learning process. The main issue under discussion is the possibility of individualization and control of the learning process in the case of the use of teaching video materials by the teacher. By comparing the traditional face-to-face lecture and the training video, the positive and negative effects of each of the approaches to the delivery of lecture material are described. Based on the results of such a comparison, a video lecture model is proposed that includes interactive feedback elements.
{"title":"An Approach to Individualization and Control of Students’ Activities During Wathing a Video Lecture","authors":"Maxim Yurevich Novikov","doi":"10.26907/1562-5419-2022-25-1-64-74","DOIUrl":"https://doi.org/10.26907/1562-5419-2022-25-1-64-74","url":null,"abstract":"The article is devoted to the issue of the use of digital educational services in the process of teaching students of higher educational institutions. Based on the experience of using various teaching methods and digital resources in teaching students, new digital solutions are proposed that solve some of the didactic tasks and provide new opportunities for organizing the learning process. The main issue under discussion is the possibility of individualization and control of the learning process in the case of the use of teaching video materials by the teacher. By comparing the traditional face-to-face lecture and the training video, the positive and negative effects of each of the approaches to the delivery of lecture material are described. Based on the results of such a comparison, a video lecture model is proposed that includes interactive feedback elements.","PeriodicalId":262909,"journal":{"name":"Russian Digital Libraries Journal","volume":"69 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-04-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128586228","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2022-04-29DOI: 10.26907/1562-5419-2022-25-1-75-86
B. Starichenko, Lubov Vladimirovna Sardak
The article identifies four categories of goals for studying mathematics at school: mastering theoretical positions at the evidentiary level, mastering algorithms for solving mathematical problems that illustrate the provisions of the theory, using the studied algorithms to solve applied and research problems, using mathematical methods and algorithms for teaching programming. Such a selection makes it possible to substantiate and specify the use of digital technologies in the course of mathematics, as well as to ensure their continuity, including with subsequent higher education courses. The importance of using mobile and cloud technologies and applications in independent (home) work of students in mathematics is emphasized.
{"title":"Digitalization of School Mathematics – from Learning Goals to Technologies","authors":"B. Starichenko, Lubov Vladimirovna Sardak","doi":"10.26907/1562-5419-2022-25-1-75-86","DOIUrl":"https://doi.org/10.26907/1562-5419-2022-25-1-75-86","url":null,"abstract":"The article identifies four categories of goals for studying mathematics at school: mastering theoretical positions at the evidentiary level, mastering algorithms for solving mathematical problems that illustrate the provisions of the theory, using the studied algorithms to solve applied and research problems, using mathematical methods and algorithms for teaching programming. Such a selection makes it possible to substantiate and specify the use of digital technologies in the course of mathematics, as well as to ensure their continuity, including with subsequent higher education courses. The importance of using mobile and cloud technologies and applications in independent (home) work of students in mathematics is emphasized.","PeriodicalId":262909,"journal":{"name":"Russian Digital Libraries Journal","volume":"04 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-04-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129939931","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2022-04-29DOI: 10.26907/1562-5419-2022-25-1-42-63
O. Nevzorova, K. Nikolaev
The article provides an overview of existing digital publishing systems, existing ways to expand the functionality of such systems, and also proposes a project for a set of services to expand the functionality of the Open Journal Systems publishing system on the platform of the Lobachevskii-DML digital mathematical library. The proposed set of services includes services aimed at the authors of articles and intended for the editorial staff of the journal. The existing developments in individual parts of the project are described, and the main ideas for the development of all services are proposed.
{"title":"Scientific and Publishing Services on the Lobachevskii-DML Platform","authors":"O. Nevzorova, K. Nikolaev","doi":"10.26907/1562-5419-2022-25-1-42-63","DOIUrl":"https://doi.org/10.26907/1562-5419-2022-25-1-42-63","url":null,"abstract":"The article provides an overview of existing digital publishing systems, existing ways to expand the functionality of such systems, and also proposes a project for a set of services to expand the functionality of the Open Journal Systems publishing system on the platform of the Lobachevskii-DML digital mathematical library. The proposed set of services includes services aimed at the authors of articles and intended for the editorial staff of the journal. The existing developments in individual parts of the project are described, and the main ideas for the development of all services are proposed.","PeriodicalId":262909,"journal":{"name":"Russian Digital Libraries Journal","volume":"361 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-04-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132207309","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2022-04-29DOI: 10.26907/1562-5419-2022-25-1-15-41
V. Naumova, V. S. Eremenko, A. S. Eremenko, A. A. Zagumennov, M. I. Patuk
The article describes all stages of the development of an Information system for the creation, development and support of scientific research in geology: from the Information and analytical environment to the Unified Digital Space of geological scientific knowledge. The current stage of the Project under development is a step forward from the traditional geographically distributed geological system to the digital environment of the future.
{"title":"From an Information and Analytical Environment to Support Scientific Research in Geology to a Single Digital Space of Geological Scientific Knowledge","authors":"V. Naumova, V. S. Eremenko, A. S. Eremenko, A. A. Zagumennov, M. I. Patuk","doi":"10.26907/1562-5419-2022-25-1-15-41","DOIUrl":"https://doi.org/10.26907/1562-5419-2022-25-1-15-41","url":null,"abstract":"The article describes all stages of the development of an Information system for the creation, development and support of scientific research in geology: from the Information and analytical environment to the Unified Digital Space of geological scientific knowledge. The current stage of the Project under development is a step forward from the traditional geographically distributed geological system to the digital environment of the future.","PeriodicalId":262909,"journal":{"name":"Russian Digital Libraries Journal","volume":"93 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-04-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129176736","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2022-04-29DOI: 10.26907/1562-5419-2022-25-1-87-102
Chingiz Irekovich Fatikhov, K. Grigorian
The COVID-19 pandemic makes the problem of monitoring and analyzing the movement of people indoors more urgent in order to timely identify those who have been in contact with the sick and prevent further spread of the infection. The article proposes one of the ways to solve this problem - the development of a system for determining and saving the history of the location of people inside the premises. The article also discusses methods, parameters and technologies that can be used to solve the problem of indoor localization.
{"title":"Development of a System for Collecting Data on the Movement of People Indoors","authors":"Chingiz Irekovich Fatikhov, K. Grigorian","doi":"10.26907/1562-5419-2022-25-1-87-102","DOIUrl":"https://doi.org/10.26907/1562-5419-2022-25-1-87-102","url":null,"abstract":"The COVID-19 pandemic makes the problem of monitoring and analyzing the movement of people indoors more urgent in order to timely identify those who have been in contact with the sick and prevent further spread of the infection. \u0000The article proposes one of the ways to solve this problem - the development of a system for determining and saving the history of the location of people inside the premises. The article also discusses methods, parameters and technologies that can be used to solve the problem of indoor localization.","PeriodicalId":262909,"journal":{"name":"Russian Digital Libraries Journal","volume":"10 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-04-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127915539","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}