电子媒体出版物监测分析系统基本流程形式化及数学模型

V. Komarov, S.M. Roschin
{"title":"电子媒体出版物监测分析系统基本流程形式化及数学模型","authors":"V. Komarov, S.M. Roschin","doi":"10.14529/ctcr210403","DOIUrl":null,"url":null,"abstract":"The article describes an approach to formalizing basic processes and building a mathematical model for a system for collecting and analyzing data from electronic media. The authors, as part of a scientific study, are creating a system, including the development of new algorithms, methods and approaches for collecting and analyzing textual information from Internet news sources. The main direction of the study is the application of methods for the mining of text data based on the technology of artificial neural networks, methods of natural language processing, text mining, machine learning and big data processing. Purpose of the study. To develop a formalized description of the model of the system for monitoring and analyzing the text information of electronic news media using the methods of mathematical modeling. Research methods and tools. The use of the toolkit of the methodology of mathematical modeling, with the methods of system analysis is proposed. To study the system, such methods of system analysis as abstraction, formalization, composition and decomposition, structuring and restructuring, modeling, recognition and identification were used. The system is considered as a formalized model of an automatic classifier and clusterizer for a set of text documents in a natural language in the form of an algebraic system. To solve the problems of classification and clustering of texts, it is proposed to apply machine learning methods based on neural network approaches. The structure of the system and its constituent processes, as well as processes interacting with the system from outside, are presented in the form of a formalized mathematical description. Results. The developed formalized mathematical description of the system model clearly shows the interconnection of the system components with each other, as well as internal processes. The applied approach makes it possible to detail the representation of the system based on its decomposition into subsystems and modules. All this makes it possible to streamline the sequence of stages of creating a system and decompose them into separate stages of work. Conclusion. The results obtained in the course of the study allow us to move on to the next stage of the life cycle of the information system being developed - its software development.","PeriodicalId":338904,"journal":{"name":"Bulletin of the South Ural State University. Ser. Computer Technologies, Automatic Control & Radioelectronics","volume":"19 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Formalization of Basic Processes and Mathematical Model of the System for Monitoring and Analysis of Publications of Electronic Media\",\"authors\":\"V. Komarov, S.M. Roschin\",\"doi\":\"10.14529/ctcr210403\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The article describes an approach to formalizing basic processes and building a mathematical model for a system for collecting and analyzing data from electronic media. The authors, as part of a scientific study, are creating a system, including the development of new algorithms, methods and approaches for collecting and analyzing textual information from Internet news sources. The main direction of the study is the application of methods for the mining of text data based on the technology of artificial neural networks, methods of natural language processing, text mining, machine learning and big data processing. Purpose of the study. To develop a formalized description of the model of the system for monitoring and analyzing the text information of electronic news media using the methods of mathematical modeling. Research methods and tools. The use of the toolkit of the methodology of mathematical modeling, with the methods of system analysis is proposed. To study the system, such methods of system analysis as abstraction, formalization, composition and decomposition, structuring and restructuring, modeling, recognition and identification were used. The system is considered as a formalized model of an automatic classifier and clusterizer for a set of text documents in a natural language in the form of an algebraic system. To solve the problems of classification and clustering of texts, it is proposed to apply machine learning methods based on neural network approaches. The structure of the system and its constituent processes, as well as processes interacting with the system from outside, are presented in the form of a formalized mathematical description. Results. The developed formalized mathematical description of the system model clearly shows the interconnection of the system components with each other, as well as internal processes. The applied approach makes it possible to detail the representation of the system based on its decomposition into subsystems and modules. All this makes it possible to streamline the sequence of stages of creating a system and decompose them into separate stages of work. Conclusion. The results obtained in the course of the study allow us to move on to the next stage of the life cycle of the information system being developed - its software development.\",\"PeriodicalId\":338904,\"journal\":{\"name\":\"Bulletin of the South Ural State University. Ser. Computer Technologies, Automatic Control & Radioelectronics\",\"volume\":\"19 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-11-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Bulletin of the South Ural State University. Ser. Computer Technologies, Automatic Control & Radioelectronics\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.14529/ctcr210403\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Bulletin of the South Ural State University. Ser. Computer Technologies, Automatic Control & Radioelectronics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.14529/ctcr210403","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

摘要

本文描述了一种形式化基本过程的方法,并为从电子媒体收集和分析数据的系统建立数学模型。作为一项科学研究的一部分,作者正在创建一个系统,包括开发新的算法、方法和途径,用于从互联网新闻来源收集和分析文本信息。主要研究方向是基于人工神经网络技术、自然语言处理方法、文本挖掘、机器学习和大数据处理方法的文本数据挖掘方法的应用。研究目的:运用数学建模的方法,对电子新闻媒体文本信息监测与分析系统的模型进行形式化描述。研究方法和工具。运用工具箱的方法进行数学建模,并结合系统分析的方法提出。采用抽象、形式化、组成与分解、结构化与重构、建模、识别与识别等系统分析方法对系统进行研究。该系统被认为是一组自然语言文本文档以代数系统形式的自动分类器和聚类器的形式化模型。为了解决文本的分类和聚类问题,提出了基于神经网络方法的机器学习方法。系统的结构及其组成过程,以及从外部与系统相互作用的过程,都以形式化的数学描述的形式呈现。结果。系统模型的形式化数学描述清楚地显示了系统组件之间的互连以及内部过程。应用的方法使得基于子系统和模块分解的系统的详细表示成为可能。所有这些都可以简化创建系统的阶段序列,并将它们分解为单独的工作阶段。结论。在研究过程中获得的结果使我们能够进入正在开发的信息系统生命周期的下一个阶段- -它的软件开发。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
Formalization of Basic Processes and Mathematical Model of the System for Monitoring and Analysis of Publications of Electronic Media
The article describes an approach to formalizing basic processes and building a mathematical model for a system for collecting and analyzing data from electronic media. The authors, as part of a scientific study, are creating a system, including the development of new algorithms, methods and approaches for collecting and analyzing textual information from Internet news sources. The main direction of the study is the application of methods for the mining of text data based on the technology of artificial neural networks, methods of natural language processing, text mining, machine learning and big data processing. Purpose of the study. To develop a formalized description of the model of the system for monitoring and analyzing the text information of electronic news media using the methods of mathematical modeling. Research methods and tools. The use of the toolkit of the methodology of mathematical modeling, with the methods of system analysis is proposed. To study the system, such methods of system analysis as abstraction, formalization, composition and decomposition, structuring and restructuring, modeling, recognition and identification were used. The system is considered as a formalized model of an automatic classifier and clusterizer for a set of text documents in a natural language in the form of an algebraic system. To solve the problems of classification and clustering of texts, it is proposed to apply machine learning methods based on neural network approaches. The structure of the system and its constituent processes, as well as processes interacting with the system from outside, are presented in the form of a formalized mathematical description. Results. The developed formalized mathematical description of the system model clearly shows the interconnection of the system components with each other, as well as internal processes. The applied approach makes it possible to detail the representation of the system based on its decomposition into subsystems and modules. All this makes it possible to streamline the sequence of stages of creating a system and decompose them into separate stages of work. Conclusion. The results obtained in the course of the study allow us to move on to the next stage of the life cycle of the information system being developed - its software development.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Formalization of Basic Processes and Mathematical Model of the System for Monitoring and Analysis of Publications of Electronic Media Determination of the Parameters of the La¬mination of a Bimetallic Plate by Means of Active Thermal Non-Destructive Control Perm Region Natural Resource Potential Forecasting Using Machine Learning Models To the Question of Determining the Barometric Height by a Mechanical Altimeter and Air Signal System Formalism of Writing Out of Manipulators Dynamic Equation
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1