{"title":"Analysis and Evaluation of Text Comparison Based on Intelligent Optimization Algorithm","authors":"Zixian Fang, Jiayi Wang, Fenglan Luo","doi":"10.56397/ist.2023.09.02","DOIUrl":null,"url":null,"abstract":"Text transcription is crucial in Chinese information processing. Text transcription has always existed since ancient times, but no matter whether it is manual transcription in ancient times or modern transcription using communication and storage devices, random errors cannot be avoided when a message has been forwarded and transcribed many times. In this paper, we study how to measure the size of differences between different versions of texts, how to estimate the number of transmissions experienced between two texts, and how to design an effective and fast algorithm for the calculation of the first two types of problems in the study of text transcription, with respect to the characteristics of text transcription. This paper proposes the concept of text similarity, constructs the TF-IDF similarity evaluation model of text, the text transmission evaluation model based on Gaussian process (i.e., GFCT Model), and the model based on the immune frog jumping algorithm to analyze the comparative processing of text, so as to achieve accurate and effective information processing, with a view to providing a new method for text data processing, and improving the accuracy and effectiveness of text data processing.","PeriodicalId":20688,"journal":{"name":"Proceedings of The 6th International Conference on Innovation in Science and Technology","volume":"138 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of The 6th International Conference on Innovation in Science and Technology","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.56397/ist.2023.09.02","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Text transcription is crucial in Chinese information processing. Text transcription has always existed since ancient times, but no matter whether it is manual transcription in ancient times or modern transcription using communication and storage devices, random errors cannot be avoided when a message has been forwarded and transcribed many times. In this paper, we study how to measure the size of differences between different versions of texts, how to estimate the number of transmissions experienced between two texts, and how to design an effective and fast algorithm for the calculation of the first two types of problems in the study of text transcription, with respect to the characteristics of text transcription. This paper proposes the concept of text similarity, constructs the TF-IDF similarity evaluation model of text, the text transmission evaluation model based on Gaussian process (i.e., GFCT Model), and the model based on the immune frog jumping algorithm to analyze the comparative processing of text, so as to achieve accurate and effective information processing, with a view to providing a new method for text data processing, and improving the accuracy and effectiveness of text data processing.