{"title":"创建聊天机器人时的文本处理方法","authors":"A. Borodin, R. Veynberg, O. Litvishko","doi":"10.34671/sch.hbr.2019.0303.0026","DOIUrl":null,"url":null,"abstract":". As part of the development of a chatbot, a necessary and sufficient condition for working with text is the use of various methods of text analysis as an input element of communication with the bot and its training. The article deals with a number of solutions used for text analysis and construction of text data analysis models: lemmatization methods, text vectorization, various machine learning models. The main focus of the article is on the methods of text processing in different formats and using different technologies, which provides scalability and versatility of the proposed technology and the effectiveness of the future chatbot as a whole. The article will be interesting for programmers, text analysts and anyone interested in working with text and developing systems for working with text information.","PeriodicalId":34335,"journal":{"name":"Khumanitarni Balkanski izsledvaniia","volume":" ","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2019-08-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":"{\"title\":\"METHODS OF TEXT PROCESSING WHEN CREATING CHATBOTS\",\"authors\":\"A. Borodin, R. Veynberg, O. Litvishko\",\"doi\":\"10.34671/sch.hbr.2019.0303.0026\",\"DOIUrl\":null,\"url\":null,\"abstract\":\". As part of the development of a chatbot, a necessary and sufficient condition for working with text is the use of various methods of text analysis as an input element of communication with the bot and its training. The article deals with a number of solutions used for text analysis and construction of text data analysis models: lemmatization methods, text vectorization, various machine learning models. The main focus of the article is on the methods of text processing in different formats and using different technologies, which provides scalability and versatility of the proposed technology and the effectiveness of the future chatbot as a whole. The article will be interesting for programmers, text analysts and anyone interested in working with text and developing systems for working with text information.\",\"PeriodicalId\":34335,\"journal\":{\"name\":\"Khumanitarni Balkanski izsledvaniia\",\"volume\":\" \",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2019-08-31\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"2\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Khumanitarni Balkanski izsledvaniia\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.34671/sch.hbr.2019.0303.0026\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Khumanitarni Balkanski izsledvaniia","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.34671/sch.hbr.2019.0303.0026","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
. As part of the development of a chatbot, a necessary and sufficient condition for working with text is the use of various methods of text analysis as an input element of communication with the bot and its training. The article deals with a number of solutions used for text analysis and construction of text data analysis models: lemmatization methods, text vectorization, various machine learning models. The main focus of the article is on the methods of text processing in different formats and using different technologies, which provides scalability and versatility of the proposed technology and the effectiveness of the future chatbot as a whole. The article will be interesting for programmers, text analysts and anyone interested in working with text and developing systems for working with text information.