Le Thi Hong Hanh, Nguyen Ngoc Nam, N. T. Linh, Nguyen Linh Diep, Nguyen Ngoc Hai
{"title":"股票市场预测:文本挖掘在越南的应用","authors":"Le Thi Hong Hanh, Nguyen Ngoc Nam, N. T. Linh, Nguyen Linh Diep, Nguyen Ngoc Hai","doi":"10.25073/2588-1108/vnueab.4715","DOIUrl":null,"url":null,"abstract":"There are very few studies in Vietnam on the application of text mining in finance and Vietnamese language processing. The origin of this study comes from one of the leading studies on the use of machine learning to analyze text data from 4 well-known online newspapers in Vietnam to forecast the increase, decrease and neutrality of the VN-Index one day in advance. This study used nearly 70,000 articles from four reputable and reliable online newspapers in Vietnam as input data for machine learning models. These were: decision trees, random forests, KNNs and SVMs. After selecting the best model (SVM) and the best dataset (Vietstock), the techniques used to dig deep and refine the findings raised the accuracy to 60.1%. The end result is solid evidence that news about the financial and stock situation in the popular press affects the price movements of the VN-INDEX and the Vietnamese stock market.","PeriodicalId":270329,"journal":{"name":"VNU JOURNAL OF ECONOMICS AND BUSINESS","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-04-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Stock Market Prediction: The Application of Text-Mining in Vietnam\",\"authors\":\"Le Thi Hong Hanh, Nguyen Ngoc Nam, N. T. Linh, Nguyen Linh Diep, Nguyen Ngoc Hai\",\"doi\":\"10.25073/2588-1108/vnueab.4715\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"There are very few studies in Vietnam on the application of text mining in finance and Vietnamese language processing. The origin of this study comes from one of the leading studies on the use of machine learning to analyze text data from 4 well-known online newspapers in Vietnam to forecast the increase, decrease and neutrality of the VN-Index one day in advance. This study used nearly 70,000 articles from four reputable and reliable online newspapers in Vietnam as input data for machine learning models. These were: decision trees, random forests, KNNs and SVMs. After selecting the best model (SVM) and the best dataset (Vietstock), the techniques used to dig deep and refine the findings raised the accuracy to 60.1%. The end result is solid evidence that news about the financial and stock situation in the popular press affects the price movements of the VN-INDEX and the Vietnamese stock market.\",\"PeriodicalId\":270329,\"journal\":{\"name\":\"VNU JOURNAL OF ECONOMICS AND BUSINESS\",\"volume\":\"1 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-04-25\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"VNU JOURNAL OF ECONOMICS AND BUSINESS\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.25073/2588-1108/vnueab.4715\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"VNU JOURNAL OF ECONOMICS AND BUSINESS","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.25073/2588-1108/vnueab.4715","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Stock Market Prediction: The Application of Text-Mining in Vietnam
There are very few studies in Vietnam on the application of text mining in finance and Vietnamese language processing. The origin of this study comes from one of the leading studies on the use of machine learning to analyze text data from 4 well-known online newspapers in Vietnam to forecast the increase, decrease and neutrality of the VN-Index one day in advance. This study used nearly 70,000 articles from four reputable and reliable online newspapers in Vietnam as input data for machine learning models. These were: decision trees, random forests, KNNs and SVMs. After selecting the best model (SVM) and the best dataset (Vietstock), the techniques used to dig deep and refine the findings raised the accuracy to 60.1%. The end result is solid evidence that news about the financial and stock situation in the popular press affects the price movements of the VN-INDEX and the Vietnamese stock market.