{"title":"一种改进的基于hlda的中文新闻多文档自动摘要方法","authors":"Yan Liu, Ying Li, Chengcheng Hu, Yongbin Wang","doi":"10.1109/dsa.2019.00068","DOIUrl":null,"url":null,"abstract":"There are a lot of Chïnese news about the same topic on the Internet today. Many of them are similar or repetitive for readers. It is hard to find what are the readers needed exactly. Multi-document news summarization aim at extractioninformationfrommultiple news texts on sametopie to automatically generate summary report for readers. Our paper chooses the news of the Great Wall as an example to illustrate the method of automatic summary generation In ourmethod, combinedwiththe characteristies ofnews corpus, the HLDA topie importance calculation model is improved. Based on the abstractly characteristics of the model, news related features such as news headline words, topie sensitive words and TF-IDF are added. Abstract sentence extraction and sentence fusion, automatic generation of abstracts. Experimental results show that the proposed algorithm is higherin the index thanthe traditional method, indicatingthe accuracy of the corpus combined with news features and the improved HLDA algorithm.","PeriodicalId":342719,"journal":{"name":"2019 6th International Conference on Dependable Systems and Their Applications (DSA)","volume":"109 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"An Method of Improved HLDA-Based Multi-document Automatic Summarization of Chinese News\",\"authors\":\"Yan Liu, Ying Li, Chengcheng Hu, Yongbin Wang\",\"doi\":\"10.1109/dsa.2019.00068\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"There are a lot of Chïnese news about the same topic on the Internet today. Many of them are similar or repetitive for readers. It is hard to find what are the readers needed exactly. Multi-document news summarization aim at extractioninformationfrommultiple news texts on sametopie to automatically generate summary report for readers. Our paper chooses the news of the Great Wall as an example to illustrate the method of automatic summary generation In ourmethod, combinedwiththe characteristies ofnews corpus, the HLDA topie importance calculation model is improved. Based on the abstractly characteristics of the model, news related features such as news headline words, topie sensitive words and TF-IDF are added. Abstract sentence extraction and sentence fusion, automatic generation of abstracts. Experimental results show that the proposed algorithm is higherin the index thanthe traditional method, indicatingthe accuracy of the corpus combined with news features and the improved HLDA algorithm.\",\"PeriodicalId\":342719,\"journal\":{\"name\":\"2019 6th International Conference on Dependable Systems and Their Applications (DSA)\",\"volume\":\"109 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2020-01-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2019 6th International Conference on Dependable Systems and Their Applications (DSA)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/dsa.2019.00068\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2019 6th International Conference on Dependable Systems and Their Applications (DSA)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/dsa.2019.00068","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
An Method of Improved HLDA-Based Multi-document Automatic Summarization of Chinese News
There are a lot of Chïnese news about the same topic on the Internet today. Many of them are similar or repetitive for readers. It is hard to find what are the readers needed exactly. Multi-document news summarization aim at extractioninformationfrommultiple news texts on sametopie to automatically generate summary report for readers. Our paper chooses the news of the Great Wall as an example to illustrate the method of automatic summary generation In ourmethod, combinedwiththe characteristies ofnews corpus, the HLDA topie importance calculation model is improved. Based on the abstractly characteristics of the model, news related features such as news headline words, topie sensitive words and TF-IDF are added. Abstract sentence extraction and sentence fusion, automatic generation of abstracts. Experimental results show that the proposed algorithm is higherin the index thanthe traditional method, indicatingthe accuracy of the corpus combined with news features and the improved HLDA algorithm.