{"title":"基于层次聚类的印地语文本抽取与抽象摘要","authors":"Cheshta Kwatra, K. Gupta","doi":"10.1109/ICSES52305.2021.9633789","DOIUrl":null,"url":null,"abstract":"Text Summarization is a widely researched and successful area of Natural Language Processing application. However, it remains limited to established languages such as English, French, etc. In this paper, we propose and compare extractive and abstractive summarization techniques for Hindi text documents. For either summarization, we first propose ward hierarchical agglomerative clustering. This is followed by the PageRank algorithm for extractive summarization while in abstractive summarization, we present an approach based on multi-sentence compression which only requires a POS tagger to generate Hindi text summaries.","PeriodicalId":6777,"journal":{"name":"2021 International Conference on Innovative Computing, Intelligent Communication and Smart Electrical Systems (ICSES)","volume":"23 1","pages":"1-6"},"PeriodicalIF":0.0000,"publicationDate":"2021-09-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":"{\"title\":\"Extractive and Abstractive Summarization for Hindi Text using Hierarchical Clustering\",\"authors\":\"Cheshta Kwatra, K. Gupta\",\"doi\":\"10.1109/ICSES52305.2021.9633789\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Text Summarization is a widely researched and successful area of Natural Language Processing application. However, it remains limited to established languages such as English, French, etc. In this paper, we propose and compare extractive and abstractive summarization techniques for Hindi text documents. For either summarization, we first propose ward hierarchical agglomerative clustering. This is followed by the PageRank algorithm for extractive summarization while in abstractive summarization, we present an approach based on multi-sentence compression which only requires a POS tagger to generate Hindi text summaries.\",\"PeriodicalId\":6777,\"journal\":{\"name\":\"2021 International Conference on Innovative Computing, Intelligent Communication and Smart Electrical Systems (ICSES)\",\"volume\":\"23 1\",\"pages\":\"1-6\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-09-24\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"2\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2021 International Conference on Innovative Computing, Intelligent Communication and Smart Electrical Systems (ICSES)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICSES52305.2021.9633789\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 International Conference on Innovative Computing, Intelligent Communication and Smart Electrical Systems (ICSES)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICSES52305.2021.9633789","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Extractive and Abstractive Summarization for Hindi Text using Hierarchical Clustering
Text Summarization is a widely researched and successful area of Natural Language Processing application. However, it remains limited to established languages such as English, French, etc. In this paper, we propose and compare extractive and abstractive summarization techniques for Hindi text documents. For either summarization, we first propose ward hierarchical agglomerative clustering. This is followed by the PageRank algorithm for extractive summarization while in abstractive summarization, we present an approach based on multi-sentence compression which only requires a POS tagger to generate Hindi text summaries.