{"title":"A Framework for Automatic Generation of FAQs from Email Repositories","authors":"Shiney Jeyaraj, Raghuveera Tripuraribhatla","doi":"10.1109/IEMCON.2018.8614894","DOIUrl":null,"url":null,"abstract":"In many organizations, enquiry emails from customers remain unanswered due to lack of patience and availability of a respondent. Generating FAQs from email repositories with lot of enquiry emails will be beneficial. However, manual generation of FAQs by experts is a time consuming and strenous job. Hence automatic generation of FAQs is a necessity. Automatic generation of FAQs require effective categorization of emails which is challenging since the emails are written by different people with heterogenous cognition levels. In this paper, we propose a framework using Non-negative Matrix Factorization (NMF) and k-means that groups emails into clusters which can be used for FAQ generation. The proposed framework determines not only the broad topic under which the emails have to be tagged but also categorizes the emails into clusters with similar sub contents. The number of clusters was determined by the elbow method whereas the number of topics was fixed by calculating the percentage of relevant topics. The average Silhouette coefficient score of the resulting clusters was found to be 0.52 indicating reasonably good clusters. Also, the Silhouette coefficient score of the proposed method increased by 36.82 % compared to k-means.","PeriodicalId":368939,"journal":{"name":"2018 IEEE 9th Annual Information Technology, Electronics and Mobile Communication Conference (IEMCON)","volume":"78 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 IEEE 9th Annual Information Technology, Electronics and Mobile Communication Conference (IEMCON)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/IEMCON.2018.8614894","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1
Abstract
In many organizations, enquiry emails from customers remain unanswered due to lack of patience and availability of a respondent. Generating FAQs from email repositories with lot of enquiry emails will be beneficial. However, manual generation of FAQs by experts is a time consuming and strenous job. Hence automatic generation of FAQs is a necessity. Automatic generation of FAQs require effective categorization of emails which is challenging since the emails are written by different people with heterogenous cognition levels. In this paper, we propose a framework using Non-negative Matrix Factorization (NMF) and k-means that groups emails into clusters which can be used for FAQ generation. The proposed framework determines not only the broad topic under which the emails have to be tagged but also categorizes the emails into clusters with similar sub contents. The number of clusters was determined by the elbow method whereas the number of topics was fixed by calculating the percentage of relevant topics. The average Silhouette coefficient score of the resulting clusters was found to be 0.52 indicating reasonably good clusters. Also, the Silhouette coefficient score of the proposed method increased by 36.82 % compared to k-means.