Wha Wha Adytoma, A. Huda, D. Maylawati, Nunik Destria Arianti, W. Darmalaksana, A. Rahman, M. Ramdhani
{"title":"Automatic Text Summarization for Hadith with Indonesian Text using Bellman-Ford Algorithm","authors":"Wha Wha Adytoma, A. Huda, D. Maylawati, Nunik Destria Arianti, W. Darmalaksana, A. Rahman, M. Ramdhani","doi":"10.1109/ICCED51276.2020.9415864","DOIUrl":null,"url":null,"abstract":"Automatic text summarization is one of Natural Language Processing technology to create a summary automatically by not changing the core or main idea of a summarized document. According to the agreement of the majority of Muslim scholars, Hadith is a second source of Muslim's life guideline. This study aims to extract the main meaning of Hadith document using automatic text summarization. The method used in this study is Bellman-Ford Algorithm from graph theory to extract and score the sentence based on the closeness and interconnection between sentences. The Hadith summary which resulted from Bellman-Ford algorithm is evaluated using Recall-Oriented Understudy for Gisting Evaluation - Longest Common Subsequence (ROUGE-L) metrics. Based on experiment using 20 Hadith documents with Indonesian text and ROUGE-L metrics evaluation, the summary results from sentence extraction are influenced by the type of similarity formula between sentences. The result shows that an average value of precision is 46.5%, recall value is 56% recall and f-score 49.5%. The evaluation result is not good enough because the summary result is extraction summary, while the summary evaluation database is abstraction summary. However, from human evaluation, this research contributes to understand the contents of the Hadith with a shorter text but does not eliminate the essence of the Hadith itself.","PeriodicalId":344981,"journal":{"name":"2020 6th International Conference on Computing Engineering and Design (ICCED)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-10-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 6th International Conference on Computing Engineering and Design (ICCED)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICCED51276.2020.9415864","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1
Abstract
Automatic text summarization is one of Natural Language Processing technology to create a summary automatically by not changing the core or main idea of a summarized document. According to the agreement of the majority of Muslim scholars, Hadith is a second source of Muslim's life guideline. This study aims to extract the main meaning of Hadith document using automatic text summarization. The method used in this study is Bellman-Ford Algorithm from graph theory to extract and score the sentence based on the closeness and interconnection between sentences. The Hadith summary which resulted from Bellman-Ford algorithm is evaluated using Recall-Oriented Understudy for Gisting Evaluation - Longest Common Subsequence (ROUGE-L) metrics. Based on experiment using 20 Hadith documents with Indonesian text and ROUGE-L metrics evaluation, the summary results from sentence extraction are influenced by the type of similarity formula between sentences. The result shows that an average value of precision is 46.5%, recall value is 56% recall and f-score 49.5%. The evaluation result is not good enough because the summary result is extraction summary, while the summary evaluation database is abstraction summary. However, from human evaluation, this research contributes to understand the contents of the Hadith with a shorter text but does not eliminate the essence of the Hadith itself.