K. Hasan, Md. Sakhawat Hossain, Md. Abdulla Al-Sun, Md. Mostafizur Rahman
{"title":"基于确定性有限自动机的TRIE孟加拉语解析器设计","authors":"K. Hasan, Md. Sakhawat Hossain, Md. Abdulla Al-Sun, Md. Mostafizur Rahman","doi":"10.1109/ICCIT57492.2022.10056008","DOIUrl":null,"url":null,"abstract":"We describe a new method of parsing Bangla language based on Deterministic Finite Automata (DFA) and implement the parser using a TRIE data structure. Hence we call the parser as TRIE parser. TRIE parser successfully parses sentences faster than other important parsing schemes as it needs no formal rules, no parameters and no Context Free Grammars (CFG). the scheme stores the Bangla grammar symbols or Pasts Of Speech (POS) as a state of the DFA and process a sentence following the operations of a DFA. If the set of POS symbols reaches to final state, then parsing is successful otherwise unsuccessful. The parser uses the grammar rules in compressed form hence it becomes very less space consuming. Therefore, it can be implemented in light weight fashion in main memory. The TRIE parser is compared with two other parsers and it shows that the proposed TRIE parser outperforms others in terms of processing time with an increasing number of sentences in the input paragraph. Necessary figures and examples are used to properly explain the proposed TRIE parser.","PeriodicalId":255498,"journal":{"name":"2022 25th International Conference on Computer and Information Technology (ICCIT)","volume":"39 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-12-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Designing a Bangla Parser using TRIE Based on Deterministic Finite Automata\",\"authors\":\"K. Hasan, Md. Sakhawat Hossain, Md. Abdulla Al-Sun, Md. Mostafizur Rahman\",\"doi\":\"10.1109/ICCIT57492.2022.10056008\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"We describe a new method of parsing Bangla language based on Deterministic Finite Automata (DFA) and implement the parser using a TRIE data structure. Hence we call the parser as TRIE parser. TRIE parser successfully parses sentences faster than other important parsing schemes as it needs no formal rules, no parameters and no Context Free Grammars (CFG). the scheme stores the Bangla grammar symbols or Pasts Of Speech (POS) as a state of the DFA and process a sentence following the operations of a DFA. If the set of POS symbols reaches to final state, then parsing is successful otherwise unsuccessful. The parser uses the grammar rules in compressed form hence it becomes very less space consuming. Therefore, it can be implemented in light weight fashion in main memory. The TRIE parser is compared with two other parsers and it shows that the proposed TRIE parser outperforms others in terms of processing time with an increasing number of sentences in the input paragraph. Necessary figures and examples are used to properly explain the proposed TRIE parser.\",\"PeriodicalId\":255498,\"journal\":{\"name\":\"2022 25th International Conference on Computer and Information Technology (ICCIT)\",\"volume\":\"39 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-12-17\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2022 25th International Conference on Computer and Information Technology (ICCIT)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICCIT57492.2022.10056008\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 25th International Conference on Computer and Information Technology (ICCIT)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICCIT57492.2022.10056008","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Designing a Bangla Parser using TRIE Based on Deterministic Finite Automata
We describe a new method of parsing Bangla language based on Deterministic Finite Automata (DFA) and implement the parser using a TRIE data structure. Hence we call the parser as TRIE parser. TRIE parser successfully parses sentences faster than other important parsing schemes as it needs no formal rules, no parameters and no Context Free Grammars (CFG). the scheme stores the Bangla grammar symbols or Pasts Of Speech (POS) as a state of the DFA and process a sentence following the operations of a DFA. If the set of POS symbols reaches to final state, then parsing is successful otherwise unsuccessful. The parser uses the grammar rules in compressed form hence it becomes very less space consuming. Therefore, it can be implemented in light weight fashion in main memory. The TRIE parser is compared with two other parsers and it shows that the proposed TRIE parser outperforms others in terms of processing time with an increasing number of sentences in the input paragraph. Necessary figures and examples are used to properly explain the proposed TRIE parser.