{"title":"Implementation Approach of Indian Language Gujarati Grammar's Concept “sandhi” using the Concepts of Rule-based NLP","authors":"N. Patel, Dhiren R. Patel","doi":"10.1109/INDIACom51348.2021.00085","DOIUrl":null,"url":null,"abstract":"The term ‘language’ in NLP has to be understood as natural languages like Gujarati, Hindi, English etc., which we use in daily life to communicate. Most of the NLP research has been centered on English & other European Languages. NLP research concerning the Indian language like Gujarati is commenced in the last few years. The centre of attention of this paper is to demonstrate the road map of implementation of Gujarati grammar's concept “sandhi ”. In our words sandhi is a word segmentation process & it is present in most of the South Asian language, such as Devnagri, Sanskrit, Hindi, and Gujarati & even in Chinese & Thai languages.” Sandhi leads to phonetic transformation at word boundaries of a written chunk (small part), and the sounds at the end of word join together to form a single chunk of the character sequence.” Our main spotlight is on rule-based implementation of “sandhi”. Similar to every Indian scripting language Gujarati language (Grammar) also has its own specified rules of composition for combining the consonants, vowels and modifiers. We have identified certain rules by which we accomplish the practical implementation of “sandhi ”. There are many sandhi rules available, each denoting a unique combination of phonetic transformations, documented in the grammatical tradition of Gujarati. The Sandhi does not make any syntactic or semantic changes to the words implicated. Sandhi is an elective operation that depends only on the alertness of the writer.","PeriodicalId":415594,"journal":{"name":"2021 8th International Conference on Computing for Sustainable Global Development (INDIACom)","volume":"40 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-03-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 8th International Conference on Computing for Sustainable Global Development (INDIACom)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/INDIACom51348.2021.00085","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
The term ‘language’ in NLP has to be understood as natural languages like Gujarati, Hindi, English etc., which we use in daily life to communicate. Most of the NLP research has been centered on English & other European Languages. NLP research concerning the Indian language like Gujarati is commenced in the last few years. The centre of attention of this paper is to demonstrate the road map of implementation of Gujarati grammar's concept “sandhi ”. In our words sandhi is a word segmentation process & it is present in most of the South Asian language, such as Devnagri, Sanskrit, Hindi, and Gujarati & even in Chinese & Thai languages.” Sandhi leads to phonetic transformation at word boundaries of a written chunk (small part), and the sounds at the end of word join together to form a single chunk of the character sequence.” Our main spotlight is on rule-based implementation of “sandhi”. Similar to every Indian scripting language Gujarati language (Grammar) also has its own specified rules of composition for combining the consonants, vowels and modifiers. We have identified certain rules by which we accomplish the practical implementation of “sandhi ”. There are many sandhi rules available, each denoting a unique combination of phonetic transformations, documented in the grammatical tradition of Gujarati. The Sandhi does not make any syntactic or semantic changes to the words implicated. Sandhi is an elective operation that depends only on the alertness of the writer.