{"title":"使用自定义写作风格的N-gram模型进行内容开发","authors":"J. Dhar, Vipul Gandhi","doi":"10.1109/INCITE.2016.7857630","DOIUrl":null,"url":null,"abstract":"Amateur writers usually find it difficult and often make errors while building up content when they are doing so in a style different from their own writing style. This causes loss of interest by the readers and sometimes even misinterpretations of actual thoughts desired to be conveyed by author. This work attempts to embark upon this problem statement by ranking the best available choices of words fitting the style of writing that the author would like to adopt. Our methodology allows authors to choose from amongst default, formal and literature style of writing. Also, authors can infer words and traces from his own past writings by developing a custom corpus of his own write-ups. A spell checker and N-gram based statistical model along with a corpus based technique is proposed to achieve above objectives. Rank 4 N-gram along with backoff smoothing provided optimum results for our work. To showcase the effectiveness of this method, we have tested it on real time data and performance evaluation fetched satisfactory results.","PeriodicalId":59618,"journal":{"name":"下一代","volume":"219 1","pages":"271-275"},"PeriodicalIF":0.0000,"publicationDate":"2016-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Content development using N-gram model in custom writing style\",\"authors\":\"J. Dhar, Vipul Gandhi\",\"doi\":\"10.1109/INCITE.2016.7857630\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Amateur writers usually find it difficult and often make errors while building up content when they are doing so in a style different from their own writing style. This causes loss of interest by the readers and sometimes even misinterpretations of actual thoughts desired to be conveyed by author. This work attempts to embark upon this problem statement by ranking the best available choices of words fitting the style of writing that the author would like to adopt. Our methodology allows authors to choose from amongst default, formal and literature style of writing. Also, authors can infer words and traces from his own past writings by developing a custom corpus of his own write-ups. A spell checker and N-gram based statistical model along with a corpus based technique is proposed to achieve above objectives. Rank 4 N-gram along with backoff smoothing provided optimum results for our work. To showcase the effectiveness of this method, we have tested it on real time data and performance evaluation fetched satisfactory results.\",\"PeriodicalId\":59618,\"journal\":{\"name\":\"下一代\",\"volume\":\"219 1\",\"pages\":\"271-275\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2016-10-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"下一代\",\"FirstCategoryId\":\"1092\",\"ListUrlMain\":\"https://doi.org/10.1109/INCITE.2016.7857630\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"下一代","FirstCategoryId":"1092","ListUrlMain":"https://doi.org/10.1109/INCITE.2016.7857630","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Content development using N-gram model in custom writing style
Amateur writers usually find it difficult and often make errors while building up content when they are doing so in a style different from their own writing style. This causes loss of interest by the readers and sometimes even misinterpretations of actual thoughts desired to be conveyed by author. This work attempts to embark upon this problem statement by ranking the best available choices of words fitting the style of writing that the author would like to adopt. Our methodology allows authors to choose from amongst default, formal and literature style of writing. Also, authors can infer words and traces from his own past writings by developing a custom corpus of his own write-ups. A spell checker and N-gram based statistical model along with a corpus based technique is proposed to achieve above objectives. Rank 4 N-gram along with backoff smoothing provided optimum results for our work. To showcase the effectiveness of this method, we have tested it on real time data and performance evaluation fetched satisfactory results.