使用自定义写作风格的N-gram模型进行内容开发

J. Dhar, Vipul Gandhi
{"title":"使用自定义写作风格的N-gram模型进行内容开发","authors":"J. Dhar, Vipul Gandhi","doi":"10.1109/INCITE.2016.7857630","DOIUrl":null,"url":null,"abstract":"Amateur writers usually find it difficult and often make errors while building up content when they are doing so in a style different from their own writing style. This causes loss of interest by the readers and sometimes even misinterpretations of actual thoughts desired to be conveyed by author. This work attempts to embark upon this problem statement by ranking the best available choices of words fitting the style of writing that the author would like to adopt. Our methodology allows authors to choose from amongst default, formal and literature style of writing. Also, authors can infer words and traces from his own past writings by developing a custom corpus of his own write-ups. A spell checker and N-gram based statistical model along with a corpus based technique is proposed to achieve above objectives. Rank 4 N-gram along with backoff smoothing provided optimum results for our work. To showcase the effectiveness of this method, we have tested it on real time data and performance evaluation fetched satisfactory results.","PeriodicalId":59618,"journal":{"name":"下一代","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2016-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Content development using N-gram model in custom writing style\",\"authors\":\"J. Dhar, Vipul Gandhi\",\"doi\":\"10.1109/INCITE.2016.7857630\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Amateur writers usually find it difficult and often make errors while building up content when they are doing so in a style different from their own writing style. This causes loss of interest by the readers and sometimes even misinterpretations of actual thoughts desired to be conveyed by author. This work attempts to embark upon this problem statement by ranking the best available choices of words fitting the style of writing that the author would like to adopt. Our methodology allows authors to choose from amongst default, formal and literature style of writing. Also, authors can infer words and traces from his own past writings by developing a custom corpus of his own write-ups. A spell checker and N-gram based statistical model along with a corpus based technique is proposed to achieve above objectives. Rank 4 N-gram along with backoff smoothing provided optimum results for our work. To showcase the effectiveness of this method, we have tested it on real time data and performance evaluation fetched satisfactory results.\",\"PeriodicalId\":59618,\"journal\":{\"name\":\"下一代\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2016-10-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"下一代\",\"FirstCategoryId\":\"1092\",\"ListUrlMain\":\"https://doi.org/10.1109/INCITE.2016.7857630\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"下一代","FirstCategoryId":"1092","ListUrlMain":"https://doi.org/10.1109/INCITE.2016.7857630","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

摘要

业余作家通常会发现,当他们用一种与自己写作风格不同的风格来构建内容时,这很难,而且经常会犯错误。这会导致读者失去兴趣,有时甚至会误解作者想要传达的实际思想。这项工作试图通过对适合作者想要采用的写作风格的最佳可用选择的单词进行排名来开始这个问题陈述。我们的方法允许作者从默认,正式和文学写作风格中进行选择。此外,作者可以通过开发自己的文章的自定义语料库,从自己过去的作品中推断出单词和痕迹。为了实现上述目标,本文提出了一个基于拼写检查器和n图的统计模型以及基于语料库的技术。Rank 4 N-gram和backoff平滑为我们的工作提供了最佳结果。为了证明该方法的有效性,我们在实时数据上进行了测试,性能评估取得了满意的结果。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
Content development using N-gram model in custom writing style
Amateur writers usually find it difficult and often make errors while building up content when they are doing so in a style different from their own writing style. This causes loss of interest by the readers and sometimes even misinterpretations of actual thoughts desired to be conveyed by author. This work attempts to embark upon this problem statement by ranking the best available choices of words fitting the style of writing that the author would like to adopt. Our methodology allows authors to choose from amongst default, formal and literature style of writing. Also, authors can infer words and traces from his own past writings by developing a custom corpus of his own write-ups. A spell checker and N-gram based statistical model along with a corpus based technique is proposed to achieve above objectives. Rank 4 N-gram along with backoff smoothing provided optimum results for our work. To showcase the effectiveness of this method, we have tested it on real time data and performance evaluation fetched satisfactory results.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
6212
期刊最新文献
Proceedings of the 4th International Workshop on Software Engineering Education for the Next Generation Parental Divorce and Children's Interpersonal Relationships: A Meta-Analysis How Young Adults Perceive Parental Divorce: The Role of Their Relationships with Their Fathers and Mothers Relationships Between Parents' Marital Status and University Students' Mental Health, Views of Mothers and Views of Fathers: A Study in Bulgaria Gender Schematization in Adolescents: Differences Based on Rearing in Single-Parent and Intact Families
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1