Natural Language Watermarking Based on Syntactic Displacement and Morphological Division

Mi-Young Kim, Osmar R Zaiane, R. Goebel
{"title":"Natural Language Watermarking Based on Syntactic Displacement and Morphological Division","authors":"Mi-Young Kim, Osmar R Zaiane, R. Goebel","doi":"10.1109/COMPSACW.2010.37","DOIUrl":null,"url":null,"abstract":"This paper explores a method for Korean text watermarking based on a linguistic analysis scheme using morphemic and syntactic analysis. In this scheme, a predicate nominal is separated into its nominal and its predicate, and syntactic adverbial is displaced. Korean, as an agglutinative language, provides a good basis for this morpheme-based natural language watermarking because a word consists of several morphemes. A Korean word usually consists of a content morpheme and a function morpheme. However, a predicate nominal is an exception, having two content morphemes—nominal and predicate--and one function morpheme. So, we can divide a predicate nominal into a nominal and a predicate. In addition, we also perform syntax-based watermarking. We displace syntactic adverbials using the characteristic that most languages permit displacement of syntactic adverbials within its clause. Combining these morphemic and syntactic characteristics, we propose a method of language watermarking based on syntactic displacement and morphological division. To make our system more secure, we also include a sentence weight value and encode the weight value with a watermark bit. Our watermarking method doesn’t change the meaning of the most marked sentences, and it also ensures the naturalness of the sentences. From the experimental results, we show that the rate of unnatural sentences of marked text is reasonable, and the watermarking capacity is better than previous systems. The coverage of marked sentences is also reasonable. Experimental results also show that the marked text retains the same style, and also has the same information without semantic distortion.","PeriodicalId":121135,"journal":{"name":"2010 IEEE 34th Annual Computer Software and Applications Conference Workshops","volume":"96 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2010-07-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"15","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2010 IEEE 34th Annual Computer Software and Applications Conference Workshops","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/COMPSACW.2010.37","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 15

Abstract

This paper explores a method for Korean text watermarking based on a linguistic analysis scheme using morphemic and syntactic analysis. In this scheme, a predicate nominal is separated into its nominal and its predicate, and syntactic adverbial is displaced. Korean, as an agglutinative language, provides a good basis for this morpheme-based natural language watermarking because a word consists of several morphemes. A Korean word usually consists of a content morpheme and a function morpheme. However, a predicate nominal is an exception, having two content morphemes—nominal and predicate--and one function morpheme. So, we can divide a predicate nominal into a nominal and a predicate. In addition, we also perform syntax-based watermarking. We displace syntactic adverbials using the characteristic that most languages permit displacement of syntactic adverbials within its clause. Combining these morphemic and syntactic characteristics, we propose a method of language watermarking based on syntactic displacement and morphological division. To make our system more secure, we also include a sentence weight value and encode the weight value with a watermark bit. Our watermarking method doesn’t change the meaning of the most marked sentences, and it also ensures the naturalness of the sentences. From the experimental results, we show that the rate of unnatural sentences of marked text is reasonable, and the watermarking capacity is better than previous systems. The coverage of marked sentences is also reasonable. Experimental results also show that the marked text retains the same style, and also has the same information without semantic distortion.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
基于句法位移和形态划分的自然语言水印
本文研究了一种基于语素和句法分析的语言分析方案的韩文文本水印方法。在该方案中,谓语名词性被分离为其名词性和谓语性,句法状语被置换。朝鲜语作为一种黏着语言,一个词由多个语素组成,为基于语素的自然语言水印提供了良好的基础。韩语单词通常由内容语素和功能语素组成。然而,谓词名义是一个例外,它有两个内容语素——名义语素和谓词语素——和一个功能语素。所以,我们可以把一个名义谓词分为名义谓词和谓词。此外,我们还执行了基于语法的水印。我们利用大多数语言允许在其子句中替换句法状语的特点来替换句法状语。结合这些语素和句法特征,提出了一种基于句法位移和形态划分的语言水印方法。为了使我们的系统更安全,我们还加入了一个句子权重值,并用水印位对权重值进行编码。我们的水印方法在不改变标记最多的句子的意义的前提下,保证了句子的自然性。实验结果表明,该方法对标记文本的非自然句率合理,水印能力优于以往的水印系统。标注句子的覆盖范围也是合理的。实验结果还表明,标记后的文本保留了相同的风格,并且具有相同的信息,没有语义失真。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
MiRE4OWL: Mobile Rule Engine for OWL A Scenario Based Approach for Service Identification Supporting Concern-Based Regression Testing and Prioritization in a Model-Driven Environment XIDR: A Dynamic Framework Utilizing Cross-Layer Intrusion Detection for Effective Response Deployment A Middleware for Personal Smart Spaces
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1