A Framework for Exploring Computational Models of Novelty in Unstructured Text

M. Mohseni, M. Maher
{"title":"A Framework for Exploring Computational Models of Novelty in Unstructured Text","authors":"M. Mohseni, M. Maher","doi":"10.1145/3546157.3546164","DOIUrl":null,"url":null,"abstract":"Novelty modeling in unstructured text data is a research topic within the Natural Language Processing (NLP) Community. Effective novelty models can play a key role in providing relevant and interesting content to the users which is the central goal in many applications including education and recommender systems. This paper presents a framework for comparing different approaches and applications of computational models of novelty in unstructured text data. We focus on computational models that apply methods such as natural language processing and information theory. The framework provides an ontology for computational novelty with respect to the source of text data, methods for representing the data, and models for measuring novelty. We explore the value of the framework by applying it to research on computational novelty in news articles, research publications, books, and recipes. This framework is independent of the type of data in the items and can be used as a tool for researchers to study, compare, and extend existing computational novelty models and applications.","PeriodicalId":422215,"journal":{"name":"Proceedings of the 6th International Conference on Information System and Data Mining","volume":"30 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-05-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 6th International Conference on Information System and Data Mining","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3546157.3546164","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Novelty modeling in unstructured text data is a research topic within the Natural Language Processing (NLP) Community. Effective novelty models can play a key role in providing relevant and interesting content to the users which is the central goal in many applications including education and recommender systems. This paper presents a framework for comparing different approaches and applications of computational models of novelty in unstructured text data. We focus on computational models that apply methods such as natural language processing and information theory. The framework provides an ontology for computational novelty with respect to the source of text data, methods for representing the data, and models for measuring novelty. We explore the value of the framework by applying it to research on computational novelty in news articles, research publications, books, and recipes. This framework is independent of the type of data in the items and can be used as a tool for researchers to study, compare, and extend existing computational novelty models and applications.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
探索非结构化文本新颖性计算模型的框架
非结构化文本数据的新颖性建模是自然语言处理(NLP)领域的一个研究课题。有效的新颖性模型可以在向用户提供相关和有趣的内容方面发挥关键作用,这是包括教育和推荐系统在内的许多应用程序的中心目标。本文提出了一个框架,用于比较非结构化文本数据中新颖性计算模型的不同方法和应用。我们专注于应用自然语言处理和信息理论等方法的计算模型。该框架提供了关于文本数据源的计算新颖性的本体、表示数据的方法和测量新颖性的模型。我们通过将其应用于新闻文章、研究出版物、书籍和食谱中计算新颖性的研究来探索该框架的价值。该框架独立于项目中的数据类型,可以作为研究人员研究、比较和扩展现有计算新颖性模型和应用程序的工具。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Towards Simplifying and Formalizing UML Class Diagram Generalization/Specialization Relationship with Mathematical Set Theory A Nonsynaptic Memory Based Neural Network for Hand-Written Digit Classification Using an Explainable Feature Extraction Method Docker Container based Crowd Control Analysis Using Dask Hadoop Framework Engaging Undergraduate Students in an Introductory A.I. Course through a Knowledge-Based Chatbot Workshop Examining User Acceptance and Adoption of the Internet of Things in Indonesia
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1