The Summarization of Creative Content

Olivier Toubia
{"title":"The Summarization of Creative Content","authors":"Olivier Toubia","doi":"10.2139/ssrn.3020131","DOIUrl":null,"url":null,"abstract":"We study and model the process by which humans summarize creative documents (e.g., from a movie script to a synopsis). We develop a customized topic model based on Poisson Factorization and inspired by the creativity literature, which links the text in a summary to the text in the original document. Traditional Poisson Factorization approximates documents as positive combinations of topics, i.e., as points in the cone defined by a set of topics (in the Euclidean space defined by the words in the vocabulary). The model proposed here captures not only this “inside the cone” portion of a document, but also the “outside the cone” portion that is not explained by a combination of common topics. The model captures how these two types of content are weighed in summaries as compared to full documents. In addition, it captures writing norms that influence the extent to which each topic appears in summaries compared to full documents. We apply this model to a dataset of marketing academic papers and their abstracts, and to a dataset of movie scripts and their synopses. We illustrate a practical application of our research by creating a public, online interactive tool meant to serve as a “sounding board” for users interested in writing summaries of creative documents.","PeriodicalId":250074,"journal":{"name":"IRPN: Anthropology & Cultural Studies Innovation (Topic)","volume":"315 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2017-08-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IRPN: Anthropology & Cultural Studies Innovation (Topic)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.2139/ssrn.3020131","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 3

Abstract

We study and model the process by which humans summarize creative documents (e.g., from a movie script to a synopsis). We develop a customized topic model based on Poisson Factorization and inspired by the creativity literature, which links the text in a summary to the text in the original document. Traditional Poisson Factorization approximates documents as positive combinations of topics, i.e., as points in the cone defined by a set of topics (in the Euclidean space defined by the words in the vocabulary). The model proposed here captures not only this “inside the cone” portion of a document, but also the “outside the cone” portion that is not explained by a combination of common topics. The model captures how these two types of content are weighed in summaries as compared to full documents. In addition, it captures writing norms that influence the extent to which each topic appears in summaries compared to full documents. We apply this model to a dataset of marketing academic papers and their abstracts, and to a dataset of movie scripts and their synopses. We illustrate a practical application of our research by creating a public, online interactive tool meant to serve as a “sounding board” for users interested in writing summaries of creative documents.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
创意内容的总结
我们研究并模拟人类总结创造性文档(例如,从电影剧本到摘要)的过程。我们开发了一个基于泊松分解的定制主题模型,并受到创意文献的启发,该模型将摘要中的文本链接到原始文档中的文本。传统的泊松分解将文档近似为主题的正组合,即由一组主题定义的圆锥体中的点(在由词汇表中的单词定义的欧几里德空间中)。这里提出的模型不仅捕获了文档的“锥内”部分,还捕获了未由共同主题组合解释的“锥外”部分。该模型捕获了与完整文档相比,如何在摘要中权衡这两种类型的内容。此外,它还捕获了影响每个主题在摘要中出现的程度的写作规范,而不是完整的文档。我们将此模型应用于营销学术论文及其摘要的数据集,以及电影剧本及其大纲的数据集。我们通过创建一个公共的在线互动工具来说明我们的研究的实际应用,该工具旨在为有兴趣撰写创意文档摘要的用户提供“声音板”。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Creative Corporate Culture and Innovation Vanguards, Laggards, and Isolates: Outliers and Demographic Research The Summarization of Creative Content The Double Binds of Indigeneity and Indigenous Resistance The Challenges and Enhancing Opportunities of Global Project Management: Evidence from Chinese and Dutch Cross-Cultural Project Management
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1