{"title":"基于 BERTopic 和 UGC 挖掘中国动画电影受众关注主题","authors":"Wei Ding, Hongjie Yuan, Sheng Zhang","doi":"10.62051/adargr12","DOIUrl":null,"url":null,"abstract":"[Objective] To explore how Chinese animated movies can effectively utilize hot topics to feed the box office and improve the competitiveness of the movies by mining hot topics of users' concern through User-Generated Content (UGC). [Methods] Based on the UGC text of Douban reviews of the Chinese animated movie \"30,000 Leagues in Chang'an\", we adopt the BERTopic algorithm to cluster topics based on the category-based TF-IDF (Word Frequency-Inverse Text Frequency)-weighted clustering, and introduce the semantic fine-tuning of the topics by the ChatGLM2-6B model to excavate the hot topics of users' attention. [Results] Users of the Chinese animated movie \"30,000 Leagues in Chang'an\" pay attention to five major topic clusters, including Chinese culture, word-of-mouth communication, poetic narrative, production technology, and the controversy of new historical facts. [Limitations] The country differences in the topic concerns of Chinese animated films were not analyzed in comparison with foreign UGC texts. [Conclusion] This study focuses on UGC text analysis to discover the social attention and evolution law of related topics. These findings provide some reference value for future Chinese animated movie production and production, box office prediction and marketing.","PeriodicalId":512428,"journal":{"name":"Transactions on Social Science, Education and Humanities Research","volume":"49 6","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-07-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Mining Chinese Animation Movie Audience Concern Themes Based on BERTopic and UGC\",\"authors\":\"Wei Ding, Hongjie Yuan, Sheng Zhang\",\"doi\":\"10.62051/adargr12\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"[Objective] To explore how Chinese animated movies can effectively utilize hot topics to feed the box office and improve the competitiveness of the movies by mining hot topics of users' concern through User-Generated Content (UGC). [Methods] Based on the UGC text of Douban reviews of the Chinese animated movie \\\"30,000 Leagues in Chang'an\\\", we adopt the BERTopic algorithm to cluster topics based on the category-based TF-IDF (Word Frequency-Inverse Text Frequency)-weighted clustering, and introduce the semantic fine-tuning of the topics by the ChatGLM2-6B model to excavate the hot topics of users' attention. [Results] Users of the Chinese animated movie \\\"30,000 Leagues in Chang'an\\\" pay attention to five major topic clusters, including Chinese culture, word-of-mouth communication, poetic narrative, production technology, and the controversy of new historical facts. [Limitations] The country differences in the topic concerns of Chinese animated films were not analyzed in comparison with foreign UGC texts. [Conclusion] This study focuses on UGC text analysis to discover the social attention and evolution law of related topics. These findings provide some reference value for future Chinese animated movie production and production, box office prediction and marketing.\",\"PeriodicalId\":512428,\"journal\":{\"name\":\"Transactions on Social Science, Education and Humanities Research\",\"volume\":\"49 6\",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2024-07-10\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Transactions on Social Science, Education and Humanities Research\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.62051/adargr12\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Transactions on Social Science, Education and Humanities Research","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.62051/adargr12","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
摘要
[目的] 通过用户生成内容(UGC)挖掘用户关注的热点话题,探讨中国动画电影如何有效利用热点话题反哺票房,提高电影竞争力。[方法]基于中国动画电影《长安三万里》豆瓣评论的UGC文本,采用BERTopic算法,基于分类的TF-IDF(词频-反向文本频率)加权聚类对话题进行聚类,并引入ChatGLM2-6B模型对话题进行语义微调,挖掘用户关注的热点话题。[结果]中国动画电影《长安三万里》的用户关注五大话题集群,包括中国文化、口碑传播、诗意叙事、制作技术和新史实争议。[局限性]与国外 UGC 文本相比,本研究没有分析中国动画电影话题关注的国别差异。[结论]本研究主要通过对 UGC 文本的分析,发现相关话题的社会关注度和演变规律。这些发现为未来中国动画电影的制作与生产、票房预测与营销提供了一定的参考价值。
Mining Chinese Animation Movie Audience Concern Themes Based on BERTopic and UGC
[Objective] To explore how Chinese animated movies can effectively utilize hot topics to feed the box office and improve the competitiveness of the movies by mining hot topics of users' concern through User-Generated Content (UGC). [Methods] Based on the UGC text of Douban reviews of the Chinese animated movie "30,000 Leagues in Chang'an", we adopt the BERTopic algorithm to cluster topics based on the category-based TF-IDF (Word Frequency-Inverse Text Frequency)-weighted clustering, and introduce the semantic fine-tuning of the topics by the ChatGLM2-6B model to excavate the hot topics of users' attention. [Results] Users of the Chinese animated movie "30,000 Leagues in Chang'an" pay attention to five major topic clusters, including Chinese culture, word-of-mouth communication, poetic narrative, production technology, and the controversy of new historical facts. [Limitations] The country differences in the topic concerns of Chinese animated films were not analyzed in comparison with foreign UGC texts. [Conclusion] This study focuses on UGC text analysis to discover the social attention and evolution law of related topics. These findings provide some reference value for future Chinese animated movie production and production, box office prediction and marketing.