Image Synthesis from Themes Captured in Poems using Latent Diffusion Models

Mayank Virmani, A. M. Michael, Manjiri Pathak, K. S. Pai, V. B. Prasad
{"title":"Image Synthesis from Themes Captured in Poems using Latent Diffusion Models","authors":"Mayank Virmani, A. M. Michael, Manjiri Pathak, K. S. Pai, V. B. Prasad","doi":"10.1109/ICAAIC56838.2023.10141274","DOIUrl":null,"url":null,"abstract":"Due to the presence of complex literary devices such as metaphors and imagery, poetry can be difficult to comprehend. This is especially true for literary works of classical poets like Kaalidasa that employ intricate, often conflicting themes which tend to be particularly tedious to interpret and decipher. The beauty of these works of art tends to get lost in translation. A visual representation in the form of images corresponding to the various themes in the poetry, greatly aids in providing a clearer understanding of the meaning and imagery described. The main aim here is to make classical poetry more accessible by generating detailed images that capture and depict the metaphors and themes used in various works of literature. The core task in this paper is to employ novel machine learning (NLP) techniques to detect and extract the central themes and keywords from the poems that encapsulate the essence of the literary work. This is done using transformer models fine-tuned specifically on a summarization dataset, that generate an abstractive summary of the segment of input text. Maintaining context while doing so is essential to the accuracy of the images being generated. Further, this summary is then provided as an input to a Latent Diffusion Model to generate detailed images corresponding to the poetry. The goal of the project is to make it easier to consume and enjoy classical works of literature by providing additional context and information in the form of images complementing the poetry.","PeriodicalId":267906,"journal":{"name":"2023 2nd International Conference on Applied Artificial Intelligence and Computing (ICAAIC)","volume":"72 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-05-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2023 2nd International Conference on Applied Artificial Intelligence and Computing (ICAAIC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICAAIC56838.2023.10141274","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Due to the presence of complex literary devices such as metaphors and imagery, poetry can be difficult to comprehend. This is especially true for literary works of classical poets like Kaalidasa that employ intricate, often conflicting themes which tend to be particularly tedious to interpret and decipher. The beauty of these works of art tends to get lost in translation. A visual representation in the form of images corresponding to the various themes in the poetry, greatly aids in providing a clearer understanding of the meaning and imagery described. The main aim here is to make classical poetry more accessible by generating detailed images that capture and depict the metaphors and themes used in various works of literature. The core task in this paper is to employ novel machine learning (NLP) techniques to detect and extract the central themes and keywords from the poems that encapsulate the essence of the literary work. This is done using transformer models fine-tuned specifically on a summarization dataset, that generate an abstractive summary of the segment of input text. Maintaining context while doing so is essential to the accuracy of the images being generated. Further, this summary is then provided as an input to a Latent Diffusion Model to generate detailed images corresponding to the poetry. The goal of the project is to make it easier to consume and enjoy classical works of literature by providing additional context and information in the form of images complementing the poetry.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
利用潜在扩散模型从诗歌中捕获的主题中合成图像
由于存在复杂的文学手段,如隐喻和意象,诗歌可能很难理解。对于像Kaalidasa这样的古典诗人的文学作品来说尤其如此,这些作品采用了复杂的,经常是相互矛盾的主题,这些主题往往特别乏味,难以解释和破译。这些艺术作品的美往往在翻译中消失了。与诗歌主题相对应的图像形式的视觉表现,极大地有助于提供对所描述的意义和意象的更清晰理解。这里的主要目的是通过生成捕捉和描绘各种文学作品中使用的隐喻和主题的详细图像,使古典诗歌更容易理解。本文的核心任务是采用新颖的机器学习(NLP)技术从诗歌中检测和提取中心主题和关键词,这些主题和关键词概括了文学作品的本质。这是使用专门针对摘要数据集进行微调的转换器模型来完成的,该模型生成输入文本片段的抽象摘要。在这样做的同时保持上下文对生成的图像的准确性至关重要。此外,该摘要随后作为潜在扩散模型的输入提供,以生成与诗歌对应的详细图像。该项目的目标是通过以图像的形式提供额外的背景和信息来补充诗歌,从而使古典文学作品更容易消费和享受。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Mosquitoes Classification using EfficientNetB4 Transfer Learning Model A Novel Framework in Scheduling Packets for Energy-Efficient Bandwidth Allocation in Wireless Networks Malware Classification using Malware Visualization and Deep Learning Automatic Vehicle Classification and Speed Tracking Predicting and Analyzing Air Quality Features Effectively using a Hybrid Machine Learning Model
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1