使用高级条件生成对抗网络生成带有服装的人体图像

News. Phi Delta Epsilon Pub Date : 2020-01-01 DOI:10.5220/0009832200300041

Sheela Raju Kurupathi, Pramod Murthy, D. Stricker

{"title":"使用高级条件生成对抗网络生成带有服装的人体图像","authors":"Sheela Raju Kurupathi, Pramod Murthy, D. Stricker","doi":"10.5220/0009832200300041","DOIUrl":null,"url":null,"abstract":"One of the main challenges of human-image generation is generating a person along with pose and clothing details. However, it is still a difficult task due to challenging backgrounds and appearance variance. Recently, various deep learning models like Stacked Hourglass networks, Variational Auto Encoders (VAE), and Generative Adversarial Networks (GANs) have been used to solve this problem. However, still, they do not generalize well to the real-world human-image generation task qualitatively. The main goal is to use the Spectral Normalization (SN) technique for training GAN to synthesize the human-image along with the perfect pose and appearance details of the person. In this paper, we have investigated how Conditional GANs, along with Spectral Normalization (SN), could synthesize the new image of the target person given the image of the person and the target (novel) pose desired. The model uses 2D keypoints to represent human poses. We also use adversarial hinge loss and present an ablation study. The proposed model variants have generated promising results on both the Market-1501 and DeepFashion Datasets. We supported our claims by benchmarking the proposed model with recent state-of-the-art models. Finally, we show how the Spectral Normalization (SN) technique influences the process of human-image synthesis.","PeriodicalId":88612,"journal":{"name":"News. Phi Delta Epsilon","volume":"40 1","pages":"30-41"},"PeriodicalIF":0.0000,"publicationDate":"2020-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":"{\"title\":\"Generation of Human Images with Clothing using Advanced Conditional Generative Adversarial Networks\",\"authors\":\"Sheela Raju Kurupathi, Pramod Murthy, D. Stricker\",\"doi\":\"10.5220/0009832200300041\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"One of the main challenges of human-image generation is generating a person along with pose and clothing details. However, it is still a difficult task due to challenging backgrounds and appearance variance. Recently, various deep learning models like Stacked Hourglass networks, Variational Auto Encoders (VAE), and Generative Adversarial Networks (GANs) have been used to solve this problem. However, still, they do not generalize well to the real-world human-image generation task qualitatively. The main goal is to use the Spectral Normalization (SN) technique for training GAN to synthesize the human-image along with the perfect pose and appearance details of the person. In this paper, we have investigated how Conditional GANs, along with Spectral Normalization (SN), could synthesize the new image of the target person given the image of the person and the target (novel) pose desired. The model uses 2D keypoints to represent human poses. We also use adversarial hinge loss and present an ablation study. The proposed model variants have generated promising results on both the Market-1501 and DeepFashion Datasets. We supported our claims by benchmarking the proposed model with recent state-of-the-art models. Finally, we show how the Spectral Normalization (SN) technique influences the process of human-image synthesis.\",\"PeriodicalId\":88612,\"journal\":{\"name\":\"News. Phi Delta Epsilon\",\"volume\":\"40 1\",\"pages\":\"30-41\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2020-01-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"2\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"News. Phi Delta Epsilon\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.5220/0009832200300041\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"News. Phi Delta Epsilon","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.5220/0009832200300041","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 2

摘要

人类图像生成的主要挑战之一是生成一个人以及姿势和服装细节。然而，由于具有挑战性的背景和外观差异，这仍然是一项艰巨的任务。最近，各种深度学习模型，如堆叠沙漏网络，变分自动编码器(VAE)和生成对抗网络(gan)被用来解决这个问题。然而，它们仍然不能很好地定性地推广到现实世界的人类图像生成任务。主要目标是使用光谱归一化(SN)技术训练GAN来合成人类图像以及人的完美姿势和外观细节。在本文中，我们研究了条件gan以及谱归一化(SN)如何在给定人物图像和目标(新)姿势的情况下合成目标人物的新图像。该模型使用2D关键点来表示人体姿势。我们也使用了对抗性铰链损失，并提出了消融研究。所提出的模型变体在Market-1501和DeepFashion数据集上都产生了有希望的结果。我们通过用最新的最先进的模型对所提出的模型进行基准测试来支持我们的主张。最后，我们展示了光谱归一化(SN)技术对人体图像合成过程的影响。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

Generation of Human Images with Clothing using Advanced Conditional Generative Adversarial Networks

One of the main challenges of human-image generation is generating a person along with pose and clothing details. However, it is still a difficult task due to challenging backgrounds and appearance variance. Recently, various deep learning models like Stacked Hourglass networks, Variational Auto Encoders (VAE), and Generative Adversarial Networks (GANs) have been used to solve this problem. However, still, they do not generalize well to the real-world human-image generation task qualitatively. The main goal is to use the Spectral Normalization (SN) technique for training GAN to synthesize the human-image along with the perfect pose and appearance details of the person. In this paper, we have investigated how Conditional GANs, along with Spectral Normalization (SN), could synthesize the new image of the target person given the image of the person and the target (novel) pose desired. The model uses 2D keypoints to represent human poses. We also use adversarial hinge loss and present an ablation study. The proposed model variants have generated promising results on both the Market-1501 and DeepFashion Datasets. We supported our claims by benchmarking the proposed model with recent state-of-the-art models. Finally, we show how the Spectral Normalization (SN) technique influences the process of human-image synthesis.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

News. Phi Delta Epsilon

自引率

0.00%

发文量

期刊最新文献

GAN-Based LiDAR Intensity Simulation Improving Primate Sounds Classification using Binary Presorting for Deep Learning Towards exploring adversarial learning for anomaly detection in complex driving scenes A Study of Neural Collapse for Text Classification Using Artificial Intelligence to Reduce the Risk of Transfusion Hemolytic Reactions