{"title":"Aesthetic Image Synthesis Using Multiple-Aesthetic-Aware GAN","authors":"Yaya Setiyadi, J. Santoso, K. Surendro","doi":"10.1145/3587828.3587852","DOIUrl":null,"url":null,"abstract":"The use of synthesized images in fields that prioritize visual appearance, particularly the field of art, places a high value on aesthetics, with the system's output images required to adhere to several aesthetic rules. There are two approaches to aesthetic image synthesis with architecture based on GAN. The first strategy is to modify the loss function on the GAN so that, in addition to the loss from the GAN architecture, aesthetic loss and content/semantic loss are also calculated and the total loss is the sum of the three loss calculations. However, the outcomes still fall short of the expected natural image appearance. The second strategy involves modifying the GAN architecture by adding a new layer to the GAN generator and discriminator network, while the loss function calculation remains unchanged. The results of this second approach to image synthesis have not been optimized in terms of producing meaningful images across multiple semantic classes. This study proposes a method for increasing the aesthetic value of the synthesized image by modifying the two approaches and employing the multiple-aesthetic-aware GAN method. The proposed method takes conditional semantic information and conditional aesthetic information into account not only in the GAN architecture, but also in the loss function value calculation. The proposed method is the result of ongoing research and will be evaluated using the Inception Score (IS), the Frechet Inception Distance (FID), and an aesthetic value metric.","PeriodicalId":340917,"journal":{"name":"Proceedings of the 2023 12th International Conference on Software and Computer Applications","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-02-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 2023 12th International Conference on Software and Computer Applications","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3587828.3587852","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
The use of synthesized images in fields that prioritize visual appearance, particularly the field of art, places a high value on aesthetics, with the system's output images required to adhere to several aesthetic rules. There are two approaches to aesthetic image synthesis with architecture based on GAN. The first strategy is to modify the loss function on the GAN so that, in addition to the loss from the GAN architecture, aesthetic loss and content/semantic loss are also calculated and the total loss is the sum of the three loss calculations. However, the outcomes still fall short of the expected natural image appearance. The second strategy involves modifying the GAN architecture by adding a new layer to the GAN generator and discriminator network, while the loss function calculation remains unchanged. The results of this second approach to image synthesis have not been optimized in terms of producing meaningful images across multiple semantic classes. This study proposes a method for increasing the aesthetic value of the synthesized image by modifying the two approaches and employing the multiple-aesthetic-aware GAN method. The proposed method takes conditional semantic information and conditional aesthetic information into account not only in the GAN architecture, but also in the loss function value calculation. The proposed method is the result of ongoing research and will be evaluated using the Inception Score (IS), the Frechet Inception Distance (FID), and an aesthetic value metric.