Wang Yibo, Zhang Ke, Kong Yinghui, Yu Tingting, Zhao Shiwei
{"title":"Overview of human-facial-related age syntheis based generative adversarial network methods","authors":"Wang Yibo, Zhang Ke, Kong Yinghui, Yu Tingting, Zhao Shiwei","doi":"10.11834/jig.220842","DOIUrl":null,"url":null,"abstract":"年龄信息作为人类生物特征识别的重要组成部分,在社会保障和数字娱乐等领域具有广泛的应用前景。人脸年龄合成技术由于其广泛的应用价值,受到了越来越多学者的重视,已经成为计算机视觉领域的重要研究方向之一。随着深度学习的快速发展,基于生成对抗网络的人脸年龄合成技术已成为研究热点。尽管基于生成对抗网络的人脸年龄合成方法取得了不错的成果,但生成的人脸年龄图像仍存在图像质量较差、真实感较低、年龄转换效果和多样性不足等问题。主要因为当前人脸年龄合成研究仍存在以下困难: 1)现有人脸年龄合成数据集的限制; 2)引入人脸年龄合成的先验知识不足; 3)人脸年龄图像的细粒度性被忽视; 4)高分辨率下的人脸年龄合成问题;5)目前人脸年龄合成方法的评价标准不规范。本文对目前人脸年龄合成技术进行全面综述,以人脸年龄合成方法为研究对象,阐述其研究现状。通过调研文献,对人脸年龄合成方法进行分类,重点介绍了基于生成对抗网络的人脸年龄合成方法。此外,本文还讨论了常用的人脸年龄合成数据集及评价指标,分析了各种人脸年龄合成方法的基本思想、特点及其局限性,对比了部分代表方法的性能,指出了该领域目前存在的挑战并提供了一些具有潜力的研究方向,为研究者们解决存在的问题提供便利。;Human-biometric age information has been widely used for such domains like public security and digital entertainment. Such of human-facial-related age synthesis methods are mainly divided into traditional image processing methods and machine learning-based methods. Traditional image processing methods are divided into physics-based methods and prototype-based methods. Machine learning based method is focused on the model-based method,which can be divided into parametric linear model method,deep generative model method based on the time frame and generative adversarial network(GAN)-based method. The physics-based methods are focused on intuitive facial features only,for which some subtle changes are inevitably ignored,resulting in the irrationality of synthetic images. In addition,it requires a large number of facial samples for the same person at several of ages,which is costly and labor-intensive to be collected. The aging patterns generated by the prototype-based method are obtained by faces-related averaging value,and some important personalized features may be averaged,resulting in the loss of personal identity. Severe ghosting artifacts will be appeared in their synthetic images while some dictionary-based learning methods are used to preserve personalized features to some extent. Its related parametric linear model method and the deep generative model method based on the time frame are still challenged to find a general model suitable for a specific age group,and its following model established is still linear,so the quality of its synthetic image is deficient as well. The emerging GAN-based method can be used to train models using deep convolution network. Aging patterns-related age groups is learnt in terms of the generative adversarial learning mechanism,different types of loss functions are introduced for various problems appearing in the image,and the minimum value of the perceptual loss of the original image is sorted out. Aging mode can be realized in the input face image,and identity information can be preserved simultaneously. Recent GAN framework is derived of a series of variant models and has been optimizing consistently. GAN-based age synthesis methods can be segmented into four sorts of categories:GAN-classical,GANsequential,GAN-translational and GAN-conditional. For classical GAN method,it can be used to simulate face aging. However,the input information is not fully considered,which affects the identity retention,and all age maps and networks are limited under the control of age conditions,and the age accuracy of the generated image need to be optimized further. For sequential GAN method,it focuses on the sequential relationship of datasets,and there is a severe dependency. If the output of a certain model goes wrong,the performance of the whole model will be affected. Additionally,it requires consistent and completed images for each age group. The potentials of translational GAN is that a large number of photos of the same person are not required at different ages,and it needs sufficient images for each age group in the datasets only. Conditional GAN requires clear and correct labels for datasets. Compared to the methods based on translational GAN and sequential GAN,conditional GAN is extremely linked to the given limited tags in the datasets,and it is difficult to get refined control further. GAN-based methods can be used to improve the quality of generated images,but there are still some challenging problems to be resolved. Although various of face age synthesis methods based on generative adversarial network has achieved considerable progress,the generated face age image still has some problems,such as poor image quality,low realism,insufficient age transition effect and diversity. At present,the research of face age synthesis is still facing the following problems and challenges:1) the limitations of existing face age synthesis datasets;2) lack of prior knowledge of face age synthesis;3) the ignored fine granularity of face age image;4) face-related age synthesis at high resolution;and 5) current non-standardized evaluation of face age synthesis methods. Our literature review of the current face age synthesis technology is proposed,and current research situation is reviewed based on current facial age synthesis method as well. The methods of facial age synthesis can be classified,and generative adversarial network based method can be focused on as well. The commonly-used face age synthesis datasets and evaluation indicators are discussed,and the basic ideas,characteristics,and limitations of various face age synthesis methods are analyzed further. We also compare the performance of several representative methods on popular age synthesis datasets. We also predict some potential research directions and its in-depth development of related technologies.","PeriodicalId":36336,"journal":{"name":"中国图象图形学报","volume":"23 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"中国图象图形学报","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.11834/jig.220842","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"Computer Science","Score":null,"Total":0}
引用次数: 0
Abstract
年龄信息作为人类生物特征识别的重要组成部分,在社会保障和数字娱乐等领域具有广泛的应用前景。人脸年龄合成技术由于其广泛的应用价值,受到了越来越多学者的重视,已经成为计算机视觉领域的重要研究方向之一。随着深度学习的快速发展,基于生成对抗网络的人脸年龄合成技术已成为研究热点。尽管基于生成对抗网络的人脸年龄合成方法取得了不错的成果,但生成的人脸年龄图像仍存在图像质量较差、真实感较低、年龄转换效果和多样性不足等问题。主要因为当前人脸年龄合成研究仍存在以下困难: 1)现有人脸年龄合成数据集的限制; 2)引入人脸年龄合成的先验知识不足; 3)人脸年龄图像的细粒度性被忽视; 4)高分辨率下的人脸年龄合成问题;5)目前人脸年龄合成方法的评价标准不规范。本文对目前人脸年龄合成技术进行全面综述,以人脸年龄合成方法为研究对象,阐述其研究现状。通过调研文献,对人脸年龄合成方法进行分类,重点介绍了基于生成对抗网络的人脸年龄合成方法。此外,本文还讨论了常用的人脸年龄合成数据集及评价指标,分析了各种人脸年龄合成方法的基本思想、特点及其局限性,对比了部分代表方法的性能,指出了该领域目前存在的挑战并提供了一些具有潜力的研究方向,为研究者们解决存在的问题提供便利。;Human-biometric age information has been widely used for such domains like public security and digital entertainment. Such of human-facial-related age synthesis methods are mainly divided into traditional image processing methods and machine learning-based methods. Traditional image processing methods are divided into physics-based methods and prototype-based methods. Machine learning based method is focused on the model-based method,which can be divided into parametric linear model method,deep generative model method based on the time frame and generative adversarial network(GAN)-based method. The physics-based methods are focused on intuitive facial features only,for which some subtle changes are inevitably ignored,resulting in the irrationality of synthetic images. In addition,it requires a large number of facial samples for the same person at several of ages,which is costly and labor-intensive to be collected. The aging patterns generated by the prototype-based method are obtained by faces-related averaging value,and some important personalized features may be averaged,resulting in the loss of personal identity. Severe ghosting artifacts will be appeared in their synthetic images while some dictionary-based learning methods are used to preserve personalized features to some extent. Its related parametric linear model method and the deep generative model method based on the time frame are still challenged to find a general model suitable for a specific age group,and its following model established is still linear,so the quality of its synthetic image is deficient as well. The emerging GAN-based method can be used to train models using deep convolution network. Aging patterns-related age groups is learnt in terms of the generative adversarial learning mechanism,different types of loss functions are introduced for various problems appearing in the image,and the minimum value of the perceptual loss of the original image is sorted out. Aging mode can be realized in the input face image,and identity information can be preserved simultaneously. Recent GAN framework is derived of a series of variant models and has been optimizing consistently. GAN-based age synthesis methods can be segmented into four sorts of categories:GAN-classical,GANsequential,GAN-translational and GAN-conditional. For classical GAN method,it can be used to simulate face aging. However,the input information is not fully considered,which affects the identity retention,and all age maps and networks are limited under the control of age conditions,and the age accuracy of the generated image need to be optimized further. For sequential GAN method,it focuses on the sequential relationship of datasets,and there is a severe dependency. If the output of a certain model goes wrong,the performance of the whole model will be affected. Additionally,it requires consistent and completed images for each age group. The potentials of translational GAN is that a large number of photos of the same person are not required at different ages,and it needs sufficient images for each age group in the datasets only. Conditional GAN requires clear and correct labels for datasets. Compared to the methods based on translational GAN and sequential GAN,conditional GAN is extremely linked to the given limited tags in the datasets,and it is difficult to get refined control further. GAN-based methods can be used to improve the quality of generated images,but there are still some challenging problems to be resolved. Although various of face age synthesis methods based on generative adversarial network has achieved considerable progress,the generated face age image still has some problems,such as poor image quality,low realism,insufficient age transition effect and diversity. At present,the research of face age synthesis is still facing the following problems and challenges:1) the limitations of existing face age synthesis datasets;2) lack of prior knowledge of face age synthesis;3) the ignored fine granularity of face age image;4) face-related age synthesis at high resolution;and 5) current non-standardized evaluation of face age synthesis methods. Our literature review of the current face age synthesis technology is proposed,and current research situation is reviewed based on current facial age synthesis method as well. The methods of facial age synthesis can be classified,and generative adversarial network based method can be focused on as well. The commonly-used face age synthesis datasets and evaluation indicators are discussed,and the basic ideas,characteristics,and limitations of various face age synthesis methods are analyzed further. We also compare the performance of several representative methods on popular age synthesis datasets. We also predict some potential research directions and its in-depth development of related technologies.
中国图象图形学报Computer Science-Computer Graphics and Computer-Aided Design
CiteScore
1.20
自引率
0.00%
发文量
6776
期刊介绍:
Journal of Image and Graphics (ISSN 1006-8961, CN 11-3758/TB, CODEN ZTTXFZ) is an authoritative academic journal supervised by the Chinese Academy of Sciences and co-sponsored by the Institute of Space and Astronautical Information Innovation of the Chinese Academy of Sciences (ISIAS), the Chinese Society of Image and Graphics (CSIG), and the Beijing Institute of Applied Physics and Computational Mathematics (BIAPM). The journal integrates high-tech theories, technical methods and industrialisation of applied research results in computer image graphics, and mainly publishes innovative and high-level scientific research papers on basic and applied research in image graphics science and its closely related fields. The form of papers includes reviews, technical reports, project progress, academic news, new technology reviews, new product introduction and industrialisation research. The content covers a wide range of fields such as image analysis and recognition, image understanding and computer vision, computer graphics, virtual reality and augmented reality, system simulation, animation, etc., and theme columns are opened according to the research hotspots and cutting-edge topics.
Journal of Image and Graphics reaches a wide range of readers, including scientific and technical personnel, enterprise supervisors, and postgraduates and college students of colleges and universities engaged in the fields of national defence, military, aviation, aerospace, communications, electronics, automotive, agriculture, meteorology, environmental protection, remote sensing, mapping, oil field, construction, transportation, finance, telecommunications, education, medical care, film and television, and art.
Journal of Image and Graphics is included in many important domestic and international scientific literature database systems, including EBSCO database in the United States, JST database in Japan, Scopus database in the Netherlands, China Science and Technology Thesis Statistics and Analysis (Annual Research Report), China Science Citation Database (CSCD), China Academic Journal Network Publishing Database (CAJD), and China Academic Journal Network Publishing Database (CAJD). China Science Citation Database (CSCD), China Academic Journals Network Publishing Database (CAJD), China Academic Journal Abstracts, Chinese Science Abstracts (Series A), China Electronic Science Abstracts, Chinese Core Journals Abstracts, Chinese Academic Journals on CD-ROM, and China Academic Journals Comprehensive Evaluation Database.