LAMA-WAVELET: IMAGE IMPAINTING WITH HIGH QUALITY OF FINE DETAILS AND OBJECT EDGES

D. O. Kolodochka, M. V. Polyakova
{"title":"LAMA-WAVELET: IMAGE IMPAINTING WITH HIGH QUALITY OF FINE DETAILS AND OBJECT EDGES","authors":"D. O. Kolodochka, M. V. Polyakova","doi":"10.15588/1607-3274-2024-1-19","DOIUrl":null,"url":null,"abstract":"Context. The problem of the image impainting in computer graphic and computer vision systems is considered. The subject of the research is deep learning convolutional neural networks for image inpainting. \nObjective. The objective of the research is to improve the image inpainting performance in computer vision and computer graphics systems by applying wavelet transform in the LaMa-Fourier network architecture. \nMethod. The basic LaMa-Fourier network decomposes the image into global and local texture. Then it is proposed to improve the network block, processing the global context of the image, namely, the spectral transform block. To improve the block of spectral transform, instead of Fourier Unit Structure the Simple Wavelet Convolution Block elaborated by the authors is used. In this block, 3D wavelet transform of the image on two levels was initially performed using the Daubechies wavelet db4. The obtained coefficients of 3D wavelet transform are splitted so that each subband represents a separate feature of the image. Convolutional layer, batch normalization and ReLU activation function are sequentially applied to the results of splitting of coefficients on each level of wavelet transform. The obtained subbands of wavelet coefficients are concatenated and the inverse wavelet transform is applied to them, the result of which is the output of the block. Note that the wavelet coefficients at different levels were processed separately. This reduces the computational complexity of calculating the network outputs while preserving the influence of the context of each level on image inpainting. The obtained neural network is named LaMa-Wavelet. The FID, PSNR, SSIM indexes and visual analysis were used to estimate the quality of images inpainted with LaMa-Wavelet network. \nResults. The proposed LaMa-Wavelet network has been implemented in software and researched for solving the problem of image inpainting. The PSNR of images inpainted using the LaMa-Wavelet exceeds the results obtained using the LaMa-Fourier network for narrow and medium masks in average by 4.5%, for large masks in average by 6%. The LaMa-Wavelet applying can enhance SSIM by 2–4% depending on a mask size. But it takes 3 times longer to inpaint one image with LaMa-Wavelet than with LaMa-Fourier network. Analysis of specific images demonstrates that both networks show similar results of inpainting of a homogeneous background. On complex backgrounds with repeating elements the LaMa-Wavelet is often more effective in restoring textures. \nConclusions. The obtained LaMa-Wavelet network allows to improve the image inpainting with large masks due to applying wavelet transform in the LaMa network architecture. Namely, the quality of reconstruction of image edges and fine details is increased.","PeriodicalId":518330,"journal":{"name":"Radio Electronics, Computer Science, Control","volume":"96 2","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-04-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Radio Electronics, Computer Science, Control","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.15588/1607-3274-2024-1-19","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Context. The problem of the image impainting in computer graphic and computer vision systems is considered. The subject of the research is deep learning convolutional neural networks for image inpainting. Objective. The objective of the research is to improve the image inpainting performance in computer vision and computer graphics systems by applying wavelet transform in the LaMa-Fourier network architecture. Method. The basic LaMa-Fourier network decomposes the image into global and local texture. Then it is proposed to improve the network block, processing the global context of the image, namely, the spectral transform block. To improve the block of spectral transform, instead of Fourier Unit Structure the Simple Wavelet Convolution Block elaborated by the authors is used. In this block, 3D wavelet transform of the image on two levels was initially performed using the Daubechies wavelet db4. The obtained coefficients of 3D wavelet transform are splitted so that each subband represents a separate feature of the image. Convolutional layer, batch normalization and ReLU activation function are sequentially applied to the results of splitting of coefficients on each level of wavelet transform. The obtained subbands of wavelet coefficients are concatenated and the inverse wavelet transform is applied to them, the result of which is the output of the block. Note that the wavelet coefficients at different levels were processed separately. This reduces the computational complexity of calculating the network outputs while preserving the influence of the context of each level on image inpainting. The obtained neural network is named LaMa-Wavelet. The FID, PSNR, SSIM indexes and visual analysis were used to estimate the quality of images inpainted with LaMa-Wavelet network. Results. The proposed LaMa-Wavelet network has been implemented in software and researched for solving the problem of image inpainting. The PSNR of images inpainted using the LaMa-Wavelet exceeds the results obtained using the LaMa-Fourier network for narrow and medium masks in average by 4.5%, for large masks in average by 6%. The LaMa-Wavelet applying can enhance SSIM by 2–4% depending on a mask size. But it takes 3 times longer to inpaint one image with LaMa-Wavelet than with LaMa-Fourier network. Analysis of specific images demonstrates that both networks show similar results of inpainting of a homogeneous background. On complex backgrounds with repeating elements the LaMa-Wavelet is often more effective in restoring textures. Conclusions. The obtained LaMa-Wavelet network allows to improve the image inpainting with large masks due to applying wavelet transform in the LaMa network architecture. Namely, the quality of reconstruction of image edges and fine details is increased.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
拉玛-小波:高质量的图像细节和物体边缘着色
背景。本研究考虑了计算机图形和计算机视觉系统中的图像绘制问题。研究主题是用于图像着色的深度学习卷积神经网络。目标。研究目的是通过在 LaMa-Fourier 网络架构中应用小波变换,提高计算机视觉和计算机图形系统中的图像绘制性能。方法。基本的 LaMa-Fourier 网络将图像分解为全局和局部纹理。然后建议改进网络块,处理图像的全局背景,即频谱变换块。为了改进频谱变换模块,我们使用了作者精心设计的简单小波卷积模块来代替傅里叶单元结构。在这个区块中,最初使用 Daubechies 小波 db4 对图像进行两级三维小波变换。对三维小波变换得到的系数进行分割,使每个子带代表图像的一个单独特征。卷积层、批量归一化和 ReLU 激活函数依次应用于小波变换各层次的系数分割结果。将获得的小波系数子带合并,并对其进行小波逆变换,其结果即为该块的输出。请注意,不同级别的小波系数是分开处理的。这样既降低了计算网络输出的计算复杂度,又保留了各层次背景对图像绘制的影响。得到的神经网络被命名为 LaMa-Wavelet。使用 FID、PSNR、SSIM 指数和视觉分析来评估使用 LaMa-Wavelet 网络绘制的图像的质量。结果所提出的 LaMa-Wavelet 网络已在软件中实现,并用于解决图像涂色问题的研究。使用 LaMa-Wavelet网络绘制的图像的PSNR比使用 LaMa-Fourier网络绘制的图像的PSNR平均高出4.5%(窄掩膜和中掩膜),比使用 LaMa-Fourier网络绘制的图像的PSNR平均高出6%。LaMa 小波应用可将 SSIM 提高 2-4%,具体取决于掩膜的大小。但是,使用 LaMa-Wavelet(小波)网络绘制一幅图像所需的时间是 LaMa-Fourier 网络的 3 倍。对具体图像的分析表明,这两种网络对均匀背景的内绘效果相似。在具有重复元素的复杂背景中,LaMa-Wavelet 通常能更有效地还原纹理。结论由于在 LaMa 网络架构中应用了小波变换,因此所获得的 LaMa-小波网络可以改善大掩膜图像的内绘效果。也就是说,图像边缘和细节的重建质量得到了提高。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
REALIZATION OF THE DECISION-MAKING SUPPORT SYSTEM FOR TWITTER USERS’ PUBLICATIONS ANALYSIS IN-MEMORY INTELLIGENT COMPUTING METHOD FOR DETERMINING THE BIT GRID OVERFLOW OF A COMPUTER SYSTEM OPERATING IN THE SYSTEM OF RESIDUAL CLASSES THE DESIGN OF THE PIPELINED RISC-V PROCESSOR WITH THE HARDWARE COPROCESSOR OF DIGITAL SIGNAL PROCESSING UA-LLM: ADVANCING CONTEXT-BASED QUESTION ANSWERING IN UKRAINIAN THROUGH LARGE LANGUAGE MODELS
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1