Pub Date : 2023-08-01DOI: 10.18287/2412-6179-co-1241
A.I. Novikov, A.V. Pronkin
The paper shows that it is expedient to use vector masks for solving some types of digital image processing problems. The main advantage of vector masks compared to matrix masks is that they reduce the computational complexity of algorithms while maintaining, and in some problems even improving, quality indicators. The article demonstrates examples of the use of vector masks in the problem of estimating the level of discrete white noise in an image, forming a basis for constructing a correctly working sigma filter, which are used for obtaining smoothed partial derivative estimates in the problem of edge detection and detecting straight lines in a contour image. The work uses results obtained by the authors in their earlier publications.
{"title":"Linear operators with vector masks in digital image processing problems","authors":"A.I. Novikov, A.V. Pronkin","doi":"10.18287/2412-6179-co-1241","DOIUrl":"https://doi.org/10.18287/2412-6179-co-1241","url":null,"abstract":"The paper shows that it is expedient to use vector masks for solving some types of digital image processing problems. The main advantage of vector masks compared to matrix masks is that they reduce the computational complexity of algorithms while maintaining, and in some problems even improving, quality indicators. The article demonstrates examples of the use of vector masks in the problem of estimating the level of discrete white noise in an image, forming a basis for constructing a correctly working sigma filter, which are used for obtaining smoothed partial derivative estimates in the problem of edge detection and detecting straight lines in a contour image. The work uses results obtained by the authors in their earlier publications.","PeriodicalId":46692,"journal":{"name":"Computer Optics","volume":"60 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"134998059","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
In the process of development of computer technologies, the number of areas of their application naturally grows and, along with it, the complexity of the tasks to be solved, which entails the need for new research. Similar tasks include digital filtering of images in the field of medical technologies and active-pulse television measuring systems. There are many methods and algorithms of digital filtering designed to solve the problem of improving the quality; algorithms that can improve the quality of images while reducing computational costs are widely used. High demands, which are made due to the constant growth in the size of the generated images, as well as the requirement for modern television systems, is real-time operation. When solving practical problems, it is required to use different filter aperture sizes, which provide an increase in quality and preservation of image details. The solution of these problems was the reason for the emergence of adaptive filters that are able to change the parameters in the process of processing the received data, while not spending additional time on processing with an increase in the size of the aperture. The paper presents the principles of constructing adaptive image processing filters, which, by obtaining an input parameter indicating the required dimension of a multi-element aperture, are able to implement the construction of the required aperture. The Laplacian “Truncated Pyramid” filter and the “double pyramid” Laplacian were modified. A feature of these filters is the oddness of the multi-element aperture, so the coefficient used to build the mask is always set to odd. When using these filters, it is possible to use two coefficients that are responsible for increasing the filtration efficiency, since, in their original form, the Laplacian filters have a sum of coefficients equal to zero. The experiment shows a comparison with high-dimensional filters that work when using classical two-dimensional convolution. The next stage of the presented research will be the application of parallel computing techniques, which will increase the speed of the developed filters.
{"title":"High-speed recursive-separable image processing filters with variable scanning aperture sizes","authors":"A.V. Kamenskiy, M.I. Kuryachiy, A.S. Krasnoperova, Yu.V. Ilyin, T.M. Akaeva, S.E. Boyarkin","doi":"10.18287/2412-6179-co-1240","DOIUrl":"https://doi.org/10.18287/2412-6179-co-1240","url":null,"abstract":"In the process of development of computer technologies, the number of areas of their application naturally grows and, along with it, the complexity of the tasks to be solved, which entails the need for new research. Similar tasks include digital filtering of images in the field of medical technologies and active-pulse television measuring systems. There are many methods and algorithms of digital filtering designed to solve the problem of improving the quality; algorithms that can improve the quality of images while reducing computational costs are widely used. High demands, which are made due to the constant growth in the size of the generated images, as well as the requirement for modern television systems, is real-time operation. When solving practical problems, it is required to use different filter aperture sizes, which provide an increase in quality and preservation of image details. The solution of these problems was the reason for the emergence of adaptive filters that are able to change the parameters in the process of processing the received data, while not spending additional time on processing with an increase in the size of the aperture. The paper presents the principles of constructing adaptive image processing filters, which, by obtaining an input parameter indicating the required dimension of a multi-element aperture, are able to implement the construction of the required aperture. The Laplacian “Truncated Pyramid” filter and the “double pyramid” Laplacian were modified. A feature of these filters is the oddness of the multi-element aperture, so the coefficient used to build the mask is always set to odd. When using these filters, it is possible to use two coefficients that are responsible for increasing the filtration efficiency, since, in their original form, the Laplacian filters have a sum of coefficients equal to zero. The experiment shows a comparison with high-dimensional filters that work when using classical two-dimensional convolution. The next stage of the presented research will be the application of parallel computing techniques, which will increase the speed of the developed filters.","PeriodicalId":46692,"journal":{"name":"Computer Optics","volume":"53 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"134998060","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2023-08-01DOI: 10.18287/2412-6179-co-1211
T.T. Bui, D.T. Le, T.H.L. Nguyen, T.T. Le
We propose a new on-chip optical neural network (OONN) based on multimode interference-microring resonators (MMI-RRs). The suggested structure eliminates the need for wavelength division multiplexers (WDM) to create an optical neuron on a single chip. New microring resonator structure based on 4×4 MMI coupler with a size of 24µm × 2900 µm is used for the basic elements of the computation matrix, as a result a higher bandwidth and free spectral range (FSR) can be achieved. The Si3N4 platform along with the graphene sheet is designed to modulate the signals and weights of the neural networks at a very high speed. The Si3N4 can provide wide range of operating wavelengths and can work directly with the wavelengths of color images. The structure's benefits include rapid computing speed, little loss, and the ability to handle both positive and negative values. The OONN has been applied to the MNIST dataset with a speed faster than 2.8 to 14x times compared with the conventional GPU methods.
{"title":"On chip optical neural networks based on MMI microring resonators for image classification","authors":"T.T. Bui, D.T. Le, T.H.L. Nguyen, T.T. Le","doi":"10.18287/2412-6179-co-1211","DOIUrl":"https://doi.org/10.18287/2412-6179-co-1211","url":null,"abstract":"We propose a new on-chip optical neural network (OONN) based on multimode interference-microring resonators (MMI-RRs). The suggested structure eliminates the need for wavelength division multiplexers (WDM) to create an optical neuron on a single chip. New microring resonator structure based on 4×4 MMI coupler with a size of 24µm × 2900 µm is used for the basic elements of the computation matrix, as a result a higher bandwidth and free spectral range (FSR) can be achieved. The Si3N4 platform along with the graphene sheet is designed to modulate the signals and weights of the neural networks at a very high speed. The Si3N4 can provide wide range of operating wavelengths and can work directly with the wavelengths of color images. The structure's benefits include rapid computing speed, little loss, and the ability to handle both positive and negative values. The OONN has been applied to the MNIST dataset with a speed faster than 2.8 to 14x times compared with the conventional GPU methods.","PeriodicalId":46692,"journal":{"name":"Computer Optics","volume":"32 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"134998067","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2023-08-01DOI: 10.18287/2412-6179-co-1289
V.V. Kotlyar, A.A. Kovalev, A.M. Telegin
Based on the Richards-Wolf formalism, we obtain two different exact expressions for the angular momentum (AM) density in the focus of a vortex beam with the topological charge n and with right circular polarization. One expression for the AM density is derived as the cross product of the position vector and the Poynting vector and has a nonzero value at the focus for an arbitrary integer number n. The other expression for the AM density is deduced as a sum of the orbital angular momentum (OAM) and the spin angular momentum (SAM). We reveal that at the focus of the light field under analysis, the latter turns zero at n = –1. While both these expressions are not equal to each other at each point of space, 3D integrals thereof are equal. Thus, exact expressions are obtained for densities of AM, SAM and OAM at the focus of a vortex beam with right-hand circular polarization and the identity for the densities AM = SAM + OAM is shown to be violated. Besides, it is shown that the expressions for the strength vectors of the electric and magnetic fields near the sharp focus, obtained by adopting the Richards-Wolf formalism, are exact solutions of the Maxwell's equations. Thus, Richards–Wolf theory exactly describes the behavior of light near the sharp focus in free space.
基于Richards-Wolf形式,我们得到了拓扑电荷为n且具有右圆极化的涡旋光束聚焦处角动量密度的两种精确表达式。其中,调幅密度的表达式为位置矢量与坡印廷矢量的叉积,且对于任意整数n,该表达式在焦点处具有非零值。另一种表达式为轨道角动量(OAM)与自旋角动量(SAM)之和。我们发现,在分析光场的焦点处,后者在n = -1时变为零。虽然这两个表达式在空间的每个点上并不相等,但它们的三维积分是相等的。由此,得到了右圆偏振涡旋光束焦点处AM、SAM和OAM密度的精确表达式,证明了AM = SAM + OAM密度的恒等式不成立。此外,还证明了采用Richards-Wolf形式得到的尖锐焦点附近电场和磁场的强度矢量表达式是麦克斯韦方程组的精确解。因此,理查兹-沃尔夫理论准确地描述了光在自由空间中靠近尖锐焦点的行为。
{"title":"Focusing a vortex beam with circular polarization: angular momentum","authors":"V.V. Kotlyar, A.A. Kovalev, A.M. Telegin","doi":"10.18287/2412-6179-co-1289","DOIUrl":"https://doi.org/10.18287/2412-6179-co-1289","url":null,"abstract":"Based on the Richards-Wolf formalism, we obtain two different exact expressions for the angular momentum (AM) density in the focus of a vortex beam with the topological charge n and with right circular polarization. One expression for the AM density is derived as the cross product of the position vector and the Poynting vector and has a nonzero value at the focus for an arbitrary integer number n. The other expression for the AM density is deduced as a sum of the orbital angular momentum (OAM) and the spin angular momentum (SAM). We reveal that at the focus of the light field under analysis, the latter turns zero at n = –1. While both these expressions are not equal to each other at each point of space, 3D integrals thereof are equal. Thus, exact expressions are obtained for densities of AM, SAM and OAM at the focus of a vortex beam with right-hand circular polarization and the identity for the densities AM = SAM + OAM is shown to be violated. Besides, it is shown that the expressions for the strength vectors of the electric and magnetic fields near the sharp focus, obtained by adopting the Richards-Wolf formalism, are exact solutions of the Maxwell's equations. Thus, Richards–Wolf theory exactly describes the behavior of light near the sharp focus in free space.","PeriodicalId":46692,"journal":{"name":"Computer Optics","volume":"84 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"134998062","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
We propose an improved structure of an optical biosensor based on a photonic crystal with a defect layer, which can detect the concentration of organic contaminants in water by defect mode shift. We investigated 4 types of defective photonic crystals with different arrangements of layers inside the perfect photonic crystals and their impact on the performance of the sensor. The sensitivity and amplitude of defect mode were examined as a function of defect layer thickness. Also, the peculiarities of edge modes in the presence of defect layer were investigated. Finally, we obtained a characteristic equation to determine the wavelengths of defect modes for an arbitrary 1D photonic crystal with an isotropic defect inside.
{"title":"Sensor with enhanced performance based on photonic crystal with a defect layer","authors":"I.M. Efimov, N.A. Vanyushkin, A.H. Gevorgyan, S.S. Golik","doi":"10.18287/2412-6179-co-1245","DOIUrl":"https://doi.org/10.18287/2412-6179-co-1245","url":null,"abstract":"We propose an improved structure of an optical biosensor based on a photonic crystal with a defect layer, which can detect the concentration of organic contaminants in water by defect mode shift. We investigated 4 types of defective photonic crystals with different arrangements of layers inside the perfect photonic crystals and their impact on the performance of the sensor. The sensitivity and amplitude of defect mode were examined as a function of defect layer thickness. Also, the peculiarities of edge modes in the presence of defect layer were investigated. Finally, we obtained a characteristic equation to determine the wavelengths of defect modes for an arbitrary 1D photonic crystal with an isotropic defect inside.","PeriodicalId":46692,"journal":{"name":"Computer Optics","volume":"84 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"134997846","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2023-08-01DOI: 10.18287/2412-6179-co-1272
T. Cheng, T. Chenchen
In the conventional single-molecule localizations and super-resolution microscopy, the pixel size of a raw image is approximately equal to the standard deviation of the point spread function. Such a raw image is referred to herein as a conventional raw image, based on which better single molecule localization effect and efficiency can be achieved. It is found that both interpolation and de-noising can effectively improve the Signal to Noise Ratio of the conventional raw image. The conventional raw image, the de-noised, the interpolated and the de-noised interpolated are compared and analyzed and compressed sensing is used for super-resolution reconstruction. The simulation results show that both the highest Signal to Noise Ratio and the best super-resolution reconstruction can be obtained by de-noising the interpolated conventional raw image. This method also renders the best super-resolution reconstruction and minimum gradient in the real experiment. De-noising the interpolated conventional raw image is an effective method to improve the super-resolution microscopy.
{"title":"Super-resolution microscopy based on interpolation and wide spectrum de-noising","authors":"T. Cheng, T. Chenchen","doi":"10.18287/2412-6179-co-1272","DOIUrl":"https://doi.org/10.18287/2412-6179-co-1272","url":null,"abstract":"In the conventional single-molecule localizations and super-resolution microscopy, the pixel size of a raw image is approximately equal to the standard deviation of the point spread function. Such a raw image is referred to herein as a conventional raw image, based on which better single molecule localization effect and efficiency can be achieved. It is found that both interpolation and de-noising can effectively improve the Signal to Noise Ratio of the conventional raw image. The conventional raw image, the de-noised, the interpolated and the de-noised interpolated are compared and analyzed and compressed sensing is used for super-resolution reconstruction. The simulation results show that both the highest Signal to Noise Ratio and the best super-resolution reconstruction can be obtained by de-noising the interpolated conventional raw image. This method also renders the best super-resolution reconstruction and minimum gradient in the real experiment. De-noising the interpolated conventional raw image is an effective method to improve the super-resolution microscopy.","PeriodicalId":46692,"journal":{"name":"Computer Optics","volume":"8 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"134997847","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2023-08-01DOI: 10.18287/2412-6179-co-1282
A.A. Kovalev, V.V. Kotlyar
We investigate the spin angular momentum (SAM) of two-order cylindrical vector beams at the tight focus. Such beams are a generalization of the conventional cylindrical vector beams since the transverse field components on the Cartesian axes have different polarization orders. Using the Richards-Wolf approximation, we derive an expression for the longitudinal component of the SAM density distribution. We show that if the polarization indices are of different parity then, an optical spin Hall effect arises at the tight focus, meaning that alternating areas with positive and negative SAM are originated although the initial light field is linearly polarized. We study the angular harmonics spectrum of all the components of the focused light field and determine the predominant angular harmonics. Then, neglecting the insignificant harmonics, we define the shape of the longitudinal component of the SAM density distribution and demonstrate the possibility of generating a focal SAM distribution where the areas with positive and negative SAM reside on a circle as alternating pairs or separated into different half-circles.
{"title":"Spin angular momentum and angular harmonics spectrum of two-order polarization vortices at the tight focus","authors":"A.A. Kovalev, V.V. Kotlyar","doi":"10.18287/2412-6179-co-1282","DOIUrl":"https://doi.org/10.18287/2412-6179-co-1282","url":null,"abstract":"We investigate the spin angular momentum (SAM) of two-order cylindrical vector beams at the tight focus. Such beams are a generalization of the conventional cylindrical vector beams since the transverse field components on the Cartesian axes have different polarization orders. Using the Richards-Wolf approximation, we derive an expression for the longitudinal component of the SAM density distribution. We show that if the polarization indices are of different parity then, an optical spin Hall effect arises at the tight focus, meaning that alternating areas with positive and negative SAM are originated although the initial light field is linearly polarized. We study the angular harmonics spectrum of all the components of the focused light field and determine the predominant angular harmonics. Then, neglecting the insignificant harmonics, we define the shape of the longitudinal component of the SAM density distribution and demonstrate the possibility of generating a focal SAM distribution where the areas with positive and negative SAM reside on a circle as alternating pairs or separated into different half-circles.","PeriodicalId":46692,"journal":{"name":"Computer Optics","volume":"32 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"134998058","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2023-08-01DOI: 10.18287/2412-6179-co-1238
K.B. Bulatov, A.S. Ingacheva, M.I. Gilmanov, K. Kutukova, Z.V. Soldatova, A.V. Buzmakov, M.V. Chukalina, E. Zschech, V.V. Arlazarov
The monitored tomographic reconstruction (MTR) with optimized photon flux technique is a pioneering method for X-ray computed tomography (XCT) that reduces the time for data acquisition and the radiation dose. The capturing of the projections in the MTR technique is guided by a scanning protocol built on similar experiments to reach the predetermined quality of the reconstruction. This method allows achieving a similar average reconstruction quality as in ordinary tomography while using lower mean numbers of projections. In this paper, we, for the first time, systematically study the MTR technique under several conditions: reconstruction algorithm (FBP, SIRT, SIRT-TV, and others), type of tomography setup (micro-XCT and nano-XCT), and objects with different morphology. It was shown that a mean dose reduction for reconstruction with a given quality only slightlyvaries with choice of reconstruction algorithm, and reach up to 12.5 % in case of micro-XCT and 8.5 % for nano-XCT. The obtained results allow to conclude that the monitored tomographic reconstruction approach can be universally combined with an algorithm of choice to perform a controlled trade-off between radiation dose and image quality. Validation of the protocol on independent common ground truth demonstrated a good convergence of all reconstruction algorithms within the MTR protocol.
{"title":"Towards monitored tomographic reconstruction: algorithm-dependence and convergence","authors":"K.B. Bulatov, A.S. Ingacheva, M.I. Gilmanov, K. Kutukova, Z.V. Soldatova, A.V. Buzmakov, M.V. Chukalina, E. Zschech, V.V. Arlazarov","doi":"10.18287/2412-6179-co-1238","DOIUrl":"https://doi.org/10.18287/2412-6179-co-1238","url":null,"abstract":"The monitored tomographic reconstruction (MTR) with optimized photon flux technique is a pioneering method for X-ray computed tomography (XCT) that reduces the time for data acquisition and the radiation dose. The capturing of the projections in the MTR technique is guided by a scanning protocol built on similar experiments to reach the predetermined quality of the reconstruction. This method allows achieving a similar average reconstruction quality as in ordinary tomography while using lower mean numbers of projections. In this paper, we, for the first time, systematically study the MTR technique under several conditions: reconstruction algorithm (FBP, SIRT, SIRT-TV, and others), type of tomography setup (micro-XCT and nano-XCT), and objects with different morphology. It was shown that a mean dose reduction for reconstruction with a given quality only slightlyvaries with choice of reconstruction algorithm, and reach up to 12.5 % in case of micro-XCT and 8.5 % for nano-XCT. The obtained results allow to conclude that the monitored tomographic reconstruction approach can be universally combined with an algorithm of choice to perform a controlled trade-off between radiation dose and image quality. Validation of the protocol on independent common ground truth demonstrated a good convergence of all reconstruction algorithms within the MTR protocol.","PeriodicalId":46692,"journal":{"name":"Computer Optics","volume":"14 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"134998068","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2023-08-01DOI: 10.18287/2412-6179-co-1218
N.V. Selina
A method of analytical study of a metal film perforated by cylindrical nanoholes as the basis of a metasurface has been proposed. The formulas deduced completely determine the electromagnetic field of a wave scattered by the hole at any distance from the surface. The calculation of the transmission and absorption spectra of the studied structure, as well as the dependence of the phase of the scattered wave on the radius of the hole, is carried out using analytical formulas. The calculation results are consistent with the literature data for calculating the efficiency of a quantum photodetector and the phase change interval of light scattered by meta-atoms required for lens design.
{"title":"Analytical approach to the problem of electromagnetic wave scattering by a cylindrical hole in a thin metal screen","authors":"N.V. Selina","doi":"10.18287/2412-6179-co-1218","DOIUrl":"https://doi.org/10.18287/2412-6179-co-1218","url":null,"abstract":"A method of analytical study of a metal film perforated by cylindrical nanoholes as the basis of a metasurface has been proposed. The formulas deduced completely determine the electromagnetic field of a wave scattered by the hole at any distance from the surface. The calculation of the transmission and absorption spectra of the studied structure, as well as the dependence of the phase of the scattered wave on the radius of the hole, is carried out using analytical formulas. The calculation results are consistent with the literature data for calculating the efficiency of a quantum photodetector and the phase change interval of light scattered by meta-atoms required for lens design.","PeriodicalId":46692,"journal":{"name":"Computer Optics","volume":"17 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"134998069","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2023-08-01DOI: 10.18287/2412-6179-co-1207
R. Sánchez-Rivero, P.V. Bezmaternykh, A.V. Gayer, A. Morales-González, F. José Silva-Mata, K.B. Bulatov
Text recognition has benefited considerably from deep learning research, as well as the preprocessing methods included in its workflow. Identity documents are critical in the field of document analysis and should be thoroughly researched in relation to this workflow. We propose to examine the link between deep learning-based binarization and recognition algorithms for this sort of documents on the MIDV-500 and MIDV-2020 datasets. We provide a series of experiments to illustrate the relation between the quality of the collected images with respect to the binarization results, as well as the influence of its output on final recognition performance. We show that deep learning-based binarization solutions are affected by the capture quality, which implies that they still need significant improvements. We also show that proper binarization results can improve the performance for many recognition methods. Our retrained U-Net-bin outperformed all other binarization methods, and the best result in recognition was obtained by Paddle Paddle OCR v2.
{"title":"A joint study of deep learning-based methods for identity document image binarization and its influence on attribute recognition","authors":"R. Sánchez-Rivero, P.V. Bezmaternykh, A.V. Gayer, A. Morales-González, F. José Silva-Mata, K.B. Bulatov","doi":"10.18287/2412-6179-co-1207","DOIUrl":"https://doi.org/10.18287/2412-6179-co-1207","url":null,"abstract":"Text recognition has benefited considerably from deep learning research, as well as the preprocessing methods included in its workflow. Identity documents are critical in the field of document analysis and should be thoroughly researched in relation to this workflow. We propose to examine the link between deep learning-based binarization and recognition algorithms for this sort of documents on the MIDV-500 and MIDV-2020 datasets. We provide a series of experiments to illustrate the relation between the quality of the collected images with respect to the binarization results, as well as the influence of its output on final recognition performance. We show that deep learning-based binarization solutions are affected by the capture quality, which implies that they still need significant improvements. We also show that proper binarization results can improve the performance for many recognition methods. Our retrained U-Net-bin outperformed all other binarization methods, and the best result in recognition was obtained by Paddle Paddle OCR v2.","PeriodicalId":46692,"journal":{"name":"Computer Optics","volume":"40 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"134998063","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}