{"title":"[Foreword] Welcome to the Special Section on Advanced Image Sensor Technology","authors":"J. Akita","doi":"10.3169/MTA.9.113","DOIUrl":"https://doi.org/10.3169/MTA.9.113","url":null,"abstract":"","PeriodicalId":41874,"journal":{"name":"ITE Transactions on Media Technology and Applications","volume":"1 1","pages":""},"PeriodicalIF":1.1,"publicationDate":"2021-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"69652197","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Tadaaki Isobe, Y. Takimoto, Ryosuke Harakawa, M. Iwahashi
This paper develops a system to visually inspect cutlery based on a simple machine learning algorithm using image features that are robust against overexposure. First, we develop an image acquisition apparatus comprising a laser and a screen that produces speckle images of unique shapes depending on the degree to which the photographed cutlery has been polished. The contribution of this study is to produce speckle images in this way. This enables accurate classification without newly deriving a sophisticated machine learning algorithm in the subsequent processing. We use the speckle images to develop moment-related features that represent the unique shapes and avoid the problem of overexposure. Second, we apply the extreme learning machine, a simple but representative machine learning algorithm, to the obtained features. Experimental results using real cutlery show that our developed system achieved good accuracy and precision regardless of exposure time.
{"title":"[Paper] Development of System to Classify Speckle Images for Visual Inspection of Cutlery","authors":"Tadaaki Isobe, Y. Takimoto, Ryosuke Harakawa, M. Iwahashi","doi":"10.3169/mta.9.169","DOIUrl":"https://doi.org/10.3169/mta.9.169","url":null,"abstract":"This paper develops a system to visually inspect cutlery based on a simple machine learning algorithm using image features that are robust against overexposure. First, we develop an image acquisition apparatus comprising a laser and a screen that produces speckle images of unique shapes depending on the degree to which the photographed cutlery has been polished. The contribution of this study is to produce speckle images in this way. This enables accurate classification without newly deriving a sophisticated machine learning algorithm in the subsequent processing. We use the speckle images to develop moment-related features that represent the unique shapes and avoid the problem of overexposure. Second, we apply the extreme learning machine, a simple but representative machine learning algorithm, to the obtained features. Experimental results using real cutlery show that our developed system achieved good accuracy and precision regardless of exposure time.","PeriodicalId":41874,"journal":{"name":"ITE Transactions on Media Technology and Applications","volume":"1 1","pages":""},"PeriodicalIF":1.1,"publicationDate":"2021-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"69653241","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Hiroki Tsurusaki, Keisuke Nonaka, R. Watanabe, Tomoaki Konno, S. Naito
Sports scene analysis is an important technology to quantify a player’s action and visualize game statistics. To realize such technology, camera calibration is required to recognize the player’s position from a video. In this paper, we propose an automatic camera calibration method by using intersection resorting and refinement. Our contributions are 1) flexible intersection selection and 2) intersection refinement to improve accuracy in calibration. A homography matrix is used to convert world coordinates to image coordinates for the calibration. Sports scenes can be estimated by using a priori information such as length and position of field lines and their intersections. Conventional methods using the field lines and intersections cannot realize sufficient calibration accuracy because the intersections are selected from the combination of horizontal and vertical lines. Moreover, displacement at the intersections occurs between the detected position and a real one on the input image. Our proposed method can solve these problems by flexible intersection selection and refinement. As a result, a player’s position in the real world is identified from the video by using the estimated homography matrix. Our experimental results show that the proposed method achieves higher accuracy than that by conventional methods.
{"title":"[Paper] Sports Camera Calibration using Flexible Intersection Selection and Refinement","authors":"Hiroki Tsurusaki, Keisuke Nonaka, R. Watanabe, Tomoaki Konno, S. Naito","doi":"10.3169/mta.9.95","DOIUrl":"https://doi.org/10.3169/mta.9.95","url":null,"abstract":"Sports scene analysis is an important technology to quantify a player’s action and visualize game statistics. To realize such technology, camera calibration is required to recognize the player’s position from a video. In this paper, we propose an automatic camera calibration method by using intersection resorting and refinement. Our contributions are 1) flexible intersection selection and 2) intersection refinement to improve accuracy in calibration. A homography matrix is used to convert world coordinates to image coordinates for the calibration. Sports scenes can be estimated by using a priori information such as length and position of field lines and their intersections. Conventional methods using the field lines and intersections cannot realize sufficient calibration accuracy because the intersections are selected from the combination of horizontal and vertical lines. Moreover, displacement at the intersections occurs between the detected position and a real one on the input image. Our proposed method can solve these problems by flexible intersection selection and refinement. As a result, a player’s position in the real world is identified from the video by using the estimated homography matrix. Our experimental results show that the proposed method achieves higher accuracy than that by conventional methods.","PeriodicalId":41874,"journal":{"name":"ITE Transactions on Media Technology and Applications","volume":"1 1","pages":""},"PeriodicalIF":1.1,"publicationDate":"2021-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"69653267","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Kentaro Kusama, T. Ishinabe, B. Katagiri, H. Fujikake
changes; however, if the diffusion area is too wide, the overall brightness will be reduced and the image will be blurred. Therefore, we optimized the diffuse reflection characteristics while taking into account lowering Abstract We examined anisotropic light-diffusing films for reflective displays to realize a wide viewing angle for a rotating screen such as a tablet. By orthogonally laminating 60-µm-thick anisotropic light diffusing films having bent columnar structure, we were able to demonstrate uniform diffuse reflection characteristics for a point light source polar angle of 30° from various azimuth angles. Then, we showed that image blur could be suppressed by narrowing the gap between the reflector and the diffusing layer, such as by grinding the top cell glass.
{"title":"[Paper] Design of Anisotropic Light-diffusing Film for Rotational Use of Reflective Displays","authors":"Kentaro Kusama, T. Ishinabe, B. Katagiri, H. Fujikake","doi":"10.3169/mta.9.210","DOIUrl":"https://doi.org/10.3169/mta.9.210","url":null,"abstract":"changes; however, if the diffusion area is too wide, the overall brightness will be reduced and the image will be blurred. Therefore, we optimized the diffuse reflection characteristics while taking into account lowering Abstract We examined anisotropic light-diffusing films for reflective displays to realize a wide viewing angle for a rotating screen such as a tablet. By orthogonally laminating 60-µm-thick anisotropic light diffusing films having bent columnar structure, we were able to demonstrate uniform diffuse reflection characteristics for a point light source polar angle of 30° from various azimuth angles. Then, we showed that image blur could be suppressed by narrowing the gap between the reflector and the diffusing layer, such as by grinding the top cell glass.","PeriodicalId":41874,"journal":{"name":"ITE Transactions on Media Technology and Applications","volume":"1 1","pages":""},"PeriodicalIF":1.1,"publicationDate":"2021-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"69653353","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"[Paper] Spatial Uniformity Improvement at Low Luminance Levels for Organic Light-Emitting Diode Displays","authors":"T. Okada, T. Usui, Y. Nakajima","doi":"10.3169/mta.9.234","DOIUrl":"https://doi.org/10.3169/mta.9.234","url":null,"abstract":"","PeriodicalId":41874,"journal":{"name":"ITE Transactions on Media Technology and Applications","volume":"1 1","pages":""},"PeriodicalIF":1.1,"publicationDate":"2021-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"69653415","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Takumi Nagasawa, Ryo Takahashi, Keiko Ogawa-Ochiai, N. Tsumura
stress estimation methods using camera have been investigated, which involve capturing biological information in the form of facial and skin videos. Specific biological information that can be Abstract In this paper, we propose a method for estimating stress levels using multibiological information obtained from red-green-blue (RGB) facial videos. The most popular method for estimating stress is by questionnaire, which is not highly objective or accurate. To estimate stress levels more accurately, it is desirable to establish the method which can estimate with objective and accurate information. In recent years, many methods for using biological information have been proposed. In this study, we used three types of biological information to estimate stress: pulse rate variability, blink rate, and pupil size, which were captured using an RGB camera. Results showed that the use of these biological signals improved the accuracy of stress estimation. Moreover, using biological information in a multimodal manner enabled the most accurate and detailed stress estimation.
{"title":"[Paper] Multimodal Stress Estimation Using Multibiological Information: Towards More Accurate and Detailed Stress Estimation","authors":"Takumi Nagasawa, Ryo Takahashi, Keiko Ogawa-Ochiai, N. Tsumura","doi":"10.3169/mta.9.276","DOIUrl":"https://doi.org/10.3169/mta.9.276","url":null,"abstract":"stress estimation methods using camera have been investigated, which involve capturing biological information in the form of facial and skin videos. Specific biological information that can be Abstract In this paper, we propose a method for estimating stress levels using multibiological information obtained from red-green-blue (RGB) facial videos. The most popular method for estimating stress is by questionnaire, which is not highly objective or accurate. To estimate stress levels more accurately, it is desirable to establish the method which can estimate with objective and accurate information. In recent years, many methods for using biological information have been proposed. In this study, we used three types of biological information to estimate stress: pulse rate variability, blink rate, and pupil size, which were captured using an RGB camera. Results showed that the use of these biological signals improved the accuracy of stress estimation. Moreover, using biological information in a multimodal manner enabled the most accurate and detailed stress estimation.","PeriodicalId":41874,"journal":{"name":"ITE Transactions on Media Technology and Applications","volume":"1 1","pages":""},"PeriodicalIF":1.1,"publicationDate":"2021-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"69653002","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
For photosynthesis promotion in the field of agriculture, we report on solution-processed molecular alignment control of organic fluorescent dye using guest-host effect and photo-polymerization reaction in this paper. We investigated the relation between molecular conformation and polarized photoluminescence characteristics in dye/polymer blend films. As the result, we observed polarized photoluminescence with reduction of concentration quenching effect and disordered region of dye molecules. We discussed the solution process and materials for highly-ordered solid-state films using liquid crystalline monomer and fluorescent dye.
{"title":"[Paper] Polarized Photoluminescence Characteristics of Uniaxially-Aligned Fluorescent Dye with Liquid Crystalline Polymer for Agricultural Applications","authors":"Y. Shibata, R. Tsunaki, T. Ishinabe, H. Fujikake","doi":"10.3169/mta.9.203","DOIUrl":"https://doi.org/10.3169/mta.9.203","url":null,"abstract":"For photosynthesis promotion in the field of agriculture, we report on solution-processed molecular alignment control of organic fluorescent dye using guest-host effect and photo-polymerization reaction in this paper. We investigated the relation between molecular conformation and polarized photoluminescence characteristics in dye/polymer blend films. As the result, we observed polarized photoluminescence with reduction of concentration quenching effect and disordered region of dye molecules. We discussed the solution process and materials for highly-ordered solid-state films using liquid crystalline monomer and fluorescent dye.","PeriodicalId":41874,"journal":{"name":"ITE Transactions on Media Technology and Applications","volume":"1 1","pages":""},"PeriodicalIF":1.1,"publicationDate":"2021-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"69653341","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"[Foreword] Welcome to the Special Section on Data Storage and Related Technologies","authors":"Y. Kawata","doi":"10.3169/mta.9.143","DOIUrl":"https://doi.org/10.3169/mta.9.143","url":null,"abstract":"","PeriodicalId":41874,"journal":{"name":"ITE Transactions on Media Technology and Applications","volume":"1 1","pages":""},"PeriodicalIF":1.1,"publicationDate":"2021-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"69652505","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Tetsuhiko Muroi, Yutaro Katano, N. Kinoshita, Ishii Norihiko
{"title":"[Paper] Spatial Filter and Combination of Angle and Peristrophic Multiplexings to Achieve Recording Density of 1 Tbit/inch2 in Holographic Data Storage","authors":"Tetsuhiko Muroi, Yutaro Katano, N. Kinoshita, Ishii Norihiko","doi":"10.3169/mta.9.153","DOIUrl":"https://doi.org/10.3169/mta.9.153","url":null,"abstract":"","PeriodicalId":41874,"journal":{"name":"ITE Transactions on Media Technology and Applications","volume":"1 1","pages":""},"PeriodicalIF":1.1,"publicationDate":"2021-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"69652755","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Yutaro Katano, Teruyoshi Nobukawa, Tetsuhiko Muroi, N. Kinoshita, Ishii Norihiko
LDPC (SC-LDPC) code 15) is one of the strongest error correction codes that approaches the Shannon limit, based on the LDPC code 16) . We confirmed that the capability of error correction of the SC-LDPC code outperforms that of the LDPC code in the HDS 17) . This study presents an effective data-decoding method by combining the CNN demodulation and SC-LDPC code to enable a more powerful error correction by using the likelihood information obtained as the output from the CNN. We evaluated the characteristics of the demodulation and error correction method using the reproduced data with numerically added noise. Abstract In this study, we propose an effective data-decoding method for holographic data storage (HDS) by combining convolutional neural network (CNN) and spatially coupled low-density parity-check (SC-LDPC) code. The trained CNN provides output class probabilities and accurately demodulates the reproduced data from HDS. We focus on these probabilities, wherein only the untrainable noise components such as white Gaussian noise remain. These are used for calculating the log likelihood ratio in the sum-product decoding for the SC-LDPC code. We demonstrate an improvement of approximately 10 dB in the required signal-to-noise ratio for an error-free decoding in numerical simulations.
{"title":"[Paper] Efficient Decoding Method for Holographic Data Storage Combining Convolutional Neural Network and Spatially Coupled Low-Density Parity-Check Code","authors":"Yutaro Katano, Teruyoshi Nobukawa, Tetsuhiko Muroi, N. Kinoshita, Ishii Norihiko","doi":"10.3169/mta.9.161","DOIUrl":"https://doi.org/10.3169/mta.9.161","url":null,"abstract":"LDPC (SC-LDPC) code 15) is one of the strongest error correction codes that approaches the Shannon limit, based on the LDPC code 16) . We confirmed that the capability of error correction of the SC-LDPC code outperforms that of the LDPC code in the HDS 17) . This study presents an effective data-decoding method by combining the CNN demodulation and SC-LDPC code to enable a more powerful error correction by using the likelihood information obtained as the output from the CNN. We evaluated the characteristics of the demodulation and error correction method using the reproduced data with numerically added noise. Abstract In this study, we propose an effective data-decoding method for holographic data storage (HDS) by combining convolutional neural network (CNN) and spatially coupled low-density parity-check (SC-LDPC) code. The trained CNN provides output class probabilities and accurately demodulates the reproduced data from HDS. We focus on these probabilities, wherein only the untrainable noise components such as white Gaussian noise remain. These are used for calculating the log likelihood ratio in the sum-product decoding for the SC-LDPC code. We demonstrate an improvement of approximately 10 dB in the required signal-to-noise ratio for an error-free decoding in numerical simulations.","PeriodicalId":41874,"journal":{"name":"ITE Transactions on Media Technology and Applications","volume":"1 1","pages":""},"PeriodicalIF":1.1,"publicationDate":"2021-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"69653143","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}