首页 > 最新文献

中国图象图形学报最新文献

英文 中文
A Hybrid DWT-SVD Based Adaptive Image Watermarking Scheme 基于 DWT-SVD 的混合自适应图像水印方案
Q3 Computer Science Pub Date : 2023-12-01 DOI: 10.18178/joig.11.4.414-427
Sachin Gaur, Navneet Tripathi, Jyoti Pandey
In the digital age, protecting the ownership and data veracity of digital documents is a major challenge. To address the issues concerning copyright protection and data verification of digital media, digital watermarking has emerged as a solution. In this paper, we aspire to make a modest contribution to this emerging and exciting field by presenting our proposed adaptive hybrid image watermarking approach that combines Discrete Wavelet Transform (DWT) and Singular Value Decomposition (SVD). Our method involves applying DWT to both the host image and watermark, followed by singular decomposition using SVD on the Low-Low (LL) component of both images. Now modify the singular values of the host image by the singular values of the watermark, and then inverse SVD is applied, followed by inverse DWT, to obtain the watermarked image. After that, the reverse process is applied to obtain the watermark image. Finally, we evaluate our approach’s performance by measuring the Peak Signal-to-Noise Ratio (PSNR) between the original and watermarked image as well as the Normalized Cross-Correlation (NCC) between the original and extracted watermark. Simulation results indicate that the proposed method is rich in terms of robustness, imperceptibility and capacity than the previously presented schemes.
在数字时代,保护数字文档的所有权和数据真实性是一项重大挑战。为了解决数字媒体的版权保护和数据验证问题,数字水印作为一种解决方案应运而生。在本文中,我们希望通过提出我们提出的结合离散小波变换(DWT)和奇异值分解(SVD)的自适应混合图像水印方法,为这个新兴和令人兴奋的领域做出适度的贡献。我们的方法包括对主图像和水印应用DWT,然后对两幅图像的Low-Low (LL)分量使用SVD进行奇异分解。然后通过水印的奇异值对主机图像的奇异值进行修改,然后进行反奇异值分解,再进行反小波变换,得到水印图像。然后,应用反向过程获得水印图像。最后,我们通过测量原始图像和水印图像之间的峰值信噪比(PSNR)以及原始图像和提取的水印之间的归一化互相关(NCC)来评估我们的方法的性能。仿真结果表明,该方法在鲁棒性、不可感知性和容量等方面都优于已有的方法。
{"title":"A Hybrid DWT-SVD Based Adaptive Image Watermarking Scheme","authors":"Sachin Gaur, Navneet Tripathi, Jyoti Pandey","doi":"10.18178/joig.11.4.414-427","DOIUrl":"https://doi.org/10.18178/joig.11.4.414-427","url":null,"abstract":"In the digital age, protecting the ownership and data veracity of digital documents is a major challenge. To address the issues concerning copyright protection and data verification of digital media, digital watermarking has emerged as a solution. In this paper, we aspire to make a modest contribution to this emerging and exciting field by presenting our proposed adaptive hybrid image watermarking approach that combines Discrete Wavelet Transform (DWT) and Singular Value Decomposition (SVD). Our method involves applying DWT to both the host image and watermark, followed by singular decomposition using SVD on the Low-Low (LL) component of both images. Now modify the singular values of the host image by the singular values of the watermark, and then inverse SVD is applied, followed by inverse DWT, to obtain the watermarked image. After that, the reverse process is applied to obtain the watermark image. Finally, we evaluate our approach’s performance by measuring the Peak Signal-to-Noise Ratio (PSNR) between the original and watermarked image as well as the Normalized Cross-Correlation (NCC) between the original and extracted watermark. Simulation results indicate that the proposed method is rich in terms of robustness, imperceptibility and capacity than the previously presented schemes.","PeriodicalId":36336,"journal":{"name":"中国图象图形学报","volume":" 12","pages":""},"PeriodicalIF":0.0,"publicationDate":"2023-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"138620978","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Reg-PointNet++: A CNN Network Based on PointNet++ Architecture for 3D Reconstruction of 3D Objects Modeled by Supershapes Reg-PointNet++:基于 PointNet++ 架构的 CNN 网络,用于超形状建模的三维物体的三维重建
Q3 Computer Science Pub Date : 2023-12-01 DOI: 10.18178/joig.11.4.405-413
Hassnae Remmach, Raja Mouachi, M. Sadgal, Aziz El Fazziki
The use of 3D reconstruction in computer vision applications has opened up new avenues for research and development. It has a significant impact on a range of industries, from healthcare to robotics, by improving the performance and abilities of computer vision systems. In this paper we aim to improve 3D reconstruction quality and accuracy. The objective is to develop a model that can learn to extract features, estimate a Supershape parameters and reconstruct 3D directly from input points cloud. In this regard, we present a continuity of our latest works, using a CNN-based Multi-Output and Multi-Task Regressor, for 3D reconstruction from 3D point cloud. We propose another new approach in order to refine our previous methodology and expand our findings. It is about “Reg-PointNet++”, which is mainly based on a PointNet++ architecture adapted for multi-task regression, with the goal of reconstructing a 3D object modeled by Supershapes from 3D point cloud. Given the difficulties encountered in applying convolution to point clouds, our approach is based on the PointNet ++ architecture. It is used to extract features from the 3D point cloud, which are then fed into a Multi-task Regressor for predicting the Supershape parameters needed to reconstruct the shape. The approach has shown promising results in reconstructing 3D objects modeled by Supershapes, demonstrating improved accuracy and robustness to noise and outperforming existing techniques. Visually, the predicted shapes have a high likelihood with the real shapes, as well as a high accuracy rate in a very reasonable number of iterations. Overall, the approach presented in the paper has the potential to significantly improve the accuracy and efficiency of 3D reconstruction, enabling its use in a wider range of applications.
在计算机视觉应用中使用3D重建为研究和开发开辟了新的途径。它通过提高计算机视觉系统的性能和能力,对从医疗保健到机器人等一系列行业产生了重大影响。本文旨在提高三维重建的质量和精度。目标是开发一个可以学习提取特征,估计Supershape参数并直接从输入点云重建3D的模型。在这方面,我们展示了我们最新工作的连续性,使用基于cnn的多输出和多任务回归器,从3D点云进行3D重建。我们提出了另一种新方法,以完善我们以前的方法并扩展我们的发现。它是关于“regg -PointNet++”的,它主要基于适合多任务回归的PointNet++架构,目标是从3D点云中重建由Supershapes建模的3D对象。考虑到在将卷积应用于点云时遇到的困难,我们的方法是基于pointnet++架构的。它用于从3D点云中提取特征,然后将这些特征输入到多任务回归器中,用于预测重建形状所需的Supershape参数。该方法在重建由Supershapes建模的3D物体方面显示出有希望的结果,证明了提高的准确性和对噪声的鲁棒性,优于现有技术。从视觉上看,预测的形状与实际形状具有很高的可能性,并且在非常合理的迭代次数下具有很高的准确率。总体而言,本文提出的方法具有显著提高3D重建精度和效率的潜力,使其能够在更广泛的应用中使用。
{"title":"Reg-PointNet++: A CNN Network Based on PointNet++ Architecture for 3D Reconstruction of 3D Objects Modeled by Supershapes","authors":"Hassnae Remmach, Raja Mouachi, M. Sadgal, Aziz El Fazziki","doi":"10.18178/joig.11.4.405-413","DOIUrl":"https://doi.org/10.18178/joig.11.4.405-413","url":null,"abstract":"The use of 3D reconstruction in computer vision applications has opened up new avenues for research and development. It has a significant impact on a range of industries, from healthcare to robotics, by improving the performance and abilities of computer vision systems. In this paper we aim to improve 3D reconstruction quality and accuracy. The objective is to develop a model that can learn to extract features, estimate a Supershape parameters and reconstruct 3D directly from input points cloud. In this regard, we present a continuity of our latest works, using a CNN-based Multi-Output and Multi-Task Regressor, for 3D reconstruction from 3D point cloud. We propose another new approach in order to refine our previous methodology and expand our findings. It is about “Reg-PointNet++”, which is mainly based on a PointNet++ architecture adapted for multi-task regression, with the goal of reconstructing a 3D object modeled by Supershapes from 3D point cloud. Given the difficulties encountered in applying convolution to point clouds, our approach is based on the PointNet ++ architecture. It is used to extract features from the 3D point cloud, which are then fed into a Multi-task Regressor for predicting the Supershape parameters needed to reconstruct the shape. The approach has shown promising results in reconstructing 3D objects modeled by Supershapes, demonstrating improved accuracy and robustness to noise and outperforming existing techniques. Visually, the predicted shapes have a high likelihood with the real shapes, as well as a high accuracy rate in a very reasonable number of iterations. Overall, the approach presented in the paper has the potential to significantly improve the accuracy and efficiency of 3D reconstruction, enabling its use in a wider range of applications.","PeriodicalId":36336,"journal":{"name":"中国图象图形学报","volume":"60 3","pages":""},"PeriodicalIF":0.0,"publicationDate":"2023-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"138627535","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Roselle Pest Detection and Classification Using Threshold and Template Matching 利用阈值和模板匹配进行洛泽尔害虫检测和分类
Q3 Computer Science Pub Date : 2023-12-01 DOI: 10.18178/joig.11.4.330-342
Ade Bastian, Adie Iman Nurzaman, Tri Ferga Prasetyo, Sri Fatimah
Roselle is a fiber-producing plant that has broad benefits for health food, so many farmers are interested in starting to cultivate it. This study aims to design a rosella plant pest detection system to reduce the risk of crop failure or reduced yields of rosella calyx. The design of a system for detecting and classifying rosella pests uses the threshold method as a digital image processing method connected via the internet with information media applications and template matching to detect and classify pests on rosella plants. Detection of pests on rosella plants has been successfully built using a detection system using thresholding and template matching methods. Datasets of rosella plant pests that are not yet widely available encourage the detection of rosella plant pests with datasets from rosella plant objects and limited data testing. Testing with 75% accuracy, the detection process is affected by light and camera quality.
玫瑰是一种纤维生产植物,对健康食品有广泛的好处,所以许多农民都有兴趣开始种植它。本研究旨在设计一套玫瑰花萼病虫害检测系统,以降低玫瑰花萼歉收或减产的风险。采用阈值法作为数字图像处理方法,通过互联网与信息媒体应用和模板匹配相连接,设计了一套玫瑰属植物害虫检测与分类系统。应用阈值法和模板匹配法成功建立了玫瑰属植物害虫检测系统。尚未广泛获得的玫瑰植物有害生物数据集鼓励使用来自玫瑰植物目标的数据集和有限的数据测试来检测玫瑰植物有害生物。测试精度为75%,检测过程受光线和相机质量的影响。
{"title":"Roselle Pest Detection and Classification Using Threshold and Template Matching","authors":"Ade Bastian, Adie Iman Nurzaman, Tri Ferga Prasetyo, Sri Fatimah","doi":"10.18178/joig.11.4.330-342","DOIUrl":"https://doi.org/10.18178/joig.11.4.330-342","url":null,"abstract":"Roselle is a fiber-producing plant that has broad benefits for health food, so many farmers are interested in starting to cultivate it. This study aims to design a rosella plant pest detection system to reduce the risk of crop failure or reduced yields of rosella calyx. The design of a system for detecting and classifying rosella pests uses the threshold method as a digital image processing method connected via the internet with information media applications and template matching to detect and classify pests on rosella plants. Detection of pests on rosella plants has been successfully built using a detection system using thresholding and template matching methods. Datasets of rosella plant pests that are not yet widely available encourage the detection of rosella plant pests with datasets from rosella plant objects and limited data testing. Testing with 75% accuracy, the detection process is affected by light and camera quality.","PeriodicalId":36336,"journal":{"name":"中国图象图形学报","volume":" 9","pages":""},"PeriodicalIF":0.0,"publicationDate":"2023-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"138612885","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
An Enhanced Security in Medical Image Encryption Using Dynamic Chaotic Fuzzy Based Technique 利用基于动态混沌模糊技术提高医学图像加密的安全性
Q3 Computer Science Pub Date : 2023-12-01 DOI: 10.18178/joig.11.4.376-383
Snehashish Bhattacharjee, Mousumi Gupta, Biswajoy Chatterjee
As IoT and cloud computing have grown in popularity, medical images are now often transmitted between devices or accessed directly from the cloud. With this, the security is always a concern as these images are prone to many types of attack. We have proposed a proven method that is efficient in terms of security, time complexity, and integrity in order to be cloud-friendly so that it may be launched into the cloud and made accessible to users at any time. The goal of the work is to create a dynamic key that, depending on fuzzy values, alters the reproduction rate parameters with each repetition. By applying the last chaotic value created from the previous iteration, the fuzzy triangular membership function has been used in this manner to generate the reproduction rate parameter. The uniqueness and major benefit of the suggested strategy are that it can increase the security of the algorithm that makes use of a chaotic map and a static key. The method has been put forth when designing algorithms so that it should not only demonstrate security against different attacks but also provide efficiency towards computational complexity. The technique has been tested against a set of images and an existing algorithm using a variety of security metrics, including the correlation coefficient, Number of Pixel Change Rate (NPCR), Unified Average Changing Intensity (UACI), and entropy. It has been determined from the comparative analysis that the proposed approach can make the existing algorithm more secure.
随着物联网和云计算的普及,医疗图像现在经常在设备之间传输或直接从云中访问。因此,安全性总是一个问题,因为这些图像容易受到多种类型的攻击。我们已经提出了一种经过验证的方法,该方法在安全性、时间复杂性和完整性方面都是有效的,以便于云友好,以便可以将其启动到云中并随时供用户访问。这项工作的目标是创建一个动态键,根据模糊值,每次重复都会改变复制速率参数。利用前一次迭代产生的最后一个混沌值,利用模糊三角隶属函数生成再现率参数。所建议的策略的惟一性和主要优点是,它可以提高使用混沌映射和静态密钥的算法的安全性。在设计算法时提出了这种方法,既要证明对不同攻击的安全性,又要对计算复杂度提供效率。该技术已经针对一组图像和使用各种安全度量的现有算法进行了测试,包括相关系数、像素变化率(NPCR)、统一平均变化强度(UACI)和熵。对比分析表明,本文提出的方法可以提高现有算法的安全性。
{"title":"An Enhanced Security in Medical Image Encryption Using Dynamic Chaotic Fuzzy Based Technique","authors":"Snehashish Bhattacharjee, Mousumi Gupta, Biswajoy Chatterjee","doi":"10.18178/joig.11.4.376-383","DOIUrl":"https://doi.org/10.18178/joig.11.4.376-383","url":null,"abstract":"As IoT and cloud computing have grown in popularity, medical images are now often transmitted between devices or accessed directly from the cloud. With this, the security is always a concern as these images are prone to many types of attack. We have proposed a proven method that is efficient in terms of security, time complexity, and integrity in order to be cloud-friendly so that it may be launched into the cloud and made accessible to users at any time. The goal of the work is to create a dynamic key that, depending on fuzzy values, alters the reproduction rate parameters with each repetition. By applying the last chaotic value created from the previous iteration, the fuzzy triangular membership function has been used in this manner to generate the reproduction rate parameter. The uniqueness and major benefit of the suggested strategy are that it can increase the security of the algorithm that makes use of a chaotic map and a static key. The method has been put forth when designing algorithms so that it should not only demonstrate security against different attacks but also provide efficiency towards computational complexity. The technique has been tested against a set of images and an existing algorithm using a variety of security metrics, including the correlation coefficient, Number of Pixel Change Rate (NPCR), Unified Average Changing Intensity (UACI), and entropy. It has been determined from the comparative analysis that the proposed approach can make the existing algorithm more secure.","PeriodicalId":36336,"journal":{"name":"中国图象图形学报","volume":"121 ","pages":""},"PeriodicalIF":0.0,"publicationDate":"2023-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"138615226","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Melanoma Detection Based on SVM Using MATLAB 使用 MATLAB 基于 SVM 检测黑色素瘤
Q3 Computer Science Pub Date : 2023-12-01 DOI: 10.18178/joig.11.4.353-358
Radhwan M. W. Khaleel, N. M. Basheer
Skin cancer has become the fifth-most dangerous type of cancer. Melanoma, the most ferocious type of skin cancer, should be detected and treated to reduce the risk of spreading to the rest of the body’s organs. This study aims to provide fast and painless detection of skin cancer using image processing, including enhancement and extraction of interesting features for the characterization and classification of infected skin images into melanoma or nonmelanoma in MATLAB. The features used for texture analysis of inserted images are the Gray Level Co-occurrence Matrix (GLCM) and Local Binary Pattern (LBP). The classification of melanoma and non-melanoma is done by training a Support Vector Machine (SVM) using the radial basis function kernel. The accuracy of testing is 94.87%.
皮肤癌已经成为第五种最危险的癌症。黑色素瘤是一种最凶猛的皮肤癌,应该及时发现并治疗,以降低扩散到身体其他器官的风险。本研究旨在通过图像处理提供快速无痛的皮肤癌检测,包括增强和提取有趣的特征,在MATLAB中对感染的皮肤图像进行表征和分类为黑色素瘤或非黑色素瘤。用于插入图像纹理分析的特征是灰度共生矩阵(GLCM)和局部二值模式(LBP)。利用径向基函数核训练支持向量机(SVM)进行黑色素瘤和非黑色素瘤的分类。检测准确率为94.87%。
{"title":"Melanoma Detection Based on SVM Using MATLAB","authors":"Radhwan M. W. Khaleel, N. M. Basheer","doi":"10.18178/joig.11.4.353-358","DOIUrl":"https://doi.org/10.18178/joig.11.4.353-358","url":null,"abstract":"Skin cancer has become the fifth-most dangerous type of cancer. Melanoma, the most ferocious type of skin cancer, should be detected and treated to reduce the risk of spreading to the rest of the body’s organs. This study aims to provide fast and painless detection of skin cancer using image processing, including enhancement and extraction of interesting features for the characterization and classification of infected skin images into melanoma or nonmelanoma in MATLAB. The features used for texture analysis of inserted images are the Gray Level Co-occurrence Matrix (GLCM) and Local Binary Pattern (LBP). The classification of melanoma and non-melanoma is done by training a Support Vector Machine (SVM) using the radial basis function kernel. The accuracy of testing is 94.87%.","PeriodicalId":36336,"journal":{"name":"中国图象图形学报","volume":" 65","pages":""},"PeriodicalIF":0.0,"publicationDate":"2023-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"138614083","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Using Random Forest Algorithm to Grading Mango's Quality Based on External Features Extracted from Captured Images 使用随机森林算法根据从拍摄图像中提取的外部特征对芒果的质量进行分级
Q3 Computer Science Pub Date : 2023-12-01 DOI: 10.18178/joig.11.4.391-396
Nguyen Minh Trieu, Nguyen Truong Thinh
The grading of mango is still a manual process in agriculture. Nowadays, mangoes are classified based on human experience, which makes the grade not uniform for agricultural product export establishments. Therefore, the automated grading of mango is very important to solve these problems. In this study, a random forest algorithm is proposed for an automated mango grading system based on quality attributes such as density, surface defect, and weight. The internal features including dimensions and surface defects are extracted via the captured image. These features are combined with the weight to estimate density. This study uses 732 mangoes that are collected from several local farms. The experiment of the grading system has high accuracy with 98.3%. Instead of using Non-Destructive Testing (NDT) equipment, this grading method can be used to apply to evaluate the quality of other tropical fruits.
在农业中,芒果的分级仍然是一个人工过程。目前,芒果的分类是根据人的经验进行的,这使得农产品出口机构的等级不统一。因此,芒果的自动分级对于解决这些问题是非常重要的。在这项研究中,提出了一种随机森林算法,用于基于密度、表面缺陷和重量等质量属性的芒果自动分级系统。通过捕获的图像提取内部特征,包括尺寸和表面缺陷。这些特征与权重相结合来估计密度。这项研究使用了从几个当地农场收集的732个芒果。实验结果表明,该分级系统的准确率高达98.3%。该分级方法可以代替无损检测设备,应用于其他热带水果的质量评价。
{"title":"Using Random Forest Algorithm to Grading Mango's Quality Based on External Features Extracted from Captured Images","authors":"Nguyen Minh Trieu, Nguyen Truong Thinh","doi":"10.18178/joig.11.4.391-396","DOIUrl":"https://doi.org/10.18178/joig.11.4.391-396","url":null,"abstract":"The grading of mango is still a manual process in agriculture. Nowadays, mangoes are classified based on human experience, which makes the grade not uniform for agricultural product export establishments. Therefore, the automated grading of mango is very important to solve these problems. In this study, a random forest algorithm is proposed for an automated mango grading system based on quality attributes such as density, surface defect, and weight. The internal features including dimensions and surface defects are extracted via the captured image. These features are combined with the weight to estimate density. This study uses 732 mangoes that are collected from several local farms. The experiment of the grading system has high accuracy with 98.3%. Instead of using Non-Destructive Testing (NDT) equipment, this grading method can be used to apply to evaluate the quality of other tropical fruits.","PeriodicalId":36336,"journal":{"name":"中国图象图形学报","volume":"99 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2023-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"138626163","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Human Action Recognition with Skeleton and Infrared Fusion Model 利用骨架和红外融合模型识别人体动作
Q3 Computer Science Pub Date : 2023-12-01 DOI: 10.18178/joig.11.4.309-320
Amine Mansouri, Toufik Bakir, S. Femmam
Skeleton-based human action recognition conveys interesting information about the dynamics of a human body. In this work, we develop a method that uses a multi-stream model with connections between the parallel streams. This work is inspired by a state-of-the-art method called FUSIONCPA that merges different modalities: infrared input and skeleton input. Because we are interested in investigating improvements related to the skeleton-branch backbone, we used the Spatial-Temporal Graph Convolutional Networks (ST-GCN) model and an EfficientGCN attention module. We aim to provide improvements when capturing spatial and temporal features. In addition, we exploited a Graph Convolutional Network (GCN) implemented in the ST-GCN model to capture the graphic connectivity in skeletons. This paper reports interesting accuracy on a large-scale dataset (NTU-RGB+D 60), over 91% and 93% on respectively crosssubject, and cross-view benchmarks. This proposed model is lighter by 9 million training parameters compared with the model FUSION-CPA.
基于骨骼的人体动作识别传达了关于人体动态的有趣信息。在这项工作中,我们开发了一种使用并行流之间连接的多流模型的方法。这项工作的灵感来自于一种称为FUSIONCPA的最先进的方法,它融合了不同的模式:红外输入和骨骼输入。因为我们对研究与骨骼-分支主干相关的改进感兴趣,我们使用了时空图卷积网络(ST-GCN)模型和高效gcn注意力模块。我们的目标是在捕捉空间和时间特征时提供改进。此外,我们利用在ST-GCN模型中实现的图形卷积网络(GCN)来捕获骨架中的图形连通性。本文报告了在大规模数据集(NTU-RGB+ d60)上有趣的准确性,在交叉主题和交叉视角基准上分别超过91%和93%。与FUSION-CPA模型相比,该模型减少了900万个训练参数。
{"title":"Human Action Recognition with Skeleton and Infrared Fusion Model","authors":"Amine Mansouri, Toufik Bakir, S. Femmam","doi":"10.18178/joig.11.4.309-320","DOIUrl":"https://doi.org/10.18178/joig.11.4.309-320","url":null,"abstract":"Skeleton-based human action recognition conveys interesting information about the dynamics of a human body. In this work, we develop a method that uses a multi-stream model with connections between the parallel streams. This work is inspired by a state-of-the-art method called FUSIONCPA that merges different modalities: infrared input and skeleton input. Because we are interested in investigating improvements related to the skeleton-branch backbone, we used the Spatial-Temporal Graph Convolutional Networks (ST-GCN) model and an EfficientGCN attention module. We aim to provide improvements when capturing spatial and temporal features. In addition, we exploited a Graph Convolutional Network (GCN) implemented in the ST-GCN model to capture the graphic connectivity in skeletons. This paper reports interesting accuracy on a large-scale dataset (NTU-RGB+D 60), over 91% and 93% on respectively crosssubject, and cross-view benchmarks. This proposed model is lighter by 9 million training parameters compared with the model FUSION-CPA.","PeriodicalId":36336,"journal":{"name":"中国图象图形学报","volume":" 89","pages":""},"PeriodicalIF":0.0,"publicationDate":"2023-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"138613767","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
An Intra- and Inter-Modality Fusion Model with Invariant- and Specific-Constraints Using MR Images for Prediction of Glioma Isocitrate Dehydrogenase Mutation Status 使用磁共振图像预测胶质瘤异柠檬酸脱氢酶突变状态的具有无创和特异性限制的模内和模间融合模型
Q3 Computer Science Pub Date : 2023-12-01 DOI: 10.18178/joig.11.4.321-329
Xiaoyu Shi, Yinhao Li, Yen-wei Chen, Jingliang Cheng, J. Bai, Guohua Zhao
In the 2021 World Health Organization classification of gliomas, it is proposed that Isocitrate Dehydrogenase (IDH) plays a key role. The prognosis of glioma is largely affected by IDH mutation status. Therefore, IDH mutation status needs to be predicted in advance before surgery. In the past decade, with the development of machine learning, more and more machine learning methods, especially deep learning methods, have been applied to the development of computer-aided diagnosis systems. At present, in this field, many deep learning and radiomics based methods have been proposed for IDH prediction using multimodal Magnetic Resonance Imaging (MRI). In this study, we proposed an intra- and inter-modality fusion model with invariant- and specific- constraints to improve the performance of IDH status prediction. First, MRI-based radiomics features were fused with deep learning features in each modality (intra-modality fusion) and then the features extracted from each modality of brain MRI were fused by using an inter-modality fusion model with invariant and specific constraints. We experimented our proposed method on the dataset provided by the Affiliated Hospital of Zhengzhou University in Zhengzhou, China and demonstrated the effectiveness of the proposed method. In our study, we propose two inter-modality fusion models, and our experimental results show that our best proposed method outperformed state-of-the-art methods with an accuracy of 0.79, precision of 0.80, recall of 0.75, and F1 score of 0.78. Thus, we predicted the IDH mutation status for glioma treatment with a 2% increase in accuracy and 4% increase in precision to predict the IDH mutation status for glioma treatment.
在2021年世界卫生组织的胶质瘤分类中,提出异柠檬酸脱氢酶(IDH)起着关键作用。胶质瘤的预后在很大程度上受IDH突变状态的影响。因此术前需提前预测IDH突变状态。近十年来,随着机器学习的发展,越来越多的机器学习方法,特别是深度学习方法被应用到计算机辅助诊断系统的开发中。目前,在这一领域,已经提出了许多基于深度学习和放射组学的方法来利用多模态磁共振成像(MRI)预测IDH。在这项研究中,我们提出了一种具有不变约束和特定约束的模态内和模态间融合模型,以提高IDH状态预测的性能。首先,将基于MRI的放射组学特征与各模态的深度学习特征进行融合(模态内融合),然后使用具有不变约束和特定约束的模态间融合模型对脑MRI各模态提取的特征进行融合。我们在中国郑州郑州大学附属医院提供的数据集上进行了实验,验证了所提出方法的有效性。在我们的研究中,我们提出了两种多模态融合模型,我们的实验结果表明,我们提出的最佳方法优于现有的方法,准确率为0.79,精密度为0.80,召回率为0.75,F1分数为0.78。因此,我们预测胶质瘤治疗中IDH突变状态的准确性提高了2%,预测胶质瘤治疗中IDH突变状态的精度提高了4%。
{"title":"An Intra- and Inter-Modality Fusion Model with Invariant- and Specific-Constraints Using MR Images for Prediction of Glioma Isocitrate Dehydrogenase Mutation Status","authors":"Xiaoyu Shi, Yinhao Li, Yen-wei Chen, Jingliang Cheng, J. Bai, Guohua Zhao","doi":"10.18178/joig.11.4.321-329","DOIUrl":"https://doi.org/10.18178/joig.11.4.321-329","url":null,"abstract":"In the 2021 World Health Organization classification of gliomas, it is proposed that Isocitrate Dehydrogenase (IDH) plays a key role. The prognosis of glioma is largely affected by IDH mutation status. Therefore, IDH mutation status needs to be predicted in advance before surgery. In the past decade, with the development of machine learning, more and more machine learning methods, especially deep learning methods, have been applied to the development of computer-aided diagnosis systems. At present, in this field, many deep learning and radiomics based methods have been proposed for IDH prediction using multimodal Magnetic Resonance Imaging (MRI). In this study, we proposed an intra- and inter-modality fusion model with invariant- and specific- constraints to improve the performance of IDH status prediction. First, MRI-based radiomics features were fused with deep learning features in each modality (intra-modality fusion) and then the features extracted from each modality of brain MRI were fused by using an inter-modality fusion model with invariant and specific constraints. We experimented our proposed method on the dataset provided by the Affiliated Hospital of Zhengzhou University in Zhengzhou, China and demonstrated the effectiveness of the proposed method. In our study, we propose two inter-modality fusion models, and our experimental results show that our best proposed method outperformed state-of-the-art methods with an accuracy of 0.79, precision of 0.80, recall of 0.75, and F1 score of 0.78. Thus, we predicted the IDH mutation status for glioma treatment with a 2% increase in accuracy and 4% increase in precision to predict the IDH mutation status for glioma treatment.","PeriodicalId":36336,"journal":{"name":"中国图象图形学报","volume":"5 21","pages":""},"PeriodicalIF":0.0,"publicationDate":"2023-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"138623852","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Evaluation of SSD Architecture for Small Size Object Detection: A Case Study on UAV Oil Pipeline MonitoringEvaluation of SSD Architecture for Small Size Object Detection: A Case Study on UAV Oil Pipeline Monitoring 用于小尺寸物体检测的固态硬盘架构评估:用于小尺寸物体检测的固态硬盘架构评估:无人机石油管道监测案例研究固态硬盘架构评估:无人机石油管道监测案例研究无人机石油管道监控案例研究
Q3 Computer Science Pub Date : 2023-12-01 DOI: 10.18178/joig.11.4.384-390
Annisa Istiqomah Arrahmah, Rissa Rahmania, D. E. Saputra
Oil pipeline monitoring using Unmanned Airborne Vehicles (UAV) can be done by utilizing Deep Learning. Deep Learning can be used to automatically detect harmed or unauthorized objects near the pipeline for further action by the authority. Input video in the pipeline area taken from the UAV has unique characteristics. It has low resolution with dense composition object in the image. The detected object also has a small scale as the objects are far away from the UAV. Thus, the selection of the Deep Learning algorithm is important to get a desirable result with the following conditions. Single Shot Multi-Box (SSD) is one of the popular Deep Learning algorithms with fast calculation compared to others and suitable for real-time object detection. Previous works on this topic using low to medium altitude dataset (20–200 m). This paper provides an evaluation of SSD implementation to detect vehicles on high-altitude dataset (300 m). As much as 2482 dataset is fed into SSD architecture and trained to detect 3 class of vehicles. The result shows the mAP and mAR are 0.026360 and 0.067377, respectively. However, the low lost function value shows that the model is able to classify the object correctly. In conclusion, the SSD cannot process low density information to correctly locate the object.
利用深度学习技术可以实现利用无人机(UAV)进行石油管道监测。深度学习可用于自动检测管道附近的受损或未经授权的物体,以便当局采取进一步行动。从无人机采集的管道区域输入视频具有独特的特点。它的分辨率较低,图像中构成物体密集。由于目标距离无人机较远,被探测目标的尺度也较小。因此,深度学习算法的选择对于在以下条件下获得理想的结果非常重要。单镜头多盒(Single Shot Multi-Box, SSD)算法是目前流行的深度学习算法之一,具有计算速度快,适合于实时目标检测。本文对SSD在高海拔数据集(300 m)上检测车辆的实现进行了评估,将多达2482个数据集输入到SSD架构中,并对其进行了训练,以检测3类车辆。结果表明,mAP和mAR分别为0.026360和0.067377。然而,低损失函数值表明该模型能够正确地对目标进行分类。综上所述,SSD无法处理低密度信息,无法正确定位目标。
{"title":"Evaluation of SSD Architecture for Small Size Object Detection: A Case Study on UAV Oil Pipeline MonitoringEvaluation of SSD Architecture for Small Size Object Detection: A Case Study on UAV Oil Pipeline Monitoring","authors":"Annisa Istiqomah Arrahmah, Rissa Rahmania, D. E. Saputra","doi":"10.18178/joig.11.4.384-390","DOIUrl":"https://doi.org/10.18178/joig.11.4.384-390","url":null,"abstract":"Oil pipeline monitoring using Unmanned Airborne Vehicles (UAV) can be done by utilizing Deep Learning. Deep Learning can be used to automatically detect harmed or unauthorized objects near the pipeline for further action by the authority. Input video in the pipeline area taken from the UAV has unique characteristics. It has low resolution with dense composition object in the image. The detected object also has a small scale as the objects are far away from the UAV. Thus, the selection of the Deep Learning algorithm is important to get a desirable result with the following conditions. Single Shot Multi-Box (SSD) is one of the popular Deep Learning algorithms with fast calculation compared to others and suitable for real-time object detection. Previous works on this topic using low to medium altitude dataset (20–200 m). This paper provides an evaluation of SSD implementation to detect vehicles on high-altitude dataset (300 m). As much as 2482 dataset is fed into SSD architecture and trained to detect 3 class of vehicles. The result shows the mAP and mAR are 0.026360 and 0.067377, respectively. However, the low lost function value shows that the model is able to classify the object correctly. In conclusion, the SSD cannot process low density information to correctly locate the object.","PeriodicalId":36336,"journal":{"name":"中国图象图形学报","volume":" 6","pages":""},"PeriodicalIF":0.0,"publicationDate":"2023-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"138614664","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Improving Brain Tumor Classification Efficacy through the Application of Feature Selection and Ensemble Classifiers 应用特征选择和集合分类器提高脑肿瘤分类效率
Q3 Computer Science Pub Date : 2023-12-01 DOI: 10.18178/joig.11.4.397-404
Y. Yuhandri, A. Windarto, Muhammad Noor Hasan Siregar
Accurate brain tumor detection is crucial due to its high mortality rate. However, existing automated methods suffer from limited accuracy and high false-positive rates. In this study, we aimed to improve brain tumor classification by comparing 17 different classifiers organized into six groups: Decision Tree (DT) Model, Support Vector Machine (SVM), Naive Bayes Classifier, Logistic Regression, Generalized Linear Model (GLM) Classifier, and Neural Network. We utilized a dataset of 3,762 Magnetic Resonance Imaging (MRI) scans of brain tumors from Kaggle, with each image having dimensions of 240 × 240 pixels and labeled as tumor or non-tumor. Our approach involved three main steps: extracting visual information using 17 predictor classes, optimizing feature extraction through weight optimization, and comparing different sets of classifier models. We evaluated the models’ performance using the confusion matrix and Receiver Operating Characteristics (ROC) curves. Our results showed that optimizing feature selection and utilizing ensemble classifiers improved the accuracy of brain tumor classification. The DT Model with ensemble classifiers emerged as the best-performing classifier, achieving an accuracy of 98.11% and an AUC of 0.99. Notably, Random Tree (RT) exhibited the highest accuracy within the ensemble classifier set, with a significant increase compared to other models. Our proposed method outperformed the standard approach, demonstrating its potential for enhancing brain tumor detection accuracy. This study contributes to the field by providing a more accurate method for detecting brain tumors, potentially enabling earlier detection and improved patient outcomes. Future research should focus on further improving brain tumor diagnosis and treatment through the application of machine learning techniques.
由于脑肿瘤的高死亡率,准确的检测是至关重要的。然而,现有的自动化方法存在准确性有限和假阳性率高的问题。在这项研究中,我们旨在通过比较17种不同的分类器来改进脑肿瘤分类,这些分类器分为六组:决策树(DT)模型、支持向量机(SVM)、朴素贝叶斯分类器、逻辑回归、广义线性模型(GLM)分类器和神经网络。我们使用了来自Kaggle的3,762个脑肿瘤磁共振成像(MRI)扫描数据集,每个图像的尺寸为240 × 240像素,并标记为肿瘤或非肿瘤。我们的方法包括三个主要步骤:使用17个预测器类提取视觉信息,通过权重优化优化特征提取,以及比较不同的分类器模型集。我们使用混淆矩阵和受试者工作特征(ROC)曲线来评估模型的性能。结果表明,优化特征选择和使用集成分类器可以提高脑肿瘤分类的准确性。具有集成分类器的DT模型是表现最好的分类器,准确率达到98.11%,AUC为0.99。值得注意的是,随机树(RT)在集成分类器集中表现出最高的准确率,与其他模型相比有显著提高。我们提出的方法优于标准方法,证明了其提高脑肿瘤检测准确性的潜力。这项研究通过提供一种更准确的检测脑肿瘤的方法,有可能使早期检测和改善患者预后,从而为该领域做出贡献。未来的研究应侧重于通过应用机器学习技术进一步提高脑肿瘤的诊断和治疗。
{"title":"Improving Brain Tumor Classification Efficacy through the Application of Feature Selection and Ensemble Classifiers","authors":"Y. Yuhandri, A. Windarto, Muhammad Noor Hasan Siregar","doi":"10.18178/joig.11.4.397-404","DOIUrl":"https://doi.org/10.18178/joig.11.4.397-404","url":null,"abstract":"Accurate brain tumor detection is crucial due to its high mortality rate. However, existing automated methods suffer from limited accuracy and high false-positive rates. In this study, we aimed to improve brain tumor classification by comparing 17 different classifiers organized into six groups: Decision Tree (DT) Model, Support Vector Machine (SVM), Naive Bayes Classifier, Logistic Regression, Generalized Linear Model (GLM) Classifier, and Neural Network. We utilized a dataset of 3,762 Magnetic Resonance Imaging (MRI) scans of brain tumors from Kaggle, with each image having dimensions of 240 × 240 pixels and labeled as tumor or non-tumor. Our approach involved three main steps: extracting visual information using 17 predictor classes, optimizing feature extraction through weight optimization, and comparing different sets of classifier models. We evaluated the models’ performance using the confusion matrix and Receiver Operating Characteristics (ROC) curves. Our results showed that optimizing feature selection and utilizing ensemble classifiers improved the accuracy of brain tumor classification. The DT Model with ensemble classifiers emerged as the best-performing classifier, achieving an accuracy of 98.11% and an AUC of 0.99. Notably, Random Tree (RT) exhibited the highest accuracy within the ensemble classifier set, with a significant increase compared to other models. Our proposed method outperformed the standard approach, demonstrating its potential for enhancing brain tumor detection accuracy. This study contributes to the field by providing a more accurate method for detecting brain tumors, potentially enabling earlier detection and improved patient outcomes. Future research should focus on further improving brain tumor diagnosis and treatment through the application of machine learning techniques.","PeriodicalId":36336,"journal":{"name":"中国图象图形学报","volume":" 3","pages":""},"PeriodicalIF":0.0,"publicationDate":"2023-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"138614904","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
中国图象图形学报
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1