中国图象图形学报最新文献_第4页

Poisson Noise Reduction with Nonlocal-PCA Hybrid Model in Medical X-ray Images 基于非局部pca混合模型的医用x射线泊松降噪

Q3 Computer Science

中国图象图形学报

Pub Date : 2023-06-01 DOI: 10.18178/joig.11.2.178-184

Daniel Kipele, K. Greyson

The presence of Poisson noise in medical X-ray images leads to degradation of the image quality. The obscured information is required for accurate diagnosis. During X-ray image acquisition process, weak light results into limited number of available photons, which leads into the Poisson noise commonly known as X-ray noise. Currently, the available X-ray noise removal methods have not yet obtained satisfying total denoising results to remove noise from the medical X-ray images. The available techniques tend to show good performance when the image model corresponds to the algorithm’s assumptions used but in general, the denoising algorithms fail to do complete denoise. X-ray image quality could be improved by increasing the X-ray dose value (beyond the maximum medically permissible dose) but the process could be lethal to patients’ health since higher X-ray energy may kill cells due to the effects of higher dose values. In this study, the hybrid model that combines the Poisson Principal Component Analysis (Poisson PCA) with the nonlocal (NL) means denoising algorithm is developed to reduce noise in images. This hybrid model for X-ray noise removal and the contrast enhancement improves the quality of X-ray images and can, thus, be used for medical diagnosis. The performance of the proposed hybrid model was observed by using the standard data and was compared with the standard Poisson algorithms.

医用x射线图像中泊松噪声的存在会导致图像质量的下降。准确诊断需要模糊的信息。在x射线图像采集过程中，弱光导致可用光子数量有限，从而导致通常称为x射线噪声的泊松噪声。目前，现有的x射线去噪方法尚未获得令人满意的全去噪效果，以去除医学x射线图像中的噪声。当图像模型符合算法所使用的假设时，现有的技术往往表现出良好的性能，但通常情况下，去噪算法不能完全去噪。可以通过增加x射线剂量值(超过医学上允许的最大剂量)来改善x射线图像质量，但这一过程可能对患者的健康是致命的，因为较高的x射线能量可能会由于较高剂量值的影响而杀死细胞。本研究将泊松主成分分析(Poisson Principal Component Analysis, PCA)与非局部均值去噪算法相结合，建立了一种混合模型来降低图像中的噪声。这种x射线噪声去除和对比度增强的混合模型提高了x射线图像的质量，因此可以用于医学诊断。用标准数据对混合模型的性能进行了观察，并与标准泊松算法进行了比较。

{"title":"Poisson Noise Reduction with Nonlocal-PCA Hybrid Model in Medical X-ray Images","authors":"Daniel Kipele, K. Greyson","doi":"10.18178/joig.11.2.178-184","DOIUrl":"https://doi.org/10.18178/joig.11.2.178-184","url":null,"abstract":"The presence of Poisson noise in medical X-ray images leads to degradation of the image quality. The obscured information is required for accurate diagnosis. During X-ray image acquisition process, weak light results into limited number of available photons, which leads into the Poisson noise commonly known as X-ray noise. Currently, the available X-ray noise removal methods have not yet obtained satisfying total denoising results to remove noise from the medical X-ray images. The available techniques tend to show good performance when the image model corresponds to the algorithm’s assumptions used but in general, the denoising algorithms fail to do complete denoise. X-ray image quality could be improved by increasing the X-ray dose value (beyond the maximum medically permissible dose) but the process could be lethal to patients’ health since higher X-ray energy may kill cells due to the effects of higher dose values. In this study, the hybrid model that combines the Poisson Principal Component Analysis (Poisson PCA) with the nonlocal (NL) means denoising algorithm is developed to reduce noise in images. This hybrid model for X-ray noise removal and the contrast enhancement improves the quality of X-ray images and can, thus, be used for medical diagnosis. The performance of the proposed hybrid model was observed by using the standard data and was compared with the standard Poisson algorithms.","PeriodicalId":36336,"journal":{"name":"中国图象图形学报","volume":"53 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2023-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"76662502","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Fuzzy Image Enhancement Based on an Adjustable Intensifier OperatorFuzzy Image Enhancement Based on an Adjustable Intensifier Operator 基于可调增强算子的模糊图像增强

Q3 Computer Science

中国图象图形学报

Pub Date : 2023-06-01 DOI: 10.18178/joig.11.2.146-152

Libao Yang, S. Zenian, R. Zakaria

Fuzzy image enhancement is an important method in the process of image processing. Fuzzy image enhancement includes steps: gray-level fuzzification, modifying membership using intensifier (INT) operator, and obtaining new gray-levels by defuzzification. This paper proposed an adjustable INT operator with parameter k. Firstly, the image’s pixels are divided into two regions by the OTSU method (low and high region), and calculate the pixels’ membership by fuzzification in each region. Then, the INT operator reduce pixels’ membership in the low region and enlarge pixels’ membership in the high region. The parameter k is determined base on the pixel’s location information (neighborhood information), and plays an adjusting role when the INT operator is working. And finally, the result image is obtained by the defuzzification process. In the experiment results, the fuzzy image enhancement with the adjustable intensifier operator achieves a better performance.

模糊图像增强是图像处理过程中的一种重要方法。模糊图像增强包括灰度模糊化、使用增强算子(INT)修改隶属度、通过去模糊化获得新的灰度。本文提出了一种参数为k的可调INT算子。首先，采用OTSU方法将图像像素划分为两个区域(低区和高区)，并对每个区域进行模糊化计算像素的隶属度;然后，INT算子在低区域减少像素的隶属度，在高区域增加像素的隶属度。参数k是根据像素的位置信息(邻域信息)确定的，在INT算子工作时起调节作用。最后对结果图像进行去模糊处理。实验结果表明，采用可调增强算子的模糊图像增强效果较好。

引用次数: 0

Multiclass Classification of Paddy Leaf Diseases Using Random Forest Classifier 基于随机森林分类器的水稻叶片病害多类分类

Q3 Computer Science

中国图象图形学报

Pub Date : 2023-06-01 DOI: 10.18178/joig.11.2.195-203

K. Saminathan, B. Sowmiya, Devi M Chithra

With increase in population, improving the quality and quantity of food is essential. Paddy is a vital food crop serving numerous people in various continents of the world. The yield of paddy is affected by numerous factors. Early diagnosis of disease is needed to prevent the plants from successive stage of disease. Manual diagnosis by naked eye is the traditional method widely adopted by farmers to identify leaf diseases. However, when the task involves manual disease diagnosis, problems like the hiring of domain experts, time consumption, and inaccurate results will arise. Inconsistent results may lead to improper treatment of plants. To overcome this problem, automatic disease diagnosis is proposed by researchers. This will help the farmers to accurately diagnose the disease swiftly without the need for expert. This manuscript develops model to classify four types of paddy leaf diseases bacterial blight, blast, tungro and brown spot. To begin with, the image is preprocessed by resizing and conversion to RGB Red, Green and Blue (RGB) and Hue, Saturation and Value (HSV) color space. Segmentation is done. Global features namely: hu moments, Haralick and color histogram are extracted and concatenated. Data is split in to training part and testing part in 70:30 ratios. Images are trained using multiple classifiers like Logistic Regression, Random Forest Classifier, Decision Tree Classifier, K-Nearest Neighbor (KNN) Classifier, Linear Discriminant Analysis (LDA),Support Vector Machine (SVM) and Gaussian Naive Bayes. This study reports Random Forest classifier as the best classifier. The Accuracy of the proposed model gained 92.84% after validation and 97.62% after testing using paddy disordered samples. 10 fold cross validation is performed. Performance of classification algorithms is measured using confusion matrix with precision, recall, F1- score and support as parameters.

随着人口的增加，提高食物的质量和数量是必不可少的。稻谷是一种重要的粮食作物，为世界各大洲的无数人提供服务。水稻的产量受许多因素的影响。为了防止植株连续发病，需要对病害进行早期诊断。人工肉眼诊断是农民广泛采用的传统叶片病害诊断方法。然而，当任务涉及到人工疾病诊断时，就会出现雇佣领域专家、耗时和结果不准确等问题。不一致的结果可能导致植物处理不当。为了克服这一问题，研究人员提出了疾病自动诊断。这将有助于农民在不需要专家的情况下迅速准确地诊断疾病。本文建立了水稻叶枯病、稻瘟病、褐枯病和褐斑病四种病害的分类模型。首先，图像通过调整大小和转换为RGB红、绿、蓝(RGB)和色调、饱和度和值(HSV)色彩空间进行预处理。分割完成。提取并拼接全局特征，即:hu矩、Haralick和颜色直方图。数据以70:30的比例分为训练部分和测试部分。图像使用多个分类器进行训练，如逻辑回归、随机森林分类器、决策树分类器、k -最近邻(KNN)分类器、线性判别分析(LDA)、支持向量机(SVM)和高斯朴素贝叶斯。本研究报告随机森林分类器是最好的分类器。模型经验证的准确率为92.84%，经水稻无序样本检验的准确率为97.62%。进行10次交叉验证。以精确度、召回率、F1分数和支持度为参数，采用混淆矩阵来衡量分类算法的性能。

{"title":"Multiclass Classification of Paddy Leaf Diseases Using Random Forest Classifier","authors":"K. Saminathan, B. Sowmiya, Devi M Chithra","doi":"10.18178/joig.11.2.195-203","DOIUrl":"https://doi.org/10.18178/joig.11.2.195-203","url":null,"abstract":"With increase in population, improving the quality and quantity of food is essential. Paddy is a vital food crop serving numerous people in various continents of the world. The yield of paddy is affected by numerous factors. Early diagnosis of disease is needed to prevent the plants from successive stage of disease. Manual diagnosis by naked eye is the traditional method widely adopted by farmers to identify leaf diseases. However, when the task involves manual disease diagnosis, problems like the hiring of domain experts, time consumption, and inaccurate results will arise. Inconsistent results may lead to improper treatment of plants. To overcome this problem, automatic disease diagnosis is proposed by researchers. This will help the farmers to accurately diagnose the disease swiftly without the need for expert. This manuscript develops model to classify four types of paddy leaf diseases bacterial blight, blast, tungro and brown spot. To begin with, the image is preprocessed by resizing and conversion to RGB Red, Green and Blue (RGB) and Hue, Saturation and Value (HSV) color space. Segmentation is done. Global features namely: hu moments, Haralick and color histogram are extracted and concatenated. Data is split in to training part and testing part in 70:30 ratios. Images are trained using multiple classifiers like Logistic Regression, Random Forest Classifier, Decision Tree Classifier, K-Nearest Neighbor (KNN) Classifier, Linear Discriminant Analysis (LDA),Support Vector Machine (SVM) and Gaussian Naive Bayes. This study reports Random Forest classifier as the best classifier. The Accuracy of the proposed model gained 92.84% after validation and 97.62% after testing using paddy disordered samples. 10 fold cross validation is performed. Performance of classification algorithms is measured using confusion matrix with precision, recall, F1- score and support as parameters.","PeriodicalId":36336,"journal":{"name":"中国图象图形学报","volume":"22 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2023-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"84507094","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Stereo Vision Based Localization of Handheld Controller in Virtual Reality for 3D Painting Using Inertial System 基于立体视觉的惯性系统三维绘画虚拟现实手持控制器定位

Q3 Computer Science

中国图象图形学报

Pub Date : 2023-06-01 DOI: 10.18178/joig.11.2.127-131

A. Saif, Z. R. Mahayuddin

Google Tilt Brush is expensive for virtual drawing which needs further improvement on the functionalities of mechanisms rather than implementation expects addressed in this research. Several issues are addressed by this research in this context, i.e., noise removal from sensor data, double integration-based drift issues and cost. Recently, available smart phones do not have the ability to perform drawing within artificial settings handling cardboard and daydream of google without purchasing Oculus Rift and HTC Vive (Virtual Reality Headset) because of expensiveness for large number of users. In addition, various extrinsic hardwares, i.e., satellite localization hardware and ultrasonic localization applications are not used for drawing in virtual reality. Proposed methodology implemented extended Kalman filter and Butterworth filter to perform positioning using six degree of freedom using Microelectromechanical Applications (MEMS) software data. A stereo visual method using Simultaneous Localization and Mapping (SLAM) is used to estimate the measurement for positioning implicating mobile phone (i.e., android platform) for the hardware system to estimate drift. This research implemented Google Virtual Reality application settings Kit with Unity3D engine. Experimentation validation states that proposed method can perform painting using virtual reality hardware integrated with controller software implicating smartphone mobile without using extrinsic controller device, i.e., Oculus Rift and HTC Vive with satisfactory accuracy.

对于虚拟绘图来说，倾斜刷是昂贵的，这需要进一步改进机制的功能，而不是在本研究中解决的实现预期。在此背景下，本研究解决了几个问题，即传感器数据的噪声去除，基于双积分的漂移问题和成本。最近，市面上的智能手机由于价格昂贵，对于大量用户来说，如果不购买Oculus Rift和HTC Vive(虚拟现实耳机)，就无法在人工设置中处理纸板和白日梦。此外，各种外部硬件，即卫星定位硬件和超声波定位应用程序不用于虚拟现实中的绘图。该方法采用扩展卡尔曼滤波器和巴特沃斯滤波器，利用微机电应用(MEMS)软件数据使用六自由度进行定位。采用同时定位与测绘(SLAM)的立体视觉方法，通过手机(即android平台)对硬件系统进行漂移估计。本研究利用Unity3D引擎实现谷歌虚拟现实应用设置工具包。实验验证表明，该方法可以在不使用外部控制器设备(即Oculus Rift和HTC Vive)的情况下，使用集成了智能手机控制器软件的虚拟现实硬件进行绘画，并具有令人满意的精度。

{"title":"Stereo Vision Based Localization of Handheld Controller in Virtual Reality for 3D Painting Using Inertial System","authors":"A. Saif, Z. R. Mahayuddin","doi":"10.18178/joig.11.2.127-131","DOIUrl":"https://doi.org/10.18178/joig.11.2.127-131","url":null,"abstract":"Google Tilt Brush is expensive for virtual drawing which needs further improvement on the functionalities of mechanisms rather than implementation expects addressed in this research. Several issues are addressed by this research in this context, i.e., noise removal from sensor data, double integration-based drift issues and cost. Recently, available smart phones do not have the ability to perform drawing within artificial settings handling cardboard and daydream of google without purchasing Oculus Rift and HTC Vive (Virtual Reality Headset) because of expensiveness for large number of users. In addition, various extrinsic hardwares, i.e., satellite localization hardware and ultrasonic localization applications are not used for drawing in virtual reality. Proposed methodology implemented extended Kalman filter and Butterworth filter to perform positioning using six degree of freedom using Microelectromechanical Applications (MEMS) software data. A stereo visual method using Simultaneous Localization and Mapping (SLAM) is used to estimate the measurement for positioning implicating mobile phone (i.e., android platform) for the hardware system to estimate drift. This research implemented Google Virtual Reality application settings Kit with Unity3D engine. Experimentation validation states that proposed method can perform painting using virtual reality hardware integrated with controller software implicating smartphone mobile without using extrinsic controller device, i.e., Oculus Rift and HTC Vive with satisfactory accuracy.","PeriodicalId":36336,"journal":{"name":"中国图象图形学报","volume":"35 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2023-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"81339470","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Rapid Analysis of Thorax Images for the Detection of Viral Infections 胸腔图像快速分析检测病毒感染

Q3 Computer Science

中国图象图形学报

Pub Date : 2023-06-01 DOI: 10.18178/joig.11.2.115-120

R. Radtke, Alexander Jesser

At the end of December 2019, a person in the Chinse city Wuhan was probably infected for the first time with the novel SARS-CoV-2 virus. In order to be able to react as quickly as possible after infection rapid diagnostic measures are of the utmost importance so that medical treatment can be taken at an early stage. An imageprocessing algorithm is presented using chest X-rays to determine whether a lung infection has a viral or a bacterial cause. In comparison to other more complicated evaluation methods, focus was put on using a simple algorithm by using the Canny algorithm for edge detection of infected areas of the lung tissue instead of complex deep learning processes. Main advantage here is that the method is portable to many different computer systems with little effort and does not need huge computing power. This should contribute to a faster diagnosis of SARS-CoV-2 virus-infection, especially in medically underdeveloped areas.

2019年12月底，中国武汉市有一人可能首次感染了新型SARS-CoV-2病毒。为了能够在感染后尽快作出反应，快速诊断措施至关重要，以便能够在早期阶段采取医疗措施。提出了一种图像处理算法，使用胸部x光片来确定肺部感染是由病毒还是细菌引起的。与其他较为复杂的评估方法相比，重点是使用Canny算法对肺组织感染区域进行边缘检测，而不是使用复杂的深度学习过程。这里的主要优点是该方法可以轻松地移植到许多不同的计算机系统，并且不需要巨大的计算能力。这将有助于更快地诊断SARS-CoV-2病毒感染，特别是在医疗欠发达地区。

引用次数: 0

Deep Learning-Based Emotion Recognition through Facial Expressions 基于深度学习的面部表情情感识别

Q3 Computer Science

中国图象图形学报

Pub Date : 2023-06-01 DOI: 10.18178/joig.11.2.140-145

Sarunya Kanjanawattana, Piyapong Kittichaiwatthana, Komsan Srivisut, Panchalee Praneetpholkrang

Nowadays, humans can communicate easily with others by recognizing speech and text characters, particularly facial expressions. In human communication, it is critical to comprehend their emotion or implicit expression. Indeed, facial expression recognition is vital for analyzing the emotions of conversation partners, which can contribute to a series of matters, including mental health consulting. This technique enables psychiatrists to select appropriate questions based on their patients’ current emotional state. The purpose of this study was to develop a deep learningbased model for detecting and recognizing emotions on human faces. We divided the experiment into two parts: Faster R-CNN and mini-Xception architecture. We concentrated on four distinct emotional states: angry, sad, happy, and neutral. Both models implemented using the Faster R-CNN and the mini-Xception architectures were compared during the evaluation process. The findings indicate that the mini-Xception architecture model produced a better result than the Faster R-CNN. This study will be expanded in the future to include the detection of complex emotions such as sadness.

如今，人类可以通过识别语音和文本字符，特别是面部表情，轻松地与他人交流。在人类的交流中，理解他们的情感或含蓄的表达是至关重要的。事实上，面部表情识别对于分析谈话对象的情绪至关重要，这可以有助于一系列问题，包括心理健康咨询。这项技术使精神科医生能够根据病人当前的情绪状态选择合适的问题。本研究的目的是开发一个基于深度学习的模型来检测和识别人类面部的情绪。我们将实验分为两部分:更快的R-CNN和mini- exception架构。我们专注于四种不同的情绪状态:愤怒、悲伤、快乐和中性。在评估过程中，比较了使用Faster R-CNN和mini-Xception架构实现的两种模型。研究结果表明，mini-Xception架构模型比Faster R-CNN产生了更好的结果。这项研究将在未来扩展到包括复杂情绪的检测，如悲伤。

{"title":"Deep Learning-Based Emotion Recognition through Facial Expressions","authors":"Sarunya Kanjanawattana, Piyapong Kittichaiwatthana, Komsan Srivisut, Panchalee Praneetpholkrang","doi":"10.18178/joig.11.2.140-145","DOIUrl":"https://doi.org/10.18178/joig.11.2.140-145","url":null,"abstract":"Nowadays, humans can communicate easily with others by recognizing speech and text characters, particularly facial expressions. In human communication, it is critical to comprehend their emotion or implicit expression. Indeed, facial expression recognition is vital for analyzing the emotions of conversation partners, which can contribute to a series of matters, including mental health consulting. This technique enables psychiatrists to select appropriate questions based on their patients’ current emotional state. The purpose of this study was to develop a deep learningbased model for detecting and recognizing emotions on human faces. We divided the experiment into two parts: Faster R-CNN and mini-Xception architecture. We concentrated on four distinct emotional states: angry, sad, happy, and neutral. Both models implemented using the Faster R-CNN and the mini-Xception architectures were compared during the evaluation process. The findings indicate that the mini-Xception architecture model produced a better result than the Faster R-CNN. This study will be expanded in the future to include the detection of complex emotions such as sadness.","PeriodicalId":36336,"journal":{"name":"中国图象图形学报","volume":"1 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2023-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"89893365","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Simulation of Facial Palsy Using Cycle GAN with Skip-Layer Excitation Module and Self-Supervised Discriminator 基于跳跃层激励模块和自监督鉴别器的循环GAN仿真面瘫

Q3 Computer Science

中国图象图形学报

Pub Date : 2023-06-01 DOI: 10.18178/joig.11.2.132-139

Takato Sakai, M. Seo, N. Matsushiro, Yen-Wei Chen

The Yanagihara method is used to evaluate facial nerve palsy based on visual examinations by physicians. Examples of scored images are important for educational purposes and as references, however, due to patient privacy concern, actual facial images of real patients cannot be used for educational purposes. In this paper, we propose a solution to this problem by generating facial images of a virtual patient with facial nerve palsy, that can be shared and utilized by physicians. To reproduce the patient facial expression in a public face image, we propose a method to generate a swapped face image using the improved Cycle Generative Adversarial Networks (Cycle GAN) with a skiplayer excitation module and a self-supervised discriminator. Experimental results demonstrate that the proposed model can generate more coherent swapped faces that are similar to the public face identity and patient facial expressions. The proposed method also improves the quality of generated swapped face images while keeping them identical to the source (genuine) face image.

Yanagihara法是基于医生的视觉检查来评估面神经麻痹。评分图像的示例具有重要的教育意义和参考意义，但出于对患者隐私的考虑，真实患者的真实面部图像不能用于教育目的。在本文中，我们提出了一个解决这个问题的方法，通过生成一个虚拟的面部神经麻痹患者的面部图像，可以被医生共享和利用。为了在公共人脸图像中再现患者的面部表情，我们提出了一种使用改进的循环生成对抗网络(Cycle GAN)生成交换人脸图像的方法，该网络具有skiplayer激励模块和自监督鉴别器。实验结果表明，该模型可以生成与公众面部身份和患者面部表情相似的更连贯的交换脸。该方法还提高了生成的交换人脸图像的质量，同时保持它们与源(真实)人脸图像相同。

引用次数: 0

Cascaded Graph Convolution Approach for Nuclei Detection in Histopathology Images 组织病理图像中核检测的级联图卷积方法

Q3 Computer Science

中国图象图形学报

Pub Date : 2023-03-01 DOI: 10.18178/joig.11.1.15-20

Sachin Bahade, Michael Edwards, Xianghua Xie

Nuclei detection in histopathology images of cancerous tissue stained with conventional hematoxylin and eosin stain is a challenging task due to the complexity and diversity of cell data. Deep learning techniques have produced encouraging results in the field of nuclei detection, where the main emphasis is on classification and regressionbased methods. Recent research has demonstrated that regression-based techniques outperform classification. In this paper, we propose a classification model based on graph convolutions to classify nuclei, and similar models to detect nuclei using cascaded architecture. With nearly 29,000 annotated nuclei in a large dataset of cancer histology images, we evaluated the Convolutional Neural Network (CNN) and Graph Convolutional Networks (GCN) based approaches. Our findings demonstrate that graph convolutions perform better with a cascaded GCN architecture and are more stable than centre-of-pixel approach. We have compared our twofold evaluation quantitative results with CNN-based models such as Spatial Constrained Convolutional Neural Network (SC-CNN) and Centre-of-Pixel Convolutional Neural Network (CP-CNN). We used two different loss functions, binary cross-entropy and focal loss function, and also investigated the behaviour of CP-CNN and GCN models to observe the effectiveness of CNN and GCN operators. The compared quantitative F1 score of cascaded-GCN shows an improvement of 6% compared to state-of-the-art methods.

由于细胞数据的复杂性和多样性，用常规苏木精和伊红染色染色的癌组织病理图像中的细胞核检测是一项具有挑战性的任务。深度学习技术在核检测领域产生了令人鼓舞的结果，其中主要强调的是基于分类和回归的方法。最近的研究表明，基于回归的技术优于分类。在本文中，我们提出了一种基于图卷积的分类模型来对核进行分类，并使用类似的模型来使用级联结构来检测核。利用大型癌症组织学图像数据集中近29,000个带注释的细胞核，我们评估了基于卷积神经网络(CNN)和基于图卷积网络(GCN)的方法。我们的研究结果表明，图卷积在级联GCN架构下表现更好，并且比像素中心方法更稳定。我们将我们的双重评估定量结果与基于cnn的模型(如空间约束卷积神经网络(SC-CNN)和像素中心卷积神经网络(CP-CNN))进行了比较。我们使用了二值交叉熵和焦点损失函数两种不同的损失函数，并研究了CP-CNN和GCN模型的行为，以观察CNN和GCN算子的有效性。与最先进的方法相比，级联- gcn的定量F1评分提高了6%。

{"title":"Cascaded Graph Convolution Approach for Nuclei Detection in Histopathology Images","authors":"Sachin Bahade, Michael Edwards, Xianghua Xie","doi":"10.18178/joig.11.1.15-20","DOIUrl":"https://doi.org/10.18178/joig.11.1.15-20","url":null,"abstract":"Nuclei detection in histopathology images of cancerous tissue stained with conventional hematoxylin and eosin stain is a challenging task due to the complexity and diversity of cell data. Deep learning techniques have produced encouraging results in the field of nuclei detection, where the main emphasis is on classification and regressionbased methods. Recent research has demonstrated that regression-based techniques outperform classification. In this paper, we propose a classification model based on graph convolutions to classify nuclei, and similar models to detect nuclei using cascaded architecture. With nearly 29,000 annotated nuclei in a large dataset of cancer histology images, we evaluated the Convolutional Neural Network (CNN) and Graph Convolutional Networks (GCN) based approaches. Our findings demonstrate that graph convolutions perform better with a cascaded GCN architecture and are more stable than centre-of-pixel approach. We have compared our twofold evaluation quantitative results with CNN-based models such as Spatial Constrained Convolutional Neural Network (SC-CNN) and Centre-of-Pixel Convolutional Neural Network (CP-CNN). We used two different loss functions, binary cross-entropy and focal loss function, and also investigated the behaviour of CP-CNN and GCN models to observe the effectiveness of CNN and GCN operators. The compared quantitative F1 score of cascaded-GCN shows an improvement of 6% compared to state-of-the-art methods.","PeriodicalId":36336,"journal":{"name":"中国图象图形学报","volume":"42 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2023-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"87525380","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 1

Breast Cancer Detection Using Image Processing and Machine Learning 使用图像处理和机器学习的乳腺癌检测

Q3 Computer Science

中国图象图形学报

Pub Date : 2023-03-01 DOI: 10.18178/joig.11.1.1-8

Z. Q. Habeeb, B. Vuksanovic, Imad Al-Zaydi

Different breast cancer detection systems have been developed to help clinicians analyze screening mammograms. Breast cancer has been increasing gradually so scientists work to develop new methods to reduce the risks of this life-threatening disease. Convolutional Neural Networks (CNNs) have shown much promise In the field of medical imaging because of recent developments in deep learning. However, CNN’s based methods have been restricted due to the small size of the few public breast cancer datasets. This research has developed a new framework and introduced it to detect breast cancer. This framework utilizes Convolutional Neural Networks (CNNs) and image processing to achieve its goal because CNNs have been an important success in image recognition, reaching human performance. An efficient and fast CNN pre-trained object detector called RetinaNet has been used in this research. RetinaNet is an uncomplicated one-stage object detector. A two-stage transfer learning has been used with the selected detector to improve the performance. RetinaNet model is initially trained with a general-purpose dataset called COCO dataset. The transfer learning is then used to apply the RetinaNet model to another dataset of mammograms called the CBIS-DDSM dataset. Finally, the second transfer learning is used to test the RetinaNet model onto a small dataset of mammograms called the INbreast dataset. The results of the proposed two-stage transfer learning (RetinaNet → CBIS-DDSM → INbreast) are better than the other state-of-the-art methods on the public INbreast dataset. Furthermore, the True Positive Rate (TPR) is 0.99 ± 0.02 at 1.67 False Positives per Image (FPPI), which is better than the one-stage transfer learning with a TPR of 0.94 ± 0.02 at 1.67 FPPI.

已经开发了不同的乳腺癌检测系统来帮助临床医生分析筛查性乳房x光照片。乳腺癌一直在逐渐增加，因此科学家们致力于开发新的方法来降低这种危及生命的疾病的风险。由于深度学习的最新发展，卷积神经网络(cnn)在医学成像领域显示出很大的前景。然而，由于少数公开的乳腺癌数据集规模较小，CNN的基于方法受到了限制。这项研究开发了一种新的框架，并将其用于检测乳腺癌。该框架利用卷积神经网络(cnn)和图像处理来实现其目标，因为cnn在图像识别方面取得了重要的成功，达到了人类的表现。在这项研究中使用了一种高效快速的CNN预训练对象检测器，称为RetinaNet。retanet是一个简单的单级目标探测器。采用两阶段迁移学习方法对所选择的检测器进行学习，以提高性能。retanet模型最初使用一个称为COCO数据集的通用数据集进行训练。然后使用迁移学习将RetinaNet模型应用于另一个称为CBIS-DDSM数据集的乳房x光片数据集。最后，第二次迁移学习用于在一个称为INbreast数据集的乳房x光照片小数据集上测试RetinaNet模型。所提出的两阶段迁移学习(RetinaNet→CBIS-DDSM→INbreast)在公共INbreast数据集上的结果优于其他最先进的方法。在1.67个False Positives per Image (FPPI)下，该方法的True Positive Rate (TPR)为0.99±0.02，优于单阶段迁移学习(1.67个FPPI)下的TPR(0.94±0.02)。

{"title":"Breast Cancer Detection Using Image Processing and Machine Learning","authors":"Z. Q. Habeeb, B. Vuksanovic, Imad Al-Zaydi","doi":"10.18178/joig.11.1.1-8","DOIUrl":"https://doi.org/10.18178/joig.11.1.1-8","url":null,"abstract":"Different breast cancer detection systems have been developed to help clinicians analyze screening mammograms. Breast cancer has been increasing gradually so scientists work to develop new methods to reduce the risks of this life-threatening disease. Convolutional Neural Networks (CNNs) have shown much promise In the field of medical imaging because of recent developments in deep learning. However, CNN’s based methods have been restricted due to the small size of the few public breast cancer datasets. This research has developed a new framework and introduced it to detect breast cancer. This framework utilizes Convolutional Neural Networks (CNNs) and image processing to achieve its goal because CNNs have been an important success in image recognition, reaching human performance. An efficient and fast CNN pre-trained object detector called RetinaNet has been used in this research. RetinaNet is an uncomplicated one-stage object detector. A two-stage transfer learning has been used with the selected detector to improve the performance. RetinaNet model is initially trained with a general-purpose dataset called COCO dataset. The transfer learning is then used to apply the RetinaNet model to another dataset of mammograms called the CBIS-DDSM dataset. Finally, the second transfer learning is used to test the RetinaNet model onto a small dataset of mammograms called the INbreast dataset. The results of the proposed two-stage transfer learning (RetinaNet → CBIS-DDSM → INbreast) are better than the other state-of-the-art methods on the public INbreast dataset. Furthermore, the True Positive Rate (TPR) is 0.99 ± 0.02 at 1.67 False Positives per Image (FPPI), which is better than the one-stage transfer learning with a TPR of 0.94 ± 0.02 at 1.67 FPPI.","PeriodicalId":36336,"journal":{"name":"中国图象图形学报","volume":"108 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2023-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"83824098","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 3

Robust Dual Digital Watermark Applied to Antique Digitized Cinema Images: Resistant to Print-Scan Attack 鲁棒双数字水印应用于古董数字化电影图像:抗打印扫描攻击

Q3 Computer Science

中国图象图形学报

Pub Date : 2023-03-01 DOI: 10.18178/joig.11.1.61-71

L. Reyes-Ruiz, duardo Fragoso-Navarro, F. Garcia-Ugalde, O. Juarez-Sandoval, M. Cedillo-Hernández, M. Nakano-Miyatake

Nowadays, advances in information and communication technologies along with easy access to electronic devices such as smartphones have achieved an agile and efficient storing, edition, and distribution of digital multimedia files. However, lack of regulation has led to several problems associated with intellectual property authentication and copyright protection. Furthermore, the problem becomes complex in a scenario of illegal printed exploitation, which involves printing and scanning processes. To solve these problems, several digital watermarking in combination with cryptographic algorithms has been proposed. In this paper, a strategy of robust watermarking is defined consisting of the administration and detection of unauthorized use of digitized cinematographic images from Mexican cultural heritage. The proposed strategy is based on the combination of two types of digital watermarking, one of visible-camouflaged type based on spatial domain and another of invisible type based on frequency domain, together with a particle swarm optimization. The experimental results show the high performance of the proposed algorithm faced to printing-scanning processes or digital-analogue attack, and common image geometric and image processing attacks such as JPEG compression. Additionally, the imperceptibility of the watermark is evaluated by PSNR and compared with other previously proposed algorithms.

如今，随着信息通信技术的进步和智能手机等电子设备的方便使用，数字多媒体文件的存储、编辑和分发已经实现了敏捷和高效。然而，缺乏监管导致了与知识产权认证和版权保护有关的几个问题。此外，在涉及打印和扫描过程的非法印刷开发场景中，问题变得复杂。为了解决这些问题，提出了几种数字水印与加密算法相结合的方法。本文定义了一种鲁棒水印策略，包括管理和检测未经授权使用墨西哥文化遗产的数字化电影图像。该算法将基于空间域的隐身型和基于频域的隐身型两种数字水印算法结合起来，采用粒子群算法进行优化。实验结果表明，该算法在面对印刷扫描过程或数字模拟攻击以及常见的图像几何和图像处理攻击(如JPEG压缩)时具有良好的性能。此外，通过PSNR评估水印的不可感知性，并与其他算法进行比较。

{"title":"Robust Dual Digital Watermark Applied to Antique Digitized Cinema Images: Resistant to Print-Scan Attack","authors":"L. Reyes-Ruiz, duardo Fragoso-Navarro, F. Garcia-Ugalde, O. Juarez-Sandoval, M. Cedillo-Hernández, M. Nakano-Miyatake","doi":"10.18178/joig.11.1.61-71","DOIUrl":"https://doi.org/10.18178/joig.11.1.61-71","url":null,"abstract":"Nowadays, advances in information and communication technologies along with easy access to electronic devices such as smartphones have achieved an agile and efficient storing, edition, and distribution of digital multimedia files. However, lack of regulation has led to several problems associated with intellectual property authentication and copyright protection. Furthermore, the problem becomes complex in a scenario of illegal printed exploitation, which involves printing and scanning processes. To solve these problems, several digital watermarking in combination with cryptographic algorithms has been proposed. In this paper, a strategy of robust watermarking is defined consisting of the administration and detection of unauthorized use of digitized cinematographic images from Mexican cultural heritage. The proposed strategy is based on the combination of two types of digital watermarking, one of visible-camouflaged type based on spatial domain and another of invisible type based on frequency domain, together with a particle swarm optimization. The experimental results show the high performance of the proposed algorithm faced to printing-scanning processes or digital-analogue attack, and common image geometric and image processing attacks such as JPEG compression. Additionally, the imperceptibility of the watermark is evaluated by PSNR and compared with other previously proposed algorithms.","PeriodicalId":36336,"journal":{"name":"中国图象图形学报","volume":"19 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2023-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"81786159","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 1