首页 > 最新文献

2018 Digital Image Computing: Techniques and Applications (DICTA)最新文献

英文 中文
Railway Infrastructure Defects Recognition using Fine-grained Deep Convolutional Neural Networks 基于细粒度深度卷积神经网络的铁路基础设施缺陷识别
Pub Date : 2018-12-01 DOI: 10.1109/DICTA.2018.8615868
Huaxi Huang, Jingsong Xu, Jian Zhang, Qiang Wu, Christina Kirsch
Railway power supply infrastructure is one of the most important components of railway transportation. As the key step of railway maintenance system, power supply infrastructure defects recognition plays a vital role in the whole defects inspection sub-system. Traditional defects recognition task is performed manually, which is time-consuming and high-labor costing. Inspired by the great success of deep neural networks in dealing with different vision tasks, this paper presents an end-to-end deep network to solve the railway infrastructure defects detection problem. More importantly, this paper is the first work that adopts the idea of deep fine-grained classification to do railway defects detection. We propose a new bilinear deep network named Spatial Transformer And Bilinear Low-Rank (STABLR) model and apply it to railway infrastructure defects detection. The experimental results demonstrate that the proposed method outperforms both hand-craft features based machine learning methods and classic deep neural network methods.
铁路供电基础设施是铁路运输的重要组成部分之一。供电基础设施缺陷识别作为铁路维修系统的关键环节,在整个缺陷检测子系统中起着至关重要的作用。传统的缺陷识别任务是手工完成的,耗时长,人工成本高。受深度神经网络在处理不同视觉任务方面的巨大成功的启发,本文提出了一种端到端深度网络来解决铁路基础设施缺陷检测问题。更重要的是,本文首次采用深细粒度分类的思想进行铁路缺陷检测。提出了一种新的双线性深度网络——空间变压器和双线性低秩(STABLR)模型,并将其应用于铁路基础设施缺陷检测。实验结果表明,该方法优于基于手工特征的机器学习方法和经典的深度神经网络方法。
{"title":"Railway Infrastructure Defects Recognition using Fine-grained Deep Convolutional Neural Networks","authors":"Huaxi Huang, Jingsong Xu, Jian Zhang, Qiang Wu, Christina Kirsch","doi":"10.1109/DICTA.2018.8615868","DOIUrl":"https://doi.org/10.1109/DICTA.2018.8615868","url":null,"abstract":"Railway power supply infrastructure is one of the most important components of railway transportation. As the key step of railway maintenance system, power supply infrastructure defects recognition plays a vital role in the whole defects inspection sub-system. Traditional defects recognition task is performed manually, which is time-consuming and high-labor costing. Inspired by the great success of deep neural networks in dealing with different vision tasks, this paper presents an end-to-end deep network to solve the railway infrastructure defects detection problem. More importantly, this paper is the first work that adopts the idea of deep fine-grained classification to do railway defects detection. We propose a new bilinear deep network named Spatial Transformer And Bilinear Low-Rank (STABLR) model and apply it to railway infrastructure defects detection. The experimental results demonstrate that the proposed method outperforms both hand-craft features based machine learning methods and classic deep neural network methods.","PeriodicalId":130057,"journal":{"name":"2018 Digital Image Computing: Techniques and Applications (DICTA)","volume":"21 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133050751","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 7
Automatic Detection of Valves with Disaster Response Robot on Basis of Depth Camera Information 基于深度相机信息的灾害响应机器人阀门自动检测
Pub Date : 2018-12-01 DOI: 10.1109/DICTA.2018.8615796
Keishi Nishikawa, J. Ohya, T. Matsuzawa, A. Takanishi, H. Ogata, K. Hashimoto
In recent years, there has been an increasing demand for disaster response robots designed for working in disaster sites such as nuclear power plants where accidents have occurred. One of the tasks the robots need to complete at these kinds of sites is turning a valve. In order to employ robots to perform this task at real sites, it is desirable that the robots have autonomy for detecting the valves to be manipulated. In this paper, we propose a method that allows a disaster response robot to detect a valve, whose parameters such as position, orientation and size are unknown, based on information captured by a depth camera mounted on the robot. In our proposed algorithm, first the target valve is detected on the basis of an RGB image captured by the depth camera, and 3D point cloud data including the target is reconstructed by combining the detection result and the depth image. Second, the reconstructed point cloud data is processed to estimate parameters describing the target. Experiments were conducted on a simulator, and the results showed that our method could accurately estimate the parameters with a minimum error of 0.0230 m in position, 0.196 % in radius, and 0.00222 degree in orientation.
近年来,为在核电站等发生事故的灾难现场工作而设计的灾难响应机器人的需求不断增加。在这些地方,机器人需要完成的任务之一是转动阀门。为了让机器人在实际现场执行这项任务,希望机器人能够自主检测要操作的阀门。在本文中,我们提出了一种方法,该方法允许灾难响应机器人根据安装在机器人上的深度相机捕获的信息检测位置,方向和尺寸等参数未知的阀门。该算法首先基于深度相机捕获的RGB图像对目标阀进行检测,并结合检测结果和深度图像重构包含目标的三维点云数据。其次,对重建的点云数据进行处理,估计描述目标的参数;仿真实验结果表明,该方法能够准确地估计参数,位置误差最小为0.0230 m,半径误差最小为0.196 %,方位误差最小为0.00222°。
{"title":"Automatic Detection of Valves with Disaster Response Robot on Basis of Depth Camera Information","authors":"Keishi Nishikawa, J. Ohya, T. Matsuzawa, A. Takanishi, H. Ogata, K. Hashimoto","doi":"10.1109/DICTA.2018.8615796","DOIUrl":"https://doi.org/10.1109/DICTA.2018.8615796","url":null,"abstract":"In recent years, there has been an increasing demand for disaster response robots designed for working in disaster sites such as nuclear power plants where accidents have occurred. One of the tasks the robots need to complete at these kinds of sites is turning a valve. In order to employ robots to perform this task at real sites, it is desirable that the robots have autonomy for detecting the valves to be manipulated. In this paper, we propose a method that allows a disaster response robot to detect a valve, whose parameters such as position, orientation and size are unknown, based on information captured by a depth camera mounted on the robot. In our proposed algorithm, first the target valve is detected on the basis of an RGB image captured by the depth camera, and 3D point cloud data including the target is reconstructed by combining the detection result and the depth image. Second, the reconstructed point cloud data is processed to estimate parameters describing the target. Experiments were conducted on a simulator, and the results showed that our method could accurately estimate the parameters with a minimum error of 0.0230 m in position, 0.196 % in radius, and 0.00222 degree in orientation.","PeriodicalId":130057,"journal":{"name":"2018 Digital Image Computing: Techniques and Applications (DICTA)","volume":"127 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131762990","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Inter-Subject Image Registration of Clinical Neck MRI Volumes using Discrete Periodic Spline Wavelet and Free form Deformation 基于离散周期样条小波和自由变形的临床颈部MRI体间图像配准
Pub Date : 2018-12-01 DOI: 10.1109/DICTA.2018.8615825
A. Suman, Md. Asikuzzaman, A. Webb, D. Perriman, M. Pickering
This paper presents a framework for inter-patient image registration which uses a multi-thresholds, multi-similarity measures and multi-transformations based on compactly supported spline and discrete periodic spline wavelets (DPSWs) using the Gauss-Newton gradient descent (GNGD) and gradient descent (GD) optimization methods. Our primary intellectual contribution is incorporating DPSWs in the transformation while another includes fusing out-of-range concept in a surface matching technique which is implemented by a multi-transformations and multi-similarity measures. In particular, as a true deformation cannot be achieved by single combination of transformation, similarity measure (SM) and optimization of a registration process, a moving image is required to be brought within the range of a registration. On the other hand, the surface matching technique involves an edge position difference (EPD) SM in which coarse to fine surfaces are matched using multiple thresholds with a spline-based free from deformation (FFD) method. The registration experiments were performed on 3D clinical neck magnetic resonance (MR) images, with the results showing that our proposed method provides good accuracy and robustness.
本文提出了一个基于紧支持样条和离散周期样条小波(DPSWs)的多阈值、多相似度和多变换的患者间图像配准框架,该框架采用高斯-牛顿梯度下降(GNGD)和梯度下降(GD)优化方法。我们的主要智力贡献是将dpsw纳入到转换中,而另一个智力贡献包括将超距概念融合到表面匹配技术中,该技术通过多转换和多相似度量来实现。特别是,由于单纯结合变换、相似度量(SM)和配准过程的优化无法实现真正的形变,因此需要将运动图像置于配准范围内。另一方面,表面匹配技术涉及边缘位置差(EPD) SM,其中使用基于样条的无变形(FFD)方法使用多个阈值匹配粗到细表面。在临床颈部磁共振三维图像上进行了配准实验,结果表明该方法具有较好的准确性和鲁棒性。
{"title":"Inter-Subject Image Registration of Clinical Neck MRI Volumes using Discrete Periodic Spline Wavelet and Free form Deformation","authors":"A. Suman, Md. Asikuzzaman, A. Webb, D. Perriman, M. Pickering","doi":"10.1109/DICTA.2018.8615825","DOIUrl":"https://doi.org/10.1109/DICTA.2018.8615825","url":null,"abstract":"This paper presents a framework for inter-patient image registration which uses a multi-thresholds, multi-similarity measures and multi-transformations based on compactly supported spline and discrete periodic spline wavelets (DPSWs) using the Gauss-Newton gradient descent (GNGD) and gradient descent (GD) optimization methods. Our primary intellectual contribution is incorporating DPSWs in the transformation while another includes fusing out-of-range concept in a surface matching technique which is implemented by a multi-transformations and multi-similarity measures. In particular, as a true deformation cannot be achieved by single combination of transformation, similarity measure (SM) and optimization of a registration process, a moving image is required to be brought within the range of a registration. On the other hand, the surface matching technique involves an edge position difference (EPD) SM in which coarse to fine surfaces are matched using multiple thresholds with a spline-based free from deformation (FFD) method. The registration experiments were performed on 3D clinical neck magnetic resonance (MR) images, with the results showing that our proposed method provides good accuracy and robustness.","PeriodicalId":130057,"journal":{"name":"2018 Digital Image Computing: Techniques and Applications (DICTA)","volume":"8 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132998156","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Evaluation of Graph Topological Features in Digitized Mammogram for Microcalcification Cluster Classification 微钙化聚类分类的数字化乳房x线图拓扑特征评价
Pub Date : 2018-12-01 DOI: 10.1109/DICTA.2018.8615858
N. Alam, R. Zwiggelaar
In this paper, the scale-specific graph topological changes of Microcalcifications (MC) were investigated to classify MC cluster. A series of multi-scale MC cluster graphs were generated based on the connectivity of individual MCs. The extracted features from the graph series were integrated with the statistical and morphological characteristics of MC clusters. Subsequent feature selection showed that the features related to the denseness of MC cluster at some specific scales of the generated graphs discriminated better than all other features in classifying MC clusters while using an ensemble classifier with 10-fold cross validation. The proposed method was evaluated using two well-known digitized datasets: MIAS (Mammographic Image Analysis Society) and DDSM (The Digital Database for Screening Mammography). High classification accuracy (around 98%) and good ROC (receiver operating characteristic) results (area under the ROC curve up to 0.99) were achieved.
本文研究了微钙化(mcc)的尺度特异性图拓扑变化,对mcc簇进行了分类。基于单个MC的连通性,生成了一系列多尺度MC聚类图。从图序列中提取的特征与MC聚类的统计特征和形态学特征相结合。随后的特征选择表明,在使用10倍交叉验证的集成分类器对MC聚类进行分类时,在生成的图的某些特定尺度上,与MC聚类密度相关的特征比所有其他特征都更好。采用两个著名的数字化数据集:MIAS(乳房摄影图像分析协会)和DDSM(乳腺摄影筛查数字数据库)对所提出的方法进行了评估。获得了较高的分类准确率(约98%)和良好的ROC(受试者工作特征)结果(ROC曲线下面积可达0.99)。
{"title":"Evaluation of Graph Topological Features in Digitized Mammogram for Microcalcification Cluster Classification","authors":"N. Alam, R. Zwiggelaar","doi":"10.1109/DICTA.2018.8615858","DOIUrl":"https://doi.org/10.1109/DICTA.2018.8615858","url":null,"abstract":"In this paper, the scale-specific graph topological changes of Microcalcifications (MC) were investigated to classify MC cluster. A series of multi-scale MC cluster graphs were generated based on the connectivity of individual MCs. The extracted features from the graph series were integrated with the statistical and morphological characteristics of MC clusters. Subsequent feature selection showed that the features related to the denseness of MC cluster at some specific scales of the generated graphs discriminated better than all other features in classifying MC clusters while using an ensemble classifier with 10-fold cross validation. The proposed method was evaluated using two well-known digitized datasets: MIAS (Mammographic Image Analysis Society) and DDSM (The Digital Database for Screening Mammography). High classification accuracy (around 98%) and good ROC (receiver operating characteristic) results (area under the ROC curve up to 0.99) were achieved.","PeriodicalId":130057,"journal":{"name":"2018 Digital Image Computing: Techniques and Applications (DICTA)","volume":"10 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127700289","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Bone Age Assessment Based on Two-Stage Deep Neural Networks 基于两阶段深度神经网络的骨龄评估
Pub Date : 2018-12-01 DOI: 10.1109/DICTA.2018.8615764
Meicheng Chu, Bo Liu, F. Zhou, X. Bai, Bin Guo
Skeletal bone age assessment is a clinical practice to diagnose the maturity of children. To accurately assess the bone age, we proposed an automatic bone age assessment method in this paper based on deep convolution network. This method includes two stages: mask generation network and age assessment network. A U-Net convolution network with pretrained VGG16 as the encoder is used to extract the mask of bones. For the assessment module, the original images are fused together with the generated mask image to obtain segmented normalized hand bone images. We then built a multiple output convolution network for accurate age assessment. Finally, the bone age regression problem is transformed into the K-1 binary classification sub-problems. Our model was tested on RSNA2017 Pediatric Bone Age dataset. We were able to achieve the mean absolute error (MAE) of 5.98 months, which outperforms other common methods for bone age assessment. The proposed method could be used for developing fully automatic bone age assessment with better accuracy.
骨龄评估是诊断儿童成熟度的一项临床实践。为了准确评估骨龄,本文提出了一种基于深度卷积网络的骨龄自动评估方法。该方法包括两个阶段:掩码生成网络和年龄评估网络。利用预训练的VGG16作为编码器的U-Net卷积网络提取骨骼的掩码。评估模块将原始图像与生成的掩模图像融合在一起,得到分割的归一化手骨图像。然后,我们构建了一个多输出卷积网络,用于准确的年龄评估。最后,将骨龄回归问题转化为K-1二分类子问题。我们的模型在RSNA2017儿童骨龄数据集上进行了测试。我们能够达到5.98个月的平均绝对误差(MAE),优于其他常用的骨龄评估方法。该方法可用于开发全自动骨龄评估,具有较高的准确性。
{"title":"Bone Age Assessment Based on Two-Stage Deep Neural Networks","authors":"Meicheng Chu, Bo Liu, F. Zhou, X. Bai, Bin Guo","doi":"10.1109/DICTA.2018.8615764","DOIUrl":"https://doi.org/10.1109/DICTA.2018.8615764","url":null,"abstract":"Skeletal bone age assessment is a clinical practice to diagnose the maturity of children. To accurately assess the bone age, we proposed an automatic bone age assessment method in this paper based on deep convolution network. This method includes two stages: mask generation network and age assessment network. A U-Net convolution network with pretrained VGG16 as the encoder is used to extract the mask of bones. For the assessment module, the original images are fused together with the generated mask image to obtain segmented normalized hand bone images. We then built a multiple output convolution network for accurate age assessment. Finally, the bone age regression problem is transformed into the K-1 binary classification sub-problems. Our model was tested on RSNA2017 Pediatric Bone Age dataset. We were able to achieve the mean absolute error (MAE) of 5.98 months, which outperforms other common methods for bone age assessment. The proposed method could be used for developing fully automatic bone age assessment with better accuracy.","PeriodicalId":130057,"journal":{"name":"2018 Digital Image Computing: Techniques and Applications (DICTA)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131025341","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 12
Fast and Energy-Efficient Time-of-Flight Distance Sensing Method for 3D Object Tracking 一种快速节能的三维目标跟踪飞行时间距离传感方法
Pub Date : 2018-12-01 DOI: 10.1109/DICTA.2018.8615790
H. Plank, G. Holweg, C. Steger, N. Druml
We present a new energy-efficient distance sensing method for 3D object tracking with Time-of-Flight sensors. The field of 3D object tracking with 3D cameras recently gained momentum due to the advent of front-facing depth cameras in smartphones. Tracking the user's head with 3D cameras will enable novel user experiences, but can lead to power consumption issues due to the active illumination. State-of-the-art continuous-wave Time-of-Flight imaging requires at least four different phase-images, while our approach can produce 3D measurements from single phase-images. This reduces the amount of emitted light to a minimum, improves latency and enables higher framerates. As our evaluation shows, after a brief initialization phase, our method can reduce the power consumption of a Time-of-Flight system by up to 68%.
提出了一种新的基于飞行时间传感器的三维目标跟踪节能距离传感方法。由于智能手机的前置深度摄像头的出现,使用3D相机进行3D物体跟踪的领域最近获得了发展势头。用3D相机跟踪用户的头部将带来新颖的用户体验,但由于主动照明,可能会导致功耗问题。最先进的连续波飞行时间成像需要至少四个不同的相位图像,而我们的方法可以从单个相位图像产生3D测量。这样可以将发射光的数量减少到最小,改善延迟并实现更高的帧率。正如我们的评估所示,经过短暂的初始化阶段,我们的方法可以将飞行时间系统的功耗降低高达68%。
{"title":"Fast and Energy-Efficient Time-of-Flight Distance Sensing Method for 3D Object Tracking","authors":"H. Plank, G. Holweg, C. Steger, N. Druml","doi":"10.1109/DICTA.2018.8615790","DOIUrl":"https://doi.org/10.1109/DICTA.2018.8615790","url":null,"abstract":"We present a new energy-efficient distance sensing method for 3D object tracking with Time-of-Flight sensors. The field of 3D object tracking with 3D cameras recently gained momentum due to the advent of front-facing depth cameras in smartphones. Tracking the user's head with 3D cameras will enable novel user experiences, but can lead to power consumption issues due to the active illumination. State-of-the-art continuous-wave Time-of-Flight imaging requires at least four different phase-images, while our approach can produce 3D measurements from single phase-images. This reduces the amount of emitted light to a minimum, improves latency and enables higher framerates. As our evaluation shows, after a brief initialization phase, our method can reduce the power consumption of a Time-of-Flight system by up to 68%.","PeriodicalId":130057,"journal":{"name":"2018 Digital Image Computing: Techniques and Applications (DICTA)","volume":"4 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115703607","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Impact of MRI Protocols on Alzheimer's Disease Detection MRI方案对阿尔茨海默病检测的影响
Pub Date : 2018-12-01 DOI: 10.1109/DICTA.2018.8615774
Saruar Alam, Len Hamey, K. Ho-Shon
Alzheimer's disease (AD) can be detected using magnetic resonance imaging (MRI) based features and supervised classifiers. The subcortical and ventricular volumes change for AD patients. These volumes can be extracted from MRI by tools such as FreeSurfer and the multi-atlas-based likelihood fusion (MALF) algorithm. Studies use MRI from many medical imaging centers. However, individual centers typically use distinctive MRI protocols for brain scanning. The protocol differences include different scanner models with various operating parameters. Some scanner models have different field strengths. A key factor in classifying multicentric MR subject images having different protocols is how different scanner models affect the extraction of feature, and the subsequent classification performance of a supervised classifier. We have investigated the classification performance of FreeSurfer and MALF based volume features together with Radial Basis Function Support Vector Machine and Extreme Learning Machine across different imaging protocols. We have also investigated for both FreeSurfer and MALF, which brain regions are most effective for the detection of the disease under different protocols. Our study result indicates marginal differences in classification performance across scanner models with the same or different field strengths when differentiating AD, Mild Cognitive Impairment, and Normal Controls. We have also observed differences in ranking order of the most effective brain regions.
阿尔茨海默病(AD)可以使用基于磁共振成像(MRI)的特征和监督分类器来检测。阿尔茨海默病患者的皮质下和心室容积变化。这些体积可以通过FreeSurfer和基于多图集的似然融合(MALF)算法等工具从MRI中提取。研究使用许多医学成像中心的核磁共振成像。然而,各个中心通常使用不同的MRI协议进行脑部扫描。协议差异包括具有不同操作参数的不同扫描仪模型。有些扫描器型号有不同的场强。对具有不同协议的多中心MR主题图像进行分类的一个关键因素是不同的扫描仪模型如何影响特征的提取,以及随后监督分类器的分类性能。我们研究了FreeSurfer和基于MALF的体积特征以及径向基函数支持向量机和极限学习机在不同成像协议下的分类性能。我们还对FreeSurfer和MALF进行了研究,在不同的方案下,哪个大脑区域对疾病的检测最有效。我们的研究结果表明,在区分AD、轻度认知障碍和正常对照时,具有相同或不同场强的扫描仪模型在分类性能上存在微小差异。我们还观察到在最有效的大脑区域排名顺序上的差异。
{"title":"Impact of MRI Protocols on Alzheimer's Disease Detection","authors":"Saruar Alam, Len Hamey, K. Ho-Shon","doi":"10.1109/DICTA.2018.8615774","DOIUrl":"https://doi.org/10.1109/DICTA.2018.8615774","url":null,"abstract":"Alzheimer's disease (AD) can be detected using magnetic resonance imaging (MRI) based features and supervised classifiers. The subcortical and ventricular volumes change for AD patients. These volumes can be extracted from MRI by tools such as FreeSurfer and the multi-atlas-based likelihood fusion (MALF) algorithm. Studies use MRI from many medical imaging centers. However, individual centers typically use distinctive MRI protocols for brain scanning. The protocol differences include different scanner models with various operating parameters. Some scanner models have different field strengths. A key factor in classifying multicentric MR subject images having different protocols is how different scanner models affect the extraction of feature, and the subsequent classification performance of a supervised classifier. We have investigated the classification performance of FreeSurfer and MALF based volume features together with Radial Basis Function Support Vector Machine and Extreme Learning Machine across different imaging protocols. We have also investigated for both FreeSurfer and MALF, which brain regions are most effective for the detection of the disease under different protocols. Our study result indicates marginal differences in classification performance across scanner models with the same or different field strengths when differentiating AD, Mild Cognitive Impairment, and Normal Controls. We have also observed differences in ranking order of the most effective brain regions.","PeriodicalId":130057,"journal":{"name":"2018 Digital Image Computing: Techniques and Applications (DICTA)","volume":"43 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114705779","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Multi-Class Recognition using Noisy Training Data with a Self-Learning Approach 基于自学习方法的噪声训练数据多类识别
Pub Date : 2018-12-01 DOI: 10.1109/DICTA.2018.8615864
Amir Ghahremani, E. Bondarev, P. D. With
Exploiting ConvNets for object classification systems requires extensive labor work, since these networks require to be trained by sufficiently large and accurately labeled datasets. We propose a novel self-learning approach, which is able to generate a reliable multi-class object classification model from a low-quality dataset that is disturbed with a high level of inter-class noise samples. This approach iteratively purifies the noisy training datasets for each class and updates the classification model. The iterations continue until the model and its parameters reach sufficient quality. The self-learning approach based on ConvNets is evaluated for a maritime surveillance use case, where vessels need to be classified into eight different types. The experimental results on the evaluation dataset show that the proposed approach improves the F1 score approximately by 5%, 8% and 25% at the end of the third iteration, while the initial training datasets contain 40%, 50% and 60% inter-class noise samples (erroneously classified labels of vessels), respectively. Additionally, the purification performance is highly dependent on inter- and inter-class similarities between training samples for higher noise levels. It was also found that the mean Average Precision (mAP) does not degrade so much, whereas other performance parameters show larger variation.
利用卷积神经网络进行对象分类系统需要大量的劳动,因为这些网络需要通过足够大且准确标记的数据集进行训练。我们提出了一种新的自学习方法,该方法能够从低质量数据集生成可靠的多类对象分类模型,该数据集受到高水平的类间噪声样本的干扰。该方法迭代地净化每个类别的噪声训练数据集,并更新分类模型。迭代继续,直到模型及其参数达到足够的质量。基于ConvNets的自学习方法在海上监视用例中进行了评估,其中船舶需要分为八种不同的类型。在评估数据集上的实验结果表明,在第三次迭代结束时,当初始训练数据集包含40%、50%和60%的类间噪声样本(错误分类的血管标签)时,所提出的方法分别将F1分数提高了约5%、8%和25%。此外,净化性能高度依赖于高噪声水平训练样本之间的类间和类间相似性。平均精度(mAP)并没有太大的下降,而其他性能参数的变化较大。
{"title":"Multi-Class Recognition using Noisy Training Data with a Self-Learning Approach","authors":"Amir Ghahremani, E. Bondarev, P. D. With","doi":"10.1109/DICTA.2018.8615864","DOIUrl":"https://doi.org/10.1109/DICTA.2018.8615864","url":null,"abstract":"Exploiting ConvNets for object classification systems requires extensive labor work, since these networks require to be trained by sufficiently large and accurately labeled datasets. We propose a novel self-learning approach, which is able to generate a reliable multi-class object classification model from a low-quality dataset that is disturbed with a high level of inter-class noise samples. This approach iteratively purifies the noisy training datasets for each class and updates the classification model. The iterations continue until the model and its parameters reach sufficient quality. The self-learning approach based on ConvNets is evaluated for a maritime surveillance use case, where vessels need to be classified into eight different types. The experimental results on the evaluation dataset show that the proposed approach improves the F1 score approximately by 5%, 8% and 25% at the end of the third iteration, while the initial training datasets contain 40%, 50% and 60% inter-class noise samples (erroneously classified labels of vessels), respectively. Additionally, the purification performance is highly dependent on inter- and inter-class similarities between training samples for higher noise levels. It was also found that the mean Average Precision (mAP) does not degrade so much, whereas other performance parameters show larger variation.","PeriodicalId":130057,"journal":{"name":"2018 Digital Image Computing: Techniques and Applications (DICTA)","volume":"93 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121081832","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Memory Optimized Deep Dense Network for Image Super-resolution 内存优化深度密集网络图像超分辨率
Pub Date : 2018-12-01 DOI: 10.1109/DICTA.2018.8615829
Jialiang Shen, Yucheng Wang, Jian Zhang
CNN methods for image super-resolution consume a large number of training-time memory, due to the feature size will not decrease as the network goes deeper. To reduce the memory consumption during training, we propose a memory optimized deep dense network for image super-resolution. We first reduce redundant features learning, by rationally designing the skip connection and dense connection in the network. Then we adopt share memory allocations to store concatenated features and Batch Normalization intermediate feature maps. The memory optimized network consumes less memory than normal dense network. We also evaluate our proposed architecture on highly competitive super-resolution benchmark datasets. Our deep dense network outperforms some existing methods, and requires relatively less computation.
CNN图像超分辨率方法消耗了大量的训练时间内存,因为特征大小不会随着网络的深入而减小。为了减少训练过程中的内存消耗,我们提出了一种内存优化的图像超分辨率深度密集网络。我们首先通过合理设计网络中的跳跃连接和密集连接来减少冗余特征学习。然后,我们采用共享内存分配来存储连接特征和批处理归一化中间特征映射。内存优化后的网络比普通密集网络占用的内存更少。我们还在高度竞争的超分辨率基准数据集上评估了我们提出的架构。我们的深度密集网络优于现有的一些方法,并且需要相对较少的计算量。
{"title":"Memory Optimized Deep Dense Network for Image Super-resolution","authors":"Jialiang Shen, Yucheng Wang, Jian Zhang","doi":"10.1109/DICTA.2018.8615829","DOIUrl":"https://doi.org/10.1109/DICTA.2018.8615829","url":null,"abstract":"CNN methods for image super-resolution consume a large number of training-time memory, due to the feature size will not decrease as the network goes deeper. To reduce the memory consumption during training, we propose a memory optimized deep dense network for image super-resolution. We first reduce redundant features learning, by rationally designing the skip connection and dense connection in the network. Then we adopt share memory allocations to store concatenated features and Batch Normalization intermediate feature maps. The memory optimized network consumes less memory than normal dense network. We also evaluate our proposed architecture on highly competitive super-resolution benchmark datasets. Our deep dense network outperforms some existing methods, and requires relatively less computation.","PeriodicalId":130057,"journal":{"name":"2018 Digital Image Computing: Techniques and Applications (DICTA)","volume":"74 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124429476","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Information Enhancement for Travelogues via a Hybrid Clustering Model 基于混合聚类模型的游记信息增强
Pub Date : 2018-12-01 DOI: 10.1109/DICTA.2018.8615849
Lu Zhang, Jingsong Xu, Jian Zhang, Yongshun Gong
Travelogues consist of textual information shared by tourists through web forums or other social media which often lack illustrations (images). In image sharing websites like Flicker, users can post images with rich textual information: ‘title’, ‘tag’ and ‘description’. The topics of travelogues usually revolve around beautiful sceneries. Corresponding landscape images recommended to these travelogues can enhance the vividness of reading. However, it is difficult to fuse such information because the text attached to each image has diverse meanings/views. In this paper, we propose an unsupervised Hybrid Multiple Kernel K-means (HMKKM) model to link images and travelogues through multiple views. Multi-view matrices are built to reveal the correlations between several respects. For further improving the performance, we add a regularisation based on textual similarity. To evaluate the effectiveness of the proposed method, a dataset is constructed from TripAdvisor and Flicker to find the related images for each travelogue. Experiment results demonstrate the superiority of the proposed model by comparison with other baselines.
游记由游客通过网络论坛或其他社交媒体分享的文字信息组成,通常缺乏插图(图像)。在像Flicker这样的图片分享网站上,用户可以发布带有丰富文字信息的图片:“标题”、“标签”和“描述”。游记的主题通常围绕着美丽的风景。为游记推荐相应的风景图像,可以增强阅读的生动性。然而,很难融合这些信息,因为每个图像附带的文本具有不同的含义/观点。在本文中,我们提出了一种无监督混合多核k -均值(HMKKM)模型,通过多个视图链接图像和旅行记录。建立多视图矩阵来揭示几个方面之间的相关性。为了进一步提高性能,我们添加了基于文本相似度的正则化。为了评估该方法的有效性,我们从TripAdvisor和Flicker中构建了一个数据集来查找每个旅游日志的相关图像。实验结果表明,该模型与其他基线相比具有一定的优越性。
{"title":"Information Enhancement for Travelogues via a Hybrid Clustering Model","authors":"Lu Zhang, Jingsong Xu, Jian Zhang, Yongshun Gong","doi":"10.1109/DICTA.2018.8615849","DOIUrl":"https://doi.org/10.1109/DICTA.2018.8615849","url":null,"abstract":"Travelogues consist of textual information shared by tourists through web forums or other social media which often lack illustrations (images). In image sharing websites like Flicker, users can post images with rich textual information: ‘title’, ‘tag’ and ‘description’. The topics of travelogues usually revolve around beautiful sceneries. Corresponding landscape images recommended to these travelogues can enhance the vividness of reading. However, it is difficult to fuse such information because the text attached to each image has diverse meanings/views. In this paper, we propose an unsupervised Hybrid Multiple Kernel K-means (HMKKM) model to link images and travelogues through multiple views. Multi-view matrices are built to reveal the correlations between several respects. For further improving the performance, we add a regularisation based on textual similarity. To evaluate the effectiveness of the proposed method, a dataset is constructed from TripAdvisor and Flicker to find the related images for each travelogue. Experiment results demonstrate the superiority of the proposed model by comparison with other baselines.","PeriodicalId":130057,"journal":{"name":"2018 Digital Image Computing: Techniques and Applications (DICTA)","volume":"40 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130223627","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
2018 Digital Image Computing: Techniques and Applications (DICTA)
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1