首页 > 最新文献

2018 Digital Image Computing: Techniques and Applications (DICTA)最新文献

英文 中文
Active Contours Based on An Anisotropic Diffusion 基于各向异性扩散的活动轮廓
Pub Date : 2018-12-01 DOI: 10.1109/DICTA.2018.8615767
Shafiullah Soomro, K. Choi
Image Segmentation is one of the pivotal procedure in the field of imaging and its objective is to catch required boundaries inside an image. In this paper, we propose a novel active contour method based on anisotropic diffusion. Global regionbased active contour methods rely on global intensity information across the regions. However, these methods fail to produce desired segmentation results when an image has some background variations or noise. In this regard, we adapt Perona and Malik smoothing technique as enhancement step. This technique provides interregional smoothing, sharpens the boundaries and blurs the background of an image. Our main role is the formulation of a new SPF (signed pressure force) function, which uses global intensity information across the regions. Minimizing an energy function using partial differential framework produce results with semantically meaningful boundaries instead of capturing impassive regions. Finally, we use Gaussian kernel to eliminate problem of reinitialization in level set function. We use images taken from different modalities to validate the outcome of the proposed method. In the result section, we have evaluated that, the proposed method achieves good results qualitatively and quantitatively with high accuracy compared to other state-of-the-art models.
图像分割是成像领域的关键步骤之一,其目的是捕获图像内部所需的边界。本文提出了一种基于各向异性扩散的活动轮廓线方法。基于区域的全球活动等高线方法依赖于区域间的全球强度信息。然而,当图像有背景变化或噪声时,这些方法无法产生理想的分割结果。在这方面,我们采用Perona和Malik平滑技术作为增强步骤。这种技术提供了区域间平滑,锐化边界和模糊图像的背景。我们的主要作用是制定一个新的SPF(签名压力)函数,它使用了各个地区的全球强度信息。使用偏微分框架最小化能量函数产生具有语义上有意义的边界的结果,而不是捕获无表情区域。最后,利用高斯核消除了水平集函数的重新初始化问题。我们使用不同模式的图像来验证所提出方法的结果。在结果部分,我们已经评估了,与其他最先进的模型相比,所提出的方法在定性和定量方面取得了良好的结果,精度很高。
{"title":"Active Contours Based on An Anisotropic Diffusion","authors":"Shafiullah Soomro, K. Choi","doi":"10.1109/DICTA.2018.8615767","DOIUrl":"https://doi.org/10.1109/DICTA.2018.8615767","url":null,"abstract":"Image Segmentation is one of the pivotal procedure in the field of imaging and its objective is to catch required boundaries inside an image. In this paper, we propose a novel active contour method based on anisotropic diffusion. Global regionbased active contour methods rely on global intensity information across the regions. However, these methods fail to produce desired segmentation results when an image has some background variations or noise. In this regard, we adapt Perona and Malik smoothing technique as enhancement step. This technique provides interregional smoothing, sharpens the boundaries and blurs the background of an image. Our main role is the formulation of a new SPF (signed pressure force) function, which uses global intensity information across the regions. Minimizing an energy function using partial differential framework produce results with semantically meaningful boundaries instead of capturing impassive regions. Finally, we use Gaussian kernel to eliminate problem of reinitialization in level set function. We use images taken from different modalities to validate the outcome of the proposed method. In the result section, we have evaluated that, the proposed method achieves good results qualitatively and quantitatively with high accuracy compared to other state-of-the-art models.","PeriodicalId":130057,"journal":{"name":"2018 Digital Image Computing: Techniques and Applications (DICTA)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132544227","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
Inter-Subject Image Registration of Clinical Neck MRI Volumes using Discrete Periodic Spline Wavelet and Free form Deformation 基于离散周期样条小波和自由变形的临床颈部MRI体间图像配准
Pub Date : 2018-12-01 DOI: 10.1109/DICTA.2018.8615825
A. Suman, Md. Asikuzzaman, A. Webb, D. Perriman, M. Pickering
This paper presents a framework for inter-patient image registration which uses a multi-thresholds, multi-similarity measures and multi-transformations based on compactly supported spline and discrete periodic spline wavelets (DPSWs) using the Gauss-Newton gradient descent (GNGD) and gradient descent (GD) optimization methods. Our primary intellectual contribution is incorporating DPSWs in the transformation while another includes fusing out-of-range concept in a surface matching technique which is implemented by a multi-transformations and multi-similarity measures. In particular, as a true deformation cannot be achieved by single combination of transformation, similarity measure (SM) and optimization of a registration process, a moving image is required to be brought within the range of a registration. On the other hand, the surface matching technique involves an edge position difference (EPD) SM in which coarse to fine surfaces are matched using multiple thresholds with a spline-based free from deformation (FFD) method. The registration experiments were performed on 3D clinical neck magnetic resonance (MR) images, with the results showing that our proposed method provides good accuracy and robustness.
本文提出了一个基于紧支持样条和离散周期样条小波(DPSWs)的多阈值、多相似度和多变换的患者间图像配准框架,该框架采用高斯-牛顿梯度下降(GNGD)和梯度下降(GD)优化方法。我们的主要智力贡献是将dpsw纳入到转换中,而另一个智力贡献包括将超距概念融合到表面匹配技术中,该技术通过多转换和多相似度量来实现。特别是,由于单纯结合变换、相似度量(SM)和配准过程的优化无法实现真正的形变,因此需要将运动图像置于配准范围内。另一方面,表面匹配技术涉及边缘位置差(EPD) SM,其中使用基于样条的无变形(FFD)方法使用多个阈值匹配粗到细表面。在临床颈部磁共振三维图像上进行了配准实验,结果表明该方法具有较好的准确性和鲁棒性。
{"title":"Inter-Subject Image Registration of Clinical Neck MRI Volumes using Discrete Periodic Spline Wavelet and Free form Deformation","authors":"A. Suman, Md. Asikuzzaman, A. Webb, D. Perriman, M. Pickering","doi":"10.1109/DICTA.2018.8615825","DOIUrl":"https://doi.org/10.1109/DICTA.2018.8615825","url":null,"abstract":"This paper presents a framework for inter-patient image registration which uses a multi-thresholds, multi-similarity measures and multi-transformations based on compactly supported spline and discrete periodic spline wavelets (DPSWs) using the Gauss-Newton gradient descent (GNGD) and gradient descent (GD) optimization methods. Our primary intellectual contribution is incorporating DPSWs in the transformation while another includes fusing out-of-range concept in a surface matching technique which is implemented by a multi-transformations and multi-similarity measures. In particular, as a true deformation cannot be achieved by single combination of transformation, similarity measure (SM) and optimization of a registration process, a moving image is required to be brought within the range of a registration. On the other hand, the surface matching technique involves an edge position difference (EPD) SM in which coarse to fine surfaces are matched using multiple thresholds with a spline-based free from deformation (FFD) method. The registration experiments were performed on 3D clinical neck magnetic resonance (MR) images, with the results showing that our proposed method provides good accuracy and robustness.","PeriodicalId":130057,"journal":{"name":"2018 Digital Image Computing: Techniques and Applications (DICTA)","volume":"8 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132998156","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Railway Infrastructure Defects Recognition using Fine-grained Deep Convolutional Neural Networks 基于细粒度深度卷积神经网络的铁路基础设施缺陷识别
Pub Date : 2018-12-01 DOI: 10.1109/DICTA.2018.8615868
Huaxi Huang, Jingsong Xu, Jian Zhang, Qiang Wu, Christina Kirsch
Railway power supply infrastructure is one of the most important components of railway transportation. As the key step of railway maintenance system, power supply infrastructure defects recognition plays a vital role in the whole defects inspection sub-system. Traditional defects recognition task is performed manually, which is time-consuming and high-labor costing. Inspired by the great success of deep neural networks in dealing with different vision tasks, this paper presents an end-to-end deep network to solve the railway infrastructure defects detection problem. More importantly, this paper is the first work that adopts the idea of deep fine-grained classification to do railway defects detection. We propose a new bilinear deep network named Spatial Transformer And Bilinear Low-Rank (STABLR) model and apply it to railway infrastructure defects detection. The experimental results demonstrate that the proposed method outperforms both hand-craft features based machine learning methods and classic deep neural network methods.
铁路供电基础设施是铁路运输的重要组成部分之一。供电基础设施缺陷识别作为铁路维修系统的关键环节,在整个缺陷检测子系统中起着至关重要的作用。传统的缺陷识别任务是手工完成的,耗时长,人工成本高。受深度神经网络在处理不同视觉任务方面的巨大成功的启发,本文提出了一种端到端深度网络来解决铁路基础设施缺陷检测问题。更重要的是,本文首次采用深细粒度分类的思想进行铁路缺陷检测。提出了一种新的双线性深度网络——空间变压器和双线性低秩(STABLR)模型,并将其应用于铁路基础设施缺陷检测。实验结果表明,该方法优于基于手工特征的机器学习方法和经典的深度神经网络方法。
{"title":"Railway Infrastructure Defects Recognition using Fine-grained Deep Convolutional Neural Networks","authors":"Huaxi Huang, Jingsong Xu, Jian Zhang, Qiang Wu, Christina Kirsch","doi":"10.1109/DICTA.2018.8615868","DOIUrl":"https://doi.org/10.1109/DICTA.2018.8615868","url":null,"abstract":"Railway power supply infrastructure is one of the most important components of railway transportation. As the key step of railway maintenance system, power supply infrastructure defects recognition plays a vital role in the whole defects inspection sub-system. Traditional defects recognition task is performed manually, which is time-consuming and high-labor costing. Inspired by the great success of deep neural networks in dealing with different vision tasks, this paper presents an end-to-end deep network to solve the railway infrastructure defects detection problem. More importantly, this paper is the first work that adopts the idea of deep fine-grained classification to do railway defects detection. We propose a new bilinear deep network named Spatial Transformer And Bilinear Low-Rank (STABLR) model and apply it to railway infrastructure defects detection. The experimental results demonstrate that the proposed method outperforms both hand-craft features based machine learning methods and classic deep neural network methods.","PeriodicalId":130057,"journal":{"name":"2018 Digital Image Computing: Techniques and Applications (DICTA)","volume":"21 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133050751","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 7
A Scale-Free and Parameter-Free Image Edge Strength Measure 一种无尺度、无参数的图像边缘强度测量方法
Pub Date : 2018-12-01 DOI: 10.1109/DICTA.2018.8615813
Guy Smith, P. Jackway
We present a family of image Slope Measures which are scale-free measures that are highest near image edges. They are defined at each pixel as the steepest of the (up, down, or bi-directional) intensity slopes to every other pixel. We list some useful mathematical properties such as intensity and rotation invariances and show a relationship to the maximal morphological dilations and erosions by cones. We discuss generalisations by using non-Euclidean distances or non-conical structuring functions, and extensions to colour, multi-spectral and higher-dimensional images. We present detailed pseudo-code for a fast doubly-recursive multi-resolution algorithm and give typical algorithm timings and visually demonstrate the measure as applied to standard test images. Reference C code for these algorithms is available on the internet at: https://github.com/xomexx/SlopeMeasures.
我们提出了一组图像斜率测度,它们是图像边缘附近最高的无尺度测度。它们在每个像素上被定义为每一个其他像素的最陡(向上、向下或双向)强度斜率。我们列出了一些有用的数学性质,如强度和旋转不变性,并显示了锥的最大形态扩张和侵蚀的关系。我们通过使用非欧几里得距离或非圆锥结构函数来讨论泛化,并扩展到彩色,多光谱和高维图像。我们给出了一种快速双递归多分辨率算法的详细伪代码,给出了典型的算法时序,并直观地演示了该方法在标准测试图像上的应用。这些算法的参考C代码可以在互联网上找到:https://github.com/xomexx/SlopeMeasures。
{"title":"A Scale-Free and Parameter-Free Image Edge Strength Measure","authors":"Guy Smith, P. Jackway","doi":"10.1109/DICTA.2018.8615813","DOIUrl":"https://doi.org/10.1109/DICTA.2018.8615813","url":null,"abstract":"We present a family of image Slope Measures which are scale-free measures that are highest near image edges. They are defined at each pixel as the steepest of the (up, down, or bi-directional) intensity slopes to every other pixel. We list some useful mathematical properties such as intensity and rotation invariances and show a relationship to the maximal morphological dilations and erosions by cones. We discuss generalisations by using non-Euclidean distances or non-conical structuring functions, and extensions to colour, multi-spectral and higher-dimensional images. We present detailed pseudo-code for a fast doubly-recursive multi-resolution algorithm and give typical algorithm timings and visually demonstrate the measure as applied to standard test images. Reference C code for these algorithms is available on the internet at: https://github.com/xomexx/SlopeMeasures.","PeriodicalId":130057,"journal":{"name":"2018 Digital Image Computing: Techniques and Applications (DICTA)","volume":"50 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124448700","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Information Enhancement for Travelogues via a Hybrid Clustering Model 基于混合聚类模型的游记信息增强
Pub Date : 2018-12-01 DOI: 10.1109/DICTA.2018.8615849
Lu Zhang, Jingsong Xu, Jian Zhang, Yongshun Gong
Travelogues consist of textual information shared by tourists through web forums or other social media which often lack illustrations (images). In image sharing websites like Flicker, users can post images with rich textual information: ‘title’, ‘tag’ and ‘description’. The topics of travelogues usually revolve around beautiful sceneries. Corresponding landscape images recommended to these travelogues can enhance the vividness of reading. However, it is difficult to fuse such information because the text attached to each image has diverse meanings/views. In this paper, we propose an unsupervised Hybrid Multiple Kernel K-means (HMKKM) model to link images and travelogues through multiple views. Multi-view matrices are built to reveal the correlations between several respects. For further improving the performance, we add a regularisation based on textual similarity. To evaluate the effectiveness of the proposed method, a dataset is constructed from TripAdvisor and Flicker to find the related images for each travelogue. Experiment results demonstrate the superiority of the proposed model by comparison with other baselines.
游记由游客通过网络论坛或其他社交媒体分享的文字信息组成,通常缺乏插图(图像)。在像Flicker这样的图片分享网站上,用户可以发布带有丰富文字信息的图片:“标题”、“标签”和“描述”。游记的主题通常围绕着美丽的风景。为游记推荐相应的风景图像,可以增强阅读的生动性。然而,很难融合这些信息,因为每个图像附带的文本具有不同的含义/观点。在本文中,我们提出了一种无监督混合多核k -均值(HMKKM)模型,通过多个视图链接图像和旅行记录。建立多视图矩阵来揭示几个方面之间的相关性。为了进一步提高性能,我们添加了基于文本相似度的正则化。为了评估该方法的有效性,我们从TripAdvisor和Flicker中构建了一个数据集来查找每个旅游日志的相关图像。实验结果表明,该模型与其他基线相比具有一定的优越性。
{"title":"Information Enhancement for Travelogues via a Hybrid Clustering Model","authors":"Lu Zhang, Jingsong Xu, Jian Zhang, Yongshun Gong","doi":"10.1109/DICTA.2018.8615849","DOIUrl":"https://doi.org/10.1109/DICTA.2018.8615849","url":null,"abstract":"Travelogues consist of textual information shared by tourists through web forums or other social media which often lack illustrations (images). In image sharing websites like Flicker, users can post images with rich textual information: ‘title’, ‘tag’ and ‘description’. The topics of travelogues usually revolve around beautiful sceneries. Corresponding landscape images recommended to these travelogues can enhance the vividness of reading. However, it is difficult to fuse such information because the text attached to each image has diverse meanings/views. In this paper, we propose an unsupervised Hybrid Multiple Kernel K-means (HMKKM) model to link images and travelogues through multiple views. Multi-view matrices are built to reveal the correlations between several respects. For further improving the performance, we add a regularisation based on textual similarity. To evaluate the effectiveness of the proposed method, a dataset is constructed from TripAdvisor and Flicker to find the related images for each travelogue. Experiment results demonstrate the superiority of the proposed model by comparison with other baselines.","PeriodicalId":130057,"journal":{"name":"2018 Digital Image Computing: Techniques and Applications (DICTA)","volume":"40 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130223627","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Bone Age Assessment Based on Two-Stage Deep Neural Networks 基于两阶段深度神经网络的骨龄评估
Pub Date : 2018-12-01 DOI: 10.1109/DICTA.2018.8615764
Meicheng Chu, Bo Liu, F. Zhou, X. Bai, Bin Guo
Skeletal bone age assessment is a clinical practice to diagnose the maturity of children. To accurately assess the bone age, we proposed an automatic bone age assessment method in this paper based on deep convolution network. This method includes two stages: mask generation network and age assessment network. A U-Net convolution network with pretrained VGG16 as the encoder is used to extract the mask of bones. For the assessment module, the original images are fused together with the generated mask image to obtain segmented normalized hand bone images. We then built a multiple output convolution network for accurate age assessment. Finally, the bone age regression problem is transformed into the K-1 binary classification sub-problems. Our model was tested on RSNA2017 Pediatric Bone Age dataset. We were able to achieve the mean absolute error (MAE) of 5.98 months, which outperforms other common methods for bone age assessment. The proposed method could be used for developing fully automatic bone age assessment with better accuracy.
骨龄评估是诊断儿童成熟度的一项临床实践。为了准确评估骨龄,本文提出了一种基于深度卷积网络的骨龄自动评估方法。该方法包括两个阶段:掩码生成网络和年龄评估网络。利用预训练的VGG16作为编码器的U-Net卷积网络提取骨骼的掩码。评估模块将原始图像与生成的掩模图像融合在一起,得到分割的归一化手骨图像。然后,我们构建了一个多输出卷积网络,用于准确的年龄评估。最后,将骨龄回归问题转化为K-1二分类子问题。我们的模型在RSNA2017儿童骨龄数据集上进行了测试。我们能够达到5.98个月的平均绝对误差(MAE),优于其他常用的骨龄评估方法。该方法可用于开发全自动骨龄评估,具有较高的准确性。
{"title":"Bone Age Assessment Based on Two-Stage Deep Neural Networks","authors":"Meicheng Chu, Bo Liu, F. Zhou, X. Bai, Bin Guo","doi":"10.1109/DICTA.2018.8615764","DOIUrl":"https://doi.org/10.1109/DICTA.2018.8615764","url":null,"abstract":"Skeletal bone age assessment is a clinical practice to diagnose the maturity of children. To accurately assess the bone age, we proposed an automatic bone age assessment method in this paper based on deep convolution network. This method includes two stages: mask generation network and age assessment network. A U-Net convolution network with pretrained VGG16 as the encoder is used to extract the mask of bones. For the assessment module, the original images are fused together with the generated mask image to obtain segmented normalized hand bone images. We then built a multiple output convolution network for accurate age assessment. Finally, the bone age regression problem is transformed into the K-1 binary classification sub-problems. Our model was tested on RSNA2017 Pediatric Bone Age dataset. We were able to achieve the mean absolute error (MAE) of 5.98 months, which outperforms other common methods for bone age assessment. The proposed method could be used for developing fully automatic bone age assessment with better accuracy.","PeriodicalId":130057,"journal":{"name":"2018 Digital Image Computing: Techniques and Applications (DICTA)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131025341","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 12
Band Weighting Network for Hyperspectral Image Classification 高光谱图像分类的波段加权网络
Pub Date : 2018-12-01 DOI: 10.1109/DICTA.2018.8615876
Jing Wang, Jun Zhou
Hyperspectral remote sensing images use hundreds of bands to describe the fine spectral information of the ground area. However, they inevitably contain a large amount of redundancy as well as noisy bands. Discovering the most informative bands and modeling the relationship among the bands are effective means to process the data and improve the performance of the subsequent classification task. Attention mechanism is used in computer vision and natural language processing to guide the algorithm towards the most relevant information in the data. In this paper, we propose a band weighting network by designing and integrating an attention module in the traditional convolutional neural network for hyperspectral image classification. Our proposed band weighting network has the capability to model the relationship among the bands and weight them according to their joint contribution to classification. One prominent feature of our proposed method is that it can assign different weights to different samples. The experimental results demonstrate the effectiveness and superiority of our approach.
高光谱遥感图像使用数百波段来描述地面区域的精细光谱信息。然而,它们不可避免地包含大量的冗余和噪声带。发现信息量最大的波段并对波段之间的关系进行建模是对数据进行处理和提高后续分类任务性能的有效手段。注意机制用于计算机视觉和自然语言处理,引导算法在数据中找到最相关的信息。本文通过在传统卷积神经网络中设计并集成关注模块,提出了一种用于高光谱图像分类的波段加权网络。我们提出的波段加权网络能够对波段之间的关系进行建模,并根据它们对分类的共同贡献对它们进行加权。我们提出的方法的一个突出特点是它可以为不同的样本分配不同的权重。实验结果证明了该方法的有效性和优越性。
{"title":"Band Weighting Network for Hyperspectral Image Classification","authors":"Jing Wang, Jun Zhou","doi":"10.1109/DICTA.2018.8615876","DOIUrl":"https://doi.org/10.1109/DICTA.2018.8615876","url":null,"abstract":"Hyperspectral remote sensing images use hundreds of bands to describe the fine spectral information of the ground area. However, they inevitably contain a large amount of redundancy as well as noisy bands. Discovering the most informative bands and modeling the relationship among the bands are effective means to process the data and improve the performance of the subsequent classification task. Attention mechanism is used in computer vision and natural language processing to guide the algorithm towards the most relevant information in the data. In this paper, we propose a band weighting network by designing and integrating an attention module in the traditional convolutional neural network for hyperspectral image classification. Our proposed band weighting network has the capability to model the relationship among the bands and weight them according to their joint contribution to classification. One prominent feature of our proposed method is that it can assign different weights to different samples. The experimental results demonstrate the effectiveness and superiority of our approach.","PeriodicalId":130057,"journal":{"name":"2018 Digital Image Computing: Techniques and Applications (DICTA)","volume":"70 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130734028","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 7
Impact of MRI Protocols on Alzheimer's Disease Detection MRI方案对阿尔茨海默病检测的影响
Pub Date : 2018-12-01 DOI: 10.1109/DICTA.2018.8615774
Saruar Alam, Len Hamey, K. Ho-Shon
Alzheimer's disease (AD) can be detected using magnetic resonance imaging (MRI) based features and supervised classifiers. The subcortical and ventricular volumes change for AD patients. These volumes can be extracted from MRI by tools such as FreeSurfer and the multi-atlas-based likelihood fusion (MALF) algorithm. Studies use MRI from many medical imaging centers. However, individual centers typically use distinctive MRI protocols for brain scanning. The protocol differences include different scanner models with various operating parameters. Some scanner models have different field strengths. A key factor in classifying multicentric MR subject images having different protocols is how different scanner models affect the extraction of feature, and the subsequent classification performance of a supervised classifier. We have investigated the classification performance of FreeSurfer and MALF based volume features together with Radial Basis Function Support Vector Machine and Extreme Learning Machine across different imaging protocols. We have also investigated for both FreeSurfer and MALF, which brain regions are most effective for the detection of the disease under different protocols. Our study result indicates marginal differences in classification performance across scanner models with the same or different field strengths when differentiating AD, Mild Cognitive Impairment, and Normal Controls. We have also observed differences in ranking order of the most effective brain regions.
阿尔茨海默病(AD)可以使用基于磁共振成像(MRI)的特征和监督分类器来检测。阿尔茨海默病患者的皮质下和心室容积变化。这些体积可以通过FreeSurfer和基于多图集的似然融合(MALF)算法等工具从MRI中提取。研究使用许多医学成像中心的核磁共振成像。然而,各个中心通常使用不同的MRI协议进行脑部扫描。协议差异包括具有不同操作参数的不同扫描仪模型。有些扫描器型号有不同的场强。对具有不同协议的多中心MR主题图像进行分类的一个关键因素是不同的扫描仪模型如何影响特征的提取,以及随后监督分类器的分类性能。我们研究了FreeSurfer和基于MALF的体积特征以及径向基函数支持向量机和极限学习机在不同成像协议下的分类性能。我们还对FreeSurfer和MALF进行了研究,在不同的方案下,哪个大脑区域对疾病的检测最有效。我们的研究结果表明,在区分AD、轻度认知障碍和正常对照时,具有相同或不同场强的扫描仪模型在分类性能上存在微小差异。我们还观察到在最有效的大脑区域排名顺序上的差异。
{"title":"Impact of MRI Protocols on Alzheimer's Disease Detection","authors":"Saruar Alam, Len Hamey, K. Ho-Shon","doi":"10.1109/DICTA.2018.8615774","DOIUrl":"https://doi.org/10.1109/DICTA.2018.8615774","url":null,"abstract":"Alzheimer's disease (AD) can be detected using magnetic resonance imaging (MRI) based features and supervised classifiers. The subcortical and ventricular volumes change for AD patients. These volumes can be extracted from MRI by tools such as FreeSurfer and the multi-atlas-based likelihood fusion (MALF) algorithm. Studies use MRI from many medical imaging centers. However, individual centers typically use distinctive MRI protocols for brain scanning. The protocol differences include different scanner models with various operating parameters. Some scanner models have different field strengths. A key factor in classifying multicentric MR subject images having different protocols is how different scanner models affect the extraction of feature, and the subsequent classification performance of a supervised classifier. We have investigated the classification performance of FreeSurfer and MALF based volume features together with Radial Basis Function Support Vector Machine and Extreme Learning Machine across different imaging protocols. We have also investigated for both FreeSurfer and MALF, which brain regions are most effective for the detection of the disease under different protocols. Our study result indicates marginal differences in classification performance across scanner models with the same or different field strengths when differentiating AD, Mild Cognitive Impairment, and Normal Controls. We have also observed differences in ranking order of the most effective brain regions.","PeriodicalId":130057,"journal":{"name":"2018 Digital Image Computing: Techniques and Applications (DICTA)","volume":"43 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114705779","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Fast and Energy-Efficient Time-of-Flight Distance Sensing Method for 3D Object Tracking 一种快速节能的三维目标跟踪飞行时间距离传感方法
Pub Date : 2018-12-01 DOI: 10.1109/DICTA.2018.8615790
H. Plank, G. Holweg, C. Steger, N. Druml
We present a new energy-efficient distance sensing method for 3D object tracking with Time-of-Flight sensors. The field of 3D object tracking with 3D cameras recently gained momentum due to the advent of front-facing depth cameras in smartphones. Tracking the user's head with 3D cameras will enable novel user experiences, but can lead to power consumption issues due to the active illumination. State-of-the-art continuous-wave Time-of-Flight imaging requires at least four different phase-images, while our approach can produce 3D measurements from single phase-images. This reduces the amount of emitted light to a minimum, improves latency and enables higher framerates. As our evaluation shows, after a brief initialization phase, our method can reduce the power consumption of a Time-of-Flight system by up to 68%.
提出了一种新的基于飞行时间传感器的三维目标跟踪节能距离传感方法。由于智能手机的前置深度摄像头的出现,使用3D相机进行3D物体跟踪的领域最近获得了发展势头。用3D相机跟踪用户的头部将带来新颖的用户体验,但由于主动照明,可能会导致功耗问题。最先进的连续波飞行时间成像需要至少四个不同的相位图像,而我们的方法可以从单个相位图像产生3D测量。这样可以将发射光的数量减少到最小,改善延迟并实现更高的帧率。正如我们的评估所示,经过短暂的初始化阶段,我们的方法可以将飞行时间系统的功耗降低高达68%。
{"title":"Fast and Energy-Efficient Time-of-Flight Distance Sensing Method for 3D Object Tracking","authors":"H. Plank, G. Holweg, C. Steger, N. Druml","doi":"10.1109/DICTA.2018.8615790","DOIUrl":"https://doi.org/10.1109/DICTA.2018.8615790","url":null,"abstract":"We present a new energy-efficient distance sensing method for 3D object tracking with Time-of-Flight sensors. The field of 3D object tracking with 3D cameras recently gained momentum due to the advent of front-facing depth cameras in smartphones. Tracking the user's head with 3D cameras will enable novel user experiences, but can lead to power consumption issues due to the active illumination. State-of-the-art continuous-wave Time-of-Flight imaging requires at least four different phase-images, while our approach can produce 3D measurements from single phase-images. This reduces the amount of emitted light to a minimum, improves latency and enables higher framerates. As our evaluation shows, after a brief initialization phase, our method can reduce the power consumption of a Time-of-Flight system by up to 68%.","PeriodicalId":130057,"journal":{"name":"2018 Digital Image Computing: Techniques and Applications (DICTA)","volume":"4 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115703607","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Memory Optimized Deep Dense Network for Image Super-resolution 内存优化深度密集网络图像超分辨率
Pub Date : 2018-12-01 DOI: 10.1109/DICTA.2018.8615829
Jialiang Shen, Yucheng Wang, Jian Zhang
CNN methods for image super-resolution consume a large number of training-time memory, due to the feature size will not decrease as the network goes deeper. To reduce the memory consumption during training, we propose a memory optimized deep dense network for image super-resolution. We first reduce redundant features learning, by rationally designing the skip connection and dense connection in the network. Then we adopt share memory allocations to store concatenated features and Batch Normalization intermediate feature maps. The memory optimized network consumes less memory than normal dense network. We also evaluate our proposed architecture on highly competitive super-resolution benchmark datasets. Our deep dense network outperforms some existing methods, and requires relatively less computation.
CNN图像超分辨率方法消耗了大量的训练时间内存,因为特征大小不会随着网络的深入而减小。为了减少训练过程中的内存消耗,我们提出了一种内存优化的图像超分辨率深度密集网络。我们首先通过合理设计网络中的跳跃连接和密集连接来减少冗余特征学习。然后,我们采用共享内存分配来存储连接特征和批处理归一化中间特征映射。内存优化后的网络比普通密集网络占用的内存更少。我们还在高度竞争的超分辨率基准数据集上评估了我们提出的架构。我们的深度密集网络优于现有的一些方法,并且需要相对较少的计算量。
{"title":"Memory Optimized Deep Dense Network for Image Super-resolution","authors":"Jialiang Shen, Yucheng Wang, Jian Zhang","doi":"10.1109/DICTA.2018.8615829","DOIUrl":"https://doi.org/10.1109/DICTA.2018.8615829","url":null,"abstract":"CNN methods for image super-resolution consume a large number of training-time memory, due to the feature size will not decrease as the network goes deeper. To reduce the memory consumption during training, we propose a memory optimized deep dense network for image super-resolution. We first reduce redundant features learning, by rationally designing the skip connection and dense connection in the network. Then we adopt share memory allocations to store concatenated features and Batch Normalization intermediate feature maps. The memory optimized network consumes less memory than normal dense network. We also evaluate our proposed architecture on highly competitive super-resolution benchmark datasets. Our deep dense network outperforms some existing methods, and requires relatively less computation.","PeriodicalId":130057,"journal":{"name":"2018 Digital Image Computing: Techniques and Applications (DICTA)","volume":"74 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124429476","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
2018 Digital Image Computing: Techniques and Applications (DICTA)
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1