首页 > 最新文献

2018 IEEE Southwest Symposium on Image Analysis and Interpretation (SSIAI)最新文献

英文 中文
FUSED REASONING UNDER UNCERTAINTY FOR SOLDIER CENTRIC HUMAN-AGENT DECISION MAKING 不确定性下以士兵为中心的人-智能体决策的融合推理
Pub Date : 2018-04-01 DOI: 10.1109/SSIAI.2018.8470359
A. Raglin, Andre V Harrison, Douglas Summers-Stay
As agents (devices and software) are increasingly incorporated into every aspect of our lives, the research area of human-agent teaming has seen an increase in attention. This is particularly true considering the varied, dynamic, and fast pace operations Soldiers are currently facing and will be facing in the future. There is a common idea that, in the future, the speed of machines will far exceed a Soldiers’ ability to react or even comprehend the complex activities of their digital teammates, which is a concern. Uncertainty in this accelerated environment will present unique and unforeseen challenges that may potentially inhibit a Soldier’s ability to make decisions effectively and to efficiently decide fast enough to support the future battlefield optempo. To accelerate decision making in Army operations the military is relying on agents and enabling technologies such as complex systems that integrate intelligent sensor networks and autonomous devices. These systems-of- systems will be driven by machine learning enabled artificial intelligence algorithms and will form teams with human warfighters, where both must act as one unit to accomplish their mission. Explanations can provide key information about the data or behavior of complex systems to the human to aide human agent teaming.
随着智能体(设备和软件)越来越多地融入我们生活的方方面面,人类智能体团队的研究领域受到了越来越多的关注。考虑到士兵们目前和将来面临的多样化、动态和快节奏的作战,这一点尤其正确。人们普遍认为,在未来,机器的速度将远远超过士兵的反应能力,甚至超过他们理解数字队友复杂活动的能力,这是一个令人担忧的问题。在这种加速的环境中,不确定性将带来独特的、不可预见的挑战,可能会抑制士兵有效决策的能力,并有效地做出足够快的决策,以支持未来战场的节奏。为了加快陆军行动中的决策制定,军方正在依赖代理和使能技术,如集成智能传感器网络和自主设备的复杂系统。这些系统的系统将由支持机器学习的人工智能算法驱动,并将与人类作战人员组成团队,双方必须作为一个整体来完成任务。解释可以为人类提供有关复杂系统的数据或行为的关键信息,以帮助人类代理团队。
{"title":"FUSED REASONING UNDER UNCERTAINTY FOR SOLDIER CENTRIC HUMAN-AGENT DECISION MAKING","authors":"A. Raglin, Andre V Harrison, Douglas Summers-Stay","doi":"10.1109/SSIAI.2018.8470359","DOIUrl":"https://doi.org/10.1109/SSIAI.2018.8470359","url":null,"abstract":"As agents (devices and software) are increasingly incorporated into every aspect of our lives, the research area of human-agent teaming has seen an increase in attention. This is particularly true considering the varied, dynamic, and fast pace operations Soldiers are currently facing and will be facing in the future. There is a common idea that, in the future, the speed of machines will far exceed a Soldiers’ ability to react or even comprehend the complex activities of their digital teammates, which is a concern. Uncertainty in this accelerated environment will present unique and unforeseen challenges that may potentially inhibit a Soldier’s ability to make decisions effectively and to efficiently decide fast enough to support the future battlefield optempo. To accelerate decision making in Army operations the military is relying on agents and enabling technologies such as complex systems that integrate intelligent sensor networks and autonomous devices. These systems-of- systems will be driven by machine learning enabled artificial intelligence algorithms and will form teams with human warfighters, where both must act as one unit to accomplish their mission. Explanations can provide key information about the data or behavior of complex systems to the human to aide human agent teaming.","PeriodicalId":422209,"journal":{"name":"2018 IEEE Southwest Symposium on Image Analysis and Interpretation (SSIAI)","volume":"13 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131274709","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 5
A Comparison of Column Subset Selection Methods for Unsupervised Band Subset Selection in Hyperspectral Imagery 高光谱图像无监督波段子集选择的列子集选择方法比较
Pub Date : 2018-04-01 DOI: 10.1109/SSIAI.2018.8470360
Maher Aldeghlawi, M. Velez-Reyes
This paper explores the use of column subset selection (CSS) for unsupervised band subset selection (BSS) in hyperspectral imaging. CSS is the problem of selecting the most independent columns of a matrix. Many deterministic and randomized algorithms have been proposed in the literature for CSS. This paper presents a comparison between different algorithms for CSS for BSS. The cosine of the angle between the range space spanned by the selected bands and the corresponding left singular vectors is used to evaluate the quality of the selected bands to represent the image. Numerical experiments are conducted using multispectral and hyperspectral data. Results show that SVDSS outperforms other deterministic algorithms while producing comparable results to a 2-stage randomized CSS in small images and in centered data. However, the randomized algorithm significantly outperforms deterministic approaches in large images.
本文探讨了在高光谱成像中使用列子集选择(CSS)进行无监督波段子集选择(BSS)。CSS是选择矩阵中最独立的列的问题。文献中提出了许多确定性和随机化的CSS算法。本文对不同的CSS算法进行了比较。选取的波段与对应的左奇异向量所张成的距离空间夹角的余弦值用于评价选取的波段表示图像的质量。利用多光谱和高光谱数据进行了数值实验。结果表明,在小图像和中心数据中,SVDSS在产生与2阶段随机CSS相当的结果时优于其他确定性算法。然而,随机算法在大图像中明显优于确定性方法。
{"title":"A Comparison of Column Subset Selection Methods for Unsupervised Band Subset Selection in Hyperspectral Imagery","authors":"Maher Aldeghlawi, M. Velez-Reyes","doi":"10.1109/SSIAI.2018.8470360","DOIUrl":"https://doi.org/10.1109/SSIAI.2018.8470360","url":null,"abstract":"This paper explores the use of column subset selection (CSS) for unsupervised band subset selection (BSS) in hyperspectral imaging. CSS is the problem of selecting the most independent columns of a matrix. Many deterministic and randomized algorithms have been proposed in the literature for CSS. This paper presents a comparison between different algorithms for CSS for BSS. The cosine of the angle between the range space spanned by the selected bands and the corresponding left singular vectors is used to evaluate the quality of the selected bands to represent the image. Numerical experiments are conducted using multispectral and hyperspectral data. Results show that SVDSS outperforms other deterministic algorithms while producing comparable results to a 2-stage randomized CSS in small images and in centered data. However, the randomized algorithm significantly outperforms deterministic approaches in large images.","PeriodicalId":422209,"journal":{"name":"2018 IEEE Southwest Symposium on Image Analysis and Interpretation (SSIAI)","volume":"147 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132451888","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
cTADA: The Design of a Crowdsourcing Tool for Online Food Image Identification and Segmentation cTADA:在线食品图像识别和分割众包工具的设计
Pub Date : 2018-04-01 DOI: 10.1109/SSIAI.2018.8470358
S. Fang, Chang Liu, Khalid Tahboub, F. Zhu, E. Delp, C. Boushey
Measuring accurate dietary intake, the process of determining what someone eats during the course of the day is considered to be an open research problem in the nutrition and health fields. We have developed image-based tools to automatically obtain accurate estimates of what foods and how much energy/nutrients a user consumes. In this work, we present a crowdsourcing tool we designed and implemented to collect large sets of relevant online food images. This tool can be used to locate food items and obtaining groundtruth segmentation masks associated with all the foods presented in an image. We present a systematic design for a crowdsourcing tool aiming specifically for the task of online food image collection and annotations with a detailed description. The crowdsoucing tool we designed is tailored to meet the needs of building a large image dataset for developing automatic dietary assessment tools in the nutrition and health fields.
准确测量饮食摄入量,确定一个人在一天中吃了什么,这一过程被认为是营养和健康领域的一个开放性研究问题。我们已经开发了基于图像的工具,可以自动准确估计用户消耗的食物和能量/营养。在这项工作中,我们展示了一个我们设计并实现的众包工具,用于收集大量相关的在线食物图像。该工具可用于定位食物项目,并获得与图像中呈现的所有食物相关的groundtruth分割掩码。我们提出了一个众包工具的系统设计,专门针对在线食物图像收集和详细描述的注释任务。我们设计的众包工具是为满足构建大型图像数据集的需求而量身定制的,用于开发营养与健康领域的自动膳食评估工具。
{"title":"cTADA: The Design of a Crowdsourcing Tool for Online Food Image Identification and Segmentation","authors":"S. Fang, Chang Liu, Khalid Tahboub, F. Zhu, E. Delp, C. Boushey","doi":"10.1109/SSIAI.2018.8470358","DOIUrl":"https://doi.org/10.1109/SSIAI.2018.8470358","url":null,"abstract":"Measuring accurate dietary intake, the process of determining what someone eats during the course of the day is considered to be an open research problem in the nutrition and health fields. We have developed image-based tools to automatically obtain accurate estimates of what foods and how much energy/nutrients a user consumes. In this work, we present a crowdsourcing tool we designed and implemented to collect large sets of relevant online food images. This tool can be used to locate food items and obtaining groundtruth segmentation masks associated with all the foods presented in an image. We present a systematic design for a crowdsourcing tool aiming specifically for the task of online food image collection and annotations with a detailed description. The crowdsoucing tool we designed is tailored to meet the needs of building a large image dataset for developing automatic dietary assessment tools in the nutrition and health fields.","PeriodicalId":422209,"journal":{"name":"2018 IEEE Southwest Symposium on Image Analysis and Interpretation (SSIAI)","volume":"38 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127830733","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 12
f-Sim: A quasi-realistic fMRI simulation toolbox using digital brain phantom and modeled noise f-Sim:一个准真实的fMRI模拟工具箱,使用数字脑幻象和模拟噪声
Pub Date : 2018-04-01 DOI: 10.1109/SSIAI.2018.8470346
H. Parmar, Xiangyu Liu, Hua Xie, B. Nutter, S. Mitra, L. R. Long, Sameer Kiran Antani
Functional Magnetic Resonance Imaging (fMRI) uses a noninvasive technique to study the functionality of the human brain by measuring the Blood Oxygenation Level Dependent (BOLD) signal and has been researched for decades. However, some potential problems still remain in achieving correct interpretation of BOLD-induced signals due to quite low signal levels, high noise levels, artifacts, lack of ground truth and a number of other inherent problems. We present here the development of a MATLAB based fMRI simulator (f-Sim) using digital phantom brain that generates quasi-realistic 4D fMRI volumes including modeled noise. Such 4D fMRI data can serve to hypothesize ground truth for experimentally acquired data under both task-evoked and resting state designs in investigation of localized or whole brain activation and functional connectivity patterns.
功能磁共振成像(fMRI)使用一种无创技术,通过测量血氧水平依赖(BOLD)信号来研究人类大脑的功能,已经研究了几十年。然而,由于相当低的信号电平、高噪声电平、伪影、缺乏接地真值和许多其他固有问题,在实现bold诱导信号的正确解释方面仍然存在一些潜在问题。我们在这里介绍了一个基于MATLAB的fMRI模拟器(f-Sim)的开发,该模拟器使用数字幻脑生成准真实的4D fMRI体积,包括建模噪声。在研究局部或全脑激活和功能连接模式时,这些4D fMRI数据可以为任务诱发和静息状态设计下实验获得的数据假设基本事实。
{"title":"f-Sim: A quasi-realistic fMRI simulation toolbox using digital brain phantom and modeled noise","authors":"H. Parmar, Xiangyu Liu, Hua Xie, B. Nutter, S. Mitra, L. R. Long, Sameer Kiran Antani","doi":"10.1109/SSIAI.2018.8470346","DOIUrl":"https://doi.org/10.1109/SSIAI.2018.8470346","url":null,"abstract":"Functional Magnetic Resonance Imaging (fMRI) uses a noninvasive technique to study the functionality of the human brain by measuring the Blood Oxygenation Level Dependent (BOLD) signal and has been researched for decades. However, some potential problems still remain in achieving correct interpretation of BOLD-induced signals due to quite low signal levels, high noise levels, artifacts, lack of ground truth and a number of other inherent problems. We present here the development of a MATLAB based fMRI simulator (f-Sim) using digital phantom brain that generates quasi-realistic 4D fMRI volumes including modeled noise. Such 4D fMRI data can serve to hypothesize ground truth for experimentally acquired data under both task-evoked and resting state designs in investigation of localized or whole brain activation and functional connectivity patterns.","PeriodicalId":422209,"journal":{"name":"2018 IEEE Southwest Symposium on Image Analysis and Interpretation (SSIAI)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131120615","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
DDT: Decentralized event Detection and Tracking using an ensemble of vertex-reinforced walks on a graph 分布式事件检测和跟踪,使用图形上的顶点增强行走集合
Pub Date : 2018-04-01 DOI: 10.1109/SSIAI.2018.8470332
Tamal Batabyal
Automated detection of decentralized event dynamics together with the identification of irregular topology on which the event propagates is a challenging task, which has its application in areas such as geomorphology and video surveillance. The problem becomes severe when the underlying topology is time-varying and multiple events with varied scales exist on the same topology. Conventional research works separately to deal with the problems of detecting events and identifying topology. On one hand, the methodologies for event detection involving the graph-spectral response fail to perform spatiotemporal localization of events if the underlying topology is unknown. On the other hand, the algorithms which estimate the underlying graph topology assume only static nature of the events. In this work, we utilize vertex reinforcement based walks on the topology to simultaneously perform both the tasks by using a scalable and tractable algorithm. An ensemble of such walks recursively updates the event membership of each location in the topology followed by associating a spatial support of each event. Our approach shows improvement over state-of-the-art methods in terms of the spatiotemporal localization of decentralized events.
分散事件动态的自动检测和事件传播的不规则拓扑识别是一项具有挑战性的任务,在地貌学和视频监控等领域都有应用。当底层拓扑是时变的,并且同一拓扑上存在多个不同规模的事件时,问题就变得严重了。传统的研究分别处理事件检测和拓扑识别问题。一方面,如果底层拓扑未知,则涉及图谱响应的事件检测方法无法对事件进行时空定位。另一方面,估计底层图拓扑的算法只假设事件的静态性质。在这项工作中,我们利用基于顶点强化的拓扑行走,通过使用可扩展和可处理的算法同时执行这两个任务。这种遍历的集合递归地更新拓扑中每个位置的事件成员关系,然后关联每个事件的空间支持。我们的方法在分散事件的时空定位方面比最先进的方法有所改进。
{"title":"DDT: Decentralized event Detection and Tracking using an ensemble of vertex-reinforced walks on a graph","authors":"Tamal Batabyal","doi":"10.1109/SSIAI.2018.8470332","DOIUrl":"https://doi.org/10.1109/SSIAI.2018.8470332","url":null,"abstract":"Automated detection of decentralized event dynamics together with the identification of irregular topology on which the event propagates is a challenging task, which has its application in areas such as geomorphology and video surveillance. The problem becomes severe when the underlying topology is time-varying and multiple events with varied scales exist on the same topology. Conventional research works separately to deal with the problems of detecting events and identifying topology. On one hand, the methodologies for event detection involving the graph-spectral response fail to perform spatiotemporal localization of events if the underlying topology is unknown. On the other hand, the algorithms which estimate the underlying graph topology assume only static nature of the events. In this work, we utilize vertex reinforcement based walks on the topology to simultaneously perform both the tasks by using a scalable and tractable algorithm. An ensemble of such walks recursively updates the event membership of each location in the topology followed by associating a spatial support of each event. Our approach shows improvement over state-of-the-art methods in terms of the spatiotemporal localization of decentralized events.","PeriodicalId":422209,"journal":{"name":"2018 IEEE Southwest Symposium on Image Analysis and Interpretation (SSIAI)","volume":"41 2","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114131980","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Efficient Face And Gesture Recognition For Time Sensitive Application 有效的人脸和手势识别时间敏感的应用
Pub Date : 2018-04-01 DOI: 10.1109/SSIAI.2018.8470351
Anush Ananthakumar
Face recognition systems are used in various fields such as biometric authentication, security enhancement, automobile control and user detection. This research is focused on developing a model to control a system using gestures, while simultaneously implementing continuous facial recognition to avoid unauthorized access. An effective face recognition system is developed and applied in conjunction with a gesture recognition system to control a wireless robot in real-time. The facial recognition system extracts the face using the Viola-Jones algorithm which utilizes Haar like features along with Adaboost training. This is followed by a Convolution Neural Network (CNN) based feature extractor and Support Vector Machine (SVM) to recognize the face. The gesture recognition is facilitated by using color segmentation, which involves extracting the skin tone of the detected face and using this to detect the position of hand. The gesture is obtained by tracking the hand using the Kanade-Lucas-Tomasi (KLT) algorithm. In this research, we additionally utilize a background subtraction model so as to extract the foreground and reduce the misclassifications. Such a technique highly improves the performance of the facial and gesture detector in complex and cluttered environments. The performance of the face detector was tested on different databases including the ORL, Caltech and Faces96 database. The efficacy of this system in controlling a robot in real-time has also been demonstrated in this research. It provides an accuracy of 94.44% for recognizing faces and greater than 90.8% for recognizing gestures in real-time applications. Such a system is seen to have superior performance coupled with a relatively lower computation requirement in comparison to existing techniques.
人脸识别系统应用于生物识别认证、安全增强、汽车控制和用户检测等各个领域。本研究的重点是开发一个使用手势控制系统的模型,同时实现连续的面部识别以避免未经授权的访问。开发了一种有效的人脸识别系统,并结合手势识别系统对无线机器人进行实时控制。面部识别系统使用Viola-Jones算法提取面部,该算法利用Haar类特征以及Adaboost训练。接下来是基于卷积神经网络(CNN)的特征提取器和支持向量机(SVM)来识别人脸。利用颜色分割技术提取被检测人脸的肤色,并以此来检测手的位置,从而实现手势识别。手势是通过使用Kanade-Lucas-Tomasi (KLT)算法跟踪手来获得的。在本研究中,我们还利用背景减法模型来提取前景,减少误分类。这种技术极大地提高了面部和手势检测器在复杂和杂乱环境中的性能。在ORL、Caltech和Faces96数据库上测试了人脸检测器的性能。本研究也证明了该系统在机器人实时控制中的有效性。在实时应用中,人脸识别的准确率为94.44%,手势识别的准确率超过90.8%。与现有技术相比,这种系统具有优越的性能和相对较低的计算需求。
{"title":"Efficient Face And Gesture Recognition For Time Sensitive Application","authors":"Anush Ananthakumar","doi":"10.1109/SSIAI.2018.8470351","DOIUrl":"https://doi.org/10.1109/SSIAI.2018.8470351","url":null,"abstract":"Face recognition systems are used in various fields such as biometric authentication, security enhancement, automobile control and user detection. This research is focused on developing a model to control a system using gestures, while simultaneously implementing continuous facial recognition to avoid unauthorized access. An effective face recognition system is developed and applied in conjunction with a gesture recognition system to control a wireless robot in real-time. The facial recognition system extracts the face using the Viola-Jones algorithm which utilizes Haar like features along with Adaboost training. This is followed by a Convolution Neural Network (CNN) based feature extractor and Support Vector Machine (SVM) to recognize the face. The gesture recognition is facilitated by using color segmentation, which involves extracting the skin tone of the detected face and using this to detect the position of hand. The gesture is obtained by tracking the hand using the Kanade-Lucas-Tomasi (KLT) algorithm. In this research, we additionally utilize a background subtraction model so as to extract the foreground and reduce the misclassifications. Such a technique highly improves the performance of the facial and gesture detector in complex and cluttered environments. The performance of the face detector was tested on different databases including the ORL, Caltech and Faces96 database. The efficacy of this system in controlling a robot in real-time has also been demonstrated in this research. It provides an accuracy of 94.44% for recognizing faces and greater than 90.8% for recognizing gestures in real-time applications. Such a system is seen to have superior performance coupled with a relatively lower computation requirement in comparison to existing techniques.","PeriodicalId":422209,"journal":{"name":"2018 IEEE Southwest Symposium on Image Analysis and Interpretation (SSIAI)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126073610","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 5
Image Compression: Sparse Coding vs. Bottleneck Autoencoders 图像压缩:稀疏编码与瓶颈自动编码器
Pub Date : 2017-10-26 DOI: 10.1109/SSIAI.2018.8470336
Y. Watkins, M. Sayeh, O. Iaroshenko, Garrett T. Kenyon
Bottleneck autoencoders have been actively researched as a solution to image compression tasks. However, we observed that bottleneck autoencoders produce subjectively low quality reconstructed images. In this work, we explore the ability of sparse coding to improve reconstructed image quality for the same degree of compression. We observe that sparse image compression produces visually superior reconstructed images and yields higher values of pixel-wise measures of reconstruction quality (PSNR and SSIM) compared to bottleneck autoencoders. In addition, we find that using alternative metrics that correlate better with human perception, such as feature perceptual loss and the classification accuracy, sparse image compression scores up to 18.06% and 2.7% higher, respectively, compared to bottleneck autoencoders. Although computationally much more intensive, we find that sparse coding is otherwise superior to bottleneck autoencoders for the same degree of compression.
瓶颈自编码器作为一种解决图像压缩问题的方法得到了积极的研究。然而,我们观察到瓶颈自编码器产生主观上低质量的重建图像。在这项工作中,我们探索了稀疏编码在相同压缩程度下提高重建图像质量的能力。我们观察到,与瓶颈自编码器相比,稀疏图像压缩产生了视觉上更好的重建图像,并且产生了更高的像素级重建质量(PSNR和SSIM)。此外,我们发现使用与人类感知更好相关的替代指标,如特征感知损失和分类精度,稀疏图像压缩得分分别比瓶颈自编码器高18.06%和2.7%。虽然计算更密集,我们发现稀疏编码在其他方面优于瓶颈自编码器在相同程度的压缩。
{"title":"Image Compression: Sparse Coding vs. Bottleneck Autoencoders","authors":"Y. Watkins, M. Sayeh, O. Iaroshenko, Garrett T. Kenyon","doi":"10.1109/SSIAI.2018.8470336","DOIUrl":"https://doi.org/10.1109/SSIAI.2018.8470336","url":null,"abstract":"Bottleneck autoencoders have been actively researched as a solution to image compression tasks. However, we observed that bottleneck autoencoders produce subjectively low quality reconstructed images. In this work, we explore the ability of sparse coding to improve reconstructed image quality for the same degree of compression. We observe that sparse image compression produces visually superior reconstructed images and yields higher values of pixel-wise measures of reconstruction quality (PSNR and SSIM) compared to bottleneck autoencoders. In addition, we find that using alternative metrics that correlate better with human perception, such as feature perceptual loss and the classification accuracy, sparse image compression scores up to 18.06% and 2.7% higher, respectively, compared to bottleneck autoencoders. Although computationally much more intensive, we find that sparse coding is otherwise superior to bottleneck autoencoders for the same degree of compression.","PeriodicalId":422209,"journal":{"name":"2018 IEEE Southwest Symposium on Image Analysis and Interpretation (SSIAI)","volume":"113 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2017-10-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114115426","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 12
期刊
2018 IEEE Southwest Symposium on Image Analysis and Interpretation (SSIAI)
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1