首页 > 最新文献

Proceedings of the 4th International Conference on Control and Computer Vision最新文献

英文 中文
Region-Guided Pixel-Level Label Generation for Weakly Supervised Semantic Segmentation 弱监督语义分割的区域引导像素级标签生成
Pub Date : 2021-08-13 DOI: 10.1145/3484274.3484275
Xinyu Fu, Xiao Yao
The lack of reliable segmentation labels is the major obstacles to weakly supervised semantic segmentation. We provide a pseudo-label generation approach based on a deep convolutional neural network, which is supervised by the image-level category labels only. However, the limitation of the region of interest in the targets influences the effectiveness and integrity of the traditional methods in obtaining pixel-level mask annotations. This paper studies the characteristics of class activation mapping in classification network, focusing on the methods to enhance the localization ability of class activation mapping. We propose a Region-guided Pixel-label Generation framework (RPG) for semantic segmentation. The proposed region guidance mechanism decreases the influence of category supervision and makes use of the known high-level semantic information to guide the network, attaining more complete pixel-level annotations via expanding the regions of interest. Experimental results of training and validation on the PASCALVOC 2012 data set prove to achieve better pixel labeling and segmentation accuracy comparing with state-of-the-art methods.
缺乏可靠的分词标签是弱监督语义分词的主要障碍。我们提供了一种基于深度卷积神经网络的伪标签生成方法,该方法仅由图像级类别标签监督。然而,目标感兴趣区域的局限性影响了传统方法获取像素级蒙版注释的有效性和完整性。本文研究了分类网络中类激活映射的特点,重点研究了增强类激活映射定位能力的方法。我们提出了一个区域导向的像素标签生成框架(RPG)用于语义分割。提出的区域引导机制减少了类别监督的影响,利用已知的高级语义信息来引导网络,通过扩展感兴趣的区域来获得更完整的像素级标注。在PASCALVOC 2012数据集上进行训练和验证的实验结果表明,与现有方法相比,该方法具有更好的像素标记和分割精度。
{"title":"Region-Guided Pixel-Level Label Generation for Weakly Supervised Semantic Segmentation","authors":"Xinyu Fu, Xiao Yao","doi":"10.1145/3484274.3484275","DOIUrl":"https://doi.org/10.1145/3484274.3484275","url":null,"abstract":"The lack of reliable segmentation labels is the major obstacles to weakly supervised semantic segmentation. We provide a pseudo-label generation approach based on a deep convolutional neural network, which is supervised by the image-level category labels only. However, the limitation of the region of interest in the targets influences the effectiveness and integrity of the traditional methods in obtaining pixel-level mask annotations. This paper studies the characteristics of class activation mapping in classification network, focusing on the methods to enhance the localization ability of class activation mapping. We propose a Region-guided Pixel-label Generation framework (RPG) for semantic segmentation. The proposed region guidance mechanism decreases the influence of category supervision and makes use of the known high-level semantic information to guide the network, attaining more complete pixel-level annotations via expanding the regions of interest. Experimental results of training and validation on the PASCALVOC 2012 data set prove to achieve better pixel labeling and segmentation accuracy comparing with state-of-the-art methods.","PeriodicalId":143540,"journal":{"name":"Proceedings of the 4th International Conference on Control and Computer Vision","volume":"14 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-08-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121456354","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Design Virtual Reality Simulation System for Epidemic (Covid-19) Education to Public 新型冠状病毒肺炎公众教育虚拟现实仿真系统设计
Pub Date : 2021-08-13 DOI: 10.1145/3484274.3484297
Zhanti Liang, Yongkang Xing, Kexin Guan, Zheng Da, Jianwen Fan, Gan Wu
In 2020, the COVID-19 epidemic swept the world and continued to spread. The global epidemic reflects the severe inadequacy of public education on infectious disease prevention in many countries. Therefore, it is necessary to popularize the knowledge of infectious disease prevention by designing a new epidemic education system. As a new interactive technology, Virtual Reality (VR) technology profoundly changes the human-computer interaction experience. According to the characteristics of VR, this research designed a virtual simulation system based on epidemic education research. This system is developed with Unreal 4 engine and simulates scenarios that require epidemic prevention education. The user interacts with the three-dimensional (3D) scene model through the VR controllers and visual user interface. This system allows users to effectively acquire knowledge about the epidemic and experience how medical staff deal with emergencies. This article discusses the methods of popularizing epidemic knowledge education by virtual simulation technology. It illustrates the system and knowledge popularization of the new epidemic health education in virtual simulation. The research will assist the public in acquiring epidemic knowledge effectively.
2020年,新冠肺炎疫情席卷全球并继续蔓延。这一全球性流行病反映出许多国家在预防传染病方面的公共教育严重不足。因此,有必要通过设计一种新的流行病教育体系来普及传染病预防知识。虚拟现实(VR)技术作为一种新型的交互技术,深刻地改变了人机交互体验。根据虚拟现实技术的特点,本研究设计了一个基于流行病教育研究的虚拟仿真系统。本系统采用虚幻4引擎开发,模拟需要进行防疫教育的场景。用户通过VR控制器和可视化用户界面与三维(3D)场景模型交互。该系统使用户可以有效地获取疫情知识,体验医护人员如何处理突发事件。本文探讨了利用虚拟仿真技术普及流行病知识教育的方法。阐述了新型流行病健康教育在虚拟仿真中的系统化和知识普及。这项研究将有助于公众有效地获取流行病知识。
{"title":"Design Virtual Reality Simulation System for Epidemic (Covid-19) Education to Public","authors":"Zhanti Liang, Yongkang Xing, Kexin Guan, Zheng Da, Jianwen Fan, Gan Wu","doi":"10.1145/3484274.3484297","DOIUrl":"https://doi.org/10.1145/3484274.3484297","url":null,"abstract":"In 2020, the COVID-19 epidemic swept the world and continued to spread. The global epidemic reflects the severe inadequacy of public education on infectious disease prevention in many countries. Therefore, it is necessary to popularize the knowledge of infectious disease prevention by designing a new epidemic education system. As a new interactive technology, Virtual Reality (VR) technology profoundly changes the human-computer interaction experience. According to the characteristics of VR, this research designed a virtual simulation system based on epidemic education research. This system is developed with Unreal 4 engine and simulates scenarios that require epidemic prevention education. The user interacts with the three-dimensional (3D) scene model through the VR controllers and visual user interface. This system allows users to effectively acquire knowledge about the epidemic and experience how medical staff deal with emergencies. This article discusses the methods of popularizing epidemic knowledge education by virtual simulation technology. It illustrates the system and knowledge popularization of the new epidemic health education in virtual simulation. The research will assist the public in acquiring epidemic knowledge effectively.","PeriodicalId":143540,"journal":{"name":"Proceedings of the 4th International Conference on Control and Computer Vision","volume":"28 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-08-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122822288","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Active Disturbance Rejection Control for a Parabolic Trough Solar Field 抛物线槽型太阳场的自抗扰控制
Pub Date : 2021-08-13 DOI: 10.1145/3484274.3484299
Xian-hua Gao, Zhigang Su
It is challenging and imperative to achieve unbiased control for field outlet temperature of parabolic trough solar field (PTSF) as it is a critical part of solar plants but is interrupted by multiple disturbances. To tackle this problem, an active disturbance rejection control (ADRC) is designed to alleviate these disturbances. Firstly, all the disturbances, including external disturbances, model mismatch and parameter perturbation rather than just direct normal irradiation, are lumped into one disturbance. And then, the lumped disturbance is estimated and rejected by an ADRC, where the order and gain of ADRC are determined based on an operating point of a nonlinear PTSF. The performance of the ADRC controller is also compared to 2-DOF-PID, and simulation results show remarkable merits in disturbance rejection and reference tracking of the proposed controller.
抛物槽太阳能场作为太阳能电站的重要组成部分,其出口温度会受到多种干扰,实现对出口温度的无偏控制是一项具有挑战性和必要性的工作。为了解决这个问题,设计了一种自抗扰控制(ADRC)来减轻这些干扰。首先,将所有的扰动,包括外部扰动、模型失配和参数扰动,而不仅仅是直接的正常辐射,集中到一个扰动中;然后利用自抗扰器估计和抑制集总扰动,其中自抗扰器的阶数和增益由非线性PTSF的工作点确定。通过与2- dof pid控制器的性能比较,仿真结果表明该控制器在抗干扰和参考跟踪方面具有显著的优点。
{"title":"Active Disturbance Rejection Control for a Parabolic Trough Solar Field","authors":"Xian-hua Gao, Zhigang Su","doi":"10.1145/3484274.3484299","DOIUrl":"https://doi.org/10.1145/3484274.3484299","url":null,"abstract":"It is challenging and imperative to achieve unbiased control for field outlet temperature of parabolic trough solar field (PTSF) as it is a critical part of solar plants but is interrupted by multiple disturbances. To tackle this problem, an active disturbance rejection control (ADRC) is designed to alleviate these disturbances. Firstly, all the disturbances, including external disturbances, model mismatch and parameter perturbation rather than just direct normal irradiation, are lumped into one disturbance. And then, the lumped disturbance is estimated and rejected by an ADRC, where the order and gain of ADRC are determined based on an operating point of a nonlinear PTSF. The performance of the ADRC controller is also compared to 2-DOF-PID, and simulation results show remarkable merits in disturbance rejection and reference tracking of the proposed controller.","PeriodicalId":143540,"journal":{"name":"Proceedings of the 4th International Conference on Control and Computer Vision","volume":"10 6","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-08-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121001446","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Straw Defect Detection Algorithm Based on Pruned YOLOv3 基于修剪YOLOv3的秸秆缺陷检测算法
Pub Date : 2021-08-13 DOI: 10.1145/3484274.3484285
Qi-chang Xu, Liang Zhou
To solve the problem of defect detection in straw pipeline production, this paper proposes an efficient and fast straw defect detection algorithm (IPOY) based on pruned YOLOv3. Algorithm adopts YOLOv3 model, and then trains the model with channel sparsity regularization, prunes channels with small scaling factors after sparse training, finally fine-tune the pruned network. This process was iterated several times to compress the YOLOv3 model to achieve a lighter model volume, reduce the computational cost of the model, and make the model suitable for industrial production to facilitate application migration to mobile devices. Experimental results show that the proposed algorithm can compress the volume of YOLOv3 model to the maximum extent and maintain the high precision of detection.
为了解决秸秆管道生产中的缺陷检测问题,本文提出了一种基于修剪YOLOv3的高效快速秸秆缺陷检测算法(IPOY)。算法采用YOLOv3模型,然后对模型进行信道稀疏正则化训练,在稀疏训练后对小尺度因子的信道进行剪枝,最后对剪枝后的网络进行微调。该过程经过多次迭代,对YOLOv3模型进行压缩,以实现更轻的模型体积,降低模型的计算成本,并使模型适合工业生产,便于应用向移动设备迁移。实验结果表明,该算法可以最大程度地压缩YOLOv3模型的体积,并保持较高的检测精度。
{"title":"Straw Defect Detection Algorithm Based on Pruned YOLOv3","authors":"Qi-chang Xu, Liang Zhou","doi":"10.1145/3484274.3484285","DOIUrl":"https://doi.org/10.1145/3484274.3484285","url":null,"abstract":"To solve the problem of defect detection in straw pipeline production, this paper proposes an efficient and fast straw defect detection algorithm (IPOY) based on pruned YOLOv3. Algorithm adopts YOLOv3 model, and then trains the model with channel sparsity regularization, prunes channels with small scaling factors after sparse training, finally fine-tune the pruned network. This process was iterated several times to compress the YOLOv3 model to achieve a lighter model volume, reduce the computational cost of the model, and make the model suitable for industrial production to facilitate application migration to mobile devices. Experimental results show that the proposed algorithm can compress the volume of YOLOv3 model to the maximum extent and maintain the high precision of detection.","PeriodicalId":143540,"journal":{"name":"Proceedings of the 4th International Conference on Control and Computer Vision","volume":"35 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-08-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127702978","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Learning from Human Uncertainty by Choquet Integral for Optic Disc Segmentation 基于人的不确定性学习的Choquet积分视盘分割
Pub Date : 2021-08-13 DOI: 10.1145/3484274.3484276
H. Qiu, P. Su, Shanshan Jiang, Xingyu Yue, Yitian Zhao, Jiang Liu
Modern deep neural networks are able to beat human annotators in several medical image processing tasks. In practical manual annotation for medical image segmentation tasks, the labels of annotators often show inter-observer variability (IOV) which is mainly caused by annotators’ different understandings of expertise. In order to build a trustworthy segmentation system, robust models should consider how to capture uncertainty in samples and labels. Different from the conventional way of handling IOV with label fusion such as majority voting, a fuzzy integral based ensemble framework of multiple deep learning models for optic disc segmentation is proposed. Each component segmentation model is trained with respect to an annotator. Then, a powerful nonlinear aggregation function, the Choquet integral, is employed in form of a neural network to integrate the segmentation results of multiple annotators. The proposed method is validated on the public RIM-ONE dataset consisting of 169 fundus images and each image is annotated by 5 experts. Compared with conventional segmentation ensemble methods, the proposed methods achieves a higher Dice score (98.69%).
现代深度神经网络能够在一些医学图像处理任务中击败人类注释器。在医学图像分割任务的实际手工标注中,标注者的标签往往存在观察者间变异(IOV),这主要是由于标注者对专业知识的理解不同造成的。为了建立一个可靠的分割系统,鲁棒模型应该考虑如何捕捉样本和标签中的不确定性。与传统的多数投票等标签融合处理视盘分割的方法不同,提出了一种基于模糊积分的视盘分割多深度学习模型集成框架。每个组件分割模型都是根据注释器进行训练的。然后,以神经网络的形式利用强大的非线性聚集函数Choquet积分对多个标注器的分割结果进行整合。该方法在由169张眼底图像组成的RIM-ONE公共数据集上进行了验证,每张图像由5位专家进行了注释。与传统的分割集成方法相比,本文方法的Dice得分(98.69%)更高。
{"title":"Learning from Human Uncertainty by Choquet Integral for Optic Disc Segmentation","authors":"H. Qiu, P. Su, Shanshan Jiang, Xingyu Yue, Yitian Zhao, Jiang Liu","doi":"10.1145/3484274.3484276","DOIUrl":"https://doi.org/10.1145/3484274.3484276","url":null,"abstract":"Modern deep neural networks are able to beat human annotators in several medical image processing tasks. In practical manual annotation for medical image segmentation tasks, the labels of annotators often show inter-observer variability (IOV) which is mainly caused by annotators’ different understandings of expertise. In order to build a trustworthy segmentation system, robust models should consider how to capture uncertainty in samples and labels. Different from the conventional way of handling IOV with label fusion such as majority voting, a fuzzy integral based ensemble framework of multiple deep learning models for optic disc segmentation is proposed. Each component segmentation model is trained with respect to an annotator. Then, a powerful nonlinear aggregation function, the Choquet integral, is employed in form of a neural network to integrate the segmentation results of multiple annotators. The proposed method is validated on the public RIM-ONE dataset consisting of 169 fundus images and each image is annotated by 5 experts. Compared with conventional segmentation ensemble methods, the proposed methods achieves a higher Dice score (98.69%).","PeriodicalId":143540,"journal":{"name":"Proceedings of the 4th International Conference on Control and Computer Vision","volume":"4 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-08-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131354883","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Comparison between a Whole and Separated Feature Information for Acute Lymphoblastic Leukemia (ALL) Classification 整体与分离特征信息在急性淋巴细胞白血病(ALL)分类中的比较
Pub Date : 2021-08-13 DOI: 10.1145/3484274.3484289
A. Muntasa, Muhammad Yusuf
Acute lymphoblastic Leukemia (ALL) is dangerous cancer in which the infected blood cells disturb the blood and bone marrow. It attacks the body's immune and the ability of bone marrow to produce white blood cells have diminished. This research aims to classify the ALL image using the whole feature information. We proposed a method to decrease the image's size using the whole co-occurrence matrix to represent the object. The research performances have produced 90.77%, 96,67%, and 95.38% for the maximum accuracy, sensitivity, and specificity. This research has also compared to separate channels, which are red, green, and blue. Our novel method shows that the whole feature information has yielded higher accuracy, sensitivity, and specificity than the others, which are the red, green, as well as blue channels. Furthermore, this research has a novelty, i.e., to prove that the whole feature information method is better for the implementation system. Additionally, this research contributes by proposing a method about whole feature information for the implementation system.
急性淋巴细胞白血病(ALL)是一种危险的癌症,感染的血细胞扰乱血液和骨髓。它会攻击人体的免疫系统,使骨髓产生白细胞的能力减弱。本研究旨在利用整体特征信息对ALL图像进行分类。我们提出了一种利用整个共现矩阵来表示目标的减小图像尺寸的方法。研究结果分别达到90.77%、96%、67%和95.38%的最高准确度、灵敏度和特异度。这项研究还比较了红、绿、蓝三种不同的通道。我们的新方法表明,整个特征信息比其他通道(红通道、绿通道和蓝通道)具有更高的准确性、灵敏度和特异性。此外,本研究具有新颖性,即证明了整体特征信息方法更适合于实现系统。此外,本研究还为实现系统提供了一种全特征信息的方法。
{"title":"Comparison between a Whole and Separated Feature Information for Acute Lymphoblastic Leukemia (ALL) Classification","authors":"A. Muntasa, Muhammad Yusuf","doi":"10.1145/3484274.3484289","DOIUrl":"https://doi.org/10.1145/3484274.3484289","url":null,"abstract":"Acute lymphoblastic Leukemia (ALL) is dangerous cancer in which the infected blood cells disturb the blood and bone marrow. It attacks the body's immune and the ability of bone marrow to produce white blood cells have diminished. This research aims to classify the ALL image using the whole feature information. We proposed a method to decrease the image's size using the whole co-occurrence matrix to represent the object. The research performances have produced 90.77%, 96,67%, and 95.38% for the maximum accuracy, sensitivity, and specificity. This research has also compared to separate channels, which are red, green, and blue. Our novel method shows that the whole feature information has yielded higher accuracy, sensitivity, and specificity than the others, which are the red, green, as well as blue channels. Furthermore, this research has a novelty, i.e., to prove that the whole feature information method is better for the implementation system. Additionally, this research contributes by proposing a method about whole feature information for the implementation system.","PeriodicalId":143540,"journal":{"name":"Proceedings of the 4th International Conference on Control and Computer Vision","volume":"127 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-08-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122648088","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Multi-label Prediction for Visual Sentiment Analysis using Eight Different Emotions based on Psychology 基于心理学的八种不同情绪视觉情感分析多标签预测
Pub Date : 2021-08-13 DOI: 10.1145/3484274.3484296
Tetsuya Asakawa, Masaki Aono
In visual sentiment analysis, sentiment estimation from images is a challenging research problem. Previous studies focused on a few specific sentiments and their intensities and have not captured abundant psychological human feelings. In addition, multi-label sentiment estimation from images has not been sufficiently investigated. The purpose of this research is to build a visual sentiment dataset, accurately estimate the sentiments as a multi-label multi-class problem from images that simultaneously evoke multiple emotions. We built a visual sentiment dataset based on Plutchik's wheel of emotions. We describe this ‘Senti8PW’ dataset, then perform multi-label sentiment analysis using the dataset, where we propose a combined deep neural network model that enables inputs from both hand-crafted features and CNN features. We also introduce a threshold-based multi-label prediction algorithm, in which we assume that each emotion has a probability distribution. In other words, after training our deep neural network, we predict evoked emotions for an image if the intensity of the emotion is larger than the threshold of the corresponding emotion. Extensive experiments were conducted on our dataset. Our model achieves superior results compared to the state-of-the-art algorithms in terms of subsets.
在视觉情感分析中,基于图像的情感估计是一个具有挑战性的研究问题。以前的研究集中在一些特定的情绪及其强度上,并没有捕捉到丰富的人类心理感受。此外,图像的多标签情感估计还没有得到充分的研究。本研究的目的是建立一个视觉情感数据集,从同时唤起多种情感的图像中准确地估计出情感作为一个多标签多类问题。我们建立了一个基于普鲁契克情绪轮的视觉情绪数据集。我们描述了这个“Senti8PW”数据集,然后使用该数据集进行多标签情感分析,其中我们提出了一个组合的深度神经网络模型,该模型支持手工制作特征和CNN特征的输入。我们还引入了一种基于阈值的多标签预测算法,其中我们假设每种情绪都有一个概率分布。换句话说,在训练我们的深度神经网络之后,如果情绪的强度大于相应情绪的阈值,我们就可以预测图像所唤起的情绪。在我们的数据集上进行了广泛的实验。与最先进的算法相比,我们的模型在子集方面取得了更好的结果。
{"title":"Multi-label Prediction for Visual Sentiment Analysis using Eight Different Emotions based on Psychology","authors":"Tetsuya Asakawa, Masaki Aono","doi":"10.1145/3484274.3484296","DOIUrl":"https://doi.org/10.1145/3484274.3484296","url":null,"abstract":"In visual sentiment analysis, sentiment estimation from images is a challenging research problem. Previous studies focused on a few specific sentiments and their intensities and have not captured abundant psychological human feelings. In addition, multi-label sentiment estimation from images has not been sufficiently investigated. The purpose of this research is to build a visual sentiment dataset, accurately estimate the sentiments as a multi-label multi-class problem from images that simultaneously evoke multiple emotions. We built a visual sentiment dataset based on Plutchik's wheel of emotions. We describe this ‘Senti8PW’ dataset, then perform multi-label sentiment analysis using the dataset, where we propose a combined deep neural network model that enables inputs from both hand-crafted features and CNN features. We also introduce a threshold-based multi-label prediction algorithm, in which we assume that each emotion has a probability distribution. In other words, after training our deep neural network, we predict evoked emotions for an image if the intensity of the emotion is larger than the threshold of the corresponding emotion. Extensive experiments were conducted on our dataset. Our model achieves superior results compared to the state-of-the-art algorithms in terms of subsets.","PeriodicalId":143540,"journal":{"name":"Proceedings of the 4th International Conference on Control and Computer Vision","volume":"25 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-08-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128159149","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Fault Diagnosis of UAV System Base On One-Class Support Vector Machine 基于一类支持向量机的无人机系统故障诊断
Pub Date : 2021-08-13 DOI: 10.1145/3484274.3484301
Zaifei Fu, Xin Chen, Yu-juan Guo, Jing Chen
Given the complex structure and long failure time of the flight automation control system, which affect the aircraft's operational efficiency, a fault diagnosis scheme with a one-class support vector machine(OCSVM) optimized by an ant colony optimization(ACO) is proposed. Firstly, this paper analyses the fault characteristics of flight automation systems and constructs a noise filter. Then, a residual decision algorithm based on an improved support vector machine is proposed to judge the residuals in the case of complex flight control system output coupling. Third, experimental simulation results show that the decision algorithm takes about 0.5s for fault detection at a sampling time of 0.1s, significantly reducing fault detection time and an effective fault detection rate of greater than 90%.
针对飞行自动控制系统结构复杂、故障时间长影响飞机运行效率的问题,提出了一种基于蚁群优化的一类支持向量机(OCSVM)故障诊断方案。首先,分析了飞行自动化系统的故障特征,构建了噪声滤波器。然后,提出了一种基于改进支持向量机的残差判定算法,用于判断复杂飞控系统输出耦合情况下的残差。第三,实验仿真结果表明,在采样时间为0.1s的情况下,决策算法的故障检测时间约为0.5s,显著缩短了故障检测时间,有效故障检出率大于90%。
{"title":"Fault Diagnosis of UAV System Base On One-Class Support Vector Machine","authors":"Zaifei Fu, Xin Chen, Yu-juan Guo, Jing Chen","doi":"10.1145/3484274.3484301","DOIUrl":"https://doi.org/10.1145/3484274.3484301","url":null,"abstract":"Given the complex structure and long failure time of the flight automation control system, which affect the aircraft's operational efficiency, a fault diagnosis scheme with a one-class support vector machine(OCSVM) optimized by an ant colony optimization(ACO) is proposed. Firstly, this paper analyses the fault characteristics of flight automation systems and constructs a noise filter. Then, a residual decision algorithm based on an improved support vector machine is proposed to judge the residuals in the case of complex flight control system output coupling. Third, experimental simulation results show that the decision algorithm takes about 0.5s for fault detection at a sampling time of 0.1s, significantly reducing fault detection time and an effective fault detection rate of greater than 90%.","PeriodicalId":143540,"journal":{"name":"Proceedings of the 4th International Conference on Control and Computer Vision","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-08-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129734797","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Improvement of Detection Rate for Small Objects Using Pre-processing Network 利用预处理网络提高小目标的检测率
Pub Date : 2021-08-13 DOI: 10.1145/3484274.3484283
Doohee Lee, Gi Soon Cha, Ehtesham Iqbal, H. Song, Kwang-nam Choi
Artificial intelligence (AI) has been developing in a variety of methods over the past decade. However most AI experts worried to build a deep or wide network because the accuracy of AI models depends heavily on the depth of the network. In general deep and wide networks are better at learning than those that are less deep and wide and wide. On the other hand deeper networks are more complex and have many disadvantages such as computational cost and system specification dependency. We propose a novel method to improve the average recall rate for small objects in the deep convolutional network in the paper. The proposed method added pre-processing layer before the network rather than stacking the networks deeper or wide. The presented pre-processing layer consists of two major parts: up-sampling and down-sampling of the data. The overall objective of up-sampling and down-sampling is to enhance the resolution of small objects in the input image. The pre-processing network improves the average recall rate of the base network to 3.56%. This experiment result depicts that the proposed method outperforms the small object detection performance. CCS CONCEPTS • Computing methodologies • Object detection
人工智能(AI)在过去十年中以各种方式发展。然而,大多数人工智能专家担心建立一个深度或广泛的网络,因为人工智能模型的准确性在很大程度上取决于网络的深度。一般来说,深度和广度的网络比深度和广度不够的网络更擅长学习。另一方面,深度网络更加复杂,并且存在计算成本和系统规格依赖性等缺点。本文提出了一种提高深度卷积网络中小目标平均召回率的新方法。该方法在网络前增加预处理层,而不是将网络堆叠得更深或更宽。本文提出的预处理层包括数据的上采样和下采样两大部分。上采样和下采样的总体目标是提高输入图像中小目标的分辨率。预处理网络将基础网络的平均召回率提高到3.56%。实验结果表明,该方法具有较好的小目标检测性能。CCS概念•计算方法•对象检测
{"title":"Improvement of Detection Rate for Small Objects Using Pre-processing Network","authors":"Doohee Lee, Gi Soon Cha, Ehtesham Iqbal, H. Song, Kwang-nam Choi","doi":"10.1145/3484274.3484283","DOIUrl":"https://doi.org/10.1145/3484274.3484283","url":null,"abstract":"Artificial intelligence (AI) has been developing in a variety of methods over the past decade. However most AI experts worried to build a deep or wide network because the accuracy of AI models depends heavily on the depth of the network. In general deep and wide networks are better at learning than those that are less deep and wide and wide. On the other hand deeper networks are more complex and have many disadvantages such as computational cost and system specification dependency. We propose a novel method to improve the average recall rate for small objects in the deep convolutional network in the paper. The proposed method added pre-processing layer before the network rather than stacking the networks deeper or wide. The presented pre-processing layer consists of two major parts: up-sampling and down-sampling of the data. The overall objective of up-sampling and down-sampling is to enhance the resolution of small objects in the input image. The pre-processing network improves the average recall rate of the base network to 3.56%. This experiment result depicts that the proposed method outperforms the small object detection performance. CCS CONCEPTS • Computing methodologies • Object detection","PeriodicalId":143540,"journal":{"name":"Proceedings of the 4th International Conference on Control and Computer Vision","volume":"27 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-08-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"120966421","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Forecasting PM2.5 and Tracking Spatial Influence Patterns of Traffic Using Interpretable Deep Learning 利用可解释深度学习预测PM2.5和跟踪交通空间影响模式
Pub Date : 2021-08-13 DOI: 10.1145/3484274.3484302
Lianliang Chen, Z. Shan
Air pollution is a growing worldwide problem. Accurate prediction of PM2.5 concentration has a vital role to reduce the dramatic toll of air pollution on health. Due to the non-linearity and complexity of air pollution process and the influence of multiple factors, such as meteorological conditions, human activities and other chemical components, traditional pollution-related models have challenges in dealing with PM2.5 modeling. Based on atmospheric domain knowledge, we proposed a novel and interpretable deep learning model (iDeepAir) to predict hourly PM2.5 concentration by incorporating traffic data, meteorological data and air quality data. We designed feature interaction module and temporal interaction module to simulate pollution chemical reaction process and temporal accumulated process respectively, which makes the model has better understood and improves prediction accuracy of PM2.5 concentration. Compared to the best comparison model, mean absolute error (MAE) and rooted mean squared error (RMSE) were improved by 20.1% and 14.4% in 24h respectively. Furthermore, with the embedded Layerwise Relevance Propagation (LRP) algorithm, iDeepAir allows us to observe the spatial influence patterns of regional traffic emissions in a high-resolution way and evaluate the impact of traffic emissions on PM2.5 formation. Taking Shanghai as an example, we discover that although there are serious traffic emissions in some areas of Shanghai, they do not always directly aggravate air pollution, which is also affected by local buildings, meteorological conditions, and other human activities. These results show the spatial interpretability of our model and provide a quantitive decision-making basis for the government to control air pollution.
空气污染是一个日益严重的世界性问题。准确预测PM2.5浓度对于减少空气污染对健康造成的巨大损害具有至关重要的作用。由于大气污染过程的非线性和复杂性,以及气象条件、人类活动和其他化学成分等多种因素的影响,传统的污染相关模型在处理PM2.5建模时面临挑战。基于大气领域的知识,我们提出了一种新的、可解释的深度学习模型(iDeepAir),通过结合交通数据、气象数据和空气质量数据来预测每小时PM2.5浓度。我们设计了特征交互模块和时间交互模块,分别模拟污染化学反应过程和时间累积过程,使模型更好地理解PM2.5浓度,提高了预测精度。与最佳比较模型相比,平均绝对误差(MAE)和均方根误差(RMSE)在24h内分别提高了20.1%和14.4%。此外,iDeepAir通过嵌入式分层关联传播(LRP)算法,以高分辨率的方式观察区域交通排放的空间影响格局,并评估交通排放对PM2.5形成的影响。以上海为例,我们发现,虽然上海部分地区存在严重的交通排放,但并不总是直接加剧空气污染,空气污染还受到当地建筑、气象条件等人类活动的影响。这些结果显示了模型的空间可解释性,为政府控制大气污染提供了定量决策依据。
{"title":"Forecasting PM2.5 and Tracking Spatial Influence Patterns of Traffic Using Interpretable Deep Learning","authors":"Lianliang Chen, Z. Shan","doi":"10.1145/3484274.3484302","DOIUrl":"https://doi.org/10.1145/3484274.3484302","url":null,"abstract":"Air pollution is a growing worldwide problem. Accurate prediction of PM2.5 concentration has a vital role to reduce the dramatic toll of air pollution on health. Due to the non-linearity and complexity of air pollution process and the influence of multiple factors, such as meteorological conditions, human activities and other chemical components, traditional pollution-related models have challenges in dealing with PM2.5 modeling. Based on atmospheric domain knowledge, we proposed a novel and interpretable deep learning model (iDeepAir) to predict hourly PM2.5 concentration by incorporating traffic data, meteorological data and air quality data. We designed feature interaction module and temporal interaction module to simulate pollution chemical reaction process and temporal accumulated process respectively, which makes the model has better understood and improves prediction accuracy of PM2.5 concentration. Compared to the best comparison model, mean absolute error (MAE) and rooted mean squared error (RMSE) were improved by 20.1% and 14.4% in 24h respectively. Furthermore, with the embedded Layerwise Relevance Propagation (LRP) algorithm, iDeepAir allows us to observe the spatial influence patterns of regional traffic emissions in a high-resolution way and evaluate the impact of traffic emissions on PM2.5 formation. Taking Shanghai as an example, we discover that although there are serious traffic emissions in some areas of Shanghai, they do not always directly aggravate air pollution, which is also affected by local buildings, meteorological conditions, and other human activities. These results show the spatial interpretability of our model and provide a quantitive decision-making basis for the government to control air pollution.","PeriodicalId":143540,"journal":{"name":"Proceedings of the 4th International Conference on Control and Computer Vision","volume":"38 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-08-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129220701","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Proceedings of the 4th International Conference on Control and Computer Vision
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1