首页 > 最新文献

Proceedings of the 2023 2nd Asia Conference on Algorithms, Computing and Machine Learning最新文献

英文 中文
An Interpretable Brain Network Atlas-Based Hybrid Model for Mild Cognitive Impairment Progression Prediction 基于可解释脑网络图谱的轻度认知障碍进展预测混合模型
Xianglong Guan, Li Ma, Yunyou Huang, Suqin Tang, Tinghui Li
The process of Alzheimer’s disease (AD) is irreversible, but reasonable medical intervention for preclinical AD can delay AD’s onset. Progressive mild cognitive impairment (pMCI) is the most critical stage for AD preclinical intervention. Therefore, accurate identification of pMCI will significantly improve patient benefits. Functional MRI is a neuroimaging modality that has been widely utilized to study brain activity related to AD. However, it is challenging to obtain functional MRI data, and a small amount of data will easily lead to the overfitting of the identification model. In addition, the current pMCI identification model lack interpretability leads to difficulty in acceptance by clinicians. In this work, we propose an interpretable hybrid model based on a brain network atlas to identify pMCI subjects. First, the hybrid model utilizes multi-layer perceptron to obtain categorical global features to help graph neural networks reduce overfitting. Second, the attention mechanism is introduced into the model to explain the recognition behavior of the model. The results show that our model outperforms the comparison models on multiple metrics.
阿尔茨海默病(AD)的发病过程是不可逆的,但对临床前AD进行合理的医学干预可以延缓AD的发病。进行性轻度认知障碍(pMCI)是AD临床前干预的最关键阶段。因此,准确识别pMCI将显著提高患者获益。功能MRI是一种神经成像技术,已广泛用于研究与AD相关的脑活动。然而,功能性MRI数据的获取具有挑战性,数据量少容易导致识别模型的过拟合。此外,目前的pMCI识别模型缺乏可解释性,导致临床医生难以接受。在这项工作中,我们提出了一个基于脑网络图谱的可解释混合模型来识别pMCI受试者。首先,混合模型利用多层感知器获取分类全局特征,帮助图神经网络减少过拟合。其次,在模型中引入注意机制来解释模型的识别行为。结果表明,我们的模型在多个指标上优于比较模型。
{"title":"An Interpretable Brain Network Atlas-Based Hybrid Model for Mild Cognitive Impairment Progression Prediction","authors":"Xianglong Guan, Li Ma, Yunyou Huang, Suqin Tang, Tinghui Li","doi":"10.1145/3590003.3590081","DOIUrl":"https://doi.org/10.1145/3590003.3590081","url":null,"abstract":"The process of Alzheimer’s disease (AD) is irreversible, but reasonable medical intervention for preclinical AD can delay AD’s onset. Progressive mild cognitive impairment (pMCI) is the most critical stage for AD preclinical intervention. Therefore, accurate identification of pMCI will significantly improve patient benefits. Functional MRI is a neuroimaging modality that has been widely utilized to study brain activity related to AD. However, it is challenging to obtain functional MRI data, and a small amount of data will easily lead to the overfitting of the identification model. In addition, the current pMCI identification model lack interpretability leads to difficulty in acceptance by clinicians. In this work, we propose an interpretable hybrid model based on a brain network atlas to identify pMCI subjects. First, the hybrid model utilizes multi-layer perceptron to obtain categorical global features to help graph neural networks reduce overfitting. Second, the attention mechanism is introduced into the model to explain the recognition behavior of the model. The results show that our model outperforms the comparison models on multiple metrics.","PeriodicalId":340225,"journal":{"name":"Proceedings of the 2023 2nd Asia Conference on Algorithms, Computing and Machine Learning","volume":"4577 1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-03-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114071189","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Multi-strategy Improved Multi-objective Harris Hawk Optimization Algorithm with Elite Opposition-based Learning 基于精英对立学习的多策略改进多目标Harris Hawk优化算法
Fulin Tian, Jiayang Wang, Fei Chu, Lin Zhou
Abstract: To make up for the deficiencies of the Harris hawk optimization algorithm (HHO) in solving multi-objective optimization problems with low algorithm accuracy, slow rate of convergence, and easily fall into the trap of local optima, a multi-strategy improved multi-objective Harris hawk optimization algorithm with elite opposition-based learning (MO-EMHHO) is proposed. First, the population is initialized by Sobol sequences to increase population diversity. Second, incorporate the elite backward learning strategy to improve population diversity and quality. Further, an external profile maintenance method based on an adaptive grid strategy is proposed to make the solution better contracted to the real Pareto frontier. Subsequently, optimize the update strategy of the original algorithm in a non-linear energy update way to improve the exploration and development of the algorithm. Finally, improving the diversity of the algorithm and the uniformity of the solution set using an adaptive variation strategy based on Gaussian random wandering. Experimental comparison of the multi-objective particle swarm algorithm (MOPSO), multi-objective gray wolf algorithm (MOGWO), and multi-objective Harris Hawk algorithm (MOHHO) on the commonly used benchmark functions shows that the MO-EMHHO outperforms the other compared algorithms in terms of optimization seeking accuracy, convergence speed and stability, and provides a new solution to the multi-objective optimization problem.
摘要针对Harris hawk优化算法(HHO)在解决多目标优化问题时算法精度低、收敛速度慢、易陷入局部最优陷阱等缺点,提出了一种基于精英对抗学习的多策略改进多目标Harris hawk优化算法(MO-EMHHO)。首先,利用Sobol序列对种群进行初始化,增加种群多样性;第二,融入精英落后学习策略,提高人口多样性和素质。在此基础上,提出了一种基于自适应网格策略的外部轮廓维护方法,使解更好地收缩到实际帕累托边界。随后,以非线性能量更新的方式对原算法的更新策略进行优化,提高算法的探索和发展。最后,采用基于高斯随机漫游的自适应变异策略提高了算法的多样性和解集的均匀性。在常用的基准函数上对多目标粒子群算法(MOPSO)、多目标灰狼算法(MOGWO)和多目标哈里斯鹰算法(MOHHO)进行了实验比较,结果表明,MO-EMHHO在寻优精度、收敛速度和稳定性方面均优于其他被比较算法,为多目标优化问题提供了一种新的解决方案。
{"title":"Multi-strategy Improved Multi-objective Harris Hawk Optimization Algorithm with Elite Opposition-based Learning","authors":"Fulin Tian, Jiayang Wang, Fei Chu, Lin Zhou","doi":"10.1145/3590003.3590030","DOIUrl":"https://doi.org/10.1145/3590003.3590030","url":null,"abstract":"Abstract: To make up for the deficiencies of the Harris hawk optimization algorithm (HHO) in solving multi-objective optimization problems with low algorithm accuracy, slow rate of convergence, and easily fall into the trap of local optima, a multi-strategy improved multi-objective Harris hawk optimization algorithm with elite opposition-based learning (MO-EMHHO) is proposed. First, the population is initialized by Sobol sequences to increase population diversity. Second, incorporate the elite backward learning strategy to improve population diversity and quality. Further, an external profile maintenance method based on an adaptive grid strategy is proposed to make the solution better contracted to the real Pareto frontier. Subsequently, optimize the update strategy of the original algorithm in a non-linear energy update way to improve the exploration and development of the algorithm. Finally, improving the diversity of the algorithm and the uniformity of the solution set using an adaptive variation strategy based on Gaussian random wandering. Experimental comparison of the multi-objective particle swarm algorithm (MOPSO), multi-objective gray wolf algorithm (MOGWO), and multi-objective Harris Hawk algorithm (MOHHO) on the commonly used benchmark functions shows that the MO-EMHHO outperforms the other compared algorithms in terms of optimization seeking accuracy, convergence speed and stability, and provides a new solution to the multi-objective optimization problem.","PeriodicalId":340225,"journal":{"name":"Proceedings of the 2023 2nd Asia Conference on Algorithms, Computing and Machine Learning","volume":"18 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-03-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124968430","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Intelligent perception recognition and positioning method of distribution network drainage line 配电网排水线路智能感知识别与定位方法
Shuzhou Xiao, Qiuyan Zhang, Q. Fan, Jianrong Wu, Chao Zhao
Due to the serious interference of illumination and background on the camera during the live operation of the distribution network robot, it is difficult to match, identify, and locate the feature points of the target image, such as the drainage line. This paper proposes the intelligent perception recognition and positioning method of the distribution network drainage line. First, YOLOv4 is used to identify and classify the typical parts of the distribution network and determine the two-dimensional position of the operation point. Subsequently, the Res-Unet segmentation network was improved to perform image segmentation of drainage lines and wires to avoid complex background interference. Finally, binocular vision is used to extract the center line of the wire through the image geometric moment and determine the image line of the wire and the center of the double eyes. The intersection line of the wire is the spatial three-dimensional coordinates of the wire. After the target detection, wire segmentation, and operation point positioning experiments, this method can achieve a positioning accuracy of 1 mm in the x and y directions and 3 mm in the z direction under the camera coordinate system, which provides a guarantee for accurate perception and recognition and reliable operation control of the power distribution robot operation.
配电网机器人在现场运行过程中,由于光照和背景对摄像机的严重干扰,难以匹配、识别和定位目标图像的特征点,如排水线路。本文提出了配电网排水线路的智能感知识别与定位方法。首先,利用YOLOv4对配电网的典型部件进行识别和分类,确定操作点的二维位置。随后,对Res-Unet分割网络进行改进,对排水线进行图像分割,避免复杂背景干扰。最后,利用双目视觉通过图像几何矩提取线的中心线,确定线的图像线和双眼的中心。导线的交点线是导线的空间三维坐标。经过目标检测、线段分割、操作点定位实验,该方法在摄像机坐标系下可实现x、y方向1 mm、z方向3 mm的定位精度,为配电机器人运行的准确感知识别和可靠运行控制提供了保障。
{"title":"Intelligent perception recognition and positioning method of distribution network drainage line","authors":"Shuzhou Xiao, Qiuyan Zhang, Q. Fan, Jianrong Wu, Chao Zhao","doi":"10.1145/3590003.3590088","DOIUrl":"https://doi.org/10.1145/3590003.3590088","url":null,"abstract":"Due to the serious interference of illumination and background on the camera during the live operation of the distribution network robot, it is difficult to match, identify, and locate the feature points of the target image, such as the drainage line. This paper proposes the intelligent perception recognition and positioning method of the distribution network drainage line. First, YOLOv4 is used to identify and classify the typical parts of the distribution network and determine the two-dimensional position of the operation point. Subsequently, the Res-Unet segmentation network was improved to perform image segmentation of drainage lines and wires to avoid complex background interference. Finally, binocular vision is used to extract the center line of the wire through the image geometric moment and determine the image line of the wire and the center of the double eyes. The intersection line of the wire is the spatial three-dimensional coordinates of the wire. After the target detection, wire segmentation, and operation point positioning experiments, this method can achieve a positioning accuracy of 1 mm in the x and y directions and 3 mm in the z direction under the camera coordinate system, which provides a guarantee for accurate perception and recognition and reliable operation control of the power distribution robot operation.","PeriodicalId":340225,"journal":{"name":"Proceedings of the 2023 2nd Asia Conference on Algorithms, Computing and Machine Learning","volume":"52 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-03-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127102150","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Comparative Research on Embedding Methods for Video Knowledge Graph 视频知识图嵌入方法的比较研究
Zhihong Zhou, Qiang Xu, Hui Ding, Shengwei Ji
In the video recommendation scenario, knowledge graphs are usually introduced to supplement the data information between videos to achieve information expansion and solve the problems of data sparsity and user cold start. However, there are few high-quality knowledge graphs available in the field of video recommendation, and there are many schemes based on knowledge graph embedding, which have different effects on recommendation performance and bring difficulties to researchers. Based on the streaming media video website data, this paper constructs knowledge graphs of two typical scenarios (i.e., sparse distribution scenarios and dense distribution scenarios ). Moreover, six state-of-the-art knowledge graph embedding methods are analyzed based on extensive experiments from three aspects: data distribution type, data set segmentation method, and recommended quantity range. Comparing the recommendation effect of knowledge graph embedding methods. The experimental results demonstrate that: in the sparse distribution scenario , the recommendation effect using TransE is the best; in the dense distribution scenario, the recommendation effect using TransE or TranD is the best. It provides a reference for subsequent researchers on how to choose knowledge map embedding methods under specific data distribution.
在视频推荐场景中,通常会引入知识图来补充视频之间的数据信息,实现信息的扩充,解决数据稀疏和用户冷启动的问题。然而,在视频推荐领域,高质量的知识图很少,而基于知识图嵌入的方案也很多,这些方案对推荐性能的影响不一,给研究人员带来了困难。本文基于流媒体视频网站数据,构建了两种典型场景(稀疏分布场景和密集分布场景)的知识图。在大量实验的基础上,从数据分布类型、数据集分割方法和推荐数量范围三个方面分析了六种最新的知识图嵌入方法。比较知识图嵌入方法的推荐效果。实验结果表明:在稀疏分布场景下,使用TransE进行推荐效果最好;在密集分布场景下,使用TransE或TranD的推荐效果最好。为后续研究人员在特定数据分布下如何选择知识地图嵌入方法提供了参考。
{"title":"Comparative Research on Embedding Methods for Video Knowledge Graph","authors":"Zhihong Zhou, Qiang Xu, Hui Ding, Shengwei Ji","doi":"10.1145/3590003.3590049","DOIUrl":"https://doi.org/10.1145/3590003.3590049","url":null,"abstract":"In the video recommendation scenario, knowledge graphs are usually introduced to supplement the data information between videos to achieve information expansion and solve the problems of data sparsity and user cold start. However, there are few high-quality knowledge graphs available in the field of video recommendation, and there are many schemes based on knowledge graph embedding, which have different effects on recommendation performance and bring difficulties to researchers. Based on the streaming media video website data, this paper constructs knowledge graphs of two typical scenarios (i.e., sparse distribution scenarios and dense distribution scenarios ). Moreover, six state-of-the-art knowledge graph embedding methods are analyzed based on extensive experiments from three aspects: data distribution type, data set segmentation method, and recommended quantity range. Comparing the recommendation effect of knowledge graph embedding methods. The experimental results demonstrate that: in the sparse distribution scenario , the recommendation effect using TransE is the best; in the dense distribution scenario, the recommendation effect using TransE or TranD is the best. It provides a reference for subsequent researchers on how to choose knowledge map embedding methods under specific data distribution.","PeriodicalId":340225,"journal":{"name":"Proceedings of the 2023 2nd Asia Conference on Algorithms, Computing and Machine Learning","volume":"65 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-03-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131648880","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Research on Epidemic Big Data Monitoring and Application of Ship Berthing Based on Knowledge Graph-Community Detection 基于知识图谱-社群检测的船舶靠泊流行病大数据监测与应用研究
Dongfang Shang, Yuesong Li, Jiashuai Xu, Kexin Bao, Ruixi Wang, Liu Qin
The COVID-19 epidemic has been raging overseas for more than three years, and inbound goods and people have become the main risk points of the domestic epidemic. As the main window for China to exchange materials and personnel with foreign countries, under the dual pressure of the global economic downturn and the China-US economic confrontation, ports’ pressure and responsibility to ensure material transportation and foreign trade are particularly heavy. However, the risk screening of ship and crew epidemic information based on manual methods is extremely time-consuming and labor-intensive, and it is difficult to take into account the efficiency and accuracy requirements of the port's own business and disease control and traceability. To this end, this study proposes an epidemic risk screening method based on knowledge graphs. This method is based on shipping big data and community discovery algorithms, analyzes the geospatial similarity of ship information, crew information and real-time epidemic policy information, and quickly establishes a structure. Map data, quickly screen high-risk ships and crew members, and access the business system to arrange nucleic acid testing tasks. When the time cost is only one thousandth of that of manual labor, the detection accuracy rate approaches and exceeds the accuracy level of manual screening, with an average precision advantage of 8.18% and an average time advantage of 1423 times. It is further found that it is more capable of performing heavy screening tasks than humans, and its AUC decline rate with the increase of the amount of measured data is only 34% of that of the manual method. The research results have been initially applied in Ningbo Port, which has greatly improved the informatization level and screening efficiency of Ningbo Port's risk screening during COVID-19 epidemic.
COVID-19疫情已在海外肆虐三年有余,入境货物和人员已成为国内疫情的主要风险点。作为我国对外物资和人员交流的主要窗口,在全球经济低迷和中美经济对峙的双重压力下,口岸保障物资运输和对外贸易的压力和责任尤为沉重。然而,基于人工方式的船舶和船员疫情信息风险筛查,耗时耗力极大,难以兼顾港口自身业务和疫情防控溯源的效率和准确性要求。为此,本研究提出了一种基于知识图谱的疫情风险筛查方法。该方法基于航运大数据和社群发现算法,分析船舶信息、船员信息和实时疫情政策信息的地理空间相似性,快速建立结构。绘制数据地图,快速筛选高风险船舶和船员,接入业务系统安排核酸检测任务。当时间成本仅为人工的千分之一时,检测准确率接近并超过人工筛查的准确率水平,平均精度优势为 8.18%,平均时间优势为 1423 倍。研究进一步发现,它比人工更能胜任繁重的筛选任务,其 AUC 随测量数据量增加而下降的比率仅为人工方法的 34%。研究成果已初步应用于宁波港,大大提高了宁波港在 COVID-19 疫情期间风险筛查的信息化水平和筛查效率。
{"title":"Research on Epidemic Big Data Monitoring and Application of Ship Berthing Based on Knowledge Graph-Community Detection","authors":"Dongfang Shang, Yuesong Li, Jiashuai Xu, Kexin Bao, Ruixi Wang, Liu Qin","doi":"10.1145/3590003.3590026","DOIUrl":"https://doi.org/10.1145/3590003.3590026","url":null,"abstract":"The COVID-19 epidemic has been raging overseas for more than three years, and inbound goods and people have become the main risk points of the domestic epidemic. As the main window for China to exchange materials and personnel with foreign countries, under the dual pressure of the global economic downturn and the China-US economic confrontation, ports’ pressure and responsibility to ensure material transportation and foreign trade are particularly heavy. However, the risk screening of ship and crew epidemic information based on manual methods is extremely time-consuming and labor-intensive, and it is difficult to take into account the efficiency and accuracy requirements of the port's own business and disease control and traceability. To this end, this study proposes an epidemic risk screening method based on knowledge graphs. This method is based on shipping big data and community discovery algorithms, analyzes the geospatial similarity of ship information, crew information and real-time epidemic policy information, and quickly establishes a structure. Map data, quickly screen high-risk ships and crew members, and access the business system to arrange nucleic acid testing tasks. When the time cost is only one thousandth of that of manual labor, the detection accuracy rate approaches and exceeds the accuracy level of manual screening, with an average precision advantage of 8.18% and an average time advantage of 1423 times. It is further found that it is more capable of performing heavy screening tasks than humans, and its AUC decline rate with the increase of the amount of measured data is only 34% of that of the manual method. The research results have been initially applied in Ningbo Port, which has greatly improved the informatization level and screening efficiency of Ningbo Port's risk screening during COVID-19 epidemic.","PeriodicalId":340225,"journal":{"name":"Proceedings of the 2023 2nd Asia Conference on Algorithms, Computing and Machine Learning","volume":"21 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-03-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130136017","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
An Analysis Software for Visual Position and Attitude Measurement Algorithm 一种视觉位置姿态测量算法分析软件
Tao-rang Xu, Jing Zhang, Bin Cai, Yafei Wang
Visual position and attitude measurement (VPAM) system has been widely used in obtaining space target information. In order to better obtain different target information and meet the requirements, it is particularly important to select a correct and effective measurement algorithm. In this paper, a performance evaluation software of VPAM algorithm is designed, which can compare and analyze the accuracy and complexity of algorithms used by different VPAM models, and help users select appropriate position models to obtain more accurate target information. Finally, the software is verified by using the dual photogrammetric model in the shipborne helicopter landing system, and the validity of the analysis software is verified by comparing the calculation results with the theoretical value of the algorithm accuracy analysis. The main contribution of this paper is that, as far as we know, it is the first time to try to evaluate the complexity and accuracy of the algorithm by building analysis software instead of theoretical analysis.
视觉位置姿态测量(VPAM)系统在获取空间目标信息方面得到了广泛的应用。为了更好地获取不同的目标信息,满足测量要求,选择正确有效的测量算法显得尤为重要。本文设计了VPAM算法的性能评价软件,可以对不同VPAM模型所使用的算法的精度和复杂度进行比较分析,帮助用户选择合适的位置模型,获得更准确的目标信息。最后,在舰载直升机着陆系统中使用双摄影测量模型对软件进行验证,并将计算结果与算法精度分析的理论值进行比较,验证分析软件的有效性。本文的主要贡献在于,据我们所知,这是第一次尝试通过构建分析软件来评估算法的复杂性和准确性,而不是理论分析。
{"title":"An Analysis Software for Visual Position and Attitude Measurement Algorithm","authors":"Tao-rang Xu, Jing Zhang, Bin Cai, Yafei Wang","doi":"10.1145/3590003.3590043","DOIUrl":"https://doi.org/10.1145/3590003.3590043","url":null,"abstract":"Visual position and attitude measurement (VPAM) system has been widely used in obtaining space target information. In order to better obtain different target information and meet the requirements, it is particularly important to select a correct and effective measurement algorithm. In this paper, a performance evaluation software of VPAM algorithm is designed, which can compare and analyze the accuracy and complexity of algorithms used by different VPAM models, and help users select appropriate position models to obtain more accurate target information. Finally, the software is verified by using the dual photogrammetric model in the shipborne helicopter landing system, and the validity of the analysis software is verified by comparing the calculation results with the theoretical value of the algorithm accuracy analysis. The main contribution of this paper is that, as far as we know, it is the first time to try to evaluate the complexity and accuracy of the algorithm by building analysis software instead of theoretical analysis.","PeriodicalId":340225,"journal":{"name":"Proceedings of the 2023 2nd Asia Conference on Algorithms, Computing and Machine Learning","volume":"36 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-03-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130249393","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Explainable Deep Learning for Medical Image Segmentation With Learnable Class Activation Mapping 基于可学习类激活映射的医学图像分割的可解释深度学习
Kaiyu Wang, Sixing Yin, Yining Wang, Shufang Li
Medical image segmentation is crucial for facilitating pathology assessment, ensuring reliable diagnosis and monitoring disease progression. Deep-learning models have been extensively applied in automating medical image analysis to reduce human effort. However, the non-transparency of deep-learning models limits their clinical practicality due to the unaffordably high risk of misdiagnosis resulted from the misleading model output. In this paper, we propose a explainability metric as part of the loss function. The proposed explainability metric comes from Class Activation Map(CAM) with learnable weights such that the model can be optimized to achieve desirable balance between segmentation performance and explainability. Experiments found that the proposed model visibly heightened Dice score from to , Jaccard similarity from to and Recall from to respectively compared with U-net. In addition, results make clear that the drawn model outdistances the conventional U-net in terms of explainability performance.
医学图像分割对于促进病理评估、确保可靠诊断和监测疾病进展至关重要。深度学习模型已广泛应用于医学图像的自动化分析,以减少人工的工作量。然而,深度学习模型的不透明性限制了其临床实用性,因为误导性模型输出导致的误诊风险高得难以承受。在本文中,我们提出了一个可解释性度量作为损失函数的一部分。所提出的可解释性度量来自具有可学习权重的类激活图(Class Activation Map, CAM),从而可以优化模型以实现分割性能和可解释性之间的理想平衡。实验发现,与U-net相比,该模型显著提高了骰子得分从0到0、纸牌相似度从0到0和召回率从0到0。此外,结果表明,绘制的模型在可解释性性能方面优于传统的U-net。
{"title":"Explainable Deep Learning for Medical Image Segmentation With Learnable Class Activation Mapping","authors":"Kaiyu Wang, Sixing Yin, Yining Wang, Shufang Li","doi":"10.1145/3590003.3590040","DOIUrl":"https://doi.org/10.1145/3590003.3590040","url":null,"abstract":"Medical image segmentation is crucial for facilitating pathology assessment, ensuring reliable diagnosis and monitoring disease progression. Deep-learning models have been extensively applied in automating medical image analysis to reduce human effort. However, the non-transparency of deep-learning models limits their clinical practicality due to the unaffordably high risk of misdiagnosis resulted from the misleading model output. In this paper, we propose a explainability metric as part of the loss function. The proposed explainability metric comes from Class Activation Map(CAM) with learnable weights such that the model can be optimized to achieve desirable balance between segmentation performance and explainability. Experiments found that the proposed model visibly heightened Dice score from to , Jaccard similarity from to and Recall from to respectively compared with U-net. In addition, results make clear that the drawn model outdistances the conventional U-net in terms of explainability performance.","PeriodicalId":340225,"journal":{"name":"Proceedings of the 2023 2nd Asia Conference on Algorithms, Computing and Machine Learning","volume":"107 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-03-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125002047","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
KRE: A Key-retained Random Erasing Method for Occluded Person Re-identification 一种保留密钥的随机擦除方法用于闭塞人员的再识别
Hongxia Wang, Yao Ma, Xiang Chen
Occluded person re-identification (ReID) is a challenging task in the field of computer vision, facing the problem that the target pedestrians in probe images are obscured by various occlusions. Random Erasing in data augmentation techniques is one of the effective methods used to deal with the occlusion problem, but it may introduce noise into the training process, which affects the training of the model. In order to solve this problem, we propose an novel data augmentation method named Key-retained Random Erasing (KRE) which preserves the critical parts in images for occluded person ReID. Based on the regular Random Erasing, we utilize the naturally generated attention map in Vision Transformers and introduce an adaptive threshold selection method to detect the key areas of the image to be augmented. The complexity of the training samples can be improved without losing the key information of the images by reserving the key areas in Random Erasing process, which can finally alleviate the occluded person ReID problem. Validating the proposed method on occluded, partial and holistic ReID datasets, extensive experimental results demonstrate that our method performs favorably against state-of-the-art methods on ViT-based models.
遮挡人再识别(ReID)是计算机视觉领域的一项具有挑战性的任务,它面临着探测图像中目标行人被各种遮挡遮挡的问题。数据增强技术中的随机擦除是处理遮挡问题的有效方法之一,但它可能会在训练过程中引入噪声,影响模型的训练。为了解决这一问题,我们提出了一种新的数据增强方法——密钥保留随机擦除(Key-retained Random erase, KRE)。在常规随机擦除的基础上,利用视觉变形中自然生成的注意图,引入自适应阈值选择方法来检测待增强图像的关键区域。通过在Random erase过程中保留关键区域,可以在不丢失图像关键信息的情况下提高训练样本的复杂度,最终缓解被遮挡人的ReID问题。在遮挡的、部分的和整体的ReID数据集上验证了所提出的方法,大量的实验结果表明,我们的方法在基于vit的模型上优于最先进的方法。
{"title":"KRE: A Key-retained Random Erasing Method for Occluded Person Re-identification","authors":"Hongxia Wang, Yao Ma, Xiang Chen","doi":"10.1145/3590003.3590089","DOIUrl":"https://doi.org/10.1145/3590003.3590089","url":null,"abstract":"Occluded person re-identification (ReID) is a challenging task in the field of computer vision, facing the problem that the target pedestrians in probe images are obscured by various occlusions. Random Erasing in data augmentation techniques is one of the effective methods used to deal with the occlusion problem, but it may introduce noise into the training process, which affects the training of the model. In order to solve this problem, we propose an novel data augmentation method named Key-retained Random Erasing (KRE) which preserves the critical parts in images for occluded person ReID. Based on the regular Random Erasing, we utilize the naturally generated attention map in Vision Transformers and introduce an adaptive threshold selection method to detect the key areas of the image to be augmented. The complexity of the training samples can be improved without losing the key information of the images by reserving the key areas in Random Erasing process, which can finally alleviate the occluded person ReID problem. Validating the proposed method on occluded, partial and holistic ReID datasets, extensive experimental results demonstrate that our method performs favorably against state-of-the-art methods on ViT-based models.","PeriodicalId":340225,"journal":{"name":"Proceedings of the 2023 2nd Asia Conference on Algorithms, Computing and Machine Learning","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-03-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129009280","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
End-to-end Parking Behavior Recognition Based on Self-attention Mechanism 基于自注意机制的端到端停车行为识别
Penghua Li, Dechen Zhu, Qiyun Mou, Yushan Tu, Jinfeng Wu
In response to the current problem of a large amount of abnormal data in parking behavior detection, this research proposes a network specialized in parking behavior identification, which identifies the background parking behavior data, classifies the data with high accuracy, reduces the cost of manually verifying the data in the background, speeds up the parking charging cycle of enterprises, and optimizes the user experience.The dynamic position embedding is introduced in the parking-transformer species, so that the self-attention within the transformer can dynamically model the structure of the input token and dynamically encode the input parking behavior sequence data to improve the accuracy of the model for parking behavior recognition.In addition, we created a self-collected parking behavior(SPB) dataset, which was acquired in a natural state and contained various behaviors, and manually classified the various behaviors within the data, and then randomly divided into a test set and a validation set for training and testing, respectively.Compared with the existing methods, indicate that parking-trasnformer hits acceptable trade-offs,namely,97.14% accuracy for SPB dataset.
针对当前停车行为检测中存在大量异常数据的问题,本研究提出了一种专门用于停车行为识别的网络,该网络对后台停车行为数据进行识别,对数据进行高精度分类,降低了后台人工验证数据的成本,加快了企业停车收费周期,优化了用户体验。在停车变压器种类中引入动态位置嵌入,使变压器内部的自关注能够对输入令牌的结构进行动态建模,并对输入停车行为序列数据进行动态编码,提高模型对停车行为识别的准确性。此外,我们创建了一个自动收集的停车行为(SPB)数据集,该数据集是在自然状态下获取的,包含各种行为,并对数据中的各种行为进行人工分类,然后随机分为测试集和验证集,分别进行训练和测试。与现有方法相比,该方法达到了可接受的折衷,即在SPB数据集上的准确率为97.14%。
{"title":"End-to-end Parking Behavior Recognition Based on Self-attention Mechanism","authors":"Penghua Li, Dechen Zhu, Qiyun Mou, Yushan Tu, Jinfeng Wu","doi":"10.1145/3590003.3590072","DOIUrl":"https://doi.org/10.1145/3590003.3590072","url":null,"abstract":"In response to the current problem of a large amount of abnormal data in parking behavior detection, this research proposes a network specialized in parking behavior identification, which identifies the background parking behavior data, classifies the data with high accuracy, reduces the cost of manually verifying the data in the background, speeds up the parking charging cycle of enterprises, and optimizes the user experience.The dynamic position embedding is introduced in the parking-transformer species, so that the self-attention within the transformer can dynamically model the structure of the input token and dynamically encode the input parking behavior sequence data to improve the accuracy of the model for parking behavior recognition.In addition, we created a self-collected parking behavior(SPB) dataset, which was acquired in a natural state and contained various behaviors, and manually classified the various behaviors within the data, and then randomly divided into a test set and a validation set for training and testing, respectively.Compared with the existing methods, indicate that parking-trasnformer hits acceptable trade-offs,namely,97.14% accuracy for SPB dataset.","PeriodicalId":340225,"journal":{"name":"Proceedings of the 2023 2nd Asia Conference on Algorithms, Computing and Machine Learning","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-03-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129108864","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
An Objective Reduction Evolutionary Multiobjective Algorithm using Adaptive Density-Based Clustering for Many-objective Optimization Problem 基于自适应密度聚类的目标约简进化多目标优化算法
Mingjing Wang, Long Chen, Huiling Chen
Many-objective optimization problems (MaOPs), are the most difficult problems to solve when it comes to multiobjective optimization issues (MOPs). MaOPs provide formidable challenges to current multiobjective evolutionary methods such as selection operators, computational cost, visualization of the high-dimensional trade-off front. Removal of the reductant objectives from the original objective set, known as objective reduction, is one of the most significant approaches for MaOPs, which can tackle optimization problems with more than 15 objectives is made feasible by its ability to greatly overcome the challenges of existing multi-objective evolutionary computing techniques. In this study, an objective reduction evolutionary multiobjective algorithm using adaptive density-based clustering is presented for MaOPs. The parameters in the density-based clustering can be adaptively determined by depending on the data samples constructed. Based on the clustering result, the algorithm employs an adaptive strategy for objective aggregation that preserves the structure of the original Pareto front as much as feasible. Finally, the performance of the proposed multiobjective algorithms on benchmarks is thoroughly investigated. The numerical findings and comparisons demonstrate the efficacy and superiority of the suggested multiobjective algorithms and it may be treated as a potential tool for MaOPs.
多目标优化问题(MaOPs)是多目标优化问题中最难解决的问题。MaOPs对当前的多目标进化方法提出了严峻的挑战,如选择算子、计算成本、高维权衡前沿的可视化等。从原始目标集中去除还原剂目标,即目标约简,是MaOPs最重要的方法之一,它可以解决超过15个目标的优化问题,因为它能够极大地克服现有多目标进化计算技术的挑战。本文提出了一种基于自适应密度聚类的MaOPs目标约简进化多目标算法。基于密度聚类的参数可以根据所构造的数据样本自适应确定。基于聚类结果,该算法采用自适应策略进行目标聚类,尽可能保留原Pareto前沿的结构。最后,对所提出的多目标算法在基准上的性能进行了深入的研究。数值结果和比较表明了所提出的多目标算法的有效性和优越性,它可以作为MaOPs的潜在工具。
{"title":"An Objective Reduction Evolutionary Multiobjective Algorithm using Adaptive Density-Based Clustering for Many-objective Optimization Problem","authors":"Mingjing Wang, Long Chen, Huiling Chen","doi":"10.1145/3590003.3590103","DOIUrl":"https://doi.org/10.1145/3590003.3590103","url":null,"abstract":"Many-objective optimization problems (MaOPs), are the most difficult problems to solve when it comes to multiobjective optimization issues (MOPs). MaOPs provide formidable challenges to current multiobjective evolutionary methods such as selection operators, computational cost, visualization of the high-dimensional trade-off front. Removal of the reductant objectives from the original objective set, known as objective reduction, is one of the most significant approaches for MaOPs, which can tackle optimization problems with more than 15 objectives is made feasible by its ability to greatly overcome the challenges of existing multi-objective evolutionary computing techniques. In this study, an objective reduction evolutionary multiobjective algorithm using adaptive density-based clustering is presented for MaOPs. The parameters in the density-based clustering can be adaptively determined by depending on the data samples constructed. Based on the clustering result, the algorithm employs an adaptive strategy for objective aggregation that preserves the structure of the original Pareto front as much as feasible. Finally, the performance of the proposed multiobjective algorithms on benchmarks is thoroughly investigated. The numerical findings and comparisons demonstrate the efficacy and superiority of the suggested multiobjective algorithms and it may be treated as a potential tool for MaOPs.","PeriodicalId":340225,"journal":{"name":"Proceedings of the 2023 2nd Asia Conference on Algorithms, Computing and Machine Learning","volume":"20 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-03-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128325571","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Proceedings of the 2023 2nd Asia Conference on Algorithms, Computing and Machine Learning
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1