首页 > 最新文献

2023 International Conference on Pattern Recognition, Machine Vision and Intelligent Algorithms (PRMVIA)最新文献

英文 中文
Design of A Smart Home Environment Monitoring System Based on Single-chip Microcomputer 基于单片机的智能家居环境监测系统设计
Fengze Zhong
The family living environment today has problems in safety and convenience, and the existing smart home system generally cannot guarantee high security, low energy consumption and accurate detection. A smart home safety environment detection system based on the AT89C51 microcontroller is proposed to solve it. The design uses a ZPH01 PM2.5 detector, MQ7 carbon monoxide (CO) sensor, DHT11 temperature and humidity sensor, and MQ2 smoke sensor. It can achieve the detection, display and alert of indoor temperature, humidity, carbon monoxide concentration, PM2.5 concentration and smoke concentration. At the same time, the HC-SR501 human body sensor module is used to detect the movement of the indoor area in real-time and send alerts. Also, the Principal Component Analysis (PCA) face recognition method is used to realize the recognition of humans at access control. The simulation results show that the designed system can detect the quality of the home environment in real-time and identify the personnel, significantly improving the home environment’s safety factor and quality of life.
当今的家庭生活环境在安全性和便利性方面存在问题,现有的智能家居系统普遍无法保证高安全性、低能耗和准确检测。为此,提出了一种基于AT89C51单片机的智能家居安全环境检测系统。本设计采用ZPH01 PM2.5传感器、MQ7一氧化碳传感器、DHT11温湿度传感器、MQ2烟雾传感器。可实现室内温度、湿度、一氧化碳浓度、PM2.5浓度、烟雾浓度的检测、显示和报警。同时,采用HC-SR501人体传感器模块实时检测室内区域的运动情况并发出报警。在此基础上,利用主成分分析人脸识别方法实现了门禁人员的识别。仿真结果表明,所设计的系统能够实时检测家庭环境质量并识别人员,显著提高了家庭环境的安全系数和生活质量。
{"title":"Design of A Smart Home Environment Monitoring System Based on Single-chip Microcomputer","authors":"Fengze Zhong","doi":"10.1109/prmvia58252.2023.00019","DOIUrl":"https://doi.org/10.1109/prmvia58252.2023.00019","url":null,"abstract":"The family living environment today has problems in safety and convenience, and the existing smart home system generally cannot guarantee high security, low energy consumption and accurate detection. A smart home safety environment detection system based on the AT89C51 microcontroller is proposed to solve it. The design uses a ZPH01 PM2.5 detector, MQ7 carbon monoxide (CO) sensor, DHT11 temperature and humidity sensor, and MQ2 smoke sensor. It can achieve the detection, display and alert of indoor temperature, humidity, carbon monoxide concentration, PM2.5 concentration and smoke concentration. At the same time, the HC-SR501 human body sensor module is used to detect the movement of the indoor area in real-time and send alerts. Also, the Principal Component Analysis (PCA) face recognition method is used to realize the recognition of humans at access control. The simulation results show that the designed system can detect the quality of the home environment in real-time and identify the personnel, significantly improving the home environment’s safety factor and quality of life.","PeriodicalId":221346,"journal":{"name":"2023 International Conference on Pattern Recognition, Machine Vision and Intelligent Algorithms (PRMVIA)","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2023-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130260181","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Study on Calculation Method for Hydrostatic Performance of Amphibious Vehicle 水陆两栖车辆静水性能计算方法研究
Qinghui Zhang, Xinxin Liu, Xin Zhao, Hongbin Xu, Zhengyu Li, Xiaolei Li
The calculation method for hydrostatic buoyant center and float status of amphibious vehicle based on three-dimensional model, mass and center of mass was studied. The condition of hydrostatic equilibrium of amphibious vehicle was introduced. The calculation method of buoyant center using the CATIA program and the computing process of hydrostatic float status were researched. The secondary development for CATIA was processed based on Python. The calculation program for hydrostatic float status of amphibious vehicle was written at last. The accurate and efficient calculation for hydrostatic performance of amphibious vehicle was realized.
研究了基于三维模型、质量和质心的水陆两栖车辆静浮力中心和浮力状态的计算方法。介绍了水陆两栖车辆的流体静力平衡条件。研究了利用CATIA程序计算浮心的方法和静浮状态的计算过程。基于Python对CATIA进行二次开发。最后编写了水陆两栖车辆静浮状态的计算程序。实现了水陆两栖车辆静水性能的准确、高效计算。
{"title":"Study on Calculation Method for Hydrostatic Performance of Amphibious Vehicle","authors":"Qinghui Zhang, Xinxin Liu, Xin Zhao, Hongbin Xu, Zhengyu Li, Xiaolei Li","doi":"10.1109/PRMVIA58252.2023.00027","DOIUrl":"https://doi.org/10.1109/PRMVIA58252.2023.00027","url":null,"abstract":"The calculation method for hydrostatic buoyant center and float status of amphibious vehicle based on three-dimensional model, mass and center of mass was studied. The condition of hydrostatic equilibrium of amphibious vehicle was introduced. The calculation method of buoyant center using the CATIA program and the computing process of hydrostatic float status were researched. The secondary development for CATIA was processed based on Python. The calculation program for hydrostatic float status of amphibious vehicle was written at last. The accurate and efficient calculation for hydrostatic performance of amphibious vehicle was realized.","PeriodicalId":221346,"journal":{"name":"2023 International Conference on Pattern Recognition, Machine Vision and Intelligent Algorithms (PRMVIA)","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2023-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129347527","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Convection-UNet: A Deep Convolutional Neural Network for Convection Detection based on the Geo High-speed Imager of Fengyun-4B 对流- unet:基于风云四号高速地磁成像仪的深度卷积神经网络对流检测
Yufei Wang, Baihua Xiao
Deep convection can cause a variety of severe weather conditions such as thunderstorms, strong winds, and heavy rainfall. Satellite observations provide all-weather and multi-directional observations, facilitating the timely detection of such weather systems, which is crucial to saving lives and property. However, previous methods based on channel feature extraction and threshold filtering did not make full use of information in satellite images, which led to limitations on such complex problems as strong convection detection. In this study, we propose a novel framework of a deep learning-based model Convection-UNet to detect convection. We use channel 4 to 7 of FY-4B GHI that we select according to the microphysical properties of convection as input and radar reflectivity as label. We combine the detailed training time and test time data augmentation strategies and build a deep neural network to automatically extract spatial context features and achieve end-to-end learning. Results show that the performance of our method far exceeds the previous channel extraction combined with threshold filtering methods such as BT and BTD at least 0.24 on Fi-measure. We also show that our channel selection and data augmentation strategies are of great significance to detect convection.
深层对流会导致各种恶劣的天气状况,如雷暴、强风和暴雨。卫星观测提供全天候和多方位的观测,有助于及时发现此类天气系统,这对挽救生命和财产至关重要。然而,以往基于信道特征提取和阈值滤波的方法并没有充分利用卫星图像中的信息,导致在强对流检测等复杂问题上存在局限性。在这项研究中,我们提出了一种基于深度学习的对流- unet模型的新框架来检测对流。我们使用FY-4B GHI的4 ~ 7通道,我们根据对流的微物理特性选择通道作为输入,雷达反射率作为标签。结合详细的训练时间和测试时间数据增强策略,构建深度神经网络,自动提取空间上下文特征,实现端到端学习。结果表明,我们的方法在Fi-measure上的性能远远超过了以往的信道提取与阈值滤波方法(如BT和BTD)相结合的性能至少为0.24。我们还证明了我们的信道选择和数据增强策略对对流检测具有重要意义。
{"title":"Convection-UNet: A Deep Convolutional Neural Network for Convection Detection based on the Geo High-speed Imager of Fengyun-4B","authors":"Yufei Wang, Baihua Xiao","doi":"10.1109/prmvia58252.2023.00033","DOIUrl":"https://doi.org/10.1109/prmvia58252.2023.00033","url":null,"abstract":"Deep convection can cause a variety of severe weather conditions such as thunderstorms, strong winds, and heavy rainfall. Satellite observations provide all-weather and multi-directional observations, facilitating the timely detection of such weather systems, which is crucial to saving lives and property. However, previous methods based on channel feature extraction and threshold filtering did not make full use of information in satellite images, which led to limitations on such complex problems as strong convection detection. In this study, we propose a novel framework of a deep learning-based model Convection-UNet to detect convection. We use channel 4 to 7 of FY-4B GHI that we select according to the microphysical properties of convection as input and radar reflectivity as label. We combine the detailed training time and test time data augmentation strategies and build a deep neural network to automatically extract spatial context features and achieve end-to-end learning. Results show that the performance of our method far exceeds the previous channel extraction combined with threshold filtering methods such as BT and BTD at least 0.24 on Fi-measure. We also show that our channel selection and data augmentation strategies are of great significance to detect convection.","PeriodicalId":221346,"journal":{"name":"2023 International Conference on Pattern Recognition, Machine Vision and Intelligent Algorithms (PRMVIA)","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2023-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"134511157","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Based on Spectral Clustering Dynamic Community Discovery Algorithm Research in Temporal Network 基于谱聚类的时态网络动态社区发现算法研究
Yu Yang, Yong Long, Linbin Gui, Jurun Ma
The study of temporal community discovery is an essential research area in social network analysis. As nodes join or leave social networks, the relationships between nodes are establishing or terminating, which affects community structure changes. Given the social networks discovery algorithm of static community lacks the indispensable historical information of network community nodes, resulting in insufficient network structure analysis and clustering information. Based on the community network evolution division events, the paper extracted the priority for analysis and proposed the SC-DCDA: Spectral Clustering Based Temporal Community Discovery Algorithm. According to experimental observation, the SC-DCDA firstly reduced the dimensionality of high-dimensional data leveraging the method of spectral mapping. Secondly, the improved Fuzzy C-means clustering algorithm was adopted to determine the correlation between nodes in temporal social networks and the communities to be discovered, and finally the community structure analysis was performed according to the evolutionary similarity matrix. The ground truth datasets combined with the typically community discovery algorithm metric Modularity Score experimental verification and performance evaluation. The experimental results show that the algorithm metric is well-suited for the temporal datasets, indicating that the proposed algorithm has achieved several better results in information interaction, clustering effect, and accuracy.
时间社区发现研究是社会网络分析的一个重要研究领域。随着节点加入或离开社交网络,节点间关系的建立或终止,影响着社区结构的变化。鉴于静态社区的社交网络发现算法缺乏必不可少的网络社区节点历史信息,导致网络结构分析和聚类信息不足。基于社区网络演化划分事件,提取优先级进行分析,提出基于谱聚类的时间社区发现算法SC-DCDA。根据实验观察,SC-DCDA首先利用光谱映射的方法对高维数据进行降维。其次,采用改进的模糊c均值聚类算法确定时间社会网络节点与待发现群落的相关性,最后根据进化相似矩阵进行群落结构分析;ground truth数据集结合典型的社区发现度量算法Modularity Score进行实验验证和性能评估。实验结果表明,该算法度量非常适合于时态数据集,表明该算法在信息交互、聚类效果和精度方面取得了较好的效果。
{"title":"Based on Spectral Clustering Dynamic Community Discovery Algorithm Research in Temporal Network","authors":"Yu Yang, Yong Long, Linbin Gui, Jurun Ma","doi":"10.1109/PRMVIA58252.2023.00029","DOIUrl":"https://doi.org/10.1109/PRMVIA58252.2023.00029","url":null,"abstract":"The study of temporal community discovery is an essential research area in social network analysis. As nodes join or leave social networks, the relationships between nodes are establishing or terminating, which affects community structure changes. Given the social networks discovery algorithm of static community lacks the indispensable historical information of network community nodes, resulting in insufficient network structure analysis and clustering information. Based on the community network evolution division events, the paper extracted the priority for analysis and proposed the SC-DCDA: Spectral Clustering Based Temporal Community Discovery Algorithm. According to experimental observation, the SC-DCDA firstly reduced the dimensionality of high-dimensional data leveraging the method of spectral mapping. Secondly, the improved Fuzzy C-means clustering algorithm was adopted to determine the correlation between nodes in temporal social networks and the communities to be discovered, and finally the community structure analysis was performed according to the evolutionary similarity matrix. The ground truth datasets combined with the typically community discovery algorithm metric Modularity Score experimental verification and performance evaluation. The experimental results show that the algorithm metric is well-suited for the temporal datasets, indicating that the proposed algorithm has achieved several better results in information interaction, clustering effect, and accuracy.","PeriodicalId":221346,"journal":{"name":"2023 International Conference on Pattern Recognition, Machine Vision and Intelligent Algorithms (PRMVIA)","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2023-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133266116","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Research on visual SLAM algorithm based on improved point-line feature fusion 基于改进点-线特征融合的视觉SLAM算法研究
Yu Zhang, Miao Dong
SLAM (simultaneous localization and mapping), will further known as synchronous localization and mapping, is a technology that is used to tackle the issue of localization and map building while a robot travels in an unfamiliar environment. Traditional SLAM relies on point features to estimate camera pose, which makes it difficult to extract enough point features in low-texture scenes. When the camera shakes violently or rotates too fast, the robustness of a point-based SLAM system is poor. Aiming at the problem of poor robustness of the existing visual SLAM (synchronous localization and mapping technology) system, based on the ORB-SLAM3 framework, the point feature extractor is replaced with a self-supervised deep neural network, and a matching filtering algorithm based on threshold and motion statistics is proposed to eliminate point mismatch, this significantly accelerates the system’s real- time and accuracy. Likewise, linear activities are integrated into the front-end information extraction, a linear feature extraction model is established, approximation linear features are merged and processed, and the linear feature description and mismatching eradication process are simplified. Finally, the weight allocation idea is introduced into the construction of the point and line error model, and the weight of the point and line is reasonably allocated according to the richness of the scene. Experiments on absolute error trajectory on the TUM dataset emphasize that the revised algorithm increased efficiency and stability when compared to the ORB-SLAM3 system.
SLAM(同时定位和地图绘制),将进一步被称为同步定位和地图绘制,是一种用于解决机器人在陌生环境中行进时定位和地图绘制问题的技术。传统的SLAM依赖于点特征来估计相机姿态,这使得在低纹理场景中难以提取足够的点特征。当相机剧烈抖动或旋转过快时,基于点的SLAM系统鲁棒性较差。针对现有视觉SLAM(同步定位与映射技术)系统鲁棒性差的问题,基于ORB-SLAM3框架,将点特征提取器替换为自监督深度神经网络,并提出基于阈值和运动统计的匹配滤波算法消除点不匹配,显著提高了系统的实时性和准确性。同样,将线性活动整合到前端信息提取中,建立线性特征提取模型,对近似线性特征进行合并处理,简化线性特征描述和错配消除过程。最后,将权重分配思想引入到点线误差模型的构建中,根据场景的丰富程度合理分配点线的权重。在TUM数据集上的绝对误差轨迹实验表明,与ORB-SLAM3系统相比,改进后的算法提高了效率和稳定性。
{"title":"Research on visual SLAM algorithm based on improved point-line feature fusion","authors":"Yu Zhang, Miao Dong","doi":"10.1109/prmvia58252.2023.00046","DOIUrl":"https://doi.org/10.1109/prmvia58252.2023.00046","url":null,"abstract":"SLAM (simultaneous localization and mapping), will further known as synchronous localization and mapping, is a technology that is used to tackle the issue of localization and map building while a robot travels in an unfamiliar environment. Traditional SLAM relies on point features to estimate camera pose, which makes it difficult to extract enough point features in low-texture scenes. When the camera shakes violently or rotates too fast, the robustness of a point-based SLAM system is poor. Aiming at the problem of poor robustness of the existing visual SLAM (synchronous localization and mapping technology) system, based on the ORB-SLAM3 framework, the point feature extractor is replaced with a self-supervised deep neural network, and a matching filtering algorithm based on threshold and motion statistics is proposed to eliminate point mismatch, this significantly accelerates the system’s real- time and accuracy. Likewise, linear activities are integrated into the front-end information extraction, a linear feature extraction model is established, approximation linear features are merged and processed, and the linear feature description and mismatching eradication process are simplified. Finally, the weight allocation idea is introduced into the construction of the point and line error model, and the weight of the point and line is reasonably allocated according to the richness of the scene. Experiments on absolute error trajectory on the TUM dataset emphasize that the revised algorithm increased efficiency and stability when compared to the ORB-SLAM3 system.","PeriodicalId":221346,"journal":{"name":"2023 International Conference on Pattern Recognition, Machine Vision and Intelligent Algorithms (PRMVIA)","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2023-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116971010","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
ECG Signal Extraction Method Based on Singular Value Selection and Wavelet 基于奇异值选择和小波的心电信号提取方法
Fuyu Luo, Xue Han, Zihao Zhang, Ruigang Li, Huixi Wang, Fanrong Kong
Aiming at the noise of the received ECG signal, a extraction method of ECG signal based on singular value selection and wavelet is proposed. The singular value decomposition on the ECG signal is performed firstly, and the ECG signal component corresponding to each singular value is obtained. Then the signal component corresponding to the maximum singular value is used to calculate cross-correlation coefficients with other components. The cumulative contribution rate of singular values is combined to determine the number of singular values for ECG signal reconstruction. The wavelet threshold de-noising method is used to de-noise the final determined signal components. Finally, the de-noising ECG signal is obtained by reconstructing the signal components. The experimental results show that the method can suppress noise and extract signal effectively, and it has a good noise reduction effect compared with wavelet threshold method.
针对接收到的心电信号中存在的噪声,提出了一种基于奇异值选择和小波的心电信号提取方法。首先对心电信号进行奇异值分解,得到各奇异值对应的心电信号分量。然后利用奇异值最大值所对应的信号分量计算与其他分量的互相关系数。结合奇异值的累积贡献率,确定用于心电信号重构的奇异值个数。采用小波阈值降噪方法对最终确定的信号分量进行降噪。最后,通过重构信号分量得到去噪的心电信号。实验结果表明,该方法能够有效地抑制噪声并提取信号,与小波阈值法相比具有良好的降噪效果。
{"title":"ECG Signal Extraction Method Based on Singular Value Selection and Wavelet","authors":"Fuyu Luo, Xue Han, Zihao Zhang, Ruigang Li, Huixi Wang, Fanrong Kong","doi":"10.1109/PRMVIA58252.2023.00052","DOIUrl":"https://doi.org/10.1109/PRMVIA58252.2023.00052","url":null,"abstract":"Aiming at the noise of the received ECG signal, a extraction method of ECG signal based on singular value selection and wavelet is proposed. The singular value decomposition on the ECG signal is performed firstly, and the ECG signal component corresponding to each singular value is obtained. Then the signal component corresponding to the maximum singular value is used to calculate cross-correlation coefficients with other components. The cumulative contribution rate of singular values is combined to determine the number of singular values for ECG signal reconstruction. The wavelet threshold de-noising method is used to de-noise the final determined signal components. Finally, the de-noising ECG signal is obtained by reconstructing the signal components. The experimental results show that the method can suppress noise and extract signal effectively, and it has a good noise reduction effect compared with wavelet threshold method.","PeriodicalId":221346,"journal":{"name":"2023 International Conference on Pattern Recognition, Machine Vision and Intelligent Algorithms (PRMVIA)","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2023-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"117249494","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Few-Shot Semantic Segmentation Based on Dual-Branch Feature Extraction 基于双分支特征提取的少镜头语义分割
Hongjie Zhou
Few-shot semantic segmentation (FSS) requires only few labeled samples to achieve good segmentation performance and thus has received extensive attention. However, existing FFS methods usually adopt a simple convolutional structure as the backbone, which suffers from poor feature extraction ability. In order to address this issue, a novel few-shot segmentation network based on dual-branch feature extraction (DFESN) is proposed. First, an attention-enhanced ResNet is used as the local feature extraction branch. Specifically, we in-corporate channel attention operations into each building block of ResNet to model the importance among channels, which enables DFESN to learn important class information for the segmentation task. Besides, we introduce a Vision Transformer as the global feature extraction branch. This branch leverages the multi-head self-attention mechanism in Vision Transformer to model the global dependencies of support and query image features, further enhancing the feature extraction capabilities of DFESN. We conduct experiments on the PASCAL-5i dataset and demonstrate the superiority of our DFESN.
少镜头语义分割(few -shot semantic segmentation, FSS)方法只需要少量的标记样本就能达到良好的分割效果,因此受到了广泛的关注。然而,现有的FFS方法通常采用简单的卷积结构作为主干,特征提取能力较差。为了解决这一问题,提出了一种新的基于双分支特征提取(DFESN)的少镜头分割网络。首先,将注意力增强的ResNet作为局部特征提取分支。具体来说,我们将渠道关注操作整合到ResNet的每个构建块中,对渠道之间的重要性进行建模,使DFESN能够为分割任务学习重要的类信息。此外,我们还引入了Vision Transformer作为全局特征提取分支。该分支利用Vision Transformer中的多头自关注机制对支持和查询图像特征的全局依赖关系进行建模,进一步增强了DFESN的特征提取能力。我们在PASCAL-5i数据集上进行了实验,验证了我们的DFESN的优越性。
{"title":"Few-Shot Semantic Segmentation Based on Dual-Branch Feature Extraction","authors":"Hongjie Zhou","doi":"10.1109/PRMVIA58252.2023.00053","DOIUrl":"https://doi.org/10.1109/PRMVIA58252.2023.00053","url":null,"abstract":"Few-shot semantic segmentation (FSS) requires only few labeled samples to achieve good segmentation performance and thus has received extensive attention. However, existing FFS methods usually adopt a simple convolutional structure as the backbone, which suffers from poor feature extraction ability. In order to address this issue, a novel few-shot segmentation network based on dual-branch feature extraction (DFESN) is proposed. First, an attention-enhanced ResNet is used as the local feature extraction branch. Specifically, we in-corporate channel attention operations into each building block of ResNet to model the importance among channels, which enables DFESN to learn important class information for the segmentation task. Besides, we introduce a Vision Transformer as the global feature extraction branch. This branch leverages the multi-head self-attention mechanism in Vision Transformer to model the global dependencies of support and query image features, further enhancing the feature extraction capabilities of DFESN. We conduct experiments on the PASCAL-5i dataset and demonstrate the superiority of our DFESN.","PeriodicalId":221346,"journal":{"name":"2023 International Conference on Pattern Recognition, Machine Vision and Intelligent Algorithms (PRMVIA)","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2023-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124267186","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Identification Of Imaging Features Of Diabetes Mellitus And Tuberculosis Based On YOLOv8x Model Combined With RepEca Network Structure 基于YOLOv8x模型结合RepEca网络结构识别糖尿病和肺结核的影像学特征
Wenjun Li, Linjun Jiang, Zezhou Zhu, Yanfan Li, Hua Peng, Diqing Liang, Hongzhong Yang, Weijun Liang
Tuberculosis and diabetes mellitus are highly prevalent clinical conditions worldwide, and the mortality rate of tuberculosis is high; when diabetes mellitus is combined with tuberculosis, the interaction between the two can lead to a vicious cycle, posing a serious threat to the physical and mental health and life safety of patients, especially in developing regions where medical resources are scarce. In this paper, We trained several deep learning algorithm models based on YOLOv5, YOLOv8x, Faster R- CNN and Mask R-CNN with 4 types of lesion features commonly found in 1024 images, from which we selected the algorithm with the best automatic feature recognition effect and optimized the model to further improve the recognition efficiency. Combining the complexity of lesion features and experimental results, we propose a YOLOv8x model based on RepEca network structure and ESE attention mechanism, which is more effective than the original YOLOv8x in application, with an F1 metric value of 71.19%, and can better identify lesion features in images, assisting clinicians to improve the diagnostic accuracy and treatment effect.
结核病和糖尿病是世界范围内非常普遍的临床疾病,结核病的死亡率很高;当糖尿病与肺结核合并时,两者的相互作用会导致恶性循环,对患者的身心健康和生命安全构成严重威胁,特别是在医疗资源匮乏的发展中地区。本文针对1024张图像中常见的4种病灶特征,基于YOLOv5、YOLOv8x、Faster R-CNN和Mask R-CNN训练了几种深度学习算法模型,从中选择自动特征识别效果最好的算法,并对模型进行优化,进一步提高识别效率。结合病变特征的复杂性和实验结果,我们提出了一种基于RepEca网络结构和ESE注意机制的YOLOv8x模型,该模型在应用上比原来的YOLOv8x模型更有效,F1度量值为71.19%,能够更好地识别图像中的病变特征,帮助临床医生提高诊断准确性和治疗效果。
{"title":"Identification Of Imaging Features Of Diabetes Mellitus And Tuberculosis Based On YOLOv8x Model Combined With RepEca Network Structure","authors":"Wenjun Li, Linjun Jiang, Zezhou Zhu, Yanfan Li, Hua Peng, Diqing Liang, Hongzhong Yang, Weijun Liang","doi":"10.1109/prmvia58252.2023.00032","DOIUrl":"https://doi.org/10.1109/prmvia58252.2023.00032","url":null,"abstract":"Tuberculosis and diabetes mellitus are highly prevalent clinical conditions worldwide, and the mortality rate of tuberculosis is high; when diabetes mellitus is combined with tuberculosis, the interaction between the two can lead to a vicious cycle, posing a serious threat to the physical and mental health and life safety of patients, especially in developing regions where medical resources are scarce. In this paper, We trained several deep learning algorithm models based on YOLOv5, YOLOv8x, Faster R- CNN and Mask R-CNN with 4 types of lesion features commonly found in 1024 images, from which we selected the algorithm with the best automatic feature recognition effect and optimized the model to further improve the recognition efficiency. Combining the complexity of lesion features and experimental results, we propose a YOLOv8x model based on RepEca network structure and ESE attention mechanism, which is more effective than the original YOLOv8x in application, with an F1 metric value of 71.19%, and can better identify lesion features in images, assisting clinicians to improve the diagnostic accuracy and treatment effect.","PeriodicalId":221346,"journal":{"name":"2023 International Conference on Pattern Recognition, Machine Vision and Intelligent Algorithms (PRMVIA)","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2023-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130444002","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Sparse sampling photoacoustic reconstruction with group sparse dictionary learning 基于群稀疏字典学习的稀疏采样光声重构
Zhimin Zhang, Zhaolian Wang, Chenglong Zhang, Xiaoli Yang, Xiaopeng Ma
Photoacoustic tomography often faces problems such as incomplete data and noise, which affect the quality of reconstructed images. Model-based photoacoustic image reconstruction is an ill-posed inverse problem, which usually needs to introduce the regularization term as the prior constraint. In this paper, we propose a novel model-based regularization framework for photoacoustic image reconstruction, which utilizes the group sparsity property of photoacoustic images as prior information and combines total variation regularization to effectively suppress image artifacts and recover the missing signal data during sparse sampling. Numerical simulation results show that the proposed algorithm not only improves the accuracy of photoacoustic reconstruction under sparse sampling but also improves the calculation speed.
光声层析成像经常面临数据不完整和噪声等问题,影响重建图像的质量。基于模型的光声图像重构是一个病态逆问题,通常需要引入正则化项作为先验约束。本文提出了一种基于模型的光声图像重构正则化框架,该框架利用光声图像的群稀疏性作为先验信息,结合全变分正则化,有效地抑制了图像伪影,恢复了稀疏采样过程中缺失的信号数据。数值模拟结果表明,该算法不仅提高了稀疏采样下光声重构的精度,而且提高了计算速度。
{"title":"Sparse sampling photoacoustic reconstruction with group sparse dictionary learning","authors":"Zhimin Zhang, Zhaolian Wang, Chenglong Zhang, Xiaoli Yang, Xiaopeng Ma","doi":"10.1109/PRMVIA58252.2023.00049","DOIUrl":"https://doi.org/10.1109/PRMVIA58252.2023.00049","url":null,"abstract":"Photoacoustic tomography often faces problems such as incomplete data and noise, which affect the quality of reconstructed images. Model-based photoacoustic image reconstruction is an ill-posed inverse problem, which usually needs to introduce the regularization term as the prior constraint. In this paper, we propose a novel model-based regularization framework for photoacoustic image reconstruction, which utilizes the group sparsity property of photoacoustic images as prior information and combines total variation regularization to effectively suppress image artifacts and recover the missing signal data during sparse sampling. Numerical simulation results show that the proposed algorithm not only improves the accuracy of photoacoustic reconstruction under sparse sampling but also improves the calculation speed.","PeriodicalId":221346,"journal":{"name":"2023 International Conference on Pattern Recognition, Machine Vision and Intelligent Algorithms (PRMVIA)","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2023-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130700691","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
SimpleFusion: 3D Object Detection by Fusing RGB Images and Point Clouds SimpleFusion:通过融合RGB图像和点云来检测3D物体
Yongchang Zhang, Yue Guo, Hanbing Niu, Bo Zhang, Yun Cao, Wenhao He
Achieving robust 3D object detection by fusing images and point clouds remains challenging. In this paper, we propose a novel 3D object detector (SimpleFusion) that enables simple and efficient multi-sensor fusion. Our main motivation is to boost feature extraction from a single modality and fuse them into a unified space. Specifically, we build a new visual 3D object detector in the camera stream that leverages point cloud supervision for more accurate depth prediction; in the lidar stream, we introduce a robust 3D object detector that utilizes multi-view and multi-scale features to overcome the sparsity of point clouds. Finally, we propose a dynamic fusion module to focus on more confident features and achieve accurate 3D object detection based on dynamic weights. Our method has been evaluated on the nuScenes dataset, and the experimental results indicate that it outperforms other state-of-the-art methods by a significant margin.
通过融合图像和点云实现鲁棒的3D目标检测仍然具有挑战性。在本文中,我们提出了一种新的3D目标检测器(SimpleFusion),它可以实现简单高效的多传感器融合。我们的主要动机是从单一模态中提取特征,并将它们融合到一个统一的空间中。具体来说,我们在相机流中构建了一个新的视觉3D物体检测器,该检测器利用点云监督进行更准确的深度预测;在激光雷达流中,我们引入了一种鲁棒的3D目标检测器,该检测器利用多视图和多尺度特征来克服点云的稀疏性。最后,我们提出了一个动态融合模块,专注于更自信的特征,实现基于动态权重的精确三维目标检测。我们的方法已经在nuScenes数据集上进行了评估,实验结果表明,它的性能明显优于其他最先进的方法。
{"title":"SimpleFusion: 3D Object Detection by Fusing RGB Images and Point Clouds","authors":"Yongchang Zhang, Yue Guo, Hanbing Niu, Bo Zhang, Yun Cao, Wenhao He","doi":"10.1109/prmvia58252.2023.00014","DOIUrl":"https://doi.org/10.1109/prmvia58252.2023.00014","url":null,"abstract":"Achieving robust 3D object detection by fusing images and point clouds remains challenging. In this paper, we propose a novel 3D object detector (SimpleFusion) that enables simple and efficient multi-sensor fusion. Our main motivation is to boost feature extraction from a single modality and fuse them into a unified space. Specifically, we build a new visual 3D object detector in the camera stream that leverages point cloud supervision for more accurate depth prediction; in the lidar stream, we introduce a robust 3D object detector that utilizes multi-view and multi-scale features to overcome the sparsity of point clouds. Finally, we propose a dynamic fusion module to focus on more confident features and achieve accurate 3D object detection based on dynamic weights. Our method has been evaluated on the nuScenes dataset, and the experimental results indicate that it outperforms other state-of-the-art methods by a significant margin.","PeriodicalId":221346,"journal":{"name":"2023 International Conference on Pattern Recognition, Machine Vision and Intelligent Algorithms (PRMVIA)","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2023-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128379902","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
2023 International Conference on Pattern Recognition, Machine Vision and Intelligent Algorithms (PRMVIA)
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1