Information Technology and Control最新文献

英文中文

3D Point Cloud Classification Method Based on Multiple Attention Mechanism and Dynamic Graph Convolution 基于多重注意机制和动态图卷积的三维点云分类方法

4区计算机科学 Q3 AUTOMATION & CONTROL SYSTEMS

Information Technology and Control

Pub Date : 2023-09-26 DOI: 10.5755/j01.itc.52.3.33035

Yu Zhang, Zilong Wang, Yongjian Zhu

In order to solve the problem of uneven density and the low classification accuracy of 3D point cloud, a 3D point cloud classification method fuses multi-attention machine is proposed. It is principally based on the traditional point cloud dynamic graph convolution classification network, into multiple attention mechanisms, including self-attention, spatial attention and channel attention mechanisms. The self-attention mechanism can reduce the dependence on irrelevant points while aligning point clouds, and input the processed point cloud into the classification network. Then the missing geometric information in the classification network is compensated by the integration of spatial and channel attention mechanisms. The experimental results on the public data set ModelNet40 indicate that compared with the DGCNN classification network, the improved network model improves the classification accuracy of the data set by 0.5 % and the average accuracy by 0.9 %. Meantime, the classification accuracy outstrips other contrast classification algorithms.

为了解决三维点云密度不均匀和分类精度低的问题，提出了一种融合多注意力机的三维点云分类方法。它主要是在传统点云动态图卷积分类网络的基础上，分成多重注意机制，包括自注意机制、空间注意机制和通道注意机制。自注意机制可以在对齐点云时减少对不相关点的依赖，并将处理后的点云输入到分类网络中。然后通过空间注意机制和通道注意机制的整合来补偿分类网络中缺失的几何信息。在公共数据集ModelNet40上的实验结果表明，与DGCNN分类网络相比，改进后的网络模型对数据集的分类准确率提高了0.5%，平均准确率提高了0.9%。同时，分类精度优于其他对比分类算法。

引用次数: 0

A New Sentiment and Fuzzy Aware Product Recommendation System Using Weighted Aquila Optimization and GRNN in e-Commerce 基于加权Aquila优化和GRNN的电子商务情感模糊感知产品推荐系统

4区计算机科学 Q3 AUTOMATION & CONTROL SYSTEMS

Information Technology and Control

Pub Date : 2023-09-26 DOI: 10.5755/j01.itc.52.3.33042

L. Antony Rosewelt, D. Naveen Raju, E. Sujatha

Customer reviews are playing an important role in e-commerce for increasing sales by knowing the customer’s purchase pattern and expectations. The reviews that are collected after completing their purchase reflect the quality and services in e-commerce. The user’s reviews are characterized and categorized through sentiment and semantic analysis. Moreover, the sentiment and semantic classification processes are also performed to predict the user’s purchase patterns and liked products. However, the available classification is not able to predict the user’s purchase patterns. In this paper, we propose a new Product Recommendation System (PRS) to predict the appropriate product for users based on their purchase behavior and pattern. The proposed recommendation system incorporates the standard data preprocessing tasks like tokenization process, Parts of Speech (PoS) tagging process, and parsing, a new sentiment and semantic score calculation procedure, and a new feature optimization technique called the Weighted Aquila Optimization Method (WAOM). Moreover, the sentiment and semantic classification processes are performed by applying a General Regression Neural Network with the incorporation of fuzzy temporal features (FTGRNN) and obtaining better classification results. The newly developed PRS is evaluated by conducting experiments in this work and also proved as superior than other systems available in this direction in terms of prediction accuracy, precision, recall, serendipity and nDCG.

顾客评论在电子商务中发挥着重要的作用，通过了解顾客的购买模式和期望来增加销售额。完成购买后收集的评论反映了电子商务的质量和服务。通过情感和语义分析对用户评论进行特征和分类。此外，还进行了情感和语义分类过程来预测用户的购买模式和喜欢的产品。然而，现有的分类并不能预测用户的购买模式。本文提出了一种新的产品推荐系统(PRS)，根据用户的购买行为和模式来预测合适的产品。所提出的推荐系统结合了标准的数据预处理任务，如标记化过程、词性标注过程和解析，一种新的情感和语义评分计算过程，以及一种新的特征优化技术，称为加权Aquila优化方法(WAOM)。此外，采用结合模糊时间特征的广义回归神经网络(FTGRNN)进行情感和语义分类，获得了较好的分类效果。本工作通过实验对新开发的PRS进行了评价，并证明其在预测准确度、精密度、召回率、偶然性和nDCG方面都优于该方向的其他系统。

{"title":"A New Sentiment and Fuzzy Aware Product Recommendation System Using Weighted Aquila Optimization and GRNN in e-Commerce","authors":"L. Antony Rosewelt, D. Naveen Raju, E. Sujatha","doi":"10.5755/j01.itc.52.3.33042","DOIUrl":"https://doi.org/10.5755/j01.itc.52.3.33042","url":null,"abstract":"Customer reviews are playing an important role in e-commerce for increasing sales by knowing the customer’s purchase pattern and expectations. The reviews that are collected after completing their purchase reflect the quality and services in e-commerce. The user’s reviews are characterized and categorized through sentiment and semantic analysis. Moreover, the sentiment and semantic classification processes are also performed to predict the user’s purchase patterns and liked products. However, the available classification is not able to predict the user’s purchase patterns. In this paper, we propose a new Product Recommendation System (PRS) to predict the appropriate product for users based on their purchase behavior and pattern. The proposed recommendation system incorporates the standard data preprocessing tasks like tokenization process, Parts of Speech (PoS) tagging process, and parsing, a new sentiment and semantic score calculation procedure, and a new feature optimization technique called the Weighted Aquila Optimization Method (WAOM). Moreover, the sentiment and semantic classification processes are performed by applying a General Regression Neural Network with the incorporation of fuzzy temporal features (FTGRNN) and obtaining better classification results. The newly developed PRS is evaluated by conducting experiments in this work and also proved as superior than other systems available in this direction in terms of prediction accuracy, precision, recall, serendipity and nDCG.","PeriodicalId":54982,"journal":{"name":"Information Technology and Control","volume":"30 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-09-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"134887049","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Kinodynamic RRT* Based UAV Optimal State Motion Planning with Collision Risk Awareness 基于RRT*的碰撞风险感知无人机最优状态运动规划

4区计算机科学 Q3 AUTOMATION & CONTROL SYSTEMS

Information Technology and Control

Pub Date : 2023-09-26 DOI: 10.5755/j01.itc.52.3.33583

Haolin Yin, Baoquan Li, Hai Zhu, Lintao Shi

In this paper, an autonomous navigation strategy is proposed for unmanned aerial vehicles (UAVs) based on consideration of dynamic sampling and field of view (FOV). Compare to search-based motion planning, sampling-based kinodynamic planning schemes can often find feasible trajectories in complex environments. Specifically, a global trajectory is first generated with physical information, and an expansion algorithm is constructed regarding to kinodynamic rapidly-exploring random tree* (KRRT*). Then, a KRRT* expansion strategy is designed to find local collision-free trajectories. In trajectory optimization, bending radius, collision risk function, and yaw angle penalty term are defined by taking into account onboard sensor FOV and potentialrisk. Then, smooth and dynamic feasible terms are penalized based on initial trajectory generation. Trajectories are refined by time reallocation, and weights are solved by optimization. Effectiveness of the proposed strategy is demonstrated by both simulation and experiment.

提出了一种考虑动态采样和视场的无人机自主导航策略。与基于搜索的运动规划相比，基于采样的运动规划方案往往能够在复杂的环境中找到可行的运动轨迹。具体而言，首先利用物理信息生成全局轨迹，并构造了一种基于动力学快速探索随机树* (KRRT*)的展开算法。然后，设计了KRRT*扩展策略来寻找局部无碰撞轨迹。在轨迹优化中，考虑了机载传感器的视场和潜在风险，定义了弯曲半径、碰撞风险函数和偏航角惩罚项。然后，在初始轨迹生成的基础上，对平滑可行项和动态可行项进行惩罚。通过时间重新分配细化轨迹，通过优化求解权值。仿真和实验验证了该策略的有效性。

引用次数: 0

Face Positioned Driver Drowsiness Detection Using Multistage Adaptive 3D Convolutional Neural Network 基于多级自适应三维卷积神经网络的人脸定位驾驶员困倦检测

4区计算机科学 Q3 AUTOMATION & CONTROL SYSTEMS

Information Technology and Control

Pub Date : 2023-09-26 DOI: 10.5755/j01.itc.52.3.33719

N. Adhithyaa, A. Tamilarasi, D. Sivabalaselvamani, L. Rahunathan

Accidents due to driver drowsiness are observed to be increasing at an alarming rate across all countries and it becomes necessary to identify driver drowsiness to reduce accident rates. Researchers handled many machine learning and deep learning techniques especially many CNN variants created for drowsiness detection, but it is dangerous to use in real time, as the design fails due to high computational complexity, low evaluation accuracies and low reliability. In this article, we introduce a multistage adaptive 3D-CNN model with multi-expressive features for Driver Drowsiness Detection (DDD) with special attention to system complexity and performance. The proposed architecture is divided into five cascaded stages: (1) A three level Convolutional Neural Network (CNN) for driver face positioning (2) 3D-CNN based Spatio-Temporal (ST) Learning to extract 3D features from face positioned stacked samples. (3) State Understanding (SU) to train 3D-CNN based drowsiness models (4) Feature fusion using ST and SU stages (5) Drowsiness Detection stage. The Proposed system extract ST values from the face positioned images and then merges it with SU results from each state understanding sub models to create conditional driver facial features for final Drowsiness Detection (DD) model. Final DD Model is trained offline and implemented in online, results show the developed model performs well when compared to others and additionally capable of handling Indian conditions. This method is applied (Trained and Evaluated) using two different datasets, Kongu Engineering College Driver Drowsiness Detection (KEC-DDD) own dataset and National Tsing Hua University Driver Drowsiness Detection (NTHU-DDD) Benchmark Dataset. The proposed system trained with KEC-DDD dataset produces accuracy of 77.45% and 75.91% using evaluation set of KEC-DDD and NTHU-DDD dataset and capable to detect driver drowsiness from 256×256 resolution images at 39.6 fps at an average of 400 execution seconds.

在所有国家，由于驾驶员嗜睡造成的事故都以惊人的速度增加，因此有必要确定驾驶员嗜睡以降低事故率。研究人员处理了许多机器学习和深度学习技术，特别是许多为困倦检测而创建的CNN变体，但在实时使用时是危险的，因为高计算复杂性，低评估精度和低可靠性导致设计失败。在本文中，我们介绍了一种具有多表达特征的多级自适应3D-CNN模型，用于驾驶员困倦检测(DDD)，特别注意系统的复杂性和性能。该架构分为5个级联阶段:(1)基于三层卷积神经网络(CNN)的驾驶员面部定位;(2)基于3D-CNN的时空(ST)学习，从人脸定位的堆叠样本中提取3D特征。(3)状态理解(State Understanding, SU)训练基于3D-CNN的困倦模型(4)ST和SU阶段的特征融合(5)困倦检测阶段。该系统从人脸定位图像中提取ST值，然后将其与每个状态理解子模型的SU结果合并，为最终的困倦检测(DD)模型创建条件驾驶员面部特征。最终的DD模型是离线训练并在线实施的，结果表明，与其他模型相比，开发的模型表现良好，并且能够处理印度的情况。该方法使用两个不同的数据集进行应用(训练和评估)，孔谷工程学院驾驶员嗜睡检测(KEC-DDD)自己的数据集和国立清华大学驾驶员嗜睡检测(NTHU-DDD)基准数据集。该系统使用KEC-DDD数据集和NTHU-DDD数据集进行训练，准确率分别为77.45%和75.91%，能够以39.6 fps的速度从256×256分辨率图像中检测驾驶员困倦，平均执行时间为400秒。

{"title":"Face Positioned Driver Drowsiness Detection Using Multistage Adaptive 3D Convolutional Neural Network","authors":"N. Adhithyaa, A. Tamilarasi, D. Sivabalaselvamani, L. Rahunathan","doi":"10.5755/j01.itc.52.3.33719","DOIUrl":"https://doi.org/10.5755/j01.itc.52.3.33719","url":null,"abstract":"Accidents due to driver drowsiness are observed to be increasing at an alarming rate across all countries and it becomes necessary to identify driver drowsiness to reduce accident rates. Researchers handled many machine learning and deep learning techniques especially many CNN variants created for drowsiness detection, but it is dangerous to use in real time, as the design fails due to high computational complexity, low evaluation accuracies and low reliability. In this article, we introduce a multistage adaptive 3D-CNN model with multi-expressive features for Driver Drowsiness Detection (DDD) with special attention to system complexity and performance. The proposed architecture is divided into five cascaded stages: (1) A three level Convolutional Neural Network (CNN) for driver face positioning (2) 3D-CNN based Spatio-Temporal (ST) Learning to extract 3D features from face positioned stacked samples. (3) State Understanding (SU) to train 3D-CNN based drowsiness models (4) Feature fusion using ST and SU stages (5) Drowsiness Detection stage. The Proposed system extract ST values from the face positioned images and then merges it with SU results from each state understanding sub models to create conditional driver facial features for final Drowsiness Detection (DD) model. Final DD Model is trained offline and implemented in online, results show the developed model performs well when compared to others and additionally capable of handling Indian conditions. This method is applied (Trained and Evaluated) using two different datasets, Kongu Engineering College Driver Drowsiness Detection (KEC-DDD) own dataset and National Tsing Hua University Driver Drowsiness Detection (NTHU-DDD) Benchmark Dataset. The proposed system trained with KEC-DDD dataset produces accuracy of 77.45% and 75.91% using evaluation set of KEC-DDD and NTHU-DDD dataset and capable to detect driver drowsiness from 256×256 resolution images at 39.6 fps at an average of 400 execution seconds.","PeriodicalId":54982,"journal":{"name":"Information Technology and Control","volume":"39 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-09-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"134886923","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Liver Lesion Detection Using Semantic Segmentation and Chaotic Cuckoo Search Algorithm 基于语义分割和混沌杜鹃搜索算法的肝脏病变检测

4区计算机科学 Q3 AUTOMATION & CONTROL SYSTEMS

Information Technology and Control

Pub Date : 2023-09-26 DOI: 10.5755/j01.itc.52.3.34032

R. Murugesan, K. Devaki

The classic feature extraction techniques used in recent research on computer-aided diagnosis (CAD) of liver cancer have several disadvantages, including duplicated features and substantial computational expenses. Modern deep learning methods solve these issues by implicitly detecting complex structures in massive quantities of healthcare image data. This study suggests a unique bio-inspired deep-learning way for improving liver cancer prediction outcomes. Initially, a novel semantic segmentation technique known as UNet++ is proposed to extract liver lesions from computed tomography (CT) images. Second, a hybrid approach that combines the Chaotic Cuckoo Search algorithm and AlexNet is indicated as a feature extractor and classifier for liver lesions. LiTS, a freely accessible database that contains abdominal CT images, was employed for liver tumor diagnosis and investigation. The segmentation results were evaluated using the Dice similarity coefficient and Correlation coefficient. The classification results were assessed using Accuracy, Precision, Recall, F1 Score, and Specificity. Concerning the performance metrics such as accuracy, precision, and recall, the recommended method performs better than existing algorithms producing the highest values such as 99.2%, 98.6%, and 98.8%, respectively.

近年来肝癌计算机辅助诊断(CAD)研究中使用的经典特征提取技术存在特征重复、计算量大等缺点。现代深度学习方法通过隐式检测大量医疗图像数据中的复杂结构来解决这些问题。这项研究提出了一种独特的生物启发的深度学习方法来提高肝癌的预测结果。首先，提出了一种新的语义分割技术unet++，用于从计算机断层扫描(CT)图像中提取肝脏病变。其次，提出了一种结合混沌杜鹃搜索算法和AlexNet的混合方法作为肝脏病变的特征提取和分类器。LiTS是一个免费访问的数据库，包含腹部CT图像，用于肝脏肿瘤的诊断和调查。使用Dice相似系数和相关系数对分割结果进行评价。分类结果采用准确性、精密度、召回率、F1评分和特异性进行评估。在准确度、精密度和召回率等性能指标上，推荐方法的性能优于现有算法，分别达到99.2%、98.6%和98.8%的最高值。

{"title":"Liver Lesion Detection Using Semantic Segmentation and Chaotic Cuckoo Search Algorithm","authors":"R. Murugesan, K. Devaki","doi":"10.5755/j01.itc.52.3.34032","DOIUrl":"https://doi.org/10.5755/j01.itc.52.3.34032","url":null,"abstract":"The classic feature extraction techniques used in recent research on computer-aided diagnosis (CAD) of liver cancer have several disadvantages, including duplicated features and substantial computational expenses. Modern deep learning methods solve these issues by implicitly detecting complex structures in massive quantities of healthcare image data. This study suggests a unique bio-inspired deep-learning way for improving liver cancer prediction outcomes. Initially, a novel semantic segmentation technique known as UNet++ is proposed to extract liver lesions from computed tomography (CT) images. Second, a hybrid approach that combines the Chaotic Cuckoo Search algorithm and AlexNet is indicated as a feature extractor and classifier for liver lesions. LiTS, a freely accessible database that contains abdominal CT images, was employed for liver tumor diagnosis and investigation. The segmentation results were evaluated using the Dice similarity coefficient and Correlation coefficient. The classification results were assessed using Accuracy, Precision, Recall, F1 Score, and Specificity. Concerning the performance metrics such as accuracy, precision, and recall, the recommended method performs better than existing algorithms producing the highest values such as 99.2%, 98.6%, and 98.8%, respectively.","PeriodicalId":54982,"journal":{"name":"Information Technology and Control","volume":"72 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-09-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"134886682","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Optimized Deep Learning Model Using Modified Whale’s Optimization Algorithm for EEG Signal Classification 基于改进Whale优化算法的脑电信号分类深度学习模型优化

4区计算机科学 Q3 AUTOMATION & CONTROL SYSTEMS

Information Technology and Control

Pub Date : 2023-09-26 DOI: 10.5755/j01.itc.52.3.33320

K. Venu, P. Natesan

Brain-Computer Interface (BCI) is a technology in which Electroencephalogram (EEG) signals are utilized to create a link between a person’s mental state and a computer-based signal processing system that decodes the signals without needing muscle movement. The mental process of picturing the movement of a body component without actually moving that body part is known as Motor Imagery (MI). MI BCI is a Motor Imagery-based Brain-Computer Interface that allows patients with motor impairments to interact with their environment by operating robotic prostheses, wheelchairs, and other equipment. Feature extraction and classification are essential parts of the EEG signal processing for MI BCI. In this work, Whales Optimization Algorithm with an Improved Mutualism Phase is proposed to find the optimal Convolutional Neural Network architecture for the classification of motor imagery tasks with high accuracy and less computational complexity. The Neurosky and BCI IV 2a datasets were used to evaluate the proposed methodology. Experiments demonstrate that the suggested technique outperforms other competing methods regarding classification accuracy values at 94.1% and 87.7% for the Neurosky and BCI datasets, respectively.

脑机接口(BCI)是一种利用脑电图(EEG)信号在人的精神状态和基于计算机的信号处理系统之间建立联系的技术，该系统无需肌肉运动即可解码信号。在不实际移动身体部位的情况下，想象身体某个部位运动的心理过程被称为运动想象(MI)。MI BCI是一种基于运动图像的脑机接口，它允许运动障碍患者通过操作机器人假肢、轮椅和其他设备与他们的环境进行互动。特征提取和分类是脑电信号处理的重要组成部分。本文提出了一种改进的互共生阶段的whale优化算法，以寻找最优的卷积神经网络架构，用于高精度和低计算复杂度的运动图像任务分类。使用Neurosky和BCI IV 2a数据集来评估所提出的方法。实验表明，在Neurosky和BCI数据集上，该方法的分类准确率分别为94.1%和87.7%，优于其他竞争方法。

{"title":"Optimized Deep Learning Model Using Modified Whale’s Optimization Algorithm for EEG Signal Classification","authors":"K. Venu, P. Natesan","doi":"10.5755/j01.itc.52.3.33320","DOIUrl":"https://doi.org/10.5755/j01.itc.52.3.33320","url":null,"abstract":"Brain-Computer Interface (BCI) is a technology in which Electroencephalogram (EEG) signals are utilized to create a link between a person’s mental state and a computer-based signal processing system that decodes the signals without needing muscle movement. The mental process of picturing the movement of a body component without actually moving that body part is known as Motor Imagery (MI). MI BCI is a Motor Imagery-based Brain-Computer Interface that allows patients with motor impairments to interact with their environment by operating robotic prostheses, wheelchairs, and other equipment. Feature extraction and classification are essential parts of the EEG signal processing for MI BCI. In this work, Whales Optimization Algorithm with an Improved Mutualism Phase is proposed to find the optimal Convolutional Neural Network architecture for the classification of motor imagery tasks with high accuracy and less computational complexity. The Neurosky and BCI IV 2a datasets were used to evaluate the proposed methodology. Experiments demonstrate that the suggested technique outperforms other competing methods regarding classification accuracy values at 94.1% and 87.7% for the Neurosky and BCI datasets, respectively.","PeriodicalId":54982,"journal":{"name":"Information Technology and Control","volume":"33 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-09-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"134961019","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

A Survey on Regression-Based Crowd Counting Techniques 基于回归的人群计数技术综述

4区计算机科学 Q3 AUTOMATION & CONTROL SYSTEMS

Information Technology and Control

Pub Date : 2023-09-26 DOI: 10.5755/j01.itc.52.3.33701

Yu Hao, Huimin Du, Meiwen Mao, Ying Liu, Jiulun Fan

Traditional detect and count strategy can’t well handle the extremely crowded footage in computer vision-based counting task. In recent years, deep learning approaches have been widely explored to tackle this challenge. By regressing visual features to density map, the total crowd number can be predicted while avoids the detection of their actual positions. Efforts of improving performance distribute at various phases of the detecting pipeline, such as feature extraction and eliminating deviation of regressed density map etc. In this article, we conduct a thorough review on the most representative and state-of-the-art techniques. The efforts are systematically categorized into three topics: the evolving of front-end network, the handling of unbalanced density map prediction, and the selection of loss function. After the evaluation of most significant techniques, innovations of the state-of-the-art are inspected in detail to analyze specific reasons to achieve high performances. As conclusion, possible directions of enhancement are discussed to provide insights of future research.

传统的检测和计数策略不能很好地处理基于计算机视觉的计数任务中极为拥挤的镜头。近年来，深度学习方法已被广泛探索以应对这一挑战。通过将视觉特征回归到密度图中，可以预测人群的总人数，同时避免检测人群的实际位置。提高性能的努力分布在检测管道的各个阶段，如特征提取和消除回归密度图的偏差等。在本文中，我们对最具代表性和最先进的技术进行了全面的回顾。系统地分为三个方面:前端网络的演化、不平衡密度图预测的处理和损失函数的选择。在对最重要的技术进行评估后，详细检查了最先进的创新，以分析实现高性能的具体原因。最后，讨论了可能的增强方向，为今后的研究提供参考。

引用次数: 0

Saliency Detection Algorithm for Foggy Images Based on Deep Learning 基于深度学习的雾天图像显著性检测算法

4区计算机科学 Q3 AUTOMATION & CONTROL SYSTEMS

Information Technology and Control

Pub Date : 2023-09-26 DOI: 10.5755/j01.itc.52.3.32258

Leihong Zhang, Zhaoyuan Ji, Runchu Xu, Dawei Zhang

The detection of salient objects in foggy scenes is an important research component in many practical applications such as action recognition, target tracking and pedestrian re-identification. To facilitate saliency detection in foggy scenes, this paper explores two issues. The construction of dataset for foggy weather conditions and implementation scheme for foggy weather saliency detection. Firstly, a foggy sky image synthesis method is designed based on the atmospheric scattering model, and a saliency detection dataset applicable to foggy sky is constructed. Secondly, we compare the current classification networks and adopt resnet50, which has the highest classification accuracy, as the backbone network of the classification module, and classify the foggy sky images into three levels, namely fogless, light fog and dense fog, according to different concentrations. Then, Residual Refinement Network (R2Net) was selected to train and test the classified images. Horizontal and vertical flipping and image cropping were used to enhance the training set to relieve over-fitting. The accuracy of the network model was improved by using Adam as the optimizer. Experimental results show that for the detection of fogless images, our method is almost on par with state-of-the-art, and performs well for both light and dense fog images. Our method has good adaptability, accuracy and robustness.

雾天场景中显著目标的检测是动作识别、目标跟踪和行人再识别等许多实际应用中重要的研究内容。为了便于雾天场景的显著性检测，本文探讨了两个问题。雾天气条件数据集的构建及雾天气显著性检测的实现方案。首先，设计了一种基于大气散射模型的雾天图像合成方法，构建了适用于雾天的显著性检测数据集;其次，对比现有的分类网络，采用分类精度最高的resnet50作为分类模块的骨干网络，将雾天图像根据浓度的不同分为无雾、轻雾和浓雾三个级别。然后，选择残差细化网络(R2Net)对分类图像进行训练和测试。通过水平和垂直翻转以及图像裁剪来增强训练集以缓解过拟合。采用Adam作为优化器，提高了网络模型的精度。实验结果表明，对于无雾图像的检测，我们的方法几乎达到了最先进的水平，并且对轻雾和浓雾图像都有很好的检测效果。该方法具有良好的自适应性、准确性和鲁棒性。

{"title":"Saliency Detection Algorithm for Foggy Images Based on Deep Learning","authors":"Leihong Zhang, Zhaoyuan Ji, Runchu Xu, Dawei Zhang","doi":"10.5755/j01.itc.52.3.32258","DOIUrl":"https://doi.org/10.5755/j01.itc.52.3.32258","url":null,"abstract":"The detection of salient objects in foggy scenes is an important research component in many practical applications such as action recognition, target tracking and pedestrian re-identification. To facilitate saliency detection in foggy scenes, this paper explores two issues. The construction of dataset for foggy weather conditions and implementation scheme for foggy weather saliency detection. Firstly, a foggy sky image synthesis method is designed based on the atmospheric scattering model, and a saliency detection dataset applicable to foggy sky is constructed. Secondly, we compare the current classification networks and adopt resnet50, which has the highest classification accuracy, as the backbone network of the classification module, and classify the foggy sky images into three levels, namely fogless, light fog and dense fog, according to different concentrations. Then, Residual Refinement Network (R2Net) was selected to train and test the classified images. Horizontal and vertical flipping and image cropping were used to enhance the training set to relieve over-fitting. The accuracy of the network model was improved by using Adam as the optimizer. Experimental results show that for the detection of fogless images, our method is almost on par with state-of-the-art, and performs well for both light and dense fog images. Our method has good adaptability, accuracy and robustness.","PeriodicalId":54982,"journal":{"name":"Information Technology and Control","volume":"50 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-09-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"134960200","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Human Motion Pattern Recognition Based on Nano-sensor and Deep Learning 基于纳米传感器和深度学习的人体运动模式识别

4区计算机科学 Q3 AUTOMATION & CONTROL SYSTEMS

Information Technology and Control

Pub Date : 2023-09-26 DOI: 10.5755/j01.itc.52.3.33155

Sha Ji, Chengde Lin

A human motion pattern recognition algorithm based on Nano-sensor and deep learning is studied to recognize human motion patterns in real time and with high accuracy. First, human motion data are collected by micro electro mechanical system, and the noise in such data is filtered by smoothing filtering method to obtain high-quality motion data. Second, key time-domain features are extracted from high-quality motion data. Finally, after fusing and processing the key time-domain features, it is input into the deep long and short-term memory (LSTM) neural network to build a deep LSTM human motion pattern recognition model and complete human motion pattern recognition. The results show that the proposed algorithm can realize the recognition of various motion patterns with high accuracy of data acquisition, the average recognition accuracy is 94.8%, the average recall reaches 89.7%, and the F1 score of the algorithm are high, and the recognition time consuming is short, which can realize accurate and efficient human motion pattern recognition and provide guarantee for effective monitoring of the target human motion health.

为了实时、高精度地识别人体运动模式，研究了一种基于纳米传感器和深度学习的人体运动模式识别算法。首先，由微机电系统采集人体运动数据，对数据中的噪声进行平滑滤波，得到高质量的运动数据。其次，从高质量运动数据中提取关键时域特征;最后，将关键时域特征融合处理后，输入到深度长短期记忆(LSTM)神经网络中，构建深度LSTM人体运动模式识别模型，完成人体运动模式识别。结果表明，所提算法能够以较高的数据采集准确率实现对多种运动模式的识别，平均识别准确率为94.8%，平均查全率达到89.7%，且算法F1得分较高，识别耗时短，能够实现准确高效的人体运动模式识别，为有效监测目标人体运动健康状况提供保障。

引用次数: 0

SSTP: Stock Sector Trend Prediction with Temporal-Spatial Network 基于时空网络的股票板块趋势预测

4区计算机科学 Q3 AUTOMATION & CONTROL SYSTEMS

Information Technology and Control

Pub Date : 2023-09-26 DOI: 10.5755/j01.itc.52.3.33360

Shuo Yin, Youwei Gao, Shuai Nie, Junbao Li

In financial big data field, most existing work of stock prediction has focused on the prediction of a single stock trend. However, it is challenging to predict a stock price series due to its drastic volatility. While the stock sector is a group of stocks belonging to the same sector, and the stock sector index is the weighted sum of the prices of all the stocks in the sector. Therefore the trend of stock sector is more stable and more feasible to predict than that of a single stock. In this paper, we propose a new method named Stock Sector Trend Prediction (SSTP) to solve the problem of predicting stock sector trend. In SSTP method, we adopt the Relative Price Strength (RPS) to describe the trend of the stock sector, which is the relative rank of stock sector trend. In order to learn the intrinsic probability distribution of the stock sector index series, we construct the multi-scale RPS time series and build multiple independent fully-connected stock sector relation graphs based on the real relationship among stock sectors. Then, we propose a Temporal-spatial Network (TSN) to extract the temporal features from the multi-scale RPS series and the spatial features from the stock sector relation graphs. Finally, the TSN predicts and ranks the trends of the stock sector trend with the temporal-spatial features. The experimental results on the real-world dataset validate the effectiveness of the proposed SSTP method for the stock sector trend prediction.

在金融大数据领域，现有的股票预测工作大多集中在对单一股票走势的预测上。然而，由于股票价格系列的剧烈波动，预测它是具有挑战性的。而股票行业是属于同一行业的一组股票，股票行业指数是该行业所有股票价格的加权和。因此，股票板块的走势比单一股票的走势更稳定，更易于预测。本文提出了一种新的股票行业趋势预测方法(SSTP)来解决股票行业趋势预测问题。在SSTP方法中，我们采用相对价格强度(Relative Price Strength, RPS)来描述股票行业的趋势，它是股票行业趋势的相对等级。为了了解股票行业指数序列的内在概率分布，我们构造了多尺度RPS时间序列，并基于真实的股票行业之间的关系，构建了多个独立的全连通股票行业关系图。然后，我们提出了一个时空网络(TSN)来提取多尺度RPS序列的时间特征和股票行业关系图的空间特征。最后，TSN对股票行业趋势的时空特征进行预测和排序。在实际数据集上的实验结果验证了所提出的SSTP方法对股票行业趋势预测的有效性。

{"title":"SSTP: Stock Sector Trend Prediction with Temporal-Spatial Network","authors":"Shuo Yin, Youwei Gao, Shuai Nie, Junbao Li","doi":"10.5755/j01.itc.52.3.33360","DOIUrl":"https://doi.org/10.5755/j01.itc.52.3.33360","url":null,"abstract":"In financial big data field, most existing work of stock prediction has focused on the prediction of a single stock trend. However, it is challenging to predict a stock price series due to its drastic volatility. While the stock sector is a group of stocks belonging to the same sector, and the stock sector index is the weighted sum of the prices of all the stocks in the sector. Therefore the trend of stock sector is more stable and more feasible to predict than that of a single stock. In this paper, we propose a new method named Stock Sector Trend Prediction (SSTP) to solve the problem of predicting stock sector trend. In SSTP method, we adopt the Relative Price Strength (RPS) to describe the trend of the stock sector, which is the relative rank of stock sector trend. In order to learn the intrinsic probability distribution of the stock sector index series, we construct the multi-scale RPS time series and build multiple independent fully-connected stock sector relation graphs based on the real relationship among stock sectors. Then, we propose a Temporal-spatial Network (TSN) to extract the temporal features from the multi-scale RPS series and the spatial features from the stock sector relation graphs. Finally, the TSN predicts and ranks the trends of the stock sector trend with the temporal-spatial features. The experimental results on the real-world dataset validate the effectiveness of the proposed SSTP method for the stock sector trend prediction.","PeriodicalId":54982,"journal":{"name":"Information Technology and Control","volume":"24 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-09-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"134886919","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

首页上一页

下一页尾页

类型

全部化学•材料生命科学医学物理工程技术环境•农林材料科学地球科学法学管理学化学环境科学与生态学计算机科学教育学经济学农林科学人文科学生物学数学物理与天体物理心理学综合性期刊其他工业工程理学历史学农学文学信息工程

数据库

全部 ACS Publications Elsevier ieeexplore Springer The Royal Society of Chemistry Wiley

期刊

Information Technology and Control

全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.

﹀