首页 > 最新文献

2023 International Conference on Artificial Intelligence in Information and Communication (ICAIIC)最新文献

英文 中文
Double Deep Q-Learning based Backhaul Spectrum Allocation in Integrated Access and Backhaul Network 基于双深度q学习的综合接入回传网络回程频谱分配
Jeonghun Park, Heetae Jin, Jaehan Joo, Geonho Choi, Suk Chan Kim
In the fifth-generation (5G) network, mmWave has been utilized to cope with a demand for an extremely high data rate. However, the harsh propagation characteristic of mmWave signal limits networks' coverage, thus requiring network densification. Under this circumstance, 3GPP has introduced Integrated Access and Backhaul (IAB) architecture for cost-effective network deployment&operation. Contrary to traditional network architecture using wired backhaul links, IAB uses wireless backhaul links to forward data traffic. This feature improves spectrum utilization and cost efficiency. However, due to the dynamic, time-varying environment of the IAB network, finding a proper resource allocation strategy is a challenging issue. In this paper, we formulate the backhaul spectrum allocation problem maximizing user sum capacity. Then propose a double deep Q-Iearning-based backhaul spectrum allocation strategy. The simulation result shows that the proposed reinforcement learning-based spectrum allocation can achieve 20% higher user sum capacity than static rule-based spectrum allocation.
在第五代(5G)网络中,毫米波已被用于应对对极高数据速率的需求。然而,毫米波信号的恶劣传播特性限制了网络的覆盖范围,因此需要网络致密化。在这种情况下,3GPP引入了综合接入和回程(IAB)架构,以实现经济高效的网络部署和运营。与使用有线回程链路的传统网络架构相反,IAB使用无线回程链路转发数据流量。该特性提高了频谱利用率和成本效率。然而,由于IAB网络的动态、时变环境,寻找合适的资源分配策略是一个具有挑战性的问题。本文提出了最大用户和容量的回程频谱分配问题。然后提出了一种基于双深度q学习的回程频谱分配策略。仿真结果表明,基于强化学习的频谱分配比基于静态规则的频谱分配能提高20%的用户和容量。
{"title":"Double Deep Q-Learning based Backhaul Spectrum Allocation in Integrated Access and Backhaul Network","authors":"Jeonghun Park, Heetae Jin, Jaehan Joo, Geonho Choi, Suk Chan Kim","doi":"10.1109/ICAIIC57133.2023.10067029","DOIUrl":"https://doi.org/10.1109/ICAIIC57133.2023.10067029","url":null,"abstract":"In the fifth-generation (5G) network, mmWave has been utilized to cope with a demand for an extremely high data rate. However, the harsh propagation characteristic of mmWave signal limits networks' coverage, thus requiring network densification. Under this circumstance, 3GPP has introduced Integrated Access and Backhaul (IAB) architecture for cost-effective network deployment&operation. Contrary to traditional network architecture using wired backhaul links, IAB uses wireless backhaul links to forward data traffic. This feature improves spectrum utilization and cost efficiency. However, due to the dynamic, time-varying environment of the IAB network, finding a proper resource allocation strategy is a challenging issue. In this paper, we formulate the backhaul spectrum allocation problem maximizing user sum capacity. Then propose a double deep Q-Iearning-based backhaul spectrum allocation strategy. The simulation result shows that the proposed reinforcement learning-based spectrum allocation can achieve 20% higher user sum capacity than static rule-based spectrum allocation.","PeriodicalId":105769,"journal":{"name":"2023 International Conference on Artificial Intelligence in Information and Communication (ICAIIC)","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2023-02-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127538583","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Enhanced-feature pyramid network for semantic segmentation 用于语义分割的增强特征金字塔网络
Van Toan Quyen, Jong Hyuk Lee, Min Young Kim
Semantic segmentation is a complicated topic when they require strictly the object boundary accuracy. For autonomous driving applications, they have to face a long range of objective sizes in the street scenes, so a single field of views is not suitable to extract input features. Feature pyramid network (FPN) is an effective method for computer vision tasks such as object detection and semantic segmentation. The architecture of this approach composes of a bottom-up pathway and a top-down pathway. Based on the structure, we can obtain rich spatial information from the largest layer and extract rich segmentation information from lower-scale features. The traditional FPN efficiently captures different objective sizes by using multiple receptive fields and then predicts the outputs from the concatenated features. The final feature combination is not optimistic when they burden the hardware with huge computation and reduce the semantic information. In this paper, we propose multiple predictions for semantic segmentation. Instead of combining four-feature scales together, the proposed method processes separately three lower scales as the contextual contributor and the largest features as the coarser-information branch. Each contextual feature is concatenated with the coarse branch to generate an individual prediction. By deploying this architecture, a single prediction effectively segments specific objective sizes. Finally, score maps are fused together in order to gather the prominent weights from the different predictions. A series of experiments is implemented to validate the efficiency on various open data sets. We have achieved good results 76.4% $m$IoU at 52 FPS on Cityscapes and 43.6% $m$IoU on Mapillary Vistas.
语义分割是一个非常复杂的问题,它对目标边界精度要求很高。对于自动驾驶应用来说,他们必须面对街景中的长范围物镜尺寸,因此单一视场不适合提取输入特征。特征金字塔网络(FPN)是一种有效的计算机视觉目标检测和语义分割方法。该方法的体系结构由自底向上路径和自顶向下路径组成。基于该结构,我们可以从最大层中获得丰富的空间信息,并从较低尺度的特征中提取丰富的分割信息。传统的FPN通过使用多个接收域来捕获不同的目标大小,然后从连接的特征中预测输出。当它们给硬件带来巨大的计算负担和减少语义信息时,最终的特征组合是不乐观的。在本文中,我们提出了语义分割的多个预测。该方法不是将四个特征尺度组合在一起,而是分别处理三个较低的尺度作为上下文贡献者,并将最大的特征作为粗信息分支。每个上下文特征与粗分支相连接,以生成单独的预测。通过部署这种架构,单个预测就可以有效地分割特定的目标大小。最后,将分数图融合在一起,以便从不同的预测中收集突出的权重。通过一系列实验验证了该算法在各种开放数据集上的有效性。我们取得了良好的成果,在城市景观方面取得了76.4%的百万美元IoU,在Mapillary远景方面取得了43.6%的百万美元IoU。
{"title":"Enhanced-feature pyramid network for semantic segmentation","authors":"Van Toan Quyen, Jong Hyuk Lee, Min Young Kim","doi":"10.1109/ICAIIC57133.2023.10067062","DOIUrl":"https://doi.org/10.1109/ICAIIC57133.2023.10067062","url":null,"abstract":"Semantic segmentation is a complicated topic when they require strictly the object boundary accuracy. For autonomous driving applications, they have to face a long range of objective sizes in the street scenes, so a single field of views is not suitable to extract input features. Feature pyramid network (FPN) is an effective method for computer vision tasks such as object detection and semantic segmentation. The architecture of this approach composes of a bottom-up pathway and a top-down pathway. Based on the structure, we can obtain rich spatial information from the largest layer and extract rich segmentation information from lower-scale features. The traditional FPN efficiently captures different objective sizes by using multiple receptive fields and then predicts the outputs from the concatenated features. The final feature combination is not optimistic when they burden the hardware with huge computation and reduce the semantic information. In this paper, we propose multiple predictions for semantic segmentation. Instead of combining four-feature scales together, the proposed method processes separately three lower scales as the contextual contributor and the largest features as the coarser-information branch. Each contextual feature is concatenated with the coarse branch to generate an individual prediction. By deploying this architecture, a single prediction effectively segments specific objective sizes. Finally, score maps are fused together in order to gather the prominent weights from the different predictions. A series of experiments is implemented to validate the efficiency on various open data sets. We have achieved good results 76.4% $m$IoU at 52 FPS on Cityscapes and 43.6% $m$IoU on Mapillary Vistas.","PeriodicalId":105769,"journal":{"name":"2023 International Conference on Artificial Intelligence in Information and Communication (ICAIIC)","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2023-02-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127022946","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
RNN-based Interference Suppression Method for CS radar: Simulation and Experimental Evaluations 基于rnn的CS雷达干扰抑制方法:仿真与实验评估
Ryoto Koizumi, Xiaoyan Wang, M. Umehira, S. Takeda, Ran Sun
In recent years, high-resolution 77GHz onboard automotive radar has been extensively investigated for automated driving due to its high performance and low cost characteristics. As onboard CS (Chirp Sequence) radars' deployment density increases, inter-radar interference occurs which will increase target miss-detection and false-detection probabilities significantly. To address this critical and challenging problem, wideband interference suppression method using deep learning was proposed, in which the feasibility for performance improvement is validated based on simulations. In this study, we perform both simulation and experimental evaluations on RNN (recurrent neural network) based interference suppression method, in order to address the tradeoff between the model training time and interference suppression performance and validate its real-world applicability.
近年来,高分辨率77GHz车载雷达由于其高性能和低成本的特点,在自动驾驶领域得到了广泛的研究。随着机载CS (Chirp Sequence)雷达部署密度的增加,雷达间的干扰会显著增加目标失检和误检的概率。为了解决这一关键且具有挑战性的问题,提出了基于深度学习的宽带干扰抑制方法,并通过仿真验证了该方法性能改进的可行性。在本研究中,我们对基于RNN(递归神经网络)的干扰抑制方法进行了仿真和实验评估,以解决模型训练时间和干扰抑制性能之间的权衡,并验证其在现实世界中的适用性。
{"title":"RNN-based Interference Suppression Method for CS radar: Simulation and Experimental Evaluations","authors":"Ryoto Koizumi, Xiaoyan Wang, M. Umehira, S. Takeda, Ran Sun","doi":"10.1109/ICAIIC57133.2023.10067132","DOIUrl":"https://doi.org/10.1109/ICAIIC57133.2023.10067132","url":null,"abstract":"In recent years, high-resolution 77GHz onboard automotive radar has been extensively investigated for automated driving due to its high performance and low cost characteristics. As onboard CS (Chirp Sequence) radars' deployment density increases, inter-radar interference occurs which will increase target miss-detection and false-detection probabilities significantly. To address this critical and challenging problem, wideband interference suppression method using deep learning was proposed, in which the feasibility for performance improvement is validated based on simulations. In this study, we perform both simulation and experimental evaluations on RNN (recurrent neural network) based interference suppression method, in order to address the tradeoff between the model training time and interference suppression performance and validate its real-world applicability.","PeriodicalId":105769,"journal":{"name":"2023 International Conference on Artificial Intelligence in Information and Communication (ICAIIC)","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2023-02-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116570511","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Reinforcement Learning Assisted Relative Distance based MAC in Vehicular Networks 车辆网络中基于相对距离的强化学习MAC
Yafeng Deng, Young-June Choi
Many efforts have been done to increase the performance of vehicle-to-vehicle (V2V) services, such as basic safety message (BSM) and collision avoidance warning. However, high dynamics, such as topology and channel condition, still pose big challenges to resource allocation tasks in vehicular networks. A previous work, relative distance based MAC [1], is proposed to address merging collision. The dynamics can not be fully addressed because thresholds are used. Therefore, we intuitively adapt a dueling deep Q-network [2] to tune the threshold based on the aforementioned work to further address merging collision. The simulation results demonstrate the improvement of the proposed algorithm.
为了提高车对车(V2V)服务的性能,人们已经做了很多努力,比如基本安全信息(BSM)和避碰警告。然而,拓扑和信道条件等高动态特性仍然给车载网络的资源分配任务带来了很大的挑战。先前的工作,基于相对距离的MAC[1],提出了解决合并碰撞。由于使用了阈值,因此无法完全解决动态问题。因此,我们在上述工作的基础上,直观地调整了一个决斗深度Q-network[2]来调整阈值,以进一步解决合并碰撞问题。仿真结果表明了该算法的改进。
{"title":"A Reinforcement Learning Assisted Relative Distance based MAC in Vehicular Networks","authors":"Yafeng Deng, Young-June Choi","doi":"10.1109/ICAIIC57133.2023.10067126","DOIUrl":"https://doi.org/10.1109/ICAIIC57133.2023.10067126","url":null,"abstract":"Many efforts have been done to increase the performance of vehicle-to-vehicle (V2V) services, such as basic safety message (BSM) and collision avoidance warning. However, high dynamics, such as topology and channel condition, still pose big challenges to resource allocation tasks in vehicular networks. A previous work, relative distance based MAC [1], is proposed to address merging collision. The dynamics can not be fully addressed because thresholds are used. Therefore, we intuitively adapt a dueling deep Q-network [2] to tune the threshold based on the aforementioned work to further address merging collision. The simulation results demonstrate the improvement of the proposed algorithm.","PeriodicalId":105769,"journal":{"name":"2023 International Conference on Artificial Intelligence in Information and Communication (ICAIIC)","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2023-02-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131388761","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Real-time object detection using a domain-based transfer learning method for resource-constrained edge devices 基于域迁移学习的资源受限边缘设备实时目标检测
Dongkyu Kim, Seokjun Lee, Nak-Myoung Sung, Chungjae Choe
This paper presents a domain-based transfer learning method for deep learning-based object detection models where the method enables real-time computation in resource-constrained edge devices. Object detection is an essential task for intelligent platforms (e.g., drones, robots, and autonomous vehicles). However, edge devices could not afford to run huge object detection models due to insufficient resources. Although a compressed deep learning model increases inference speed, the accuracy of the model could be significantly deteriorate. In this paper, we propose an accurate object detection method while achieving real-time computation on edge devices. Our method aims to reduce marginal detection outputs of models according to application domains (e.g., city, park, factory, etc). We classify crucial objects (i.e., pedestrian, car, bench, etc) for a specific domain and adopt a transfer learning in which the learning is solely towards the selected objects. Such approach improves detection accuracy even for a compressed deep learning model like tiny versions of a YOLO (you only look once) framework. From the experiments, we validate that the method empowers the YOLOv7-tiny can provide the comparable detection accuracy with a YOLOv7 model despite of 83% less parameters than that of the original model. Besides, we confirm that our method achieves 389% faster inference on resource-constrained edge devices (i.e., NVIDIA Jetsons) than the YOLOv7.
本文提出了一种基于深度学习的对象检测模型的基于域的迁移学习方法,该方法能够在资源受限的边缘设备上实现实时计算。物体检测是智能平台(如无人机、机器人和自动驾驶汽车)的一项基本任务。然而,由于资源不足,边缘设备无法运行庞大的目标检测模型。虽然压缩的深度学习模型提高了推理速度,但模型的准确性可能会显著下降。在本文中,我们提出了一种精确的目标检测方法,同时在边缘设备上实现实时计算。我们的方法旨在根据应用领域(如城市、公园、工厂等)减少模型的边际检测输出。我们对特定领域的关键对象(即行人,汽车,长凳等)进行分类,并采用迁移学习,其中学习仅针对选定的对象。这种方法提高了检测的准确性,即使是压缩的深度学习模型,比如微小版本的YOLO(你只看一次)框架。实验结果表明,该方法使YOLOv7-tiny在参数比YOLOv7模型少83%的情况下,仍能提供与YOLOv7模型相当的检测精度。此外,我们确认我们的方法在资源受限的边缘设备(即NVIDIA Jetsons)上的推理速度比YOLOv7快389%。
{"title":"Real-time object detection using a domain-based transfer learning method for resource-constrained edge devices","authors":"Dongkyu Kim, Seokjun Lee, Nak-Myoung Sung, Chungjae Choe","doi":"10.1109/ICAIIC57133.2023.10067064","DOIUrl":"https://doi.org/10.1109/ICAIIC57133.2023.10067064","url":null,"abstract":"This paper presents a domain-based transfer learning method for deep learning-based object detection models where the method enables real-time computation in resource-constrained edge devices. Object detection is an essential task for intelligent platforms (e.g., drones, robots, and autonomous vehicles). However, edge devices could not afford to run huge object detection models due to insufficient resources. Although a compressed deep learning model increases inference speed, the accuracy of the model could be significantly deteriorate. In this paper, we propose an accurate object detection method while achieving real-time computation on edge devices. Our method aims to reduce marginal detection outputs of models according to application domains (e.g., city, park, factory, etc). We classify crucial objects (i.e., pedestrian, car, bench, etc) for a specific domain and adopt a transfer learning in which the learning is solely towards the selected objects. Such approach improves detection accuracy even for a compressed deep learning model like tiny versions of a YOLO (you only look once) framework. From the experiments, we validate that the method empowers the YOLOv7-tiny can provide the comparable detection accuracy with a YOLOv7 model despite of 83% less parameters than that of the original model. Besides, we confirm that our method achieves 389% faster inference on resource-constrained edge devices (i.e., NVIDIA Jetsons) than the YOLOv7.","PeriodicalId":105769,"journal":{"name":"2023 International Conference on Artificial Intelligence in Information and Communication (ICAIIC)","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2023-02-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132589840","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Indoor Space Flow Analysis Based on Deep Learning 基于深度学习的室内空间流分析
Chang Woo Choi, Hyo-eun Kang, Yoonyoung Hong, Yong Su Kim, Guem Bo Kim, Aji Teguh Prihatno, Jang Hyun Ji, Seungdo Hong, Ho Won Kim
It is essential to perform flow analysis in all spaces where people live. For example, designing the shape of the wing by analyzing the flow flowing through the wing of an airplane, or finding an appropriate air conditioner installation location by analyzing the flow according to the location of the air conditioner in the indoor space. In this study, we propose a deep learning model that performs real-time flow analysis assuming an indoor space that is relatively smaller than outdoor space. Computational Fluid Dynamics (CFD), a traditional method used for flow analysis, is not suitable for this task because it takes a long time to derive simulation results. Thus, the application of deep learning to flow analysis is considered in the present study because deep learning technology for physics, i.e., fluid mechanics and thermodynamics, can be applied to real spaces. We have constructed a deep learning model based on the TransUnet model that can learn data relationships and capture spatial information. Unlike the existing TransUnet model, our model contains a dense layer to reflect operating and spatial information. train and test data were collected using the ANSYS FLUENT commercial program. On 11 test data cases, the average R2 score between the actual and predicted value was 0.884, and the RMSE was 0.047, which are significant results. We used the image of the entire space as well as a cross-section to see how similar the predicted values were to the actual ones, Although a slight error occurred inside the space, It was confirmed that the flow tendency was accurately learned under the given operating conditions. Flow analysis through simulation based on existing numerical analysis methods requires a minimum of 8 hours for processing. However, our proposed deep learning model significantly reduces the time cost of flow analysis as it requires less than 3 seconds.
在人们居住的所有空间中进行流量分析是必不可少的。例如,通过分析飞机机翼的气流来设计机翼的形状,或者根据空调在室内空间中的位置,通过分析气流来找到合适的空调安装位置。在本研究中,我们提出了一个深度学习模型,该模型可以在室内空间相对小于室外空间的情况下进行实时流量分析。计算流体力学(CFD)是一种传统的流动分析方法,但由于计算流体力学需要较长时间才能得出模拟结果,因此不适合这项任务。因此,本研究考虑将深度学习应用于流动分析,因为物理,即流体力学和热力学的深度学习技术可以应用于实际空间。我们在TransUnet模型的基础上构建了一个深度学习模型,可以学习数据关系和捕获空间信息。与现有的TransUnet模型不同,我们的模型包含一个密集的层来反映操作和空间信息。利用ANSYS FLUENT商业软件采集训练和试验数据。在11个测试数据用例中,实际值与预测值的平均R2得分为0.884,RMSE为0.047,均为显著性结果。我们使用了整个空间的图像和一个横截面来观察预测值与实际值的相似程度,虽然在空间内部有轻微的误差,但证实了在给定的操作条件下,准确地学习了流动趋势。基于现有数值分析方法进行流场仿真分析至少需要8小时的处理时间。然而,我们提出的深度学习模型显著降低了流分析的时间成本,因为它只需要不到3秒的时间。
{"title":"Indoor Space Flow Analysis Based on Deep Learning","authors":"Chang Woo Choi, Hyo-eun Kang, Yoonyoung Hong, Yong Su Kim, Guem Bo Kim, Aji Teguh Prihatno, Jang Hyun Ji, Seungdo Hong, Ho Won Kim","doi":"10.1109/ICAIIC57133.2023.10067105","DOIUrl":"https://doi.org/10.1109/ICAIIC57133.2023.10067105","url":null,"abstract":"It is essential to perform flow analysis in all spaces where people live. For example, designing the shape of the wing by analyzing the flow flowing through the wing of an airplane, or finding an appropriate air conditioner installation location by analyzing the flow according to the location of the air conditioner in the indoor space. In this study, we propose a deep learning model that performs real-time flow analysis assuming an indoor space that is relatively smaller than outdoor space. Computational Fluid Dynamics (CFD), a traditional method used for flow analysis, is not suitable for this task because it takes a long time to derive simulation results. Thus, the application of deep learning to flow analysis is considered in the present study because deep learning technology for physics, i.e., fluid mechanics and thermodynamics, can be applied to real spaces. We have constructed a deep learning model based on the TransUnet model that can learn data relationships and capture spatial information. Unlike the existing TransUnet model, our model contains a dense layer to reflect operating and spatial information. train and test data were collected using the ANSYS FLUENT commercial program. On 11 test data cases, the average R2 score between the actual and predicted value was 0.884, and the RMSE was 0.047, which are significant results. We used the image of the entire space as well as a cross-section to see how similar the predicted values were to the actual ones, Although a slight error occurred inside the space, It was confirmed that the flow tendency was accurately learned under the given operating conditions. Flow analysis through simulation based on existing numerical analysis methods requires a minimum of 8 hours for processing. However, our proposed deep learning model significantly reduces the time cost of flow analysis as it requires less than 3 seconds.","PeriodicalId":105769,"journal":{"name":"2023 International Conference on Artificial Intelligence in Information and Communication (ICAIIC)","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2023-02-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128204180","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Masked Face Images Based Gender Classification using Hybrid Bat Algorithm Optimized Bagging 基于混合蝙蝠算法的蒙面图像性别分类优化
Sabrina Adinda Sari, Wikky Fawwaz Al Maki
The face is one of the biometrics utilized to learn information from a person, such as gender. Gender classification study is expanding daily as a result of how important it is and how many other sectors, like forensics, security, business, and others, employ it. However, in order to protect themselves and stop the spread of Covid-19 during this epidemic, everyone must wear a face mask. Because many crucial facial features that help determine a person's gender are obscured by masks, using one creates an issue for the gender classification system. To obtain optimal performance outcomes, suitable hyperparameters are also required. As a result, the objective of this study is to develop a gender categorization system based on mask-covered faces utilizing a novel technique that combines several features in the Gray Level Co-occurrence Matrix (GLCM), which is then fed into the Bagging classifier.A Hybrid Bat Algorithm (HBA) is used to optimize the bagging hyperparameters. With 97% accuracy, precision, recall, and f1-score values, the suggested model is demonstrated to have greater performance than before the hyperparameters were tuned using HBA.
人脸是用来了解一个人的性别等信息的生物特征之一。由于性别分类研究的重要性以及许多其他部门(如法医、安全、商业和其他部门)使用它,性别分类研究每天都在扩大。然而,为了在疫情期间保护自己并阻止Covid-19的传播,每个人都必须戴口罩。因为许多有助于确定一个人性别的关键面部特征被面具掩盖了,使用面具会给性别分类系统带来问题。为了获得最佳的性能结果,还需要合适的超参数。因此,本研究的目的是利用一种结合灰度共生矩阵(GLCM)中的几个特征的新技术,开发一种基于蒙面人脸的性别分类系统,然后将其输入Bagging分类器。采用HBA (Hybrid Bat Algorithm)算法对装袋超参数进行优化。具有97%的准确度、精密度、召回率和f1得分值,所建议的模型被证明比使用HBA调优超参数之前具有更高的性能。
{"title":"Masked Face Images Based Gender Classification using Hybrid Bat Algorithm Optimized Bagging","authors":"Sabrina Adinda Sari, Wikky Fawwaz Al Maki","doi":"10.1109/ICAIIC57133.2023.10067008","DOIUrl":"https://doi.org/10.1109/ICAIIC57133.2023.10067008","url":null,"abstract":"The face is one of the biometrics utilized to learn information from a person, such as gender. Gender classification study is expanding daily as a result of how important it is and how many other sectors, like forensics, security, business, and others, employ it. However, in order to protect themselves and stop the spread of Covid-19 during this epidemic, everyone must wear a face mask. Because many crucial facial features that help determine a person's gender are obscured by masks, using one creates an issue for the gender classification system. To obtain optimal performance outcomes, suitable hyperparameters are also required. As a result, the objective of this study is to develop a gender categorization system based on mask-covered faces utilizing a novel technique that combines several features in the Gray Level Co-occurrence Matrix (GLCM), which is then fed into the Bagging classifier.A Hybrid Bat Algorithm (HBA) is used to optimize the bagging hyperparameters. With 97% accuracy, precision, recall, and f1-score values, the suggested model is demonstrated to have greater performance than before the hyperparameters were tuned using HBA.","PeriodicalId":105769,"journal":{"name":"2023 International Conference on Artificial Intelligence in Information and Communication (ICAIIC)","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2023-02-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133107622","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
XGBoost Calibration Considering Feature Importance for Noninvasive HbA1c Estimation Using PPG Signals 考虑特征重要性的XGBoost校准使用PPG信号进行无创糖化血红蛋白估计
Mrinmoy Sarker Turja, Tae-Ho Kwon, Hyoungkeun Kim, Ki-Doo Kim
Diabetes has recently become a more serious disease. Almost every family has at least one diabetic. Patients have to regularly monitor their blood glucose levels, and using an invasive device on the other hand can be really painful and less reliable. This is because blood glucose levels fluctuate more with food intake. On the contrary, HbA1c level does not fluctuate as much as that of blood glucose. Therefore, in this study, XGBoost calibration considering only important features for Monte-Carlo simulation based noninvasive HbA1c estimation with PPG signals was proposed. After considering the important 13 of the 45 features, the model achieved a Pearson's r value of 98.90%.
糖尿病最近已成为一种更严重的疾病。几乎每个家庭都至少有一个糖尿病患者。患者必须定期监测他们的血糖水平,另一方面,使用侵入性设备可能真的很痛苦,而且不太可靠。这是因为血糖水平随食物摄入波动更大。相反,HbA1c水平的波动不像血糖那么大。因此,在本研究中,提出了仅考虑重要特征的XGBoost校准方法,用于基于蒙特卡罗模拟的基于PPG信号的无创HbA1c估计。在考虑了45个特征中的13个重要特征后,该模型的Pearson’s r值达到了98.90%。
{"title":"XGBoost Calibration Considering Feature Importance for Noninvasive HbA1c Estimation Using PPG Signals","authors":"Mrinmoy Sarker Turja, Tae-Ho Kwon, Hyoungkeun Kim, Ki-Doo Kim","doi":"10.1109/ICAIIC57133.2023.10067013","DOIUrl":"https://doi.org/10.1109/ICAIIC57133.2023.10067013","url":null,"abstract":"Diabetes has recently become a more serious disease. Almost every family has at least one diabetic. Patients have to regularly monitor their blood glucose levels, and using an invasive device on the other hand can be really painful and less reliable. This is because blood glucose levels fluctuate more with food intake. On the contrary, HbA1c level does not fluctuate as much as that of blood glucose. Therefore, in this study, XGBoost calibration considering only important features for Monte-Carlo simulation based noninvasive HbA1c estimation with PPG signals was proposed. After considering the important 13 of the 45 features, the model achieved a Pearson's r value of 98.90%.","PeriodicalId":105769,"journal":{"name":"2023 International Conference on Artificial Intelligence in Information and Communication (ICAIIC)","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2023-02-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124425211","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
AI in Classroom: Group Score Prediction System 课堂上的AI:小组成绩预测系统
Yeoun Chan Kim, Pankaj Agarwal
Knowledge tracing and learning path optimization are active research fields in education with AI technologies. The purpose of knowledge tracing is to model student's knowledge state of a concept and to predict the percentage of correctly answer a next question. Using the technology of modeling a student's knowledge state, learning path optimization technologies recommend personalized learning path for efficient learning. These two research fields are implemented on learning management systems for individual learning. In this research paper, method of using knowledge tracing and learning path optimization in group learning environment is suggested. Group score prediction model predicts number of students who will answer their next question correctly by utilizing one-dimensional convolution neural network and fully connected layers. The model is adopted in a group score prediction system where instructors utilize the model's output to create a question set corresponding to their strategy and students' responses are used to re-train and evaluate the model.
知识追踪和学习路径优化是人工智能教育领域的研究热点。知识追踪的目的是模拟学生对一个概念的知识状态,并预测正确回答下一个问题的百分比。学习路径优化技术通过对学生的知识状态进行建模,为学生推荐个性化的学习路径,实现高效的学习。这两个研究领域都是在个人学习的学习管理系统上实现的。本文提出了在群体学习环境中运用知识跟踪和学习路径优化的方法。小组分数预测模型利用一维卷积神经网络和全连接层来预测正确回答下一题的学生人数。该模型被用于一个小组分数预测系统,教师利用模型的输出来创建一个与他们的策略相对应的问题集,并使用学生的回答来重新训练和评估模型。
{"title":"AI in Classroom: Group Score Prediction System","authors":"Yeoun Chan Kim, Pankaj Agarwal","doi":"10.1109/ICAIIC57133.2023.10067066","DOIUrl":"https://doi.org/10.1109/ICAIIC57133.2023.10067066","url":null,"abstract":"Knowledge tracing and learning path optimization are active research fields in education with AI technologies. The purpose of knowledge tracing is to model student's knowledge state of a concept and to predict the percentage of correctly answer a next question. Using the technology of modeling a student's knowledge state, learning path optimization technologies recommend personalized learning path for efficient learning. These two research fields are implemented on learning management systems for individual learning. In this research paper, method of using knowledge tracing and learning path optimization in group learning environment is suggested. Group score prediction model predicts number of students who will answer their next question correctly by utilizing one-dimensional convolution neural network and fully connected layers. The model is adopted in a group score prediction system where instructors utilize the model's output to create a question set corresponding to their strategy and students' responses are used to re-train and evaluate the model.","PeriodicalId":105769,"journal":{"name":"2023 International Conference on Artificial Intelligence in Information and Communication (ICAIIC)","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2023-02-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121450683","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Predicting Human Activity with LSTM Face Detection on Server Surveillance System 服务器监控系统中LSTM人脸检测预测人类活动
Alexander Nurenie, Y. Heryadi, Lukas, W. Suparta, Yulyani Arifin
Surveillance server technology was growth with new technology, effective, extra new features, human friendly, and human deals with big amount data, can't view and collect the data in the short time, and took time to analyze, playback video/picture to determine machine, human, vehicle or environment issues or performance, Surveillance Server Systems now which has the ability to face recognition, face detection, human detection, motion detection, license plate recognition, The authors perform this study that still new this research has never been done before to determine the efficacy of the LSTM in predicting human behavior (Long Short Term Memory) Face Detection on Server surveillance system, by taking log view data with a total of 91501 Face detection data downloaded from 10/18/2022~11/9/2022, the data will be processed using Python programming and training so that it can be used to predict the future regarding human activities that vary utilizing time series prediction LSTM include the number of daily activities, the highest and lowest numbers of days, and the maximum and minimum numbers of days. from the results of this study it was found to help to find out the days with the lowest number of humans and the days with the highest number of human activities, so that the owner can predict with sequence of the data the service would be provided when human activity is high in certain area or certain day, it can also can find out the maximum or minimum amount human counting day by day, and compare able some different date and location, the author will continue to do more in-depth research the others data related with prediction with deep learning server surveillance machine system interaction with human, vehicle behavior in the future studies.
监控服务器技术是随着新技术的发展而发展起来的,有效的,额外的新功能,人性化,人类处理大数据,不能在短时间内查看和收集数据,并且需要时间来分析,播放视频/图片以确定机器,人,车辆或环境的问题或性能,监控服务器系统现在具有人脸识别,人脸检测,人体检测,运动检测,车牌识别,为了确定LSTM在服务器监控系统上预测人类行为(长短期记忆)人脸检测方面的有效性,作者进行了一项新的研究,该研究采用了从2022年10月18日至2022年11月9日下载的91501个人脸检测数据的日志视图数据。数据将使用Python编程和训练进行处理,以便它可以用于预测人类活动的未来,利用时间序列预测LSTM包括日常活动的数量,最高和最低天数,以及最大和最小天数。从这项研究的结果被发现有助于找到最低的天的人类和天数最多的人类活动,以便业主能够预测的序列数据所提供的服务将是当人类活动是在特定的区域或特定的一天,它也可以找到人类最大或最小数量计算,并比较不同的日期和地点,在未来的研究中,作者将继续对其他与预测相关的数据进行更深入的研究,这些数据与深度学习服务器监控机系统与人、车辆的交互行为有关。
{"title":"Predicting Human Activity with LSTM Face Detection on Server Surveillance System","authors":"Alexander Nurenie, Y. Heryadi, Lukas, W. Suparta, Yulyani Arifin","doi":"10.1109/ICAIIC57133.2023.10066981","DOIUrl":"https://doi.org/10.1109/ICAIIC57133.2023.10066981","url":null,"abstract":"Surveillance server technology was growth with new technology, effective, extra new features, human friendly, and human deals with big amount data, can't view and collect the data in the short time, and took time to analyze, playback video/picture to determine machine, human, vehicle or environment issues or performance, Surveillance Server Systems now which has the ability to face recognition, face detection, human detection, motion detection, license plate recognition, The authors perform this study that still new this research has never been done before to determine the efficacy of the LSTM in predicting human behavior (Long Short Term Memory) Face Detection on Server surveillance system, by taking log view data with a total of 91501 Face detection data downloaded from 10/18/2022~11/9/2022, the data will be processed using Python programming and training so that it can be used to predict the future regarding human activities that vary utilizing time series prediction LSTM include the number of daily activities, the highest and lowest numbers of days, and the maximum and minimum numbers of days. from the results of this study it was found to help to find out the days with the lowest number of humans and the days with the highest number of human activities, so that the owner can predict with sequence of the data the service would be provided when human activity is high in certain area or certain day, it can also can find out the maximum or minimum amount human counting day by day, and compare able some different date and location, the author will continue to do more in-depth research the others data related with prediction with deep learning server surveillance machine system interaction with human, vehicle behavior in the future studies.","PeriodicalId":105769,"journal":{"name":"2023 International Conference on Artificial Intelligence in Information and Communication (ICAIIC)","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2023-02-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128621247","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
2023 International Conference on Artificial Intelligence in Information and Communication (ICAIIC)
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1