EAI Endorsed Transactions on Industrial Networks and Intelligent Systems最新文献

ViMedNER: A Medical Named Entity Recognition Dataset for Vietnamese ViMedNER：越南语医疗命名实体识别数据集

Q2 Engineering

EAI Endorsed Transactions on Industrial Networks and Intelligent Systems

Pub Date : 2024-07-11 DOI: 10.4108/eetinis.v11i3.5221

Pham Van Duong, T. Trinh, Minh-Tien Nguyen, Huy-The Vu, Minh Chuan Pham, Tran Manh Tuan, Le Hoang Son

Named entity recognition (NER) is one of the most important tasks in natural language processing, which identifies entity boundaries and classifies them into pre-defined categories. In literature, NER systems have been developed for various languages but limited works have been conducted for Vietnamese. This mainly comes from the limitation of available and high-quality annotated data, especially for specific domains such as medicine and healthcare. In this paper, we introduce a new medical NER dataset, named ViMedNER, for recognizing Vietnamese medical entities. Unlike existing works designed for common or too-specific entities, we focus on entity types that can be used in common diagnostic and treatment scenarios, including disease names, the symptoms of the diseases, the cause of the diseases, the diagnostic, and the treatment. These entities facilitate the diagnosis and treatment of doctors for common diseases. Our dataset is collected from four well-known Vietnamese websites that are professional in terms of drag selling and disease diagnostics and annotated by domain experts with high agreement scores. To create benchmark results, strong NER baselines based on pre-trained language models including PhoBERT, XLM-R, ViDeBERTa, ViPubMedDeBERTa, and ViHealthBERT are implemented and evaluated on the dataset. Experiment results show that the performance of XLM-R is consistently better than that of the other pre-trained language models. Furthermore, additional experiments are conducted to explore the behavior of the baselines and the characteristics of our dataset.

命名实体识别（NER）是自然语言处理中最重要的任务之一，它能识别实体边界并将其归入预定义的类别。在文献中，NER 系统已针对多种语言进行了开发，但针对越南语的工作还很有限。这主要是由于可用的高质量注释数据有限，尤其是在医学和医疗保健等特定领域。在本文中，我们介绍了一个新的医疗 NER 数据集，名为 ViMedNER，用于识别越南语医疗实体。与针对常见或过于特殊的实体设计的现有作品不同，我们专注于可用于常见诊断和治疗场景的实体类型，包括疾病名称、疾病症状、病因、诊断和治疗。这些实体有助于医生对常见疾病进行诊断和治疗。我们的数据集收集自四个知名的越南网站，这些网站在拖动销售和疾病诊断方面都很专业，并由领域专家注释，具有较高的一致性得分。为了创建基准结果，我们基于预先训练的语言模型（包括 PhoBERT、XLM-R、ViDeBERTa、ViPubMedDeBERTa 和 ViHealthBERT）实现了强大的 NER 基线，并在数据集上进行了评估。实验结果表明，XLM-R 的性能始终优于其他预训练语言模型。此外，我们还进行了其他实验，以探索基线的行为和我们数据集的特点。

{"title":"ViMedNER: A Medical Named Entity Recognition Dataset for Vietnamese","authors":"Pham Van Duong, T. Trinh, Minh-Tien Nguyen, Huy-The Vu, Minh Chuan Pham, Tran Manh Tuan, Le Hoang Son","doi":"10.4108/eetinis.v11i3.5221","DOIUrl":"https://doi.org/10.4108/eetinis.v11i3.5221","url":null,"abstract":"Named entity recognition (NER) is one of the most important tasks in natural language processing, which identifies entity boundaries and classifies them into pre-defined categories. In literature, NER systems have been developed for various languages but limited works have been conducted for Vietnamese. This mainly comes from the limitation of available and high-quality annotated data, especially for specific domains such as medicine and healthcare. In this paper, we introduce a new medical NER dataset, named ViMedNER, for recognizing Vietnamese medical entities. Unlike existing works designed for common or too-specific entities, we focus on entity types that can be used in common diagnostic and treatment scenarios, including disease names, the symptoms of the diseases, the cause of the diseases, the diagnostic, and the treatment. These entities facilitate the diagnosis and treatment of doctors for common diseases. Our dataset is collected from four well-known Vietnamese websites that are professional in terms of drag selling and disease diagnostics and annotated by domain experts with high agreement scores. To create benchmark results, strong NER baselines based on pre-trained language models including PhoBERT, XLM-R, ViDeBERTa, ViPubMedDeBERTa, and ViHealthBERT are implemented and evaluated on the dataset. Experiment results show that the performance of XLM-R is consistently better than that of the other pre-trained language models. Furthermore, additional experiments are conducted to explore the behavior of the baselines and the characteristics of our dataset.","PeriodicalId":33474,"journal":{"name":"EAI Endorsed Transactions on Industrial Networks and Intelligent Systems","volume":"140 49","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-07-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141656122","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Distributed Spatially Non-Stationary Channel Estimation for Extremely-Large Antenna Systems 超大型天线系统的分布式空间非静态信道估计

Q2 Engineering

EAI Endorsed Transactions on Industrial Networks and Intelligent Systems

Pub Date : 2024-05-06 DOI: 10.4108/eetinis.v11i3.5992

Yanqing Xu, Shuai Wang, Ruihong Jiang, Zhou Wang

This paper aims to develop a distributed channel estimation (CE) algorithm for spatially non-stationary (SNS) channels in extremely large aperture array systems, addressing the issues of high communication cost and computational complexity associated with traditional centralized algorithms. However, SNS channels differ from conventional spatially stationary channels, presenting new challenges such as varying sparsity patterns for different antennas. To overcome these challenges, we propose a novel distributed CE algorithm accompanied by a simple yet effective hard thresholding scheme. The proposed algorithm is not only suitable for uniform antenna arrays but also for irregularly deployed antennas. Simulation results demonstrate the advantages of the proposed algorithm in terms of estimation accuracy, communication cost, and computational complexity.

本文旨在为超大孔径阵列系统中的空间非静止（SNS）信道开发一种分布式信道估计（CE）算法，以解决与传统集中式算法相关的高通信成本和计算复杂性问题。然而，SNS 信道不同于传统的空间静止信道，它带来了新的挑战，例如不同天线的稀疏性模式各不相同。为了克服这些挑战，我们提出了一种新型分布式 CE 算法，并辅以简单有效的硬阈值方案。所提出的算法不仅适用于均匀天线阵列，也适用于不规则部署的天线。仿真结果证明了所提算法在估计精度、通信成本和计算复杂度方面的优势。

引用次数: 0

On the Performance of the Relay Selection in Multi-hop Cluster-based Wireless Networks with Multiple Eavesdroppers Under Equally Correlated Rayleigh Fading 论具有多个窃听者的多跳集群式无线网络在等相关瑞利衰减条件下的中继选择性能

Q2 Engineering

EAI Endorsed Transactions on Industrial Networks and Intelligent Systems

Pub Date : 2024-05-02 DOI: 10.4108/eetinis.v11i3.4728

Pham Minh Nam, Phong Ngo Dinh, Nguyen Luong Nhat, Tu Lam-Thanh, T. Le-Tien

The performance of multi-hop cluster-based wireless networks under multiple eavesdroppers is investigated in the present work. More precisely, we derive the outage probability (OP) of the considered networks under two relay selection schemes: the channel-gain-based scheme and the random scheme. Although equally correlated Rayleigh fading is taken into consideration, the derived mathematical framework remains tractable. Specifically, we represent the exact expression of the OP under the channel-based scheme in series form, while the OP under the random scheme is computed in a closed-form expression. Additionally, we propose a novel power allocation for each transmitter that strictly satisfies the given intercept probability. Numerical results based on the Monte Carlo method are provided to verify the correctness of the derived framework. These results are also used to identify the influences of various parameters, such as the number of clusters, the number of relays per cluster, and the transmit power.

本研究探讨了基于集群的多跳无线网络在多窃听器情况下的性能。更确切地说，我们推导了所考虑的网络在两种中继选择方案（基于信道增益的方案和随机方案）下的中断概率（OP）。虽然考虑到了等相关的瑞利衰落，但推导出的数学框架仍然是可行的。具体来说，我们用串联形式表示了基于信道的方案下 OP 的精确表达式，而随机方案下的 OP 则用闭式表达式计算。此外，我们还为每个发射机提出了严格满足给定截获概率的新型功率分配方案。我们提供了基于蒙特卡罗方法的数值结果，以验证推导框架的正确性。这些结果还用于确定各种参数的影响，如群集数、每个群集的中继数和发射功率。

{"title":"On the Performance of the Relay Selection in Multi-hop Cluster-based Wireless Networks with Multiple Eavesdroppers Under Equally Correlated Rayleigh Fading","authors":"Pham Minh Nam, Phong Ngo Dinh, Nguyen Luong Nhat, Tu Lam-Thanh, T. Le-Tien","doi":"10.4108/eetinis.v11i3.4728","DOIUrl":"https://doi.org/10.4108/eetinis.v11i3.4728","url":null,"abstract":"The performance of multi-hop cluster-based wireless networks under multiple eavesdroppers is investigated in the present work. More precisely, we derive the outage probability (OP) of the considered networks under two relay selection schemes: the channel-gain-based scheme and the random scheme. Although equally correlated Rayleigh fading is taken into consideration, the derived mathematical framework remains tractable. Specifically, we represent the exact expression of the OP under the channel-based scheme in series form, while the OP under the random scheme is computed in a closed-form expression. Additionally, we propose a novel power allocation for each transmitter that strictly satisfies the given intercept probability. Numerical results based on the Monte Carlo method are provided to verify the correctness of the derived framework. These results are also used to identify the influences of various parameters, such as the number of clusters, the number of relays per cluster, and the transmit power.","PeriodicalId":33474,"journal":{"name":"EAI Endorsed Transactions on Industrial Networks and Intelligent Systems","volume":"6 11","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-05-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141020856","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Real-time Single-Channel EOG removal based on Empirical Mode Decomposition 基于经验模式分解的实时单通道 EOG 移除

Q2 Engineering

EAI Endorsed Transactions on Industrial Networks and Intelligent Systems

Pub Date : 2024-04-08 DOI: 10.4108/eetinis.v11i2.4593

Kien Nguyen Trong, Nhat Nguyen Luong, Hanh Tan, Duy Tran Trung, Huong Ha Thi Thanh, Duy Pham The, Binh Nguyen Thanh

In recent years, single-channel physiological recordings have gained popularity in portable health devices and research settings due to their convenience. However, the presence of electrooculogram (EOG) artifacts can significantly degrade the quality of the recorded data, impacting the accuracy of essential signal features. Consequently, artifact removal from physiological signals is a crucial step in signal processing pipelines. Current techniques often employ Independent Component Analysis (ICA) to efficiently separate signal and artifact sources in multichannel recordings. However, limitations arise when dealing with single or a few channel measurements in minimal instrumentation or portable devices, restricting the utility of ICA. To address this challenge, this paper introduces an innovative artifact removal algorithm utilizing enhanced empirical mode decomposition to extract the intrinsic mode functions (IMFs). Subsequently, the algorithm targets the removal of segments related to EOG by isolating them within these IMFs. The proposed method is compared with existing single-channel EEG artifact removal algorithms, demonstrating superior performance. The findings demonstrate the effectiveness of our approach in isolating artifact components, resulting in a reconstructed signal characterized by a strong correlation and a power spectrum closely resembling the ground-truth EEG signal. This outperforms the existing methods in terms of artifact removal. Additionally, the proposed algorithm exhibits significantly reduced execution time, enabling real-time online analysis.

近年来，单通道生理记录因其便捷性在便携式健康设备和研究环境中越来越受欢迎。然而，电图（EOG）伪像的存在会大大降低记录数据的质量，影响基本信号特征的准确性。因此，去除生理信号中的伪影是信号处理管道中的关键步骤。目前的技术通常采用独立分量分析（ICA）来有效分离多通道记录中的信号源和伪像源。然而，在处理最小仪器或便携式设备中的单通道或少数通道测量时会出现限制，从而限制了 ICA 的实用性。为了应对这一挑战，本文介绍了一种创新的人工痕迹去除算法，利用增强的经验模式分解来提取本征模式函数（IMF）。随后，该算法通过在这些 IMFs 中隔离与 EOG 相关的片段，从而去除这些片段。我们将所提出的方法与现有的单通道脑电图伪像去除算法进行了比较，结果显示该方法性能优越。研究结果表明，我们的方法能有效地分离伪像成分，重建的信号具有很强的相关性，功率谱与地面真实脑电信号非常相似。这在去除伪像方面优于现有方法。此外，所提出的算法大大缩短了执行时间，实现了实时在线分析。

{"title":"Real-time Single-Channel EOG removal based on Empirical Mode Decomposition","authors":"Kien Nguyen Trong, Nhat Nguyen Luong, Hanh Tan, Duy Tran Trung, Huong Ha Thi Thanh, Duy Pham The, Binh Nguyen Thanh","doi":"10.4108/eetinis.v11i2.4593","DOIUrl":"https://doi.org/10.4108/eetinis.v11i2.4593","url":null,"abstract":"In recent years, single-channel physiological recordings have gained popularity in portable health devices and research settings due to their convenience. However, the presence of electrooculogram (EOG) artifacts can significantly degrade the quality of the recorded data, impacting the accuracy of essential signal features. Consequently, artifact removal from physiological signals is a crucial step in signal processing pipelines. Current techniques often employ Independent Component Analysis (ICA) to efficiently separate signal and artifact sources in multichannel recordings. However, limitations arise when dealing with single or a few channel measurements in minimal instrumentation or portable devices, restricting the utility of ICA. To address this challenge, this paper introduces an innovative artifact removal algorithm utilizing enhanced empirical mode decomposition to extract the intrinsic mode functions (IMFs). Subsequently, the algorithm targets the removal of segments related to EOG by isolating them within these IMFs. The proposed method is compared with existing single-channel EEG artifact removal algorithms, demonstrating superior performance. The findings demonstrate the effectiveness of our approach in isolating artifact components, resulting in a reconstructed signal characterized by a strong correlation and a power spectrum closely resembling the ground-truth EEG signal. This outperforms the existing methods in terms of artifact removal. Additionally, the proposed algorithm exhibits significantly reduced execution time, enabling real-time online analysis.","PeriodicalId":33474,"journal":{"name":"EAI Endorsed Transactions on Industrial Networks and Intelligent Systems","volume":"112 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-04-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140731868","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Improving Performance of the Typical User in the Indoor Cooperative NOMA Millimeter Wave Networks with Presence of Walls 提高有墙室内合作 NOMA 毫米波网络中典型用户的性能

Q2 Engineering

EAI Endorsed Transactions on Industrial Networks and Intelligent Systems

Pub Date : 2024-04-08 DOI: 10.4108/eetinis.v11i2.5156

S. Lam, Xuan Nam Tran

INTRODUCTION: The beyond 5G millimeter wave cellular network system is expecting to provide the high quality of service in indoor areas. OBJECTIVES: Due to the high density of obstacles, the cooperative communication technique is employed to improve the user's desired signal power by finding more than one appropriate station to serve that user. METHODS: While the conventional system utilizes additional equipment such as Reconfigurable Intelligent Surfaces (RIS) and relays to enable the cooperative features, the paper introduces a new network paradigm that utilizes the second nearest Base Station (BS) of the typical user as the Decode and Forward (DF) relay. Thus, depends on the success of decoding the message from the user' serving BS of the second nearest BS, the typical user can work with and without assistance from the relay whose operation follows the discipline of the power-domain NOMA technique. In the case of with relay assistance, the Maximum Ratio Combining technique is utilized by the typical user to combine the desired signals. RESULTS: To examine the performance of the proposed system, the Nakagami-m and the newly developed path loss model, which considers the density of walls and their properties, are adopted to derive the coverage probability of the user with and without relay assistance. The closed-form expressions of this performance metric are derived by Gauss quadrature and Welch-Satterthwaite approximation.

简介：超越 5G 的毫米波蜂窝网络系统有望在室内区域提供高质量服务。目标：由于障碍物密度高，因此需要采用合作通信技术，通过寻找多个合适的站点为用户提供服务，从而提高用户所需的信号功率。方法：传统系统利用可重构智能表面（RIS）和中继器等额外设备来实现合作功能，而本文则引入了一种新的网络范例，利用离典型用户最近的第二个基站（BS）作为解码和转发（DF）中继器。因此，典型用户可以在有中继协助或无中继协助的情况下工作，这取决于用户服务的第二最近基站能否成功解码信息。在有中继协助的情况下，典型用户利用最大比率组合技术来组合所需的信号。结果：为了检验拟议系统的性能，我们采用了 Nakagami-m 模型和新开发的路径损耗模型（该模型考虑了墙体密度及其特性）来推导有中继协助和无中继协助时用户的覆盖概率。通过高斯正交和韦尔奇-萨特斯韦特近似法得出了这一性能指标的闭式表达式。

{"title":"Improving Performance of the Typical User in the Indoor Cooperative NOMA Millimeter Wave Networks with Presence of Walls","authors":"S. Lam, Xuan Nam Tran","doi":"10.4108/eetinis.v11i2.5156","DOIUrl":"https://doi.org/10.4108/eetinis.v11i2.5156","url":null,"abstract":"INTRODUCTION: The beyond 5G millimeter wave cellular network system is expecting to provide the high quality of service in indoor areas. OBJECTIVES: Due to the high density of obstacles, the cooperative communication technique is employed to improve the user's desired signal power by finding more than one appropriate station to serve that user. METHODS: While the conventional system utilizes additional equipment such as Reconfigurable Intelligent Surfaces (RIS) and relays to enable the cooperative features, the paper introduces a new network paradigm that utilizes the second nearest Base Station (BS) of the typical user as the Decode and Forward (DF) relay. Thus, depends on the success of decoding the message from the user' serving BS of the second nearest BS, the typical user can work with and without assistance from the relay whose operation follows the discipline of the power-domain NOMA technique. In the case of with relay assistance, the Maximum Ratio Combining technique is utilized by the typical user to combine the desired signals. RESULTS: To examine the performance of the proposed system, the Nakagami-m and the newly developed path loss model, which considers the density of walls and their properties, are adopted to derive the coverage probability of the user with and without relay assistance. The closed-form expressions of this performance metric are derived by Gauss quadrature and Welch-Satterthwaite approximation.","PeriodicalId":33474,"journal":{"name":"EAI Endorsed Transactions on Industrial Networks and Intelligent Systems","volume":"85 S60","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-04-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140731650","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Facial mask-wearing prediction and adaptive gender classification using convolutional neural networks 利用卷积神经网络进行戴面具预测和自适应性别分类

Q2 Engineering

EAI Endorsed Transactions on Industrial Networks and Intelligent Systems

Pub Date : 2024-03-13 DOI: 10.4108/eetinis.v11i2.4318

Mohamed Oulad-Kaddour, Hamid Haddadou, D. Palacios-Alonso, C. Conde, E. Cabello

The world has lived an exceptional time period caused by the Coronavirus pandemic. To limit Covid-19 propagation, governments required people to wear a facial mask outside. In facial data analysis, mask-wearing on the human face creates predominant occlusion hiding the important oral region and causing more challenges for human face recognition and categorisation. The appropriation of existing solutions by taking into consideration the masked context is indispensable for researchers. In this paper, we propose an approach for mask-wearing prediction and adaptive facial human-gender classification. The proposed approach is based on convolutional neural networks (CNNs). Both mask-wearing and gender information are crucial for various possible applications. Experimentation shows that mask-wearing is very well detectable by using CNNs and justifies its use as a prepossessing step. It also shows that retraining with masked faces is indispensable to keep up gender classification performances. In addition, experimentation proclaims that in a controlled face-pose with acceptable image quality' context, the gender attribute remains well detectable. Finally, we show empirically that the adaptive proposed approach improves global performance for gender prediction in a mixed context.

冠状病毒大流行给世界带来了一段特殊的时期。为了限制 Covid-19 的传播，各国政府要求人们外出时佩戴口罩。在人脸数据分析中，戴面具的人脸会遮挡重要的口腔区域，给人脸识别和分类带来更多挑战。考虑到戴面具的情况，对现有解决方案进行改进是研究人员不可或缺的工作。在本文中，我们提出了一种用于戴面具预测和自适应人脸性别分类的方法。该方法基于卷积神经网络（CNN）。戴面具和性别信息对于各种可能的应用都至关重要。实验表明，使用卷积神经网络可以很好地检测戴面具的情况，这也证明了将其作为前置步骤的合理性。实验还表明，要保持性别分类的性能，使用戴面具的人脸进行再训练是必不可少的。此外，实验证明，在可控的人脸姿态和可接受的图像质量背景下，性别属性仍然可以很好地检测出来。最后，我们通过实证证明，所提出的自适应方法提高了混合背景下性别预测的整体性能。

{"title":"Facial mask-wearing prediction and adaptive gender classification using convolutional neural networks","authors":"Mohamed Oulad-Kaddour, Hamid Haddadou, D. Palacios-Alonso, C. Conde, E. Cabello","doi":"10.4108/eetinis.v11i2.4318","DOIUrl":"https://doi.org/10.4108/eetinis.v11i2.4318","url":null,"abstract":"The world has lived an exceptional time period caused by the Coronavirus pandemic. To limit Covid-19 propagation, governments required people to wear a facial mask outside. In facial data analysis, mask-wearing on the human face creates predominant occlusion hiding the important oral region and causing more challenges for human face recognition and categorisation. The appropriation of existing solutions by taking into consideration the masked context is indispensable for researchers. In this paper, we propose an approach for mask-wearing prediction and adaptive facial human-gender classification. The proposed approach is based on convolutional neural networks (CNNs). Both mask-wearing and gender information are crucial for various possible applications. Experimentation shows that mask-wearing is very well detectable by using CNNs and justifies its use as a prepossessing step. It also shows that retraining with masked faces is indispensable to keep up gender classification performances. In addition, experimentation proclaims that in a controlled face-pose with acceptable image quality' context, the gender attribute remains well detectable. Finally, we show empirically that the adaptive proposed approach improves global performance for gender prediction in a mixed context.","PeriodicalId":33474,"journal":{"name":"EAI Endorsed Transactions on Industrial Networks and Intelligent Systems","volume":"53 2","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-03-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140247678","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Vehicle Type Classification with Small Dataset and Transfer Learning Techniques 利用小型数据集和迁移学习技术进行车辆类型分类

Q2 Engineering

EAI Endorsed Transactions on Industrial Networks and Intelligent Systems

Pub Date : 2024-03-07 DOI: 10.4108/eetinis.v11i2.4678

Quang-Tu Pham, Dinh-Dat Pham, Khanh-Ly Can, Hieu Dao To, Hoang-Dieu Vu

This study delves into the application of deep learning training techniques using a restricted dataset, encompassing around 400 vehicle images sourced from Kaggle. Faced with the challenges of limited data, the impracticality of training models from scratch becomes apparent, advocating instead for the utilization of pre-trained models with pre-trained weights. The investigation considers three prominent models—EfficientNetB0, ResNetB0, and MobileNetV2—with EfficientNetB0 emerging as the most proficient choice. Employing the gradually unfreeze layer technique over a specified number of epochs, EfficientNetB0 exhibits remarkable accuracy, reaching 99.5% on the training dataset and 97% on the validation dataset. In contrast, training models from scratch results in notably lower accuracy. In this context, knowledge distillation proves pivotal, overcoming this limitation and significantly improving accuracy from 29.5% in training and 20.5% in validation to 54% and 45%, respectively. This study uniquely contributes by exploring transfer learning with gradually unfreeze layers and elucidates the potential of knowledge distillation. It highlights their effectiveness in robustly enhancing model performance under data scarcity, thus addressing challenges associated with training deep learning models on limited datasets. The findings underscore the practical significance of these techniques in achieving superior results when confronted with data constraints in real-world scenarios

本研究深入探讨了深度学习训练技术在有限数据集上的应用，该数据集包含来自 Kaggle 的约 400 张车辆图像。面对有限数据带来的挑战，从头开始训练模型的不切实际性变得显而易见，因此我们主张使用预先训练好的模型和预先训练好的权重。本研究考虑了三种著名的模型--EfficientNetB0、ResNetB0 和 MobileNetV2，其中 EfficientNetB0 是最熟练的选择。EfficientNetB0 采用了在指定的epoch次数内逐步解冻层的技术，表现出了卓越的准确性，在训练数据集上达到了 99.5%，在验证数据集上达到了 97%。相比之下，从头开始训练模型的准确率明显较低。在这种情况下，知识提炼被证明是至关重要的，它克服了这一局限性，并显著提高了准确率，从训练数据集的 29.5% 和验证数据集的 20.5% 分别提高到 54% 和 45%。本研究通过探索逐步解冻层的迁移学习，阐明了知识蒸馏的潜力，从而做出了独特的贡献。研究强调了它们在数据稀缺情况下稳健提高模型性能的有效性，从而解决了在有限数据集上训练深度学习模型所面临的挑战。研究结果强调了这些技术在现实世界场景中面临数据限制时取得优异结果的实际意义。

{"title":"Vehicle Type Classification with Small Dataset and Transfer Learning Techniques","authors":"Quang-Tu Pham, Dinh-Dat Pham, Khanh-Ly Can, Hieu Dao To, Hoang-Dieu Vu","doi":"10.4108/eetinis.v11i2.4678","DOIUrl":"https://doi.org/10.4108/eetinis.v11i2.4678","url":null,"abstract":"This study delves into the application of deep learning training techniques using a restricted dataset, encompassing around 400 vehicle images sourced from Kaggle. Faced with the challenges of limited data, the impracticality of training models from scratch becomes apparent, advocating instead for the utilization of pre-trained models with pre-trained weights. The investigation considers three prominent models—EfficientNetB0, ResNetB0, and MobileNetV2—with EfficientNetB0 emerging as the most proficient choice. Employing the gradually unfreeze layer technique over a specified number of epochs, EfficientNetB0 exhibits remarkable accuracy, reaching 99.5% on the training dataset and 97% on the validation dataset. In contrast, training models from scratch results in notably lower accuracy. In this context, knowledge distillation proves pivotal, overcoming this limitation and significantly improving accuracy from 29.5% in training and 20.5% in validation to 54% and 45%, respectively. This study uniquely contributes by exploring transfer learning with gradually unfreeze layers and elucidates the potential of knowledge distillation. It highlights their effectiveness in robustly enhancing model performance under data scarcity, thus addressing challenges associated with training deep learning models on limited datasets. The findings underscore the practical significance of these techniques in achieving superior results when confronted with data constraints in real-world scenarios","PeriodicalId":33474,"journal":{"name":"EAI Endorsed Transactions on Industrial Networks and Intelligent Systems","volume":"80 3","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-03-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140077575","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Cyberbullying Text Identification based on Deep Learning and Transformer-based Language Models 基于深度学习和变换器语言模型的网络欺凌文本识别

Q2 Engineering

EAI Endorsed Transactions on Industrial Networks and Intelligent Systems

Pub Date : 2024-02-22 DOI: 10.4108/eetinis.v11i1.4703

Khalid Saifullah, Muhammad Ibrahim Khan, Suhaima Jamal, Iqbal H. Sarker

In the contemporary digital age, social media platforms like Facebook, Twitter, and YouTube serve as vital channels for individuals to express ideas and connect with others. Despite fostering increased connectivity, these platforms have inadvertently given rise to negative behaviors, particularly cyberbullying. While extensive research has been conducted on high-resource languages such as English, there is a notable scarcity of resources for low-resource languages like Bengali, Arabic, Tamil, etc., particularly in terms of language modeling. This study addresses this gap by developing a cyberbullying text identification system called BullyFilterNeT tailored for social media texts, considering Bengali as a test case. The intelligent BullyFilterNeT system devised overcomes Out-of-Vocabulary (OOV) challenges associated with non-contextual embeddings and addresses the limitations of context-aware feature representations. To facilitate a comprehensive understanding, three non-contextual embedding models GloVe, FastText, and Word2Vec are developed for feature extraction in Bengali. These embedding models are utilized in the classification models, employing three statistical models (SVM, SGD, Libsvm), and four deep learning models (CNN, VDCNN, LSTM, GRU). Additionally, the study employs six transformer-based language models: mBERT, bELECTRA, IndicBERT, XML-RoBERTa, DistilBERT, and BanglaBERT, respectively to overcome the limitations of earlier models. Remarkably, BanglaBERT-based BullyFilterNeT achieves the highest accuracy of 88.04% in our test set, underscoring its effectiveness in cyberbullying text identification in the Bengali language.

在当代数字时代，Facebook、Twitter 和 YouTube 等社交媒体平台成为个人表达想法和与他人联系的重要渠道。尽管这些平台促进了更多的联系，但也在无意中引发了负面行为，尤其是网络欺凌。虽然针对英语等高资源语言开展了大量研究，但针对孟加拉语、阿拉伯语、泰米尔语等低资源语言的资源却明显匮乏，尤其是在语言建模方面。本研究以孟加拉语为测试案例，开发了一个专为社交媒体文本定制的名为 BullyFilterNeT 的网络欺凌文本识别系统，从而填补了这一空白。所设计的智能 BullyFilterNeT 系统克服了与非上下文嵌入相关的词汇缺失（OOV）难题，并解决了上下文感知特征表征的局限性。为了便于全面理解，开发了三种非上下文嵌入模型 GloVe、FastText 和 Word2Vec，用于孟加拉语的特征提取。这些嵌入模型被用于分类模型中，采用了三种统计模型（SVM、SGD、Libsvm）和四种深度学习模型（CNN、VDCNN、LSTM、GRU）。此外，研究还采用了六种基于转换器的语言模型：mBERT、bELECTRA、IndicBERT、XML-RoBERTa、DistilBERT 和 BanglaBERT，以克服早期模型的局限性。值得注意的是，基于 BanglaBERT 的 BullyFilterNeT 在我们的测试集中达到了 88.04% 的最高准确率，凸显了其在孟加拉语网络欺凌文本识别中的有效性。

{"title":"Cyberbullying Text Identification based on Deep Learning and Transformer-based Language Models","authors":"Khalid Saifullah, Muhammad Ibrahim Khan, Suhaima Jamal, Iqbal H. Sarker","doi":"10.4108/eetinis.v11i1.4703","DOIUrl":"https://doi.org/10.4108/eetinis.v11i1.4703","url":null,"abstract":"In the contemporary digital age, social media platforms like Facebook, Twitter, and YouTube serve as vital channels for individuals to express ideas and connect with others. Despite fostering increased connectivity, these platforms have inadvertently given rise to negative behaviors, particularly cyberbullying. While extensive research has been conducted on high-resource languages such as English, there is a notable scarcity of resources for low-resource languages like Bengali, Arabic, Tamil, etc., particularly in terms of language modeling. This study addresses this gap by developing a cyberbullying text identification system called BullyFilterNeT tailored for social media texts, considering Bengali as a test case. The intelligent BullyFilterNeT system devised overcomes Out-of-Vocabulary (OOV) challenges associated with non-contextual embeddings and addresses the limitations of context-aware feature representations. To facilitate a comprehensive understanding, three non-contextual embedding models GloVe, FastText, and Word2Vec are developed for feature extraction in Bengali. These embedding models are utilized in the classification models, employing three statistical models (SVM, SGD, Libsvm), and four deep learning models (CNN, VDCNN, LSTM, GRU). Additionally, the study employs six transformer-based language models: mBERT, bELECTRA, IndicBERT, XML-RoBERTa, DistilBERT, and BanglaBERT, respectively to overcome the limitations of earlier models. Remarkably, BanglaBERT-based BullyFilterNeT achieves the highest accuracy of 88.04% in our test set, underscoring its effectiveness in cyberbullying text identification in the Bengali language.","PeriodicalId":33474,"journal":{"name":"EAI Endorsed Transactions on Industrial Networks and Intelligent Systems","volume":"14 12","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-02-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140440847","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Performance Analysis for Reconfigurable Intelligent Surface-aided Communication Systems with Energy Harvesting under Imperfect Nakagami-m Channel Information 不完美中上信道信息下具有能量收集功能的可重构智能表面辅助通信系统的性能分析

Q2 Engineering

EAI Endorsed Transactions on Industrial Networks and Intelligent Systems

Pub Date : 2024-02-13 DOI: 10.4108/eetinis.v11i1.4369

Hữu Quý Trần, Ho Van Khuong

Reconfigurable intelligent surface (RIS) can serve as a passive relay to maintain communication between a transceiver in severe scenarios of no direct link between them. In addition, harvesting energy from radio frequency (RF) signals can meliorate significantly energy efficiency. In this research, we propose RIS-aided communication systems with energy harvesting (RISwEH) which combine both RIS and RF energy harvesting to improve energy efficiency as well as communication reliability. To evaluate realistically and quickly the performance of the RISwEH, we propose the explicit formulas of the system throughput and the outage probability under the realistic scenario of Nakagami-m fading and imperfect channel state information (CSI). Moreover, we propose an optimization algorithm relied upon a Golden section search to attain the optimum value of the time splitting factor of energy harvester to obtain the best system performance. Various results corroborate the theoretical derivations, confirm the efficacy of the proposed optimization algorithm, and illustrate the influence of innumerable system settings on the system performance. Particularly, the imperfect CSI deteriorates considerably the system performance. Nonetheless, the performance of the RISwEH can be enhanced by accreting the quantity of the elements of the RIS as well as with the lower fading severity. Furthermore, the time splitting factor also impacts dramatically the outage performance of the RISwEH and its optimal value mitigates significantly the outage probability.

可重构智能表面（RIS）可作为无源中继器，在收发器之间没有直接联系的严重情况下保持通信。此外，从射频（RF）信号中收集能量可以显著提高能效。在这项研究中，我们提出了具有能量收集功能的射频识别系统辅助通信系统（RISwEH），该系统结合了射频识别系统和射频能量收集功能，可提高能量效率和通信可靠性。为了真实、快速地评估 RISwEH 的性能，我们提出了在 Nakagami-m fading 和不完善信道状态信息（CSI）的实际情况下系统吞吐量和中断概率的明确公式。此外，我们还提出了一种基于黄金分割搜索的优化算法，以达到能量收集器时间分割因子的最佳值，从而获得最佳的系统性能。各种结果证实了理论推导，证实了所提优化算法的有效性，并说明了无数系统设置对系统性能的影响。特别是，不完善的 CSI 会大大降低系统性能。然而，RISwEH 的性能可以通过增加 RIS 的元素数量以及降低衰减严重程度来提高。此外，时间分割因子也会对 RISwEH 的中断性能产生重大影响，其最佳值可显著降低中断概率。

{"title":"Performance Analysis for Reconfigurable Intelligent Surface-aided Communication Systems with Energy Harvesting under Imperfect Nakagami-m Channel Information","authors":"Hữu Quý Trần, Ho Van Khuong","doi":"10.4108/eetinis.v11i1.4369","DOIUrl":"https://doi.org/10.4108/eetinis.v11i1.4369","url":null,"abstract":"Reconfigurable intelligent surface (RIS) can serve as a passive relay to maintain communication between a transceiver in severe scenarios of no direct link between them. In addition, harvesting energy from radio frequency (RF) signals can meliorate significantly energy efficiency. In this research, we propose RIS-aided communication systems with energy harvesting (RISwEH) which combine both RIS and RF energy harvesting to improve energy efficiency as well as communication reliability. To evaluate realistically and quickly the performance of the RISwEH, we propose the explicit formulas of the system throughput and the outage probability under the realistic scenario of Nakagami-m fading and imperfect channel state information (CSI). Moreover, we propose an optimization algorithm relied upon a Golden section search to attain the optimum value of the time splitting factor of energy harvester to obtain the best system performance. Various results corroborate the theoretical derivations, confirm the efficacy of the proposed optimization algorithm, and illustrate the influence of innumerable system settings on the system performance. Particularly, the imperfect CSI deteriorates considerably the system performance. Nonetheless, the performance of the RISwEH can be enhanced by accreting the quantity of the elements of the RIS as well as with the lower fading severity. Furthermore, the time splitting factor also impacts dramatically the outage performance of the RISwEH and its optimal value mitigates significantly the outage probability.","PeriodicalId":33474,"journal":{"name":"EAI Endorsed Transactions on Industrial Networks and Intelligent Systems","volume":"85 22","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-02-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139781230","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Performance Analysis for Reconfigurable Intelligent Surface-aided Communication Systems with Energy Harvesting under Imperfect Nakagami-m Channel Information 不完美中上信道信息下具有能量收集功能的可重构智能表面辅助通信系统的性能分析

Q2 Engineering

EAI Endorsed Transactions on Industrial Networks and Intelligent Systems

Pub Date : 2024-02-13 DOI: 10.4108/eetinis.v11i1.4369

Hữu Quý Trần, Ho Van Khuong

Reconfigurable intelligent surface (RIS) can serve as a passive relay to maintain communication between a transceiver in severe scenarios of no direct link between them. In addition, harvesting energy from radio frequency (RF) signals can meliorate significantly energy efficiency. In this research, we propose RIS-aided communication systems with energy harvesting (RISwEH) which combine both RIS and RF energy harvesting to improve energy efficiency as well as communication reliability. To evaluate realistically and quickly the performance of the RISwEH, we propose the explicit formulas of the system throughput and the outage probability under the realistic scenario of Nakagami-m fading and imperfect channel state information (CSI). Moreover, we propose an optimization algorithm relied upon a Golden section search to attain the optimum value of the time splitting factor of energy harvester to obtain the best system performance. Various results corroborate the theoretical derivations, confirm the efficacy of the proposed optimization algorithm, and illustrate the influence of innumerable system settings on the system performance. Particularly, the imperfect CSI deteriorates considerably the system performance. Nonetheless, the performance of the RISwEH can be enhanced by accreting the quantity of the elements of the RIS as well as with the lower fading severity. Furthermore, the time splitting factor also impacts dramatically the outage performance of the RISwEH and its optimal value mitigates significantly the outage probability.

可重构智能表面（RIS）可作为无源中继器，在收发器之间没有直接联系的严重情况下保持通信。此外，从射频（RF）信号中收集能量可以显著提高能效。在这项研究中，我们提出了具有能量收集功能的射频识别系统辅助通信系统（RISwEH），该系统结合了射频识别系统和射频能量收集功能，可提高能量效率和通信可靠性。为了真实、快速地评估 RISwEH 的性能，我们提出了在 Nakagami-m fading 和不完善信道状态信息（CSI）的实际情况下系统吞吐量和中断概率的明确公式。此外，我们还提出了一种基于黄金分割搜索的优化算法，以达到能量收集器时间分割因子的最佳值，从而获得最佳的系统性能。各种结果证实了理论推导，证实了所提优化算法的有效性，并说明了无数系统设置对系统性能的影响。特别是，不完善的 CSI 会大大降低系统性能。然而，RISwEH 的性能可以通过增加 RIS 的元素数量以及降低衰减严重程度来提高。此外，时间分割因子也会对 RISwEH 的中断性能产生重大影响，其最佳值可显著降低中断概率。

{"title":"Performance Analysis for Reconfigurable Intelligent Surface-aided Communication Systems with Energy Harvesting under Imperfect Nakagami-m Channel Information","authors":"Hữu Quý Trần, Ho Van Khuong","doi":"10.4108/eetinis.v11i1.4369","DOIUrl":"https://doi.org/10.4108/eetinis.v11i1.4369","url":null,"abstract":"Reconfigurable intelligent surface (RIS) can serve as a passive relay to maintain communication between a transceiver in severe scenarios of no direct link between them. In addition, harvesting energy from radio frequency (RF) signals can meliorate significantly energy efficiency. In this research, we propose RIS-aided communication systems with energy harvesting (RISwEH) which combine both RIS and RF energy harvesting to improve energy efficiency as well as communication reliability. To evaluate realistically and quickly the performance of the RISwEH, we propose the explicit formulas of the system throughput and the outage probability under the realistic scenario of Nakagami-m fading and imperfect channel state information (CSI). Moreover, we propose an optimization algorithm relied upon a Golden section search to attain the optimum value of the time splitting factor of energy harvester to obtain the best system performance. Various results corroborate the theoretical derivations, confirm the efficacy of the proposed optimization algorithm, and illustrate the influence of innumerable system settings on the system performance. Particularly, the imperfect CSI deteriorates considerably the system performance. Nonetheless, the performance of the RISwEH can be enhanced by accreting the quantity of the elements of the RIS as well as with the lower fading severity. Furthermore, the time splitting factor also impacts dramatically the outage performance of the RISwEH and its optimal value mitigates significantly the outage probability.","PeriodicalId":33474,"journal":{"name":"EAI Endorsed Transactions on Industrial Networks and Intelligent Systems","volume":"775 ","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-02-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139841123","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0