首页 > 最新文献

PeerJ Computer Science最新文献

英文 中文
Research on the relationship and prediction model between nighttime lighting data, pm2.5 data, and urban GDP. 夜间照明数据、pm2.5数据与城市GDP的关系及预测模型研究
IF 2.5 4区 计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2025-09-16 eCollection Date: 2025-01-01 DOI: 10.7717/peerj-cs.3185
Sen Chen, Junke Li

With the discovery of electricity and the widespread adoption of lighting technology, the extensive application of electricity has greatly increased productivity, making night-time factory production possible. At the same time, the rapid expansion of factories has led to a significant increase in particulate matter 2.5 (PM2.5) in the air. However, economic development heavily relies on lighting and factory production. To address this issue, researchers have focused on predicting urban gross domestic product (GDP) through night-time lights and PM2.5, but current studies often focus on the impact of a single factor on GDP, leaving room for improvement in model accuracy. In response to this problem, this article proposes the Relationship and Prediction Model between Night Light Data, PM2.5, and Urban GDP (R&P-NLPG model). Firstly, night light data, PM2.5 data, and GDP data are collected and preprocessed. Secondly, correlation analysis is conducted to analyze the correlation between data features. Then, data fusion methods are used to integrate features between night-time data and PM2.5 data, forming the third data features. Next, a neural network is constructed to establish a functional relationship between features and GDP. Finally, the trained neural network model is used to predict GDP. The experimental results demonstrate that the predictive capability of the R&P-NLPG model outperforms GDP prediction models constructed with single-feature input and existing multi-feature input.

随着电力的发现和照明技术的广泛采用,电力的广泛应用大大提高了生产力,使夜间工厂生产成为可能。与此同时,工厂的迅速扩张导致空气中颗粒物2.5 (PM2.5)的显著增加。然而,经济发展严重依赖于照明和工厂生产。为了解决这个问题,研究人员一直致力于通过夜间灯光和PM2.5来预测城市国内生产总值(GDP),但目前的研究往往侧重于单一因素对GDP的影响,这给模型的准确性留下了改进的空间。针对这一问题,本文提出了夜间灯光数据、PM2.5与城市GDP的关系及预测模型(R&P-NLPG模型)。首先对夜间灯光数据、PM2.5数据、GDP数据进行采集和预处理。其次,进行相关性分析,分析数据特征之间的相关性。然后,利用数据融合方法将夜间数据与PM2.5数据之间的特征进行融合,形成第三个数据特征。其次,构建神经网络,建立特征与GDP之间的函数关系。最后,利用训练好的神经网络模型对GDP进行预测。实验结果表明,R&P-NLPG模型的预测能力优于单特征输入和现有多特征输入构建的GDP预测模型。
{"title":"Research on the relationship and prediction model between nighttime lighting data, pm2.5 data, and urban GDP.","authors":"Sen Chen, Junke Li","doi":"10.7717/peerj-cs.3185","DOIUrl":"10.7717/peerj-cs.3185","url":null,"abstract":"<p><p>With the discovery of electricity and the widespread adoption of lighting technology, the extensive application of electricity has greatly increased productivity, making night-time factory production possible. At the same time, the rapid expansion of factories has led to a significant increase in particulate matter 2.5 (PM2.5) in the air. However, economic development heavily relies on lighting and factory production. To address this issue, researchers have focused on predicting urban gross domestic product (GDP) through night-time lights and PM2.5, but current studies often focus on the impact of a single factor on GDP, leaving room for improvement in model accuracy. In response to this problem, this article proposes the Relationship and Prediction Model between Night Light Data, PM2.5, and Urban GDP (R&P-NLPG model). Firstly, night light data, PM2.5 data, and GDP data are collected and preprocessed. Secondly, correlation analysis is conducted to analyze the correlation between data features. Then, data fusion methods are used to integrate features between night-time data and PM2.5 data, forming the third data features. Next, a neural network is constructed to establish a functional relationship between features and GDP. Finally, the trained neural network model is used to predict GDP. The experimental results demonstrate that the predictive capability of the R&P-NLPG model outperforms GDP prediction models constructed with single-feature input and existing multi-feature input.</p>","PeriodicalId":54224,"journal":{"name":"PeerJ Computer Science","volume":"11 ","pages":"e3185"},"PeriodicalIF":2.5,"publicationDate":"2025-09-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12453825/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145132667","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Enhancing privacy-preserving brain tumor classification with adaptive reputation-aware federated learning and homomorphic encryption. 利用自适应声誉感知联合学习和同态加密增强保护隐私的脑肿瘤分类。
IF 2.5 4区 计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2025-09-16 eCollection Date: 2025-01-01 DOI: 10.7717/peerj-cs.3165
Swetha Ghanta, Prasanthi Boyapati, Sujit Biswas, Ashok K Pradhan, Saraju P Mohanty

Brain tumor diagnosis using magnetic resonance imaging (MRI) scans is critical for improving patient survival rates. However, automating the analysis of these scans faces significant challenges, including data privacy concerns and the scarcity of large, diverse datasets. A potential solution is federated learning (FL), which enables cooperative model training among multiple organizations without requiring the sharing of raw data; however, it faces various challenges. To address these, we propose Federated Adaptive Reputation-aware aggregation with CKKS (Cheon-Kim-Kim-Song) Homomorphic encryption (FedARCH), a novel FL framework designed for a cross-silo scenario, where client weights are aggregated based on reputation scores derived from performance evaluations. Our framework incorporates a weighted aggregation method using these reputation scores to enhance the robustness of the global model. To address sudden changes in client performance, a smoothing factor is introduced, while a decay factor ensures that recent updates have a greater influence on the global model. These factors work together for dynamic performance management. Additionally, we address potential privacy risks from model inversion attacks by implementing a simplified and computationally efficient CKKS homomorphic encryption, which allows secure operations on encrypted data. With FedARCH, encrypted model weights of each client are multiplied by a plaintext reputation score for weighted aggregation. Since we are multiplying ciphertexts by plaintexts, instead of ciphertexts, the need for relinearization is eliminated, efficiently reducing the computational overhead. FedARCH achieved an accuracy of 99.39%, highlighting its potential in distinguishing between brain tumor classes. Several experiments were conducted by adding noise to the clients' data and varying the number of noisy clients. An accuracy of 94% was maintained even with 50% of noisy clients at a high noise level, while the standard FL approach accuracy dropped to 33%. Our results and the security analysis demonstrate the effectiveness of FedARCH in improving model accuracy, its robustness to noisy data, and its ability to ensure data privacy, making it a viable approach for medical image analysis in federated settings.

使用磁共振成像(MRI)扫描诊断脑肿瘤对提高患者存活率至关重要。然而,这些扫描的自动化分析面临着重大挑战,包括数据隐私问题和大型、多样化数据集的稀缺性。一个潜在的解决方案是联邦学习(FL),它支持多个组织之间的合作模型训练,而不需要共享原始数据;然而,它面临着各种挑战。为了解决这些问题,我们提出了具有CKKS (Cheon-Kim-Kim-Song)同态加密(FedARCH)的联邦自适应声誉感知聚合,这是一种为跨竖井场景设计的新型FL框架,其中客户端权重根据从性能评估中获得的声誉分数进行聚合。我们的框架结合了加权聚合方法,使用这些声誉分数来增强全局模型的鲁棒性。为了解决客户端性能的突然变化,引入了平滑因子,而衰减因子确保最近的更新对全局模型有更大的影响。这些因素共同作用于动态性能管理。此外,我们通过实现简化且计算效率高的CKKS同态加密来解决模型反转攻击带来的潜在隐私风险,该加密允许对加密数据进行安全操作。使用FedARCH,每个客户端的加密模型权重乘以用于加权聚合的明文信誉评分。由于我们将密文乘以明文,而不是密文,因此消除了对线性化的需求,有效地减少了计算开销。FedARCH的准确率达到99.39%,突出了其在区分脑肿瘤类别方面的潜力。通过在客户端数据中加入噪声和改变噪声客户端的数量,进行了多次实验。即使在高噪声水平下,50%的嘈杂客户端仍保持94%的准确率,而标准FL方法的准确率下降到33%。我们的结果和安全性分析证明了FedARCH在提高模型精度、对噪声数据的鲁棒性以及确保数据隐私方面的有效性,使其成为联邦环境下医学图像分析的可行方法。
{"title":"Enhancing privacy-preserving brain tumor classification with adaptive reputation-aware federated learning and homomorphic encryption.","authors":"Swetha Ghanta, Prasanthi Boyapati, Sujit Biswas, Ashok K Pradhan, Saraju P Mohanty","doi":"10.7717/peerj-cs.3165","DOIUrl":"10.7717/peerj-cs.3165","url":null,"abstract":"<p><p>Brain tumor diagnosis using magnetic resonance imaging (MRI) scans is critical for improving patient survival rates. However, automating the analysis of these scans faces significant challenges, including data privacy concerns and the scarcity of large, diverse datasets. A potential solution is federated learning (FL), which enables cooperative model training among multiple organizations without requiring the sharing of raw data; however, it faces various challenges. To address these, we propose Federated Adaptive Reputation-aware aggregation with CKKS (Cheon-Kim-Kim-Song) Homomorphic encryption (FedARCH), a novel FL framework designed for a cross-silo scenario, where client weights are aggregated based on reputation scores derived from performance evaluations. Our framework incorporates a weighted aggregation method using these reputation scores to enhance the robustness of the global model. To address sudden changes in client performance, a smoothing factor is introduced, while a decay factor ensures that recent updates have a greater influence on the global model. These factors work together for dynamic performance management. Additionally, we address potential privacy risks from model inversion attacks by implementing a simplified and computationally efficient CKKS homomorphic encryption, which allows secure operations on encrypted data. With FedARCH, encrypted model weights of each client are multiplied by a plaintext reputation score for weighted aggregation. Since we are multiplying ciphertexts by plaintexts, instead of ciphertexts, the need for relinearization is eliminated, efficiently reducing the computational overhead. FedARCH achieved an accuracy of 99.39%, highlighting its potential in distinguishing between brain tumor classes. Several experiments were conducted by adding noise to the clients' data and varying the number of noisy clients. An accuracy of 94% was maintained even with 50% of noisy clients at a high noise level, while the standard FL approach accuracy dropped to 33%. Our results and the security analysis demonstrate the effectiveness of FedARCH in improving model accuracy, its robustness to noisy data, and its ability to ensure data privacy, making it a viable approach for medical image analysis in federated settings.</p>","PeriodicalId":54224,"journal":{"name":"PeerJ Computer Science","volume":"11 ","pages":"e3165"},"PeriodicalIF":2.5,"publicationDate":"2025-09-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12453705/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145132401","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A comprehensive approach for waste management with GAN-augmented classification. 基于gan增强分类的废物管理综合方法。
IF 2.5 4区 计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2025-09-16 eCollection Date: 2025-01-01 DOI: 10.7717/peerj-cs.3156
Yashashree Mahale, Nida Khan, Kunal Kulkarni, Shilpa Gite, Biswajeet Pradhan, Abdullah Alamri, Chang-Wook Lee, Nandhini K, Mrinal Bachute

Image processing and computer vision highly rely on data augmentation in machine learning models to increase the diversity and variability within training datasets for better performance. One of the most promising and widely used applications of data augmentation is in classifying waste object images. This research focuses on augmenting waste object images with generative adversarial networks (GANS). Here deep convolutional GAN (DCGAN), an extension of GAN is utilized, which uses convolutional and convolutional-transpose layers for better image generation. This approach helps generate realism and variability in images. Furthermore, object detection and classification techniques are used. By utilizing ensemble learning techniques with DenseNet121, ConvNext, and Resnet101, the network can accurately identify and classify waste objects in images, thereby contributing to improved waste management practices and environmental sustainability. With ensemble learning, a notable accuracy of 99.80% was achieved. Thus, by investigating the effectiveness of these models in conjunction with data augmentation techniques, this novel approach of GAN-based augmentation cooperated with ensemble models aims to provide valuable insights into optimizing waste object identification processes for real-world applications. Future work will focus on better data augmentation methods with other types of GANS architectures and introducing multimodal sources of data to further increase the performance of the classification and detection models.

图像处理和计算机视觉高度依赖于机器学习模型中的数据增强,以增加训练数据集的多样性和可变性,从而获得更好的性能。数据增强最有前途和应用最广泛的应用之一是对废弃物体图像进行分类。本研究的重点是利用生成对抗网络(GANS)增强垃圾物体图像。在深度卷积GAN (DCGAN)中,利用了GAN的扩展,它使用卷积层和卷积转置层来更好地生成图像。这种方法有助于在图像中产生真实感和可变性。此外,还使用了目标检测和分类技术。通过使用DenseNet121、ConvNext和Resnet101的集成学习技术,该网络可以准确地识别和分类图像中的废物物体,从而有助于改善废物管理实践和环境可持续性。通过集成学习,准确率达到了99.80%。因此,通过研究这些模型与数据增强技术相结合的有效性,这种基于gan的增强方法与集成模型相结合,旨在为优化实际应用中的废物物体识别过程提供有价值的见解。未来的工作将集中在更好的数据增强方法与其他类型的gan架构,并引入多模态数据源,以进一步提高分类和检测模型的性能。
{"title":"A comprehensive approach for waste management with GAN-augmented classification.","authors":"Yashashree Mahale, Nida Khan, Kunal Kulkarni, Shilpa Gite, Biswajeet Pradhan, Abdullah Alamri, Chang-Wook Lee, Nandhini K, Mrinal Bachute","doi":"10.7717/peerj-cs.3156","DOIUrl":"10.7717/peerj-cs.3156","url":null,"abstract":"<p><p>Image processing and computer vision highly rely on data augmentation in machine learning models to increase the diversity and variability within training datasets for better performance. One of the most promising and widely used applications of data augmentation is in classifying waste object images. This research focuses on augmenting waste object images with generative adversarial networks (GANS). Here deep convolutional GAN (DCGAN), an extension of GAN is utilized, which uses convolutional and convolutional-transpose layers for better image generation. This approach helps generate realism and variability in images. Furthermore, object detection and classification techniques are used. By utilizing ensemble learning techniques with <i>DenseNet121, ConvNext, and Resnet101</i>, the network can accurately identify and classify waste objects in images, thereby contributing to improved waste management practices and environmental sustainability. With ensemble learning, a notable accuracy of 99.80% was achieved. Thus, by investigating the effectiveness of these models in conjunction with data augmentation techniques, this novel approach of GAN-based augmentation cooperated with ensemble models aims to provide valuable insights into optimizing waste object identification processes for real-world applications. Future work will focus on better data augmentation methods with other types of GANS architectures and introducing multimodal sources of data to further increase the performance of the classification and detection models.</p>","PeriodicalId":54224,"journal":{"name":"PeerJ Computer Science","volume":"11 ","pages":"e3156"},"PeriodicalIF":2.5,"publicationDate":"2025-09-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12453751/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145132612","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Hyperparameter optimization of XGBoost and hybrid CnnSVM for cyber threat detection using modified Harris hawks algorithm. 基于改进Harris hawks算法的XGBoost和混合CnnSVM网络威胁检测超参数优化。
IF 2.5 4区 计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2025-09-16 eCollection Date: 2025-01-01 DOI: 10.7717/peerj-cs.3169
Haitham Elwahsh, Ali Bakhiet, Tarek Khalifa, Julian Hoxha, Maazen Alsabaan, Mohamed I Ibrahem, Mahmoud Elwahsh, Engy El-Shafeiy

The escalating complexity of cyber threats in smart microgrids necessitates advanced detection frameworks to counter sophisticated attacks. Existing methods often underutilize optimization techniques like Harris hawks optimization (HHO) and struggle with class imbalance in cybersecurity datasets. This study proposes a novel framework integrating HHO with extreme gradient boosting (XGBoost) and a hybrid convolutional neural network with support vector machine (Cnn-SVM) to enhance cyber threat detection. Using the distributed denial of service (DDoS) botnet attack and KDD CUP99 datasets, the proposed models leverage HHO for hyperparameter optimization, achieving accuracies of 99.97% and 99.99%, respectively, alongside improved area under curve (AUC) metrics. These results highlight the framework's ability to capture complex nonlinearities and address class imbalance through RandomOverSampler. The findings demonstrate the potential of HHO-optimized models to advance automated threat detection, offering robust and scalable solutions for securing critical infrastructures.

智能微电网中日益复杂的网络威胁需要先进的检测框架来应对复杂的攻击。现有的方法往往没有充分利用哈里斯鹰优化(HHO)等优化技术,并且难以解决网络安全数据集的类不平衡问题。本文提出了一种将HHO与极限梯度增强(XGBoost)相结合,并将卷积神经网络与支持向量机(Cnn-SVM)相结合的新框架来增强网络威胁检测。利用分布式拒绝服务(DDoS)僵尸网络攻击和KDD CUP99数据集,所提出的模型利用HHO进行超参数优化,分别实现了99.97%和99.99%的准确率,同时改善了曲线下面积(AUC)指标。这些结果突出了该框架通过randomoverampler捕获复杂非线性和解决类不平衡的能力。研究结果证明了hho优化模型在推进自动化威胁检测方面的潜力,为保护关键基础设施提供了强大且可扩展的解决方案。
{"title":"Hyperparameter optimization of XGBoost and hybrid CnnSVM for cyber threat detection using modified Harris hawks algorithm.","authors":"Haitham Elwahsh, Ali Bakhiet, Tarek Khalifa, Julian Hoxha, Maazen Alsabaan, Mohamed I Ibrahem, Mahmoud Elwahsh, Engy El-Shafeiy","doi":"10.7717/peerj-cs.3169","DOIUrl":"10.7717/peerj-cs.3169","url":null,"abstract":"<p><p>The escalating complexity of cyber threats in smart microgrids necessitates advanced detection frameworks to counter sophisticated attacks. Existing methods often underutilize optimization techniques like Harris hawks optimization (HHO) and struggle with class imbalance in cybersecurity datasets. This study proposes a novel framework integrating HHO with extreme gradient boosting (XGBoost) and a hybrid convolutional neural network with support vector machine (Cnn-SVM) to enhance cyber threat detection. Using the distributed denial of service (DDoS) botnet attack and KDD CUP99 datasets, the proposed models leverage HHO for hyperparameter optimization, achieving accuracies of 99.97% and 99.99%, respectively, alongside improved area under curve (AUC) metrics. These results highlight the framework's ability to capture complex nonlinearities and address class imbalance through RandomOverSampler. The findings demonstrate the potential of HHO-optimized models to advance automated threat detection, offering robust and scalable solutions for securing critical infrastructures.</p>","PeriodicalId":54224,"journal":{"name":"PeerJ Computer Science","volume":"11 ","pages":"e3169"},"PeriodicalIF":2.5,"publicationDate":"2025-09-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12453716/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145132482","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Quantification of left ventricular mass in multiple views of echocardiograms using model-agnostic meta learning in a few-shot setting. 使用模型不可知的元学习在几次拍摄的超声心动图的多个视图中量化左心室质量。
IF 2.5 4区 计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2025-09-16 eCollection Date: 2025-01-01 DOI: 10.7717/peerj-cs.3161
Yeong Hyeon Kim, Donghoon Kim, Jin Young Youm, Jiyoon Won, Seola Kim, Woohyun Park, Yisak Kim, Dongheon Lee

Background: Reliable measurement of left ventricular mass (LVM) in echocardiography is essential for early detection of left ventricular dysfunction, coronary artery disease, and arrhythmia risk, yet growing patient volumes have created critical shortage of experts in echocardiography. Recent deep learning approaches reduce inter-operator variability but require large, fully labeled datasets for each standard view-an impractical demand in many clinical settings.

Methods: To overcome these limitations, we propose a heatmap-based point-estimation segmentation model trained via model-agnostic meta-learning (MAML) for few-shot LVM quantification across multiple echocardiographic views. Our framework adapts rapidly to new views by learning a shared representation and view-specific head performing K inner-loop updates, and then meta-updating in the outer loop. We used the EchoNet-LVH dataset for the PLAX view, the TMED-2 dataset for the PSAX view and the CAMUS dataset for both the apical 2-chamber and apical 4-chamber views under 1-, 5-, and 10-shot scenarios.

Results: As a result, the proposed MAML methods demonstrated comparable performance using mean distance error, mean angle error, successful distance error and spatial angular similarity in a few-shot setting compared to models trained with larger labeled datasets for each view of the echocardiogram.

背景:超声心动图中左心室质量(LVM)的可靠测量对于早期发现左心室功能障碍、冠状动脉疾病和心律失常风险至关重要,然而患者数量的增加导致了超声心动图专家的严重短缺。最近的深度学习方法减少了操作者之间的可变性,但每个标准视图都需要大量的、完全标记的数据集,这在许多临床环境中是不切实际的需求。方法:为了克服这些局限性,我们提出了一种基于热图的点估计分割模型,该模型通过模型不可知元学习(MAML)进行训练,用于跨多个超声心动图视图的少量LVM量化。我们的框架通过学习共享表示和特定于视图的头部来快速适应新视图,执行K次内部循环更新,然后在外部循环中进行元更新。我们将EchoNet-LVH数据集用于PLAX视图,TMED-2数据集用于PSAX视图,CAMUS数据集用于1、5和10次射击场景下的尖顶2室和尖顶4室视图。结果:与每个超声心动图视图使用较大标记数据集训练的模型相比,所提出的MAML方法在少数镜头设置下使用平均距离误差、平均角度误差、成功距离误差和空间角度相似性显示出可比的性能。
{"title":"Quantification of left ventricular mass in multiple views of echocardiograms using model-agnostic meta learning in a few-shot setting.","authors":"Yeong Hyeon Kim, Donghoon Kim, Jin Young Youm, Jiyoon Won, Seola Kim, Woohyun Park, Yisak Kim, Dongheon Lee","doi":"10.7717/peerj-cs.3161","DOIUrl":"10.7717/peerj-cs.3161","url":null,"abstract":"<p><strong>Background: </strong>Reliable measurement of left ventricular mass (LVM) in echocardiography is essential for early detection of left ventricular dysfunction, coronary artery disease, and arrhythmia risk, yet growing patient volumes have created critical shortage of experts in echocardiography. Recent deep learning approaches reduce inter-operator variability but require large, fully labeled datasets for each standard view-an impractical demand in many clinical settings.</p><p><strong>Methods: </strong>To overcome these limitations, we propose a heatmap-based point-estimation segmentation model trained <i>via</i> model-agnostic meta-learning (MAML) for few-shot LVM quantification across multiple echocardiographic views. Our framework adapts rapidly to new views by learning a shared representation and view-specific head performing K inner-loop updates, and then meta-updating in the outer loop. We used the EchoNet-LVH dataset for the PLAX view, the TMED-2 dataset for the PSAX view and the CAMUS dataset for both the apical 2-chamber and apical 4-chamber views under 1-, 5-, and 10-shot scenarios.</p><p><strong>Results: </strong>As a result, the proposed MAML methods demonstrated comparable performance using mean distance error, mean angle error, successful distance error and spatial angular similarity in a few-shot setting compared to models trained with larger labeled datasets for each view of the echocardiogram.</p>","PeriodicalId":54224,"journal":{"name":"PeerJ Computer Science","volume":"11 ","pages":"e3161"},"PeriodicalIF":2.5,"publicationDate":"2025-09-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12453733/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145132676","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Deep learning based cardiac disorder classification and user authentication for smart healthcare system using ECG signals. 基于深度学习的心电信号智能医疗系统心脏疾病分类与用户认证。
IF 2.5 4区 计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2025-09-16 eCollection Date: 2025-01-01 DOI: 10.7717/peerj-cs.3082
Tong Ding, Chenhe Liu, Jiasheng Zhang, Yibo Zhang, Cheng Ding

Abnormal cardiac activity can lead to severe health complications, emphasizing the importance of timely diagnosis. It is essential to save lives if diseases are diagnosed in a reasonable timeframe. The intelligent telehealth system has the potential to transform the healthcare industry by continuously monitoring cardiac diseases remotely and non-invasively. A cloud-based telehealth system utilizing an Internet of Things (IoT)-enabled electrocardiogram (ECG) monitor gathers and analyzes ECG signals to predict cardiac complications and notify physicians in crises, facilitating prompt and precise diagnosis of cardiovascular disorders. Abnormal cardiac activity can lead to severe health complications, making early detection crucial for effective treatment. This study provides an efficient method based on deep learning convolutional neural network (CNN) and long short-term memory (LSTM) approaches to categorize and detect cardiovascular problems utilizing ECG data to increase classifications (referring to distinguishing between different ECG signal categories) and precision. Additionally, a threshold-based classifier is developed for the telehealth system's security and privacy to enable user identification (for selecting the correct user from a group) using ECG data. A data preprocessing and augmentation technique was applied to improve the data quality and quantity. The proposed LSTM model attained 99.5% accuracy in the classification of cardiac diseases and 98.6% accuracy in user authentication utilizing ECG signals. These results exhibit enhanced performance compared to conventional machine learning and convolutional neural network models.

心脏活动异常可导致严重的健康并发症,强调及时诊断的重要性。如果在合理的时间范围内诊断出疾病,对挽救生命至关重要。智能远程医疗系统通过远程和非侵入性地持续监测心脏疾病,有可能改变医疗保健行业。基于云的远程医疗系统利用支持物联网(IoT)的心电图(ECG)监测器收集和分析ECG信号,以预测心脏并发症,并在危机中通知医生,促进心血管疾病的及时准确诊断。心脏活动异常可导致严重的健康并发症,因此早期发现对有效治疗至关重要。本研究提供了一种基于深度学习卷积神经网络(CNN)和长短期记忆(LSTM)方法的有效方法,利用心电数据对心血管问题进行分类和检测,以提高分类(指区分不同的心电信号类别)和精度。此外,为远程医疗系统的安全性和隐私性开发了基于阈值的分类器,以便使用ECG数据进行用户识别(用于从组中选择正确的用户)。采用数据预处理和增强技术,提高了数据的质量和数量。所提出的LSTM模型在利用心电信号进行心脏疾病分类时准确率达到99.5%,在用户身份验证时准确率达到98.6%。与传统的机器学习和卷积神经网络模型相比,这些结果显示出更高的性能。
{"title":"Deep learning based cardiac disorder classification and user authentication for smart healthcare system using ECG signals.","authors":"Tong Ding, Chenhe Liu, Jiasheng Zhang, Yibo Zhang, Cheng Ding","doi":"10.7717/peerj-cs.3082","DOIUrl":"10.7717/peerj-cs.3082","url":null,"abstract":"<p><p>Abnormal cardiac activity can lead to severe health complications, emphasizing the importance of timely diagnosis. It is essential to save lives if diseases are diagnosed in a reasonable timeframe. The intelligent telehealth system has the potential to transform the healthcare industry by continuously monitoring cardiac diseases remotely and non-invasively. A cloud-based telehealth system utilizing an Internet of Things (IoT)-enabled electrocardiogram (ECG) monitor gathers and analyzes ECG signals to predict cardiac complications and notify physicians in crises, facilitating prompt and precise diagnosis of cardiovascular disorders. Abnormal cardiac activity can lead to severe health complications, making early detection crucial for effective treatment. This study provides an efficient method based on deep learning convolutional neural network (CNN) and long short-term memory (LSTM) approaches to categorize and detect cardiovascular problems utilizing ECG data to increase classifications (referring to distinguishing between different ECG signal categories) and precision. Additionally, a threshold-based classifier is developed for the telehealth system's security and privacy to enable user identification (for selecting the correct user from a group) using ECG data. A data preprocessing and augmentation technique was applied to improve the data quality and quantity. The proposed LSTM model attained 99.5% accuracy in the classification of cardiac diseases and 98.6% accuracy in user authentication utilizing ECG signals. These results exhibit enhanced performance compared to conventional machine learning and convolutional neural network models.</p>","PeriodicalId":54224,"journal":{"name":"PeerJ Computer Science","volume":"11 ","pages":"e3082"},"PeriodicalIF":2.5,"publicationDate":"2025-09-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12453811/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145132508","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A literature review of research on question generation in education. 教育中问题生成研究的文献综述。
IF 2.5 4区 计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2025-09-16 eCollection Date: 2025-01-01 DOI: 10.7717/peerj-cs.3203
Xiaohui Dong, Xinyu Zhang, Zhengluo Li, Quanxin Hou, Jixiang Xue, Xiaoyi Li

As a key natural language processing (NLP) task, question generation (QG) is crucial for boosting educational quality and fostering personalized learning. This article offers an in-depth review of the research advancements and future directions in QG in education (QGEd). We start by tracing the evolution of QG and QGEd. Next, we explore the current state of QGEd research through three dimensions: its three core objectives, commonly used datasets, and question quality evaluation methods. This article also underscores its unique contributions to QGEd, including a systematic analysis of the research landscape and an identification of pivotal challenges and opportunities. Lastly, we highlight future research directions, emphasizing the need for deeper exploration in QGEd regarding multimodal data processing, controllability of fine-grained cognitive and difficulty levels, specialized educational dataset construction, automatic evaluation technology development, and system architecture design. Overall, this review aims to provide a comprehensive overview of the field, offering valuable insights for researchers and practitioners in educational technology.

作为自然语言处理(NLP)的一项关键任务,问题生成(QG)对于提高教育质量和促进个性化学习至关重要。本文对QGEd在教育中的研究进展和未来发展方向进行了综述。我们首先追溯QG和QGEd的演变。接下来,我们将从三个方面探讨QGEd研究的现状:三个核心目标、常用数据集和问题质量评估方法。本文还强调了其对QGEd的独特贡献,包括对研究前景的系统分析以及对关键挑战和机遇的识别。最后,展望了QGEd未来的研究方向,强调需要在多模态数据处理、细粒度认知水平和难度水平的可控性、专业教育数据集构建、自动评估技术开发和系统架构设计等方面进行更深入的探索。总体而言,本文旨在对该领域进行全面概述,为教育技术的研究人员和实践者提供有价值的见解。
{"title":"A literature review of research on question generation in education.","authors":"Xiaohui Dong, Xinyu Zhang, Zhengluo Li, Quanxin Hou, Jixiang Xue, Xiaoyi Li","doi":"10.7717/peerj-cs.3203","DOIUrl":"10.7717/peerj-cs.3203","url":null,"abstract":"<p><p>As a key natural language processing (NLP) task, question generation (QG) is crucial for boosting educational quality and fostering personalized learning. This article offers an in-depth review of the research advancements and future directions in QG in education (QGEd). We start by tracing the evolution of QG and QGEd. Next, we explore the current state of QGEd research through three dimensions: its three core objectives, commonly used datasets, and question quality evaluation methods. This article also underscores its unique contributions to QGEd, including a systematic analysis of the research landscape and an identification of pivotal challenges and opportunities. Lastly, we highlight future research directions, emphasizing the need for deeper exploration in QGEd regarding multimodal data processing, controllability of fine-grained cognitive and difficulty levels, specialized educational dataset construction, automatic evaluation technology development, and system architecture design. Overall, this review aims to provide a comprehensive overview of the field, offering valuable insights for researchers and practitioners in educational technology.</p>","PeriodicalId":54224,"journal":{"name":"PeerJ Computer Science","volume":"11 ","pages":"e3203"},"PeriodicalIF":2.5,"publicationDate":"2025-09-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12453861/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145132584","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Parametric art creation platform design based on visual delivery and multimedia data fusion. 基于视觉传递和多媒体数据融合的参数化美术创作平台设计。
IF 2.5 4区 计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2025-09-16 eCollection Date: 2025-01-01 DOI: 10.7717/peerj-cs.3175
Qing Yun

In the era of informational ascendancy, the discourse of artistic communication has transcended the confines of conventional physical domains and geographical boundaries, extending its purview ubiquitously across the global expanse. Consequently, the predominant mode of artistic interaction has evolved towards swift and extensive engagement through virtual platforms. However, this paradigm shift has given rise to the imperative task of meticulous categorization and labeling of an extensive corpus of artistic works, demanding substantial temporal and human resources. This article introduces an innovative bimodal time series classification model (BTSCM) network for the purpose of categorizing and labeling artworks on virtual platforms. Rooted in the foundational principles of visual communication and leveraging multimedia fusion technology, the proposed model proves instrumental in discerning categories within the realm of video content. The BTSCM framework initiates the classification of video data into constituent image and sound elements, employing the conceptual framework of visual communication. Subsequently, feature extraction for both forms of information is achieved through the application of Inflated 3D ConvNet and Mel frequency cepstrum coefficient (MFCC). The synthesis of these extracted features is orchestrated through a fusion of fully convolutional network (FCN), deep Q-network (DQN), and long short-term memory (LSTM), collectively manifesting as the BTSCM network model. This amalgamated network, shaped by the union of fully convolutional network (FCN), DQN, and LSTM, adeptly conducts information processing, culminating in the realization of high-precision video classification. Experimental findings substantiate the efficacy of the BTSCM framework, as evidenced by outstanding classification results across diverse video classification datasets. The classification recognition rate on the self-established art platform exceeds 90%, surpassing benchmarks set by multiple multimodal fusion recognition networks. These commendable outcomes underscore the BTSCM framework's potential significance, providing a theoretical and methodological foundation for the prospective scrutiny and annotation of content within art creation platforms.

在信息优势时代,艺术传播的话语已经超越了传统物理领域和地理边界的限制,将其范围扩展到全球范围。因此,艺术互动的主要模式已经演变为通过虚拟平台快速和广泛的参与。然而,这种范式的转变带来了对大量艺术作品进行细致分类和标记的迫切任务,需要大量的时间和人力资源。本文介绍了一种创新的双峰时间序列分类模型(BTSCM)网络,用于对虚拟平台上的艺术品进行分类和标记。基于视觉传播的基本原则,并利用多媒体融合技术,所提出的模型被证明有助于在视频内容领域内识别类别。BTSCM框架采用视觉传达的概念框架,将视频数据分类为构成图像和声音元素。随后,利用膨胀三维卷积神经网络和梅尔倒频谱系数(MFCC)对两种形式的信息进行特征提取。通过全卷积网络(FCN)、深度q -网络(DQN)和长短期记忆(LSTM)的融合,对这些提取的特征进行综合,共同表现为BTSCM网络模型。该融合网络由全卷积网络(FCN)、DQN和LSTM联合形成,熟练地进行信息处理,最终实现了高精度的视频分类。实验结果证实了BTSCM框架的有效性,在不同的视频分类数据集上取得了出色的分类结果。在自建的艺术平台上,分类识别率超过90%,超过多个多模态融合识别网络设定的基准。这些值得称道的成果强调了BTSCM框架的潜在意义,为艺术创作平台内内容的前瞻性审查和注释提供了理论和方法基础。
{"title":"Parametric art creation platform design based on visual delivery and multimedia data fusion.","authors":"Qing Yun","doi":"10.7717/peerj-cs.3175","DOIUrl":"10.7717/peerj-cs.3175","url":null,"abstract":"<p><p>In the era of informational ascendancy, the discourse of artistic communication has transcended the confines of conventional physical domains and geographical boundaries, extending its purview ubiquitously across the global expanse. Consequently, the predominant mode of artistic interaction has evolved towards swift and extensive engagement through virtual platforms. However, this paradigm shift has given rise to the imperative task of meticulous categorization and labeling of an extensive <i>corpus</i> of artistic works, demanding substantial temporal and human resources. This article introduces an innovative bimodal time series classification model (BTSCM) network for the purpose of categorizing and labeling artworks on virtual platforms. Rooted in the foundational principles of visual communication and leveraging multimedia fusion technology, the proposed model proves instrumental in discerning categories within the realm of video content. The BTSCM framework initiates the classification of video data into constituent image and sound elements, employing the conceptual framework of visual communication. Subsequently, feature extraction for both forms of information is achieved through the application of Inflated 3D ConvNet and Mel frequency cepstrum coefficient (MFCC). The synthesis of these extracted features is orchestrated through a fusion of fully convolutional network (FCN), deep Q-network (DQN), and long short-term memory (LSTM), collectively manifesting as the BTSCM network model. This amalgamated network, shaped by the union of fully convolutional network (FCN), DQN, and LSTM, adeptly conducts information processing, culminating in the realization of high-precision video classification. Experimental findings substantiate the efficacy of the BTSCM framework, as evidenced by outstanding classification results across diverse video classification datasets. The classification recognition rate on the self-established art platform exceeds 90%, surpassing benchmarks set by multiple multimodal fusion recognition networks. These commendable outcomes underscore the BTSCM framework's potential significance, providing a theoretical and methodological foundation for the prospective scrutiny and annotation of content within art creation platforms.</p>","PeriodicalId":54224,"journal":{"name":"PeerJ Computer Science","volume":"11 ","pages":"e3175"},"PeriodicalIF":2.5,"publicationDate":"2025-09-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12453805/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145132589","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Feature selection for emotion recognition in speech: a comparative study of filter and wrapper methods. 语音情感识别的特征选择:滤波和包装方法的比较研究。
IF 2.5 4区 计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2025-09-16 eCollection Date: 2025-01-01 DOI: 10.7717/peerj-cs.3180
Alaa Altheneyan, Aseel Alhadlaq

Feature selection is essential for enhancing the performance and reducing the complexity of speech emotion recognition models. This article evaluates various feature selection methods, including correlation-based (CB), mutual information (MI), and recursive feature elimination (RFE), against baseline approaches using three different feature sets: (1) all available features (Mel-frequency cepstral coefficients (MFCC), root mean square energy (RMS), zero crossing rate (ZCR), chromagram, spectral centroid frequency (SCF), Tonnetz, Mel spectrogram, and spectral bandwidth), totaling 170 features; (2) a five-feature subset (MFCC, RMS, ZCR, Chromagram, and Mel spectrogram), totaling 163 features; and (3) a six-feature subset (MFCC, RMS, ZCR, SCF, Tonnetz, and Mel spectrogram), totaling 157 features. Methods are compared based on precision, recall, F1-score, accuracy, and the number of features selected. Results show that using all features yields an accuracy of 61.42%, but often includes irrelevant data. MI with 120 features achieves the highest performance, with precision, recall, F1-score, and accuracy at 65%, 65%, 65%, and 64.71%, respectively. CB methods with moderate thresholds also perform well, balancing simplicity and accuracy. RFE methods improve consistently with more features, stabilizing around 120 features.

特征选择对于提高语音情感识别模型的性能和降低其复杂性至关重要。本文评估了各种特征选择方法,包括基于相关的(CB),互信息(MI)和递归特征消除(RFE),针对基线方法使用三种不同的特征集:(1)所有可用的特征(Mel频率倒谱系数(MFCC),均方根能量(RMS),零交叉率(ZCR),色谱图,频谱质心频率(SCF), Tonnetz, Mel谱图和频谱带宽),共计170个特征;(2) 5个特征子集(MFCC、RMS、ZCR、Chromagram和Mel谱图),共163个特征;(3) 6个特征子集(MFCC、RMS、ZCR、SCF、Tonnetz和Mel谱图),共有157个特征。基于精度、召回率、f1分、准确度和所选特征的数量对方法进行比较。结果表明,使用所有特征的准确率为61.42%,但通常包含不相关的数据。具有120个特征的MI达到了最高的性能,准确率、召回率、f1得分和准确率分别为65%、65%、65%和64.71%。具有中等阈值的CB方法也表现良好,平衡了简单性和准确性。随着功能的增加,RFE方法不断改进,稳定在120个功能左右。
{"title":"Feature selection for emotion recognition in speech: a comparative study of filter and wrapper methods.","authors":"Alaa Altheneyan, Aseel Alhadlaq","doi":"10.7717/peerj-cs.3180","DOIUrl":"10.7717/peerj-cs.3180","url":null,"abstract":"<p><p>Feature selection is essential for enhancing the performance and reducing the complexity of speech emotion recognition models. This article evaluates various feature selection methods, including correlation-based (CB), mutual information (MI), and recursive feature elimination (RFE), against baseline approaches using three different feature sets: (1) all available features (Mel-frequency cepstral coefficients (MFCC), root mean square energy (RMS), zero crossing rate (ZCR), chromagram, spectral centroid frequency (SCF), Tonnetz, Mel spectrogram, and spectral bandwidth), totaling 170 features; (2) a five-feature subset (MFCC, RMS, ZCR, Chromagram, and Mel spectrogram), totaling 163 features; and (3) a six-feature subset (MFCC, RMS, ZCR, SCF, Tonnetz, and Mel spectrogram), totaling 157 features. Methods are compared based on precision, recall, F1-score, accuracy, and the number of features selected. Results show that using all features yields an accuracy of 61.42%, but often includes irrelevant data. MI with 120 features achieves the highest performance, with precision, recall, F1-score, and accuracy at 65%, 65%, 65%, and 64.71%, respectively. CB methods with moderate thresholds also perform well, balancing simplicity and accuracy. RFE methods improve consistently with more features, stabilizing around 120 features.</p>","PeriodicalId":54224,"journal":{"name":"PeerJ Computer Science","volume":"11 ","pages":"e3180"},"PeriodicalIF":2.5,"publicationDate":"2025-09-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12453713/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145132524","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Enhancing human activity recognition with machine learning: insights from smartphone accelerometer and magnetometer data. 用机器学习增强人类活动识别:来自智能手机加速计和磁力计数据的见解。
IF 2.5 4区 计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2025-09-15 eCollection Date: 2025-01-01 DOI: 10.7717/peerj-cs.3137
Luis Augusto Silva Zendron, Paulo Jorge Coelho, Christophe Soares, Ivo Pereira, Ivan Miguel Pires

The domain of Human Activity Recognition (HAR) has undergone a remarkable evolution, driven by advancements in sensor technology, artificial intelligence (AI), and machine learning algorithms. The aim of this article consists of taking as a basis the previously obtained results to implement other techniques to analyze the same dataset and improve the results previously obtained in the different studies, such as neural networks with different configurations, random forest, support vector machine, CN2 rule inducer, Naive Bayes, and AdaBoost. The methodology consists of data collection from smartphone sensors, data cleaning and normalization, feature extraction techniques, and the implementation of various machine learning models. The study analyzed machine learning models for recognizing human activities using data from smartphone sensors. The results showed that the neural network and random forest models were highly effective across multiple metrics. The models achieved an area under the curve (AUC) of 98.42%, a classification accuracy of 90.14%, an F1-score of 90.13%, a precision of 90.18%, and a recall of 90.14%. With significantly reduced computational cost, our approach outperforms earlier models using the same dataset and achieves results comparable to those of contemporary deep learning-based approaches. Unlike prior studies, our work utilizes non-normalized data and integrates magnetometer signals to enhance performance, all while employing lightweight models within a reproducible visual workflow. This approach is novel, efficient, and deployable on mobile devices in real-time. This approach makes it an ideal fit for real-time mobile applications.

在传感器技术、人工智能(AI)和机器学习算法进步的推动下,人类活动识别(HAR)领域经历了显著的发展。本文的目的是在之前得到的结果的基础上,实现其他技术来分析相同的数据集,并改进之前在不同研究中得到的结果,如不同配置的神经网络、随机森林、支持向量机、CN2规则诱导器、朴素贝叶斯、AdaBoost等。该方法包括从智能手机传感器收集数据、数据清洗和规范化、特征提取技术以及各种机器学习模型的实现。该研究分析了利用智能手机传感器数据识别人类活动的机器学习模型。结果表明,神经网络和随机森林模型在多个指标上都是非常有效的。模型的曲线下面积(AUC)为98.42%,分类准确率为90.14%,f1评分为90.13%,准确率为90.18%,召回率为90.14%。通过显著降低计算成本,我们的方法优于使用相同数据集的早期模型,并获得与当代基于深度学习的方法相当的结果。与之前的研究不同,我们的工作利用非标准化数据并集成磁力计信号来提高性能,同时在可重复的可视化工作流程中采用轻量级模型。这种方法新颖、高效,并且可以在移动设备上实时部署。这种方法使其非常适合实时移动应用程序。
{"title":"Enhancing human activity recognition with machine learning: insights from smartphone accelerometer and magnetometer data.","authors":"Luis Augusto Silva Zendron, Paulo Jorge Coelho, Christophe Soares, Ivo Pereira, Ivan Miguel Pires","doi":"10.7717/peerj-cs.3137","DOIUrl":"10.7717/peerj-cs.3137","url":null,"abstract":"<p><p>The domain of Human Activity Recognition (HAR) has undergone a remarkable evolution, driven by advancements in sensor technology, artificial intelligence (AI), and machine learning algorithms. The aim of this article consists of taking as a basis the previously obtained results to implement other techniques to analyze the same dataset and improve the results previously obtained in the different studies, such as neural networks with different configurations, random forest, support vector machine, CN2 rule inducer, Naive Bayes, and AdaBoost. The methodology consists of data collection from smartphone sensors, data cleaning and normalization, feature extraction techniques, and the implementation of various machine learning models. The study analyzed machine learning models for recognizing human activities using data from smartphone sensors. The results showed that the neural network and random forest models were highly effective across multiple metrics. The models achieved an area under the curve (AUC) of 98.42%, a classification accuracy of 90.14%, an F1-score of 90.13%, a precision of 90.18%, and a recall of 90.14%. With significantly reduced computational cost, our approach outperforms earlier models using the same dataset and achieves results comparable to those of contemporary deep learning-based approaches. Unlike prior studies, our work utilizes non-normalized data and integrates magnetometer signals to enhance performance, all while employing lightweight models within a reproducible visual workflow. This approach is novel, efficient, and deployable on mobile devices in real-time. This approach makes it an ideal fit for real-time mobile applications.</p>","PeriodicalId":54224,"journal":{"name":"PeerJ Computer Science","volume":"11 ","pages":"e3137"},"PeriodicalIF":2.5,"publicationDate":"2025-09-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12453735/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145132408","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
PeerJ Computer Science
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1