首页 > 最新文献

PeerJ Computer Science最新文献

英文 中文
Enhancing reliable and energy-efficient UAV communications with RIS and deep reinforcement learning. 利用RIS和深度强化学习增强可靠和节能的无人机通信。
IF 2.5 4区 计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2025-08-29 eCollection Date: 2025-01-01 DOI: 10.7717/peerj-cs.3031
Wasim Ahmad, Umar Islam, Abdulkadhem A Abdulkadhem, Babar Shah, Fernando Moreira, Ali Abbas

The rapid growth in wireless communication demands has led to a surge in research on technologies capable of enhancing communication reliability, coverage, and energy efficiency. Among these, uncrewed aerial vehicles (UAV) and reconfigurable intelligent surfaces (RIS) have emerged as promising solutions. Prior research on using deep reinforcement learning (DRL) to integrate RIS with UAV concentrated on enhancing signal quality and coverage, but it ignored the challenges caused by electromagnetic interference (EMI). This article introduces a novel framework addressing the challenges posed by EMI from Gallium nitride (GaN) power amplifiers in RIS-assisted UAV communication systems. By integrating DRL with quadrature phase shift keying (QPSK) modulation, the proposed system dynamically optimizes UAV deployment and RIS configurations in real-time, mitigating EMI effects, improving signal-to-interference-plus-noise ratio (SINR), and enhancing energy efficiency. The framework demonstrates superior performance, with an SINR improvement of up to 6.5 dB in interference-prone environments, while achieving a 38% increase in energy efficiency compared to baseline models. Additionally, the system significantly reduces EMI impact, with a mitigation rate of over 70%, and extends coverage area by 35%. The integration of QPSK and DRL allows for real-time decision-making that balances communication quality and energy consumption. These results show the system's potential to outperform traditional methods, particularly in dynamic and challenging environments such as urban, disaster recovery, and remote settings.

无线通信需求的快速增长导致了对能够提高通信可靠性、覆盖范围和能源效率的技术的研究激增。其中,无人驾驶飞行器(UAV)和可重构智能表面(RIS)已成为有前途的解决方案。利用深度强化学习(DRL)将RIS与无人机集成的研究主要集中在提高信号质量和覆盖范围上,但忽略了电磁干扰(EMI)带来的挑战。本文介绍了一种新的框架,解决了氮化镓(GaN)功率放大器在ris辅助无人机通信系统中产生的EMI带来的挑战。通过将DRL与正交相移键控(QPSK)调制相结合,该系统实时动态优化了无人机部署和RIS配置,减轻了电磁干扰影响,提高了信噪比(SINR),提高了能效。该框架表现出卓越的性能,在容易干扰的环境中,信噪比提高了6.5 dB,同时与基线模型相比,能源效率提高了38%。此外,该系统显著降低了电磁干扰的影响,缓解率超过70%,覆盖面积扩大了35%。QPSK和DRL的集成允许平衡通信质量和能耗的实时决策。这些结果表明,该系统具有超越传统方法的潜力,特别是在城市、灾难恢复和远程环境等动态和具有挑战性的环境中。
{"title":"Enhancing reliable and energy-efficient UAV communications with RIS and deep reinforcement learning.","authors":"Wasim Ahmad, Umar Islam, Abdulkadhem A Abdulkadhem, Babar Shah, Fernando Moreira, Ali Abbas","doi":"10.7717/peerj-cs.3031","DOIUrl":"10.7717/peerj-cs.3031","url":null,"abstract":"<p><p>The rapid growth in wireless communication demands has led to a surge in research on technologies capable of enhancing communication reliability, coverage, and energy efficiency. Among these, uncrewed aerial vehicles (UAV) and reconfigurable intelligent surfaces (RIS) have emerged as promising solutions. Prior research on using deep reinforcement learning (DRL) to integrate RIS with UAV concentrated on enhancing signal quality and coverage, but it ignored the challenges caused by electromagnetic interference (EMI). This article introduces a novel framework addressing the challenges posed by EMI from Gallium nitride (GaN) power amplifiers in RIS-assisted UAV communication systems. By integrating DRL with quadrature phase shift keying (QPSK) modulation, the proposed system dynamically optimizes UAV deployment and RIS configurations in real-time, mitigating EMI effects, improving signal-to-interference-plus-noise ratio (SINR), and enhancing energy efficiency. The framework demonstrates superior performance, with an SINR improvement of up to 6.5 dB in interference-prone environments, while achieving a 38% increase in energy efficiency compared to baseline models. Additionally, the system significantly reduces EMI impact, with a mitigation rate of over 70%, and extends coverage area by 35%. The integration of QPSK and DRL allows for real-time decision-making that balances communication quality and energy consumption. These results show the system's potential to outperform traditional methods, particularly in dynamic and challenging environments such as urban, disaster recovery, and remote settings.</p>","PeriodicalId":54224,"journal":{"name":"PeerJ Computer Science","volume":"11 ","pages":"e3031"},"PeriodicalIF":2.5,"publicationDate":"2025-08-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12453847/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145132457","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Enhanced information cross-attention fusion for drug-target binding affinity prediction. 增强信息交叉关注融合,预测药物靶点结合亲和力。
IF 2.5 4区 计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2025-08-28 eCollection Date: 2025-01-01 DOI: 10.7717/peerj-cs.3117
Ailu Fei, Yihan Wang, Tiantian Ruan, Yekang Zhang, Min Yao, Li Wang

Background: The rapid development of artificial intelligence has permeated many fields, with its application in drug discovery becoming increasingly mature. Machine learning, particularly deep learning, has significantly improved the efficiency of drug discovery. In the core task of predicting drug-target affinity (DTA), deep learning enhances predictive performance by automatically extracting complex features from compounds and proteins.

Methods: Traditional approaches often rely heavily on sequence and two-dimensional structural information, overlooking critical three-dimensional and physicochemical properties. To address this, we propose a novel model-Cross Attention Fusion based on Information Enhancement for Drug-Target Affinity Prediction (CAFIE-DTA)-which incorporates protein 3D curvature and electrostatic potential information. The model approximates protein surface curvature using Delaunay triangulation, calculates total electrostatic potential via Adaptive Poisson-Boltzmann Solver (APBS) software, and employs cross multi-head attention to fuse physicochemical and sequence information of proteins. Simultaneously, it integrates graph-based and physicochemical features of compounds using the same attention mechanism. The resulting protein and compound vectors are concatenated for affinity prediction.

Results: Cross-validation and comparative evaluations on the benchmark Davis and KIBA datasets demonstrate that CAFIE-DTA outperforms existing methods. On the Davis dataset, it achieved improvements of 0.003 in confidence interval (CI) and 0.022 in R2. On the KIBA dataset, it improved MSE by 0.008, CI by 0.005, and R2 by 0.017. Compared to traditional models relying on 2D structures and sequence data, CAFIE-DTA shows superior performance in DTA prediction. The source code is available at: https://github.com/NTU-MedAI/CAFIE-DTA.

背景:人工智能的快速发展已经渗透到许多领域,其在药物发现方面的应用日益成熟。机器学习,特别是深度学习,极大地提高了药物发现的效率。在预测药物靶标亲和力(DTA)的核心任务中,深度学习通过自动从化合物和蛋白质中提取复杂特征来提高预测性能。方法:传统方法往往严重依赖于序列和二维结构信息,忽略了关键的三维和物理化学性质。为了解决这个问题,我们提出了一种新的模型-基于药物靶标亲和力预测信息增强的交叉注意融合(CAFIE-DTA)-该模型结合了蛋白质三维曲率和静电势信息。该模型采用Delaunay三角剖分法逼近蛋白质表面曲率,采用APBS软件计算总静电势,并采用交叉多头关注融合蛋白质的理化信息和序列信息。同时,它利用相同的注意机制,整合了化合物的图形特征和物理化学特征。将得到的蛋白质和复合载体连接起来进行亲和力预测。结果:对基准Davis和KIBA数据集的交叉验证和比较评估表明,CAFIE-DTA优于现有方法。在Davis数据集上,它在置信区间(CI)和R2上分别取得了0.003和0.022的改进。在KIBA数据集上,MSE提高了0.008,CI提高了0.005,R2提高了0.017。与传统的依赖于二维结构和序列数据的模型相比,CAFIE-DTA在DTA预测方面表现出更优越的性能。源代码可从https://github.com/NTU-MedAI/CAFIE-DTA获得。
{"title":"Enhanced information cross-attention fusion for drug-target binding affinity prediction.","authors":"Ailu Fei, Yihan Wang, Tiantian Ruan, Yekang Zhang, Min Yao, Li Wang","doi":"10.7717/peerj-cs.3117","DOIUrl":"10.7717/peerj-cs.3117","url":null,"abstract":"<p><strong>Background: </strong>The rapid development of artificial intelligence has permeated many fields, with its application in drug discovery becoming increasingly mature. Machine learning, particularly deep learning, has significantly improved the efficiency of drug discovery. In the core task of predicting drug-target affinity (DTA), deep learning enhances predictive performance by automatically extracting complex features from compounds and proteins.</p><p><strong>Methods: </strong>Traditional approaches often rely heavily on sequence and two-dimensional structural information, overlooking critical three-dimensional and physicochemical properties. To address this, we propose a novel model-Cross Attention Fusion based on Information Enhancement for Drug-Target Affinity Prediction (CAFIE-DTA)-which incorporates protein 3D curvature and electrostatic potential information. The model approximates protein surface curvature using Delaunay triangulation, calculates total electrostatic potential <i>via</i> Adaptive Poisson-Boltzmann Solver (APBS) software, and employs cross multi-head attention to fuse physicochemical and sequence information of proteins. Simultaneously, it integrates graph-based and physicochemical features of compounds using the same attention mechanism. The resulting protein and compound vectors are concatenated for affinity prediction.</p><p><strong>Results: </strong>Cross-validation and comparative evaluations on the benchmark Davis and KIBA datasets demonstrate that CAFIE-DTA outperforms existing methods. On the Davis dataset, it achieved improvements of 0.003 in confidence interval (CI) and 0.022 in R<sup>2</sup>. On the KIBA dataset, it improved MSE by 0.008, CI by 0.005, and R<sup>2</sup> by 0.017. Compared to traditional models relying on 2D structures and sequence data, CAFIE-DTA shows superior performance in DTA prediction. The source code is available at: https://github.com/NTU-MedAI/CAFIE-DTA.</p>","PeriodicalId":54224,"journal":{"name":"PeerJ Computer Science","volume":"11 ","pages":"e3117"},"PeriodicalIF":2.5,"publicationDate":"2025-08-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12453779/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145132669","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Risk assessment based on a new decision-making approach with fermatean fuzzy sets. 基于fermatean模糊集的风险评估新决策方法。
IF 2.5 4区 计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2025-08-28 eCollection Date: 2025-01-01 DOI: 10.7717/peerj-cs.2990
Hilal Biderci, Ali F Guneri

Background: This study presents a new approach to decision-making based on the selection of decision-makers according to evaluated criteria in multi-criteria decision-making (MCDM) methods. Therefore, sub-decision-maker groups (SDMGs) are created for each evaluated criterion. The SDMG approach, which is created according to the criteria, offers a more flexible and dynamic structure than the existing approaches. This approach aims to use the expertise and knowledge of decision-makers more effectively. The decision-making approach presented in this study offers an innovative model and adds a new dimension to decision-making processes. This decision-making approach is applied to the plastic injection moulding machine risk assessment, as it involves different criteria. In addition to classical risk parameters such as probability, severity, frequency, and detectability, new parameters such as human error, machine error, and existing safety measures are also used in the risk assessment.

Methods: The integration of the analytic hierarchy process (AHP) and the technique for order preference by similarity to ideal solution (TOPSIS) methods into the interval valued fermatean fuzzy set (IVFFS) environment makes an important contribution to a more comprehensive consideration of risks and uncertainties in the risk assessment process. The IVFF-AHP method is used to weight the risk parameters and determine the hazard scores, and the TOPSIS method is used to rank the hazards. A holistic and systematic approach to risk assessment has been achieved by integrating these two methods. Modelling of these methods is carried out using MATLAB_R2024a software.

Results: According to the evaluated criteria, it was concluded that the determination of the decision makers separately is applicable to the decision-making process. Identifying the existing safety measures parameter as the most important risk parameter emphasizes the central role of this factor in risk assessment. In addition, machine error and human error parameters are also found to be important in risk assessment. These parameters, which are used for the first time in the literature, offer a broader perspective than traditional methods and provide significant advantages in risk assessment. According to the evaluations, electricity, asphyxiating and toxic gases, and hot water use are determined as the most risky hazards. The sensitivity and comparative analysis performed in the study confirm that the proposed methodology produces consistent and reasonable results.

背景:本研究提出了一种基于多准则决策(MCDM)方法中基于评价标准的决策者选择的新决策方法。因此,为每个评估标准创建子决策者组(sdmg)。SDMG方法是根据标准创建的,它提供了比现有方法更灵活和动态的结构。这种方法旨在更有效地利用决策者的专业知识和知识。本研究提出的决策方法提供了一个创新的模型,并为决策过程增加了一个新的维度。该决策方法适用于注塑机风险评估,因为它涉及不同的标准。除了概率、严重程度、频率和可检测性等经典风险参数外,还使用了人为错误、机器错误和现有安全措施等新参数进行风险评估。方法:将层次分析法(AHP)和TOPSIS方法结合到区间值fermatean fuzzy set (IVFFS)环境中,为风险评估过程中更全面地考虑风险和不确定性做出了重要贡献。采用IVFF-AHP法对风险参数进行加权,确定危害分值,采用TOPSIS法对危害进行排序。通过综合这两种方法,实现了一种全面和系统的风险评估方法。利用MATLAB_R2024a软件对这些方法进行建模。结果:根据评价标准,得出单独确定决策者适用于决策过程。将现有安全措施参数确定为最重要的风险参数,强调了这一因素在风险评估中的核心作用。此外,机器误差和人为误差参数在风险评估中也很重要。这些参数在文献中首次使用,比传统方法提供了更广阔的视角,在风险评估中具有显著优势。根据评估,电力、窒息性和有毒气体以及热水的使用被确定为最危险的危害。研究中进行的敏感性和比较分析证实,所提出的方法产生一致和合理的结果。
{"title":"Risk assessment based on a new decision-making approach with fermatean fuzzy sets.","authors":"Hilal Biderci, Ali F Guneri","doi":"10.7717/peerj-cs.2990","DOIUrl":"10.7717/peerj-cs.2990","url":null,"abstract":"<p><strong>Background: </strong>This study presents a new approach to decision-making based on the selection of decision-makers according to evaluated criteria in multi-criteria decision-making (MCDM) methods. Therefore, sub-decision-maker groups (SDMGs) are created for each evaluated criterion. The SDMG approach, which is created according to the criteria, offers a more flexible and dynamic structure than the existing approaches. This approach aims to use the expertise and knowledge of decision-makers more effectively. The decision-making approach presented in this study offers an innovative model and adds a new dimension to decision-making processes. This decision-making approach is applied to the plastic injection moulding machine risk assessment, as it involves different criteria. In addition to classical risk parameters such as probability, severity, frequency, and detectability, new parameters such as human error, machine error, and existing safety measures are also used in the risk assessment.</p><p><strong>Methods: </strong>The integration of the analytic hierarchy process (AHP) and the technique for order preference by similarity to ideal solution (TOPSIS) methods into the interval valued fermatean fuzzy set (IVFFS) environment makes an important contribution to a more comprehensive consideration of risks and uncertainties in the risk assessment process. The IVFF-AHP method is used to weight the risk parameters and determine the hazard scores, and the TOPSIS method is used to rank the hazards. A holistic and systematic approach to risk assessment has been achieved by integrating these two methods. Modelling of these methods is carried out using MATLAB_R2024a software.</p><p><strong>Results: </strong>According to the evaluated criteria, it was concluded that the determination of the decision makers separately is applicable to the decision-making process. Identifying the existing safety measures parameter as the most important risk parameter emphasizes the central role of this factor in risk assessment. In addition, machine error and human error parameters are also found to be important in risk assessment. These parameters, which are used for the first time in the literature, offer a broader perspective than traditional methods and provide significant advantages in risk assessment. According to the evaluations, electricity, asphyxiating and toxic gases, and hot water use are determined as the most risky hazards. The sensitivity and comparative analysis performed in the study confirm that the proposed methodology produces consistent and reasonable results.</p>","PeriodicalId":54224,"journal":{"name":"PeerJ Computer Science","volume":"11 ","pages":"e2990"},"PeriodicalIF":2.5,"publicationDate":"2025-08-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12453700/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145132674","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Comparing hand-based and controller-based interactions in virtual reality learning: effects on presence and interaction performance. 比较虚拟现实学习中基于手和基于控制器的交互:对在场和交互性能的影响。
IF 2.5 4区 计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2025-08-28 eCollection Date: 2025-01-01 DOI: 10.7717/peerj-cs.3168
Murat Saran

Virtual reality (VR) holds significant promise for enhancing science education by providing immersive and interactive learning experiences. However, the optimal interaction modality within educational VR environments remains an open question. This study investigates the impact of hand-based vs. controller-based interaction on sixth-grade students' sense of presence and interaction performance in a VR science laboratory simulation. Fifty-four sixth-grade students were randomly assigned to either a hand-based interaction group or a controller-based interaction group. Participants completed three interactive science experiments (solar system, electrical circuits, and force/energy) within a virtual laboratory environment designed to mimic their school's physical lab. Presence was assessed using a validated Turkish adaptation of the Presence Questionnaire (PQ), while interaction performance was evaluated using a structured observation form completed by a school teacher. Independent samples t-tests and Mann-Whitney U tests were used to compare the presence and performance scores between the groups. Supplementary analyses explored the effects of gender and prior VR experience. Contrary to expectations, no significant differences were found in either presence (t(49.4) = -0.01, p = 0.992) or interaction performance (t(52) = -1.30, p = 0.199) between the hand-based and controller-based interaction groups. Both interaction modalities yielded comparable levels of self-reported presence and observed performance. However, an unexpected finding emerged regarding performance. A supplementary analysis revealed a significant main effect of gender on performance scores (F(1, 50) = 4.844, p = 0.032), independent of interaction type. Specifically, males demonstrated significantly higher performance than females. This study suggests that, for sixth-grade students engaging in these specific VR science simulations, hand-based and controller-based interactions are equally effective in terms of fostering presence and supporting interaction performance. These findings have practical implications for the design and implementation of VR learning environments, particularly in resource-constrained settings where the reduced maintenance and hygiene concerns associated with hand-based interaction may be advantageous.

虚拟现实(VR)通过提供身临其境的互动学习体验,为加强科学教育带来了巨大的希望。然而,教育VR环境中的最佳交互方式仍然是一个悬而未决的问题。在虚拟现实科学实验室模拟实验中,研究了基于手和基于控制器的交互对六年级学生的存在感和交互表现的影响。54名六年级学生被随机分配到基于手的互动组和基于控制器的互动组。参与者在模拟学校物理实验室的虚拟实验室环境中完成了三个互动科学实验(太阳系、电路和力/能量)。使用经过验证的土耳其版存在问卷(PQ)评估存在,而使用由学校教师完成的结构化观察表评估互动表现。使用独立样本t检验和Mann-Whitney U检验比较各组之间的存在和表现得分。补充分析探讨了性别和先前VR体验的影响。与预期相反,在基于手的交互组和基于控制器的交互组之间,无论是存在(t(49.4) = -0.01, p = 0.992)还是交互性能(t(52) = -1.30, p = 0.199)都没有发现显著差异。两种互动方式产生了相当水平的自我报告存在和观察表现。然而,在性能方面出现了一个意想不到的发现。补充分析显示,性别对成绩得分有显著的主效应(F(1,50) = 4.844, p = 0.032),与互动类型无关。具体来说,男性的表现明显高于女性。本研究表明,对于参与这些特定的VR科学模拟的六年级学生来说,基于手和基于控制器的互动在培养存在感和支持互动表现方面同样有效。这些发现对VR学习环境的设计和实施具有实际意义,特别是在资源受限的环境中,减少与手部交互相关的维护和卫生问题可能是有利的。
{"title":"Comparing hand-based and controller-based interactions in virtual reality learning: effects on presence and interaction performance.","authors":"Murat Saran","doi":"10.7717/peerj-cs.3168","DOIUrl":"10.7717/peerj-cs.3168","url":null,"abstract":"<p><p>Virtual reality (VR) holds significant promise for enhancing science education by providing immersive and interactive learning experiences. However, the optimal interaction modality within educational VR environments remains an open question. This study investigates the impact of hand-based <i>vs</i>. controller-based interaction on sixth-grade students' sense of presence and interaction performance in a VR science laboratory simulation. Fifty-four sixth-grade students were randomly assigned to either a hand-based interaction group or a controller-based interaction group. Participants completed three interactive science experiments (solar system, electrical circuits, and force/energy) within a virtual laboratory environment designed to mimic their school's physical lab. Presence was assessed using a validated Turkish adaptation of the Presence Questionnaire (PQ), while interaction performance was evaluated using a structured observation form completed by a school teacher. Independent samples t-tests and Mann-Whitney U tests were used to compare the presence and performance scores between the groups. Supplementary analyses explored the effects of gender and prior VR experience. Contrary to expectations, no significant differences were found in either presence (t(49.4) = -0.01, <i>p</i> = 0.992) or interaction performance (t(52) = -1.30, <i>p</i> = 0.199) between the hand-based and controller-based interaction groups. Both interaction modalities yielded comparable levels of self-reported presence and observed performance. However, an unexpected finding emerged regarding performance. A supplementary analysis revealed a significant main effect of gender on performance scores (F(1, 50) = 4.844, <i>p</i> = 0.032), independent of interaction type. Specifically, males demonstrated significantly higher performance than females. This study suggests that, for sixth-grade students engaging in these specific VR science simulations, hand-based and controller-based interactions are equally effective in terms of fostering presence and supporting interaction performance. These findings have practical implications for the design and implementation of VR learning environments, particularly in resource-constrained settings where the reduced maintenance and hygiene concerns associated with hand-based interaction may be advantageous.</p>","PeriodicalId":54224,"journal":{"name":"PeerJ Computer Science","volume":"11 ","pages":"e3168"},"PeriodicalIF":2.5,"publicationDate":"2025-08-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12453734/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145132371","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Mam-Incept-Net: a novel inception model for precise interpretation of mammography images. mam - inception - net:用于精确解释乳房x线摄影图像的新颖初始模型。
IF 2.5 4区 计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2025-08-28 eCollection Date: 2025-01-01 DOI: 10.7717/peerj-cs.3149
Amira Tandirovic Gursel, Yasin Kaya

Early diagnosis of breast cancer through periodic screening is a vital ally in the fight for survival. Mammography, recognized as one of the most widely used and cost-effective tools for detecting early signs of asymmetry, calcification, masses, and architectural distortion in breast tissue, plays a significant role in nearly all screening scenarios. However, the interpretation and scoring of mammograms is a complex multi-parameter process that frequently leads to false-positive and false-negative results. This article introduces a new deep-learning-based model that classifies mammograms according to the Breast Imaging Reporting and Data System (BI-RADS) assessment categories. The model is trained on a private dataset, intentionally excluding no BI-RADS categories. A novel deep neural network architecture is employed to more accurately classify breasts, including their boundaries, as regions of interest (ROIs). The ConvNeXt architecture serves as a feature extractor for lower-level features, which are then combined with the layers of a randomly initialized naive inception module to capture higher-level features. Diagnosis is achieved through three experimental tests, yielding accuracy rates ranging from 82.08% to 86.27%. These promising accuracy levels, in comparison to previous studies, can be attributed to a more comprehensive approach to addressing BI-RADS scoring challenges. In addition to pursuing further enhancements in accuracy, future research should consider integrating prior radiology reports to create a more realistic end-to-end computer-aided detection system.

通过定期筛查对乳腺癌进行早期诊断是争取生存的重要盟友。乳房x光检查被认为是检测乳腺组织不对称、钙化、肿块和结构扭曲等早期症状的最广泛使用和最具成本效益的工具之一,在几乎所有筛查方案中都发挥着重要作用。然而,乳房x光片的解释和评分是一个复杂的多参数过程,经常导致假阳性和假阴性结果。本文介绍了一种新的基于深度学习的模型,该模型根据乳腺成像报告和数据系统(BI-RADS)评估类别对乳房x线照片进行分类。该模型是在一个私有数据集上训练的,故意不排除任何BI-RADS类别。一种新的深度神经网络架构被用来更准确地对乳房进行分类,包括它们的边界,作为感兴趣的区域(roi)。ConvNeXt架构用作低级特征的特征提取器,然后将低级特征与随机初始化的初始化模块的各层相结合,以捕获高级特征。通过三次实验测试实现了诊断,准确率在82.08% ~ 86.27%之间。与以前的研究相比,这些有希望的准确性水平可归因于解决BI-RADS评分挑战的更全面的方法。除了进一步提高准确性外,未来的研究应考虑整合先前的放射学报告,以创建一个更现实的端到端计算机辅助检测系统。
{"title":"Mam-Incept-Net: a novel inception model for precise interpretation of mammography images.","authors":"Amira Tandirovic Gursel, Yasin Kaya","doi":"10.7717/peerj-cs.3149","DOIUrl":"10.7717/peerj-cs.3149","url":null,"abstract":"<p><p>Early diagnosis of breast cancer through periodic screening is a vital ally in the fight for survival. Mammography, recognized as one of the most widely used and cost-effective tools for detecting early signs of asymmetry, calcification, masses, and architectural distortion in breast tissue, plays a significant role in nearly all screening scenarios. However, the interpretation and scoring of mammograms is a complex multi-parameter process that frequently leads to false-positive and false-negative results. This article introduces a new deep-learning-based model that classifies mammograms according to the Breast Imaging Reporting and Data System (BI-RADS) assessment categories. The model is trained on a private dataset, intentionally excluding no BI-RADS categories. A novel deep neural network architecture is employed to more accurately classify breasts, including their boundaries, as regions of interest (ROIs). The ConvNeXt architecture serves as a feature extractor for lower-level features, which are then combined with the layers of a randomly initialized naive inception module to capture higher-level features. Diagnosis is achieved through three experimental tests, yielding accuracy rates ranging from 82.08% to 86.27%. These promising accuracy levels, in comparison to previous studies, can be attributed to a more comprehensive approach to addressing BI-RADS scoring challenges. In addition to pursuing further enhancements in accuracy, future research should consider integrating prior radiology reports to create a more realistic end-to-end computer-aided detection system.</p>","PeriodicalId":54224,"journal":{"name":"PeerJ Computer Science","volume":"11 ","pages":"e3149"},"PeriodicalIF":2.5,"publicationDate":"2025-08-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12453803/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145132739","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Improving course evaluation processes in higher education institutions: a modular system approach. 改进高等院校课程评价过程:模块化系统方法。
IF 2.5 4区 计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2025-08-28 eCollection Date: 2025-01-01 DOI: 10.7717/peerj-cs.3110
İlker Kocaoğlu, Erinç Karataş

Course and instructor evaluations (CIE) are essential tools for assessing educational quality in higher education. However, traditional CIE systems often suffer from inconsistencies between structured responses and open-ended feedback, leading to unreliable insights and increased administrative workload. This study suggests a modular system to address these challenges, leveraging sentiment analysis and inconsistency detection to enhance the reliability and efficiency of CIE processes.

Background: Improving the reliability of CIE data is crucial for informed decision-making in higher education. Existing methods fail to address discrepancies between numerical scores and textual feedback, resulting in misleading evaluations. This study proposes a system to identify and exclude inconsistent data, providing more reliable insights.

Methods: Using the Design Science Research Methodology (DSRM), a system architecture was developed with five modules: data collection, preprocessing, sentiment analysis, inconsistency detection, and reporting. A dataset of 13,651 anonymized Turkish CIE records was used to train and evaluate machine learning algorithms, including support vector machines, naive Bayes, random forest, decision trees, K-nearest neighbors, and OpenAI's GPT-4 Turbo Preview model. Sentiment analysis results from open-ended responses were compared with structured responses to identify inconsistencies.

Results: The GPT-4 Turbo Preview model outperformed traditional algorithms, achieving 85% accuracy, 88% precision, and 95% recall. Analysis of a prototype system applied to 431 CIEs identified a 37% inconsistency rate. By excluding inconsistent data, the system generated reliable reports with actionable insights for course and instructor performance. The purpose of this study is to design and evaluate a new system using the Design Science Research (DSR) approach to enhance the accuracy and reliability of course evaluation processes employed in higher education institutions. The modular system effectively addresses inconsistencies in CIE processes, offering a scalable and adaptable solution for higher education institutions. By integrating advanced machine learning techniques, the system enhances the accuracy and reliability of evaluation reports, supporting data-driven decision-making. Future work will focus on refining sentiment analysis for neutral comments and broadening the system's applicability to diverse educational contexts. This innovative approach represents a significant advancement in leveraging technology to improve educational quality.

课程与教师评价(CIE)是评估高等教育教学质量的重要工具。然而,传统的CIE系统经常遭受结构化响应和开放式反馈之间的不一致,导致不可靠的见解和增加的管理工作量。本研究提出了一个模块化系统来解决这些挑战,利用情感分析和不一致检测来提高CIE流程的可靠性和效率。背景:提高CIE数据的可靠性对高等教育中的明智决策至关重要。现有的方法不能解决数字分数和文本反馈之间的差异,导致误导性的评价。本研究提出了一个系统来识别和排除不一致的数据,提供更可靠的见解。方法:采用设计科学研究方法(DSRM),构建了包含数据收集、预处理、情感分析、不一致检测和报告五个模块的系统架构。使用13651个匿名土耳其CIE记录的数据集来训练和评估机器学习算法,包括支持向量机、朴素贝叶斯、随机森林、决策树、k近邻和OpenAI的GPT-4 Turbo Preview模型。将开放式回答的情绪分析结果与结构化回答进行比较,以确定不一致之处。结果:GPT-4 Turbo Preview模型优于传统算法,准确率达到85%,精密度达到88%,召回率达到95%。对应用于431个CIEs的原型系统的分析确定了37%的不一致性率。通过排除不一致的数据,系统生成了可靠的报告,并对课程和教师的表现有可操作的见解。摘要本研究旨在运用设计科学研究(DSR)的方法,设计并评估一个新的课程评估系统,以提高高等教育机构课程评估过程的准确性和可靠性。模块化系统有效地解决了CIE流程中的不一致性,为高等教育机构提供了可扩展和适应性强的解决方案。通过集成先进的机器学习技术,该系统提高了评估报告的准确性和可靠性,支持数据驱动的决策。未来的工作将集中于改进中立评论的情感分析,并扩大系统对不同教育背景的适用性。这种创新的方法代表了利用技术提高教育质量的重大进步。
{"title":"Improving course evaluation processes in higher education institutions: a modular system approach.","authors":"İlker Kocaoğlu, Erinç Karataş","doi":"10.7717/peerj-cs.3110","DOIUrl":"10.7717/peerj-cs.3110","url":null,"abstract":"<p><p>Course and instructor evaluations (CIE) are essential tools for assessing educational quality in higher education. However, traditional CIE systems often suffer from inconsistencies between structured responses and open-ended feedback, leading to unreliable insights and increased administrative workload. This study suggests a modular system to address these challenges, leveraging sentiment analysis and inconsistency detection to enhance the reliability and efficiency of CIE processes.</p><p><strong>Background: </strong>Improving the reliability of CIE data is crucial for informed decision-making in higher education. Existing methods fail to address discrepancies between numerical scores and textual feedback, resulting in misleading evaluations. This study proposes a system to identify and exclude inconsistent data, providing more reliable insights.</p><p><strong>Methods: </strong>Using the Design Science Research Methodology (DSRM), a system architecture was developed with five modules: data collection, preprocessing, sentiment analysis, inconsistency detection, and reporting. A dataset of 13,651 anonymized Turkish CIE records was used to train and evaluate machine learning algorithms, including support vector machines, naive Bayes, random forest, decision trees, K-nearest neighbors, and OpenAI's GPT-4 Turbo Preview model. Sentiment analysis results from open-ended responses were compared with structured responses to identify inconsistencies.</p><p><strong>Results: </strong>The GPT-4 Turbo Preview model outperformed traditional algorithms, achieving 85% accuracy, 88% precision, and 95% recall. Analysis of a prototype system applied to 431 CIEs identified a 37% inconsistency rate. By excluding inconsistent data, the system generated reliable reports with actionable insights for course and instructor performance. The purpose of this study is to design and evaluate a new system using the Design Science Research (DSR) approach to enhance the accuracy and reliability of course evaluation processes employed in higher education institutions. The modular system effectively addresses inconsistencies in CIE processes, offering a scalable and adaptable solution for higher education institutions. By integrating advanced machine learning techniques, the system enhances the accuracy and reliability of evaluation reports, supporting data-driven decision-making. Future work will focus on refining sentiment analysis for neutral comments and broadening the system's applicability to diverse educational contexts. This innovative approach represents a significant advancement in leveraging technology to improve educational quality.</p>","PeriodicalId":54224,"journal":{"name":"PeerJ Computer Science","volume":"11 ","pages":"e3110"},"PeriodicalIF":2.5,"publicationDate":"2025-08-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12453702/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145132721","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Next generation sequencing under attack: investigating insider threats and organizational behaviour. 遭受攻击的下一代测序:调查内部威胁和组织行为。
IF 2.5 4区 计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2025-08-27 eCollection Date: 2025-01-01 DOI: 10.7717/peerj-cs.3008
Nasreen Anjum, Hani Alshahrani, Darakhshan Syed, Asadullah Shaikh, Mahreen Ul Hassan

Next generation sequencing (NGS) has become a cornerstone of modern genomics, enabling high-throughput analysis of DNA and RNA with wide applications across medicine, research, and biotechnology. However, the growing adoption of NGS technologies has introduced significant cyber-biosecurity risks, particularly those arising from insider threats and organizational shortcomings. While technical vulnerabilities have received attention, the human and behavioral dimensions of cybersecurity in NGS environments remain underexplored. This study investigates the role of human factors and organizational behavior in shaping cyber-biosecurity risks in NGS workflows. A mixed-method approach was employed, combining survey data from 120 participants across four countries with statistical analyses including chi-square tests, cross-tabulations, and cluster analysis. The study assessed cybersecurity training availability, employee engagement, training effectiveness, and awareness of insider threats. Findings reveal substantial gaps in training frequency and participation, with 36% of respondents reporting no access to NGS-specific cybersecurity training. Only a minority of participants felt confident in detecting cyber threats, and 32.5% had never applied cybersecurity knowledge in practice. Chi-square results indicate significant associations between training frequency and threat recognition, training relevance, and knowledge application. Cluster analysis further categorized organizations into "robust," "moderate," and "emergent" cybersecurity maturity profiles. The study offers an evidence-based framework to enhance cyber-biosecurity in NGS settings by addressing human-centric risks. It recommends role-specific training, frequent policy updates, and improved organizational communication to mitigate insider threats. These insights support the development of targeted interventions and policies to strengthen the cybersecurity culture in genomics organizations.

下一代测序(NGS)已成为现代基因组学的基石,使DNA和RNA的高通量分析在医学、研究和生物技术领域得到广泛应用。然而,越来越多地采用NGS技术带来了重大的网络生物安全风险,特别是由内部威胁和组织缺陷引起的风险。虽然技术漏洞受到了关注,但NGS环境中网络安全的人类和行为维度仍未得到充分探索。本研究探讨了人为因素和组织行为在NGS工作流程中形成网络生物安全风险的作用。采用混合方法,将来自四个国家的120名参与者的调查数据与包括卡方检验、交叉表和聚类分析在内的统计分析相结合。该研究评估了网络安全培训的可用性、员工敬业度、培训有效性和对内部威胁的认识。调查结果显示,培训频率和参与程度存在巨大差距,36%的受访者表示没有接受过ngs专门的网络安全培训。只有少数参与者对检测网络威胁有信心,32.5%的人从未在实践中应用过网络安全知识。卡方结果表明,训练频率与威胁识别、训练相关性和知识应用之间存在显著关联。聚类分析进一步将组织划分为“健壮的”、“适度的”和“紧急的”网络安全成熟度。该研究提供了一个基于证据的框架,通过解决以人为中心的风险来加强NGS环境中的网络生物安全。它建议针对特定角色的培训、频繁的策略更新和改进的组织沟通,以减轻内部威胁。这些见解支持有针对性的干预措施和政策的发展,以加强基因组学组织的网络安全文化。
{"title":"Next generation sequencing under attack: investigating insider threats and organizational behaviour.","authors":"Nasreen Anjum, Hani Alshahrani, Darakhshan Syed, Asadullah Shaikh, Mahreen Ul Hassan","doi":"10.7717/peerj-cs.3008","DOIUrl":"10.7717/peerj-cs.3008","url":null,"abstract":"<p><p>Next generation sequencing (NGS) has become a cornerstone of modern genomics, enabling high-throughput analysis of DNA and RNA with wide applications across medicine, research, and biotechnology. However, the growing adoption of NGS technologies has introduced significant cyber-biosecurity risks, particularly those arising from insider threats and organizational shortcomings. While technical vulnerabilities have received attention, the human and behavioral dimensions of cybersecurity in NGS environments remain underexplored. This study investigates the role of human factors and organizational behavior in shaping cyber-biosecurity risks in NGS workflows. A mixed-method approach was employed, combining survey data from 120 participants across four countries with statistical analyses including chi-square tests, cross-tabulations, and cluster analysis. The study assessed cybersecurity training availability, employee engagement, training effectiveness, and awareness of insider threats. Findings reveal substantial gaps in training frequency and participation, with 36% of respondents reporting no access to NGS-specific cybersecurity training. Only a minority of participants felt confident in detecting cyber threats, and 32.5% had never applied cybersecurity knowledge in practice. Chi-square results indicate significant associations between training frequency and threat recognition, training relevance, and knowledge application. Cluster analysis further categorized organizations into \"robust,\" \"moderate,\" and \"emergent\" cybersecurity maturity profiles. The study offers an evidence-based framework to enhance cyber-biosecurity in NGS settings by addressing human-centric risks. It recommends role-specific training, frequent policy updates, and improved organizational communication to mitigate insider threats. These insights support the development of targeted interventions and policies to strengthen the cybersecurity culture in genomics organizations.</p>","PeriodicalId":54224,"journal":{"name":"PeerJ Computer Science","volume":"11 ","pages":"e3008"},"PeriodicalIF":2.5,"publicationDate":"2025-08-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12453824/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145132733","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Artificial intelligence-driven insights into Arab media's sustainable development goals coverage. 人工智能驱动的洞察阿拉伯媒体的可持续发展目标报道。
IF 2.5 4区 计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2025-08-26 eCollection Date: 2025-01-01 DOI: 10.7717/peerj-cs.3071
Mohammed Alsuhaibani, Kamel Gaanoun, Ali Mustafa Qamar

This study examines how Arab media have engaged with the United Nations Sustainable Development Goals (SDGs) over the past decade and evaluates the alignment between media coverage and official government priorities. The research addresses the lack of large-scale, Arabic-focused analyses in SDG discourse, which is often dominated by English-language studies. We collected and processed a unique dataset of over 1.2 million Arabic news articles from ten countries between 2010 and 2024. Using a combination of data augmentation, deep learning (specifically, Transformer-based models), and large language models (LLMs), we trained classifiers to detect references to the SDGs and categorize articles by specific SDGs. The results reveal regional patterns in SDG coverage, with North African countries focusing more on governance-related goals, while Gulf countries emphasize economic and environmental themes. Our findings reveal a general alignment between media discourse and official SDG priorities, with notable exceptions. This study is the first to combine artificial intelligence (AI) methods and Arabic media at this scale for SDG analysis, offering new tools and insights for policymakers, media professionals, and development stakeholders.

本研究考察了阿拉伯媒体在过去十年中如何参与联合国可持续发展目标(sdg),并评估了媒体报道与官方政府优先事项之间的一致性。该研究解决了可持续发展目标话语中缺乏大规模、以阿拉伯语为重点的分析的问题,这些分析通常以英语研究为主。我们收集并处理了一个独特的数据集,其中包括2010年至2024年间来自10个国家的120多万篇阿拉伯语新闻文章。结合使用数据增强、深度学习(特别是基于transformer的模型)和大型语言模型(llm),我们训练分类器来检测对可持续发展目标的引用,并根据特定的可持续发展目标对文章进行分类。结果揭示了可持续发展目标覆盖范围的区域模式,北非国家更关注与治理相关的目标,而海湾国家则强调经济和环境主题。我们的研究结果揭示了媒体话语与官方可持续发展目标优先事项之间的总体一致性,但也有明显的例外。这项研究首次将人工智能(AI)方法与如此规模的阿拉伯媒体结合起来进行可持续发展目标分析,为政策制定者、媒体专业人士和发展利益相关者提供了新的工具和见解。
{"title":"Artificial intelligence-driven insights into Arab media's sustainable development goals coverage.","authors":"Mohammed Alsuhaibani, Kamel Gaanoun, Ali Mustafa Qamar","doi":"10.7717/peerj-cs.3071","DOIUrl":"10.7717/peerj-cs.3071","url":null,"abstract":"<p><p>This study examines how Arab media have engaged with the United Nations Sustainable Development Goals (SDGs) over the past decade and evaluates the alignment between media coverage and official government priorities. The research addresses the lack of large-scale, Arabic-focused analyses in SDG discourse, which is often dominated by English-language studies. We collected and processed a unique dataset of over 1.2 million Arabic news articles from ten countries between 2010 and 2024. Using a combination of data augmentation, deep learning (specifically, Transformer-based models), and large language models (LLMs), we trained classifiers to detect references to the SDGs and categorize articles by specific SDGs. The results reveal regional patterns in SDG coverage, with North African countries focusing more on governance-related goals, while Gulf countries emphasize economic and environmental themes. Our findings reveal a general alignment between media discourse and official SDG priorities, with notable exceptions. This study is the first to combine artificial intelligence (AI) methods and Arabic media at this scale for SDG analysis, offering new tools and insights for policymakers, media professionals, and development stakeholders.</p>","PeriodicalId":54224,"journal":{"name":"PeerJ Computer Science","volume":"11 ","pages":"e3071"},"PeriodicalIF":2.5,"publicationDate":"2025-08-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12453826/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145132477","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
MLPruner: pruning convolutional neural networks with automatic mask learning. MLPruner:基于自动掩模学习的卷积神经网络剪枝。
IF 2.5 4区 计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2025-08-25 eCollection Date: 2025-01-01 DOI: 10.7717/peerj-cs.3132
Sihan Chen, Ying Zhao

In recent years, filter pruning has been recognized as an indispensable technique for mitigating the significant computational complexity and parameter burden associated with deep convolutional neural networks (CNNs). To date, existing methods are based on heuristically designed pruning metrics or implementing weight regulations to penalize filter parameters during the training process. Nevertheless, human-crafted pruning criteria tend not to identify the most critical filters, and the introduction of weight constraints can inadvertently interfere with weight training. To rectify these obstacles, this article introduces a novel mask learning method for autonomous filter pruning, negating requirements for weight penalties. Specifically, we attribute a learnable mask to each filter. During forward propagation, the mask is transformed to a binary value of 1 or 0, serving as indicators for the necessity of corresponding filter pruning. In contrast, throughout backward propagation, we use straight-through estimator (STE) to estimate the gradient of masks, accommodating the non-differentiable characteristic of the rounding function. We verify that these learned masks aptly reflect the significance of corresponding filters. Concurrently, throughout the mask learning process, the training of neural network parameters remains uninfluenced, therefore protecting the normal training process of weights. The efficacy of our proposed filter pruning method based on mask learning, termed MLPruner, is substantiated through its application to prevalent CNNs across numerous representative benchmarks.

近年来,滤波剪枝被认为是减轻深度卷积神经网络(cnn)显著的计算复杂度和参数负担的一种不可或缺的技术。迄今为止,现有的方法是基于启发式设计的修剪指标或在训练过程中实现权重规则来惩罚过滤器参数。然而,人工修剪标准往往不能识别最关键的过滤器,并且引入权重约束可能会无意中干扰重量训练。为了纠正这些障碍,本文引入了一种新的掩模学习方法,用于自动滤波器修剪,否定了权重惩罚的要求。具体来说,我们为每个过滤器赋予一个可学习的掩码。在正向传播过程中,掩码被转换为1或0的二值,作为是否需要进行相应滤波器修剪的指标。相反,在整个反向传播过程中,我们使用直通估计器(STE)来估计掩模的梯度,以适应舍入函数的不可微特性。我们验证了这些学习到的掩码恰当地反映了相应滤波器的重要性。同时,在整个掩模学习过程中,神经网络参数的训练不受影响,从而保护了权值的正常训练过程。我们提出的基于掩模学习的过滤器修剪方法(称为MLPruner)的有效性通过其在众多代表性基准中的流行cnn的应用得到证实。
{"title":"MLPruner: pruning convolutional neural networks with automatic mask learning.","authors":"Sihan Chen, Ying Zhao","doi":"10.7717/peerj-cs.3132","DOIUrl":"10.7717/peerj-cs.3132","url":null,"abstract":"<p><p>In recent years, filter pruning has been recognized as an indispensable technique for mitigating the significant computational complexity and parameter burden associated with deep convolutional neural networks (CNNs). To date, existing methods are based on heuristically designed pruning metrics or implementing weight regulations to penalize filter parameters during the training process. Nevertheless, human-crafted pruning criteria tend not to identify the most critical filters, and the introduction of weight constraints can inadvertently interfere with weight training. To rectify these obstacles, this article introduces a novel mask learning method for autonomous filter pruning, negating requirements for weight penalties. Specifically, we attribute a learnable mask to each filter. During forward propagation, the mask is transformed to a binary value of 1 or 0, serving as indicators for the necessity of corresponding filter pruning. In contrast, throughout backward propagation, we use straight-through estimator (STE) to estimate the gradient of masks, accommodating the non-differentiable characteristic of the rounding function. We verify that these learned masks aptly reflect the significance of corresponding filters. Concurrently, throughout the mask learning process, the training of neural network parameters remains uninfluenced, therefore protecting the normal training process of weights. The efficacy of our proposed filter pruning method based on mask learning, termed MLPruner, is substantiated through its application to prevalent CNNs across numerous representative benchmarks.</p>","PeriodicalId":54224,"journal":{"name":"PeerJ Computer Science","volume":"11 ","pages":"e3132"},"PeriodicalIF":2.5,"publicationDate":"2025-08-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12453823/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145132736","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Plagiarism detection across languages: a comprehensive study of Arabic and English-to-Arabic long documents. 跨语言的抄袭检测:阿拉伯语和英语-阿拉伯语长文件的综合研究。
IF 2.5 4区 计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2025-08-25 eCollection Date: 2025-01-01 DOI: 10.7717/peerj-cs.3128
Ahmad Abdelaal, Abdallah Elsaadany, Abdelrhman Ahmed Medhat, Aysha Al Shamsi, Noha Gamal ElDin Saad Ali

Plagiarism detection in Arabic texts remains a significant challenge due to the complex morphological structure, rich linguistic diversity, and scarcity of high-quality labeled datasets. This study proposes a robust framework for Arabic plagiarism detection by integrating Siamese neural networks (SNN) with state-of-the-art transformer architectures, specifically AraT5 and Longformer. The system employs a hybrid workflow, combining transformer-based encoders and a classification objective to implicitly learn textual similarity. To address the inherent imbalance in Arabic plagiarism datasets, both weighted cross-entropy loss and Dice loss functions were utilized to optimize model training. Extensive experiments were conducted using the ExAraCorpusPAN2015 dataset, demonstrating the effectiveness of the proposed architecture. Results indicate that AraT5 with weighted cross-entropy loss outperformed other configurations, achieving an F1-score of 0.9058. Additionally, comparative analysis with existing methodologies highlights the superiority of our approach in handling nuanced semantic and structural variations within Arabic texts. This study underscores the importance of transformer-based architectures and class-specific loss functions in enhancing plagiarism detection accuracy in under-resourced languages like Arabic.

由于复杂的形态结构、丰富的语言多样性和缺乏高质量的标记数据集,阿拉伯语文本的抄袭检测仍然是一个重大挑战。本研究通过将Siamese神经网络(SNN)与最先进的变压器架构(特别是AraT5和Longformer)集成,提出了一个强大的阿拉伯语抄袭检测框架。该系统采用混合工作流程,结合基于变压器的编码器和分类目标来隐式学习文本相似度。为了解决阿拉伯语抄袭数据集固有的不平衡问题,利用加权交叉熵损失和Dice损失函数对模型训练进行优化。使用ExAraCorpusPAN2015数据集进行了大量实验,证明了所提出架构的有效性。结果表明,具有加权交叉熵损失的AraT5优于其他配置,其f1得分为0.9058。此外,与现有方法的比较分析突出了我们的方法在处理阿拉伯语文本中细微的语义和结构变化方面的优势。该研究强调了基于变压器的体系结构和类特定损失函数在提高资源不足语言(如阿拉伯语)的剽窃检测准确性方面的重要性。
{"title":"Plagiarism detection across languages: a comprehensive study of Arabic and English-to-Arabic long documents.","authors":"Ahmad Abdelaal, Abdallah Elsaadany, Abdelrhman Ahmed Medhat, Aysha Al Shamsi, Noha Gamal ElDin Saad Ali","doi":"10.7717/peerj-cs.3128","DOIUrl":"https://doi.org/10.7717/peerj-cs.3128","url":null,"abstract":"<p><p>Plagiarism detection in Arabic texts remains a significant challenge due to the complex morphological structure, rich linguistic diversity, and scarcity of high-quality labeled datasets. This study proposes a robust framework for Arabic plagiarism detection by integrating Siamese neural networks (SNN) with state-of-the-art transformer architectures, specifically AraT5 and Longformer. The system employs a hybrid workflow, combining transformer-based encoders and a classification objective to implicitly learn textual similarity. To address the inherent imbalance in Arabic plagiarism datasets, both weighted cross-entropy loss and Dice loss functions were utilized to optimize model training. Extensive experiments were conducted using the ExAraCorpusPAN2015 dataset, demonstrating the effectiveness of the proposed architecture. Results indicate that AraT5 with weighted cross-entropy loss outperformed other configurations, achieving an F1-score of 0.9058. Additionally, comparative analysis with existing methodologies highlights the superiority of our approach in handling nuanced semantic and structural variations within Arabic texts. This study underscores the importance of transformer-based architectures and class-specific loss functions in enhancing plagiarism detection accuracy in under-resourced languages like Arabic.</p>","PeriodicalId":54224,"journal":{"name":"PeerJ Computer Science","volume":"11 ","pages":"e3128"},"PeriodicalIF":2.5,"publicationDate":"2025-08-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12453725/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145132585","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
PeerJ Computer Science
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1