首页 > 最新文献

International Journal of Software Engineering and Knowledge Engineering最新文献

英文 中文
On the measures of success in replication of controlled experiments with STRIDE 关于STRIDE控制实验复制成功的措施
4区 计算机科学 Q3 Computer Science Pub Date : 2023-10-27 DOI: 10.1142/s0218194023500651
Winnie Mbaka, Katja Tuma
{"title":"On the measures of success in replication of controlled experiments with STRIDE","authors":"Winnie Mbaka, Katja Tuma","doi":"10.1142/s0218194023500651","DOIUrl":"https://doi.org/10.1142/s0218194023500651","url":null,"abstract":"","PeriodicalId":50288,"journal":{"name":"International Journal of Software Engineering and Knowledge Engineering","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2023-10-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"136312495","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
SHAMROQ: A Software Engineering Methodology to Extract Deontic Expressions from the Code of Federal Regulations - A Single-Case, Embedded Case Study 从联邦法规代码中提取道义表达的软件工程方法-一个单一案例,嵌入式案例研究
4区 计算机科学 Q3 Computer Science Pub Date : 2023-10-27 DOI: 10.1142/s021819402341005x
Patrick D. Cook, Susan A. Mengel, Siva Parameswaran
This research provides a comprehensive analysis of deontic expressions within the Code of Federal Regulations (CFR) Title 48, Federal Acquisition Regulations System, specifically focusing on obligations, permissions, prohibitions, and dispensations. Utilizing SHAMROQ, a systematic and rigorous methodology, the authors extract, classify, and analyze these expressions, quantify their prevalence, and identify common linguistic patterns within the legal text. The results show that obligations (71.3%) form most deontic expressions in CFR 48, indicating the heavily prescriptive nature of the document. Permissions also form a significant part (21.9%), suggesting the liberties and allowances are embedded within the regulatory framework. In contrast, prohibitions (5.4%) and dispensations (1.4%) are less frequent, indicating that the document leans more towards defining what is required or allowed rather than what is explicitly forbidden or exempted. This research also highlights the challenges encountered during the extraction process, providing insights into the complexities of parsing legal texts and the intricacies of deontic language. These challenges range from the technical difficulties of parsing a complex hierarchical document to the conceptual challenges of defining precise rulesets for regulations and provisions. In summary, the results deepen the understanding of regulatory compliance in software engineering and contribute to the development of more effective and efficient automated extraction tools.
本研究对《联邦法规》第48篇《联邦采购法规体系》中的义务表达进行了全面分析,特别关注义务、许可、禁止和豁免。利用SHAMROQ这一系统而严谨的方法论,作者提取、分类和分析了这些表达,量化了它们的流行程度,并确定了法律文本中常见的语言模式。结果表明,在CFR 48中,义务(71.3%)构成了大多数道义性表达,这表明文件具有很强的规定性。许可也构成了重要的一部分(21.9%),这表明自由和许可是嵌入在监管框架内的。相比之下,禁止(5.4%)和豁免(1.4%)的频率较低,这表明文件更倾向于定义什么是需要或允许的,而不是明确禁止或豁免的。这项研究还强调了在提取过程中遇到的挑战,提供了对分析法律文本的复杂性和道义语言的复杂性的见解。这些挑战包括从解析复杂分层文档的技术困难到为法规和规定定义精确规则集的概念挑战。总之,结果加深了对软件工程中法规遵从性的理解,并有助于开发更有效和高效的自动化提取工具。
{"title":"SHAMROQ: A Software Engineering Methodology to Extract Deontic Expressions from the Code of Federal Regulations - A Single-Case, Embedded Case Study","authors":"Patrick D. Cook, Susan A. Mengel, Siva Parameswaran","doi":"10.1142/s021819402341005x","DOIUrl":"https://doi.org/10.1142/s021819402341005x","url":null,"abstract":"This research provides a comprehensive analysis of deontic expressions within the Code of Federal Regulations (CFR) Title 48, Federal Acquisition Regulations System, specifically focusing on obligations, permissions, prohibitions, and dispensations. Utilizing SHAMROQ, a systematic and rigorous methodology, the authors extract, classify, and analyze these expressions, quantify their prevalence, and identify common linguistic patterns within the legal text. The results show that obligations (71.3%) form most deontic expressions in CFR 48, indicating the heavily prescriptive nature of the document. Permissions also form a significant part (21.9%), suggesting the liberties and allowances are embedded within the regulatory framework. In contrast, prohibitions (5.4%) and dispensations (1.4%) are less frequent, indicating that the document leans more towards defining what is required or allowed rather than what is explicitly forbidden or exempted. This research also highlights the challenges encountered during the extraction process, providing insights into the complexities of parsing legal texts and the intricacies of deontic language. These challenges range from the technical difficulties of parsing a complex hierarchical document to the conceptual challenges of defining precise rulesets for regulations and provisions. In summary, the results deepen the understanding of regulatory compliance in software engineering and contribute to the development of more effective and efficient automated extraction tools.","PeriodicalId":50288,"journal":{"name":"International Journal of Software Engineering and Knowledge Engineering","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2023-10-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"136233161","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
GTFP: Network Fault Prediction Based on Graph and Time Series 基于图和时间序列的网络故障预测
4区 计算机科学 Q3 Computer Science Pub Date : 2023-10-18 DOI: 10.1142/s0218194023500560
Zhongliang Li, Junjun Ding, Zongming Ma
With the explosion of 5G network scale, the network structure becomes increasingly complex. During the operation of the network devices, the probability of anomalies or faults increases accordingly. Network faults may lead to the disappearance of important information and cause unpredictable losses. The prediction of network faults can enhance the quality of network services and reduce economic loss. In this paper, we propose the concept of 4D features and use the BERT algorithm to extract semantic features, the graph neural network algorithm to extract network topology information, and the Temporal Convolutional Network (TCN) algorithm to extract time series. Based on this, we propose Fault Prediction based on GraphSage and TCN (GTFP), an end-to-end solution of network fault alarm prediction, which is based on GraphSage and TCN (GTCN), a hybrid algorithm of a graph neural network and the TCN model. Our solution takes the historical alarm data as input. First, we filter out the alarm noises irrelevant to the faults through data cleaning. Then, we employ feature engineering to extract the valid alarm features, including the statistical features of the network alarm information, the semantic features of the alarm texts, the sequential features of the alarms and the network topology features of the nodes where the alarms are located. Finally, we use GTCN to predict future fault alarms based on the extracted features. Experiments on the alarm data of a real service system show that GTFP performs better than the state-of-the-art algorithms of fault alarm prediction.
随着5G网络规模的爆炸式增长,网络结构日趋复杂。在网络设备的运行过程中,出现异常或故障的概率也会随之增加。网络故障可能导致重要信息丢失,造成不可预测的损失。网络故障预测可以提高网络服务质量,减少经济损失。本文提出了四维特征的概念,并采用BERT算法提取语义特征,图神经网络算法提取网络拓扑信息,时间卷积网络(TCN)算法提取时间序列。在此基础上,提出了基于GraphSage和TCN的故障预测(GTFP),这是一种基于GraphSage和TCN (GTCN)的网络故障告警预测的端到端解决方案,是一种图神经网络和TCN模型的混合算法。我们的解决方案以历史告警数据为输入。首先,通过数据清洗过滤掉与故障无关的报警噪声。然后,利用特征工程提取有效的告警特征,包括网络告警信息的统计特征、告警文本的语义特征、告警的序列特征以及告警所在节点的网络拓扑特征。最后,基于提取的特征,利用GTCN对未来的故障告警进行预测。在实际业务系统报警数据上的实验表明,GTFP算法比现有的故障报警预测算法具有更好的预测效果。
{"title":"GTFP: Network Fault Prediction Based on Graph and Time Series","authors":"Zhongliang Li, Junjun Ding, Zongming Ma","doi":"10.1142/s0218194023500560","DOIUrl":"https://doi.org/10.1142/s0218194023500560","url":null,"abstract":"With the explosion of 5G network scale, the network structure becomes increasingly complex. During the operation of the network devices, the probability of anomalies or faults increases accordingly. Network faults may lead to the disappearance of important information and cause unpredictable losses. The prediction of network faults can enhance the quality of network services and reduce economic loss. In this paper, we propose the concept of 4D features and use the BERT algorithm to extract semantic features, the graph neural network algorithm to extract network topology information, and the Temporal Convolutional Network (TCN) algorithm to extract time series. Based on this, we propose Fault Prediction based on GraphSage and TCN (GTFP), an end-to-end solution of network fault alarm prediction, which is based on GraphSage and TCN (GTCN), a hybrid algorithm of a graph neural network and the TCN model. Our solution takes the historical alarm data as input. First, we filter out the alarm noises irrelevant to the faults through data cleaning. Then, we employ feature engineering to extract the valid alarm features, including the statistical features of the network alarm information, the semantic features of the alarm texts, the sequential features of the alarms and the network topology features of the nodes where the alarms are located. Finally, we use GTCN to predict future fault alarms based on the extracted features. Experiments on the alarm data of a real service system show that GTFP performs better than the state-of-the-art algorithms of fault alarm prediction.","PeriodicalId":50288,"journal":{"name":"International Journal of Software Engineering and Knowledge Engineering","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2023-10-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135823311","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Dynamic Drilling Sampling Method and Evaluation Model for Big Streaming Data 大流数据动态钻井采样方法及评价模型
4区 计算机科学 Q3 Computer Science Pub Date : 2023-10-18 DOI: 10.1142/s0218194023410036
Zhaohui Zhang, Pei Zhang, Peng Zhang, Fujuan Xu, Chaochao Hu, Pengwei Wang
The big data sampling method for real-time and high-speed streaming data is prone to lose the value and information of a large amount of discrete data, and it is not easy to make an efficient and accurate evaluation of the value characteristics of streaming data. The SDSLA sampling method based on mineral drilling exploration can evaluate the valuable information of streaming data containing many discrete data in real-time, but when the range of discrete data is irregular, it has low sampling accuracy for discrete data. Based on the SDSLA algorithm, we propose a dynamic drilling sampling method SDDS, which takes well as the analysis unit, dynamically changes the size and position of the well, and accurately locates the position and range of discrete data. A new model SDVEM is further proposed for data valuation, which evaluates the sample set from discrete, centralized, and overall dimensions. Experiments show that compared with the SDSLA algorithm, the sample sampled by the SDDS algorithm has higher evaluation accuracy, and the probability distribution of the sample is closer to the original streaming data, with the AOCV indicator being nearly 10% higher. In addition, the SDDS algorithm can achieve over 90% accuracy, recall, and F1 score for training and testing neural networks with small sampling rates, all of which are higher than the SDSLA algorithm. In summary, the SDDS algorithm not only accurately evaluates the value characteristics of streaming data but also facilitates the training of neural network models, which has important research significance in big data estimation.
实时、高速流数据的大数据采样方法容易丢失大量离散数据的价值和信息,不易对流数据的价值特征进行高效、准确的评价。基于矿产钻探勘探的SDSLA采样方法可以实时评价包含许多离散数据的流数据的有价值信息,但当离散数据范围不规则时,对离散数据的采样精度较低。基于SDSLA算法,提出了一种动态钻井采样方法SDDS,该方法以井为分析单元,动态改变井的尺寸和位置,准确定位离散数据的位置和范围。进一步提出了一种新的数据评估模型SDVEM,该模型从离散、集中和整体三个维度对样本集进行评估。实验表明,与SDSLA算法相比,SDDS算法采样的样本具有更高的评估精度,样本的概率分布更接近原始流数据,AOCV指标提高了近10%。此外,对于小采样率的神经网络训练和测试,SDDS算法的准确率、召回率和F1分数均高于SDSLA算法。综上所述,SDDS算法不仅能准确地评估流数据的值特征,而且便于神经网络模型的训练,在大数据估计中具有重要的研究意义。
{"title":"A Dynamic Drilling Sampling Method and Evaluation Model for Big Streaming Data","authors":"Zhaohui Zhang, Pei Zhang, Peng Zhang, Fujuan Xu, Chaochao Hu, Pengwei Wang","doi":"10.1142/s0218194023410036","DOIUrl":"https://doi.org/10.1142/s0218194023410036","url":null,"abstract":"The big data sampling method for real-time and high-speed streaming data is prone to lose the value and information of a large amount of discrete data, and it is not easy to make an efficient and accurate evaluation of the value characteristics of streaming data. The SDSLA sampling method based on mineral drilling exploration can evaluate the valuable information of streaming data containing many discrete data in real-time, but when the range of discrete data is irregular, it has low sampling accuracy for discrete data. Based on the SDSLA algorithm, we propose a dynamic drilling sampling method SDDS, which takes well as the analysis unit, dynamically changes the size and position of the well, and accurately locates the position and range of discrete data. A new model SDVEM is further proposed for data valuation, which evaluates the sample set from discrete, centralized, and overall dimensions. Experiments show that compared with the SDSLA algorithm, the sample sampled by the SDDS algorithm has higher evaluation accuracy, and the probability distribution of the sample is closer to the original streaming data, with the AOCV indicator being nearly 10% higher. In addition, the SDDS algorithm can achieve over 90% accuracy, recall, and F1 score for training and testing neural networks with small sampling rates, all of which are higher than the SDSLA algorithm. In summary, the SDDS algorithm not only accurately evaluates the value characteristics of streaming data but also facilitates the training of neural network models, which has important research significance in big data estimation.","PeriodicalId":50288,"journal":{"name":"International Journal of Software Engineering and Knowledge Engineering","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2023-10-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135823300","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Demystifying Practices, Challenges and Expected Features of Using GitHub Copilot 揭秘使用GitHub Copilot的实践、挑战和预期功能
4区 计算机科学 Q3 Computer Science Pub Date : 2023-10-17 DOI: 10.1142/s0218194023410048
Beiqi Zhang, Peng Liang, Xiyu Zhou, Aakash Ahmad, Muhammad Waseem
With the advances in machine learning, there is a growing interest in AI-enabled tools for autocompleting source code. GitHub Copilot, also referred to as the “AI Pair Programmer”, has been trained on billions of lines of open source GitHub code, and is one of such tools that has been increasingly used since its launch in June 2021. However, little effort has been devoted to understanding the practices, challenges, and expected features of using Copilot in programming for auto-completed source code from the point of view of practitioners. To this end, we conducted an empirical study by collecting and analyzing the data from Stack Overflow (SO) and GitHub Discussions. More specifically, we searched and manually collected 303 SO posts and 927 GitHub discussions related to the usage of Copilot. We identified the programming languages, Integrated Development Environments (IDEs), technologies used with Copilot, functions implemented, benefits, limitations, and challenges when using Copilot. The results show that when practitioners use Copilot: (1) The major programming languages used with Copilot are JavaScript and Python, (2) the main IDE used with Copilot is Visual Studio Code, (3) the most common used technology with Copilot is Node.js, (4) the leading function implemented by Copilot is data processing, (5) the main purpose of users using Copilot is to help generate code, (6) the significant benefit of using Copilot is useful code generation, (7) the main limitation encountered by practitioners when using Copilot is difficulty of integration, and (8) the most common expected feature is that Copilot can be integrated with more IDEs. Our results suggest that using Copilot is like a double-edged sword, which requires developers to carefully consider various aspects when deciding whether or not to use it. Our study provides empirically grounded foundations that could inform software developers and practitioners, as well as provide a basis for future investigations on the role of Copilot as an AI pair programmer in software development.
随着机器学习的进步,人们对自动完成源代码的人工智能工具越来越感兴趣。GitHub Copilot,也被称为“人工智能配对程序员”,已经接受了数十亿行GitHub开源代码的培训,自2021年6月推出以来,这类工具的使用越来越多。然而,从实践者的角度来看,很少有人致力于理解在自动完成源代码编程中使用Copilot的实践、挑战和预期特性。为此,我们通过收集和分析Stack Overflow (SO)和GitHub discussion的数据进行了实证研究。更具体地说,我们搜索并手动收集了303个SO帖子和927个与Copilot使用相关的GitHub讨论。我们确定了编程语言、集成开发环境(ide)、与Copilot一起使用的技术、实现的功能、使用Copilot时的好处、限制和挑战。结果表明,当从业者使用Copilot时:(1) Copilot使用的主要编程语言是JavaScript和Python, (2) Copilot使用的主要IDE是Visual Studio Code, (3) Copilot最常用的技术是Node.js, (4) Copilot实现的主要功能是数据处理,(5)用户使用Copilot的主要目的是帮助生成代码,(6)使用Copilot的显著好处是有用的代码生成。(7)从业者在使用Copilot时遇到的主要限制是集成困难;(8)最常见的期望是Copilot可以与更多的ide集成。我们的研究结果表明,使用Copilot就像一把双刃剑,需要开发人员在决定是否使用它时仔细考虑各个方面。我们的研究提供了经验基础,可以为软件开发人员和从业者提供信息,并为未来调查Copilot作为软件开发中AI结对程序员的作用提供基础。
{"title":"Demystifying Practices, Challenges and Expected Features of Using GitHub Copilot","authors":"Beiqi Zhang, Peng Liang, Xiyu Zhou, Aakash Ahmad, Muhammad Waseem","doi":"10.1142/s0218194023410048","DOIUrl":"https://doi.org/10.1142/s0218194023410048","url":null,"abstract":"With the advances in machine learning, there is a growing interest in AI-enabled tools for autocompleting source code. GitHub Copilot, also referred to as the “AI Pair Programmer”, has been trained on billions of lines of open source GitHub code, and is one of such tools that has been increasingly used since its launch in June 2021. However, little effort has been devoted to understanding the practices, challenges, and expected features of using Copilot in programming for auto-completed source code from the point of view of practitioners. To this end, we conducted an empirical study by collecting and analyzing the data from Stack Overflow (SO) and GitHub Discussions. More specifically, we searched and manually collected 303 SO posts and 927 GitHub discussions related to the usage of Copilot. We identified the programming languages, Integrated Development Environments (IDEs), technologies used with Copilot, functions implemented, benefits, limitations, and challenges when using Copilot. The results show that when practitioners use Copilot: (1) The major programming languages used with Copilot are JavaScript and Python, (2) the main IDE used with Copilot is Visual Studio Code, (3) the most common used technology with Copilot is Node.js, (4) the leading function implemented by Copilot is data processing, (5) the main purpose of users using Copilot is to help generate code, (6) the significant benefit of using Copilot is useful code generation, (7) the main limitation encountered by practitioners when using Copilot is difficulty of integration, and (8) the most common expected feature is that Copilot can be integrated with more IDEs. Our results suggest that using Copilot is like a double-edged sword, which requires developers to carefully consider various aspects when deciding whether or not to use it. Our study provides empirically grounded foundations that could inform software developers and practitioners, as well as provide a basis for future investigations on the role of Copilot as an AI pair programmer in software development.","PeriodicalId":50288,"journal":{"name":"International Journal of Software Engineering and Knowledge Engineering","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2023-10-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135944434","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Dual Decision-making Continuous Reinforcement Learning Method Based on Sim2Real 基于Sim2Real的对偶决策连续强化学习方法
4区 计算机科学 Q3 Computer Science Pub Date : 2023-10-13 DOI: 10.1142/s0218194023500626
Wenwen Xiao, Xinzhi Wang, Xiangfeng Luo, Shaorong Xie
{"title":"A Dual Decision-making Continuous Reinforcement Learning Method Based on Sim2Real","authors":"Wenwen Xiao, Xinzhi Wang, Xiangfeng Luo, Shaorong Xie","doi":"10.1142/s0218194023500626","DOIUrl":"https://doi.org/10.1142/s0218194023500626","url":null,"abstract":"","PeriodicalId":50288,"journal":{"name":"International Journal of Software Engineering and Knowledge Engineering","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2023-10-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135918187","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
OdegVul: An Approach for Statement-Level Defect Prediction OdegVul:一种语句级缺陷预测方法
4区 计算机科学 Q3 Computer Science Pub Date : 2023-10-13 DOI: 10.1142/s0218194023500614
Guoqiang Yin, Wei Wang
{"title":"OdegVul: An Approach for Statement-Level Defect Prediction","authors":"Guoqiang Yin, Wei Wang","doi":"10.1142/s0218194023500614","DOIUrl":"https://doi.org/10.1142/s0218194023500614","url":null,"abstract":"","PeriodicalId":50288,"journal":{"name":"International Journal of Software Engineering and Knowledge Engineering","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2023-10-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135918937","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
OC-Detector: Detecting Smart Contract Vulnerabilities Based on Clustering Opcode Instructions OC-Detector:基于聚类操作码指令的智能合约漏洞检测
4区 计算机科学 Q3 Computer Science Pub Date : 2023-10-13 DOI: 10.1142/s0218194023410061
Xiguo Gu, Liwei Zheng, Huiwen Yang, Shifan Liu, Zhanqi Cui
{"title":"OC-Detector: Detecting Smart Contract Vulnerabilities Based on Clustering Opcode Instructions","authors":"Xiguo Gu, Liwei Zheng, Huiwen Yang, Shifan Liu, Zhanqi Cui","doi":"10.1142/s0218194023410061","DOIUrl":"https://doi.org/10.1142/s0218194023410061","url":null,"abstract":"","PeriodicalId":50288,"journal":{"name":"International Journal of Software Engineering and Knowledge Engineering","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2023-10-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135918635","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Review and Application of Knowledge Graph in Crisis Management 知识图谱在危机管理中的回顾与应用
4区 计算机科学 Q3 Computer Science Pub Date : 2023-10-13 DOI: 10.1142/s0218194023300038
Xinzhi Wang, Mengyue Li, Weiwang Chen, Yige Yao, Zhennan Li, Yi Liu, Hui Zhang
{"title":"Review and Application of Knowledge Graph in Crisis Management","authors":"Xinzhi Wang, Mengyue Li, Weiwang Chen, Yige Yao, Zhennan Li, Yi Liu, Hui Zhang","doi":"10.1142/s0218194023300038","DOIUrl":"https://doi.org/10.1142/s0218194023300038","url":null,"abstract":"","PeriodicalId":50288,"journal":{"name":"International Journal of Software Engineering and Knowledge Engineering","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2023-10-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135918638","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Enhancing Code Summarization with Graph Embedding and Pre-trained Model 用图嵌入和预训练模型增强代码摘要
4区 计算机科学 Q3 Computer Science Pub Date : 2023-10-12 DOI: 10.1142/s0218194023410024
Lixuan Li, Jie Li, Yihui Xu, Hao Zhu, Xiaofang Zhang
Code summarization is a task that aims at automatically producing descriptions of source code. Recently many deep-learning-based approaches have been proposed to generate accurate code summaries, among which pre-trained models (PTMs) for programming languages have achieved promising results. It is well known that source code written in programming languages is highly structured and unambiguous. Though previous work pre-trained the model with well-design tasks to learn universal representation from a large scale of data, they have not considered structure information during the fine-tuning stage. To make full use of both the pre-trained programming language model and the structure information of source code, we utilize Flow-Augmented Abstract Syntax Tree (FA-AST) of source code for structure information and propose GraphPLBART — Graph-augmented Programming Language and Bi-directional Auto-Regressive Transformer, which can effectively introduce structure information to a well PTM through a cross attention layer. Compared with the best-performing baselines, GraphPLBART still improves by 3.2%, 7.1%, and 1.2% in terms of BLEU, METEOR, and ROUGE-L, respectively, on Java dataset, and also improves by 4.0%, 6.3%, and 2.1% on Python dataset. Further experiment shows that the structure information from FA-AST has significant benefits for the performance of GraphPLBART. In addition, our meticulous manual evaluation experiment further reinforces the superiority of our proposed approach. This demonstrates its remarkable abstract quality and solidifies its position as a promising solution in the field of code summarization.
代码摘要是一项旨在自动生成源代码描述的任务。近年来,人们提出了许多基于深度学习的方法来生成准确的代码摘要,其中针对编程语言的预训练模型(ptm)取得了可喜的成果。众所周知,用编程语言编写的源代码是高度结构化和明确的。虽然以前的工作用精心设计的任务对模型进行预训练,以从大规模数据中学习普遍表示,但他们在微调阶段没有考虑结构信息。为了充分利用预训练好的编程语言模型和源代码的结构信息,利用源代码的流增强抽象语法树(FA-AST)获取结构信息,提出GraphPLBART -图增强编程语言和双向自回归转换器,通过交叉注意层将结构信息有效地引入油井PTM。与性能最好的基线相比,GraphPLBART在Java数据集上对BLEU、METEOR和ROUGE-L分别提高了3.2%、7.1%和1.2%,在Python数据集上也分别提高了4.0%、6.3%和2.1%。进一步的实验表明,来自FA-AST的结构信息对GraphPLBART的性能有显著的好处。此外,我们细致的人工评估实验进一步强化了我们提出的方法的优越性。这证明了它卓越的抽象品质,并巩固了它作为代码摘要领域中有前途的解决方案的地位。
{"title":"Enhancing Code Summarization with Graph Embedding and Pre-trained Model","authors":"Lixuan Li, Jie Li, Yihui Xu, Hao Zhu, Xiaofang Zhang","doi":"10.1142/s0218194023410024","DOIUrl":"https://doi.org/10.1142/s0218194023410024","url":null,"abstract":"Code summarization is a task that aims at automatically producing descriptions of source code. Recently many deep-learning-based approaches have been proposed to generate accurate code summaries, among which pre-trained models (PTMs) for programming languages have achieved promising results. It is well known that source code written in programming languages is highly structured and unambiguous. Though previous work pre-trained the model with well-design tasks to learn universal representation from a large scale of data, they have not considered structure information during the fine-tuning stage. To make full use of both the pre-trained programming language model and the structure information of source code, we utilize Flow-Augmented Abstract Syntax Tree (FA-AST) of source code for structure information and propose GraphPLBART — Graph-augmented Programming Language and Bi-directional Auto-Regressive Transformer, which can effectively introduce structure information to a well PTM through a cross attention layer. Compared with the best-performing baselines, GraphPLBART still improves by 3.2%, 7.1%, and 1.2% in terms of BLEU, METEOR, and ROUGE-L, respectively, on Java dataset, and also improves by 4.0%, 6.3%, and 2.1% on Python dataset. Further experiment shows that the structure information from FA-AST has significant benefits for the performance of GraphPLBART. In addition, our meticulous manual evaluation experiment further reinforces the superiority of our proposed approach. This demonstrates its remarkable abstract quality and solidifies its position as a promising solution in the field of code summarization.","PeriodicalId":50288,"journal":{"name":"International Journal of Software Engineering and Knowledge Engineering","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2023-10-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135922850","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
International Journal of Software Engineering and Knowledge Engineering
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1