首页 > 最新文献

Int. J. Softw. Sci. Comput. Intell.最新文献

英文 中文
A Survey of Different Approaches for the Class Imbalance Problem in Software Defect Prediction 软件缺陷预测中类不平衡问题的几种方法综述
Pub Date : 2022-01-01 DOI: 10.4018/ijssci.301268
Abdullah Dar, Sheikh Umar Farooq
The imbalanced nature of the software datasets leads to the biased learning of prediction model toward the observations of the majority class (non-defective class). The prediction model can produce poor results for the minority class observations. Such misappropriations can prove costly especially in software development where minority class (defective) is the one that has the highest interest from the learning point of view. Various approaches have been used for dealing with class imbalance problem of software defect prediction but no one dominates and hence developing a generalized software defect prediction model for imbalanced datasets remains problematic. This paper surveys existing approaches for handling class imbalance problem of software defect datasets. In this survey, most relevant software defect prediction studies and identified the two main approaches that have been used for handling imbalance issue of software defect datasets. Furthermore, we also provide some comparison of findings in state-of-the-art literature and the guidelines for carrying out future research.
软件数据集的不平衡性导致预测模型对大多数类(非缺陷类)的观测结果有偏见学习。该预测模型对少数类观测结果的预测结果较差。这样的盗用可能被证明是代价高昂的,特别是在软件开发中,从学习的角度来看,少数派(有缺陷的)是最感兴趣的。针对软件缺陷预测中的类不平衡问题,人们已经采用了多种方法,但没有一种方法占主导地位,因此,针对不平衡数据集开发一种通用的软件缺陷预测模型仍然是一个问题。本文综述了现有的软件缺陷数据集类不平衡问题的处理方法。在这个调查中,大多数相关的软件缺陷预测研究并确定了用于处理软件缺陷数据集不平衡问题的两种主要方法。此外,我们还提供了一些最新文献的研究结果的比较和开展未来研究的指导方针。
{"title":"A Survey of Different Approaches for the Class Imbalance Problem in Software Defect Prediction","authors":"Abdullah Dar, Sheikh Umar Farooq","doi":"10.4018/ijssci.301268","DOIUrl":"https://doi.org/10.4018/ijssci.301268","url":null,"abstract":"The imbalanced nature of the software datasets leads to the biased learning of prediction model toward the observations of the majority class (non-defective class). The prediction model can produce poor results for the minority class observations. Such misappropriations can prove costly especially in software development where minority class (defective) is the one that has the highest interest from the learning point of view. Various approaches have been used for dealing with class imbalance problem of software defect prediction but no one dominates and hence developing a generalized software defect prediction model for imbalanced datasets remains problematic. This paper surveys existing approaches for handling class imbalance problem of software defect datasets. In this survey, most relevant software defect prediction studies and identified the two main approaches that have been used for handling imbalance issue of software defect datasets. Furthermore, we also provide some comparison of findings in state-of-the-art literature and the guidelines for carrying out future research.","PeriodicalId":432255,"journal":{"name":"Int. J. Softw. Sci. Comput. Intell.","volume":"30 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116449957","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Similarity Retrieval Based on Image Background Analysis 基于图像背景分析的相似度检索
Pub Date : 2022-01-01 DOI: 10.4018/ijssci.309426
Chang Zhu, Wenchao Jiang, Weilin Zhou, Hong Xiao
Aiming at the problem of traditional portrait background similarity retrieval methods being low accuracy and time-consuming, a similarity retrieval method based on image background analysis is presented. The proposed method uses a combination of portrait segmentation and retrieval models. Firstly, the portrait segmentation model is used to remove the portraits in the images to eliminate the interference of portraits on background features; secondly, the image retrieval model is used to retrieve images with similar background features; LSH is added to improve the retrieval efficiency; finally, the retrieval results are used to further determine whether the background is similar. The experiment is implemented based on real data from a company. The results showed that the average precision, average map, and recall of this method reached 85%, 90%, and 50%, respectively. The average accuracy and recall are 10% better than the overall image retrieval model.
针对传统人像背景相似度检索方法准确率低、耗时长的问题,提出了一种基于图像背景分析的人像背景相似度检索方法。该方法采用图像分割和图像检索相结合的方法。首先,利用人像分割模型去除图像中的人像,消除人像对背景特征的干扰;其次,利用图像检索模型对具有相似背景特征的图像进行检索;加入LSH,提高检索效率;最后,利用检索结果进一步判断背景是否相似。实验以某公司的实际数据为基础进行。结果表明,该方法的平均精密度达到85%,平均图谱达到90%,召回率达到50%。平均准确率和召回率比整体图像检索模型提高10%。
{"title":"Similarity Retrieval Based on Image Background Analysis","authors":"Chang Zhu, Wenchao Jiang, Weilin Zhou, Hong Xiao","doi":"10.4018/ijssci.309426","DOIUrl":"https://doi.org/10.4018/ijssci.309426","url":null,"abstract":"Aiming at the problem of traditional portrait background similarity retrieval methods being low accuracy and time-consuming, a similarity retrieval method based on image background analysis is presented. The proposed method uses a combination of portrait segmentation and retrieval models. Firstly, the portrait segmentation model is used to remove the portraits in the images to eliminate the interference of portraits on background features; secondly, the image retrieval model is used to retrieve images with similar background features; LSH is added to improve the retrieval efficiency; finally, the retrieval results are used to further determine whether the background is similar. The experiment is implemented based on real data from a company. The results showed that the average precision, average map, and recall of this method reached 85%, 90%, and 50%, respectively. The average accuracy and recall are 10% better than the overall image retrieval model.","PeriodicalId":432255,"journal":{"name":"Int. J. Softw. Sci. Comput. Intell.","volume":"14 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130344808","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Progressive Study and Investigation of Machine Learning Techniques to Enhance the Efficiency and Effectiveness of Industry 4.0 机器学习技术的渐进式研究和调查,以提高工业4.0的效率和有效性
Pub Date : 2022-01-01 DOI: 10.4018/ijssci.300365
Kaljot Sharma, D. Anand, K. Mishra, S. Harit
The goal of this article is to assess the most recent work on Industry 4.0 as well as the present state of science on Industry 4.0 through papers produced between January 2017 and March 2020.A systematic review process with a 5-step approach to article selection was employed, which included the following steps: 1) Selection of database 2) Research of keyword 3) Collection of articles 4) Inclusion/Exclusion criteria 5) Examining Selected Articles. It is noticed that much of the research is philosophical or case-based in character. The prospective study direction described in this paper may be useful to researchers interested in the field of industry 4.0 for research. The paper's future study directions must undoubtedly be beneficial to researchers.
本文的目标是通过2017年1月至2020年3月期间发表的论文,评估工业4.0的最新工作以及工业4.0的科学现状。采用系统评价流程,文章选择分为5步,包括以下步骤:1)数据库选择2)关键词研究3)文章收集4)纳入/排除标准5)选定文章检查。值得注意的是,许多研究都是哲学性的或基于案例的。本文所描述的前瞻性研究方向可能对对工业4.0领域感兴趣的研究人员有所帮助。本文未来的研究方向无疑对研究者有益。
{"title":"Progressive Study and Investigation of Machine Learning Techniques to Enhance the Efficiency and Effectiveness of Industry 4.0","authors":"Kaljot Sharma, D. Anand, K. Mishra, S. Harit","doi":"10.4018/ijssci.300365","DOIUrl":"https://doi.org/10.4018/ijssci.300365","url":null,"abstract":"The goal of this article is to assess the most recent work on Industry 4.0 as well as the present state of science on Industry 4.0 through papers produced between January 2017 and March 2020.A systematic review process with a 5-step approach to article selection was employed, which included the following steps: 1) Selection of database 2) Research of keyword 3) Collection of articles 4) Inclusion/Exclusion criteria 5) Examining Selected Articles. It is noticed that much of the research is philosophical or case-based in character. The prospective study direction described in this paper may be useful to researchers interested in the field of industry 4.0 for research. The paper's future study directions must undoubtedly be beneficial to researchers.","PeriodicalId":432255,"journal":{"name":"Int. J. Softw. Sci. Comput. Intell.","volume":"15 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123354083","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Optimization of the Wake-Up Scheduling Using a Hybrid of Memetic and Tabu Search Algorithms for 3D-Wireless Sensor Networks 基于模因和禁忌混合搜索算法的3d无线传感器网络唤醒调度优化
Pub Date : 2022-01-01 DOI: 10.4018/ijssci.300359
V. Chawra, Govind P. Gupta
Computation of an optimal coverage and connectivity aware wake-up schedule of sensor nodes is a fundamental research issue in a 3D-Wireless Sensor Networks. Most of the existing metaheuristic-based wake-up scheduling schemes do not make sure optimal solution and occasionally smacked in local minima. This paper propose a hybrid metaheuristic-based wake-up scheduling scheme (Memtic-Tabu-based-WS) where best feature of memtic algorithm and Tabu Search algorithm is combined. The proposed scheme has considered four parameters such as energy consumption, coverage, connectivity, and optimal size of schedule list. Performance comparison of the proposed Memtic-Tabu-based-WS scheme is performed in different network scenario and compared with three well-known state-of-art schemes in terms of coverage ratio, active sensor nodes and fitness value. The result analysis validate the superiority of the proposed scheme over the existing schemes with better coverage ratio and derivation of the optimal wake-up schedule.
计算传感器节点的最优覆盖和连通性感知唤醒计划是三维无线传感器网络的一个基本研究问题。现有的基于元启发式的唤醒调度方案大多不能保证最优解,有时还会出现局部极小值。本文提出了一种基于元启发式的混合唤醒调度方案(memtic -Tabu-based ws),该方案结合了memtic算法和禁忌搜索算法的最佳特性。该方案考虑了能耗、覆盖范围、连通性和调度列表的最优大小等四个参数。对本文提出的基于memtic -禁忌的ws方案在不同网络场景下的性能进行了比较,并在覆盖率、主动传感器节点和适应度值等方面与目前三种知名方案进行了比较。结果分析验证了该方案优于现有方案,具有更好的覆盖率和最优唤醒调度的推导。
{"title":"Optimization of the Wake-Up Scheduling Using a Hybrid of Memetic and Tabu Search Algorithms for 3D-Wireless Sensor Networks","authors":"V. Chawra, Govind P. Gupta","doi":"10.4018/ijssci.300359","DOIUrl":"https://doi.org/10.4018/ijssci.300359","url":null,"abstract":"Computation of an optimal coverage and connectivity aware wake-up schedule of sensor nodes is a fundamental research issue in a 3D-Wireless Sensor Networks. Most of the existing metaheuristic-based wake-up scheduling schemes do not make sure optimal solution and occasionally smacked in local minima. This paper propose a hybrid metaheuristic-based wake-up scheduling scheme (Memtic-Tabu-based-WS) where best feature of memtic algorithm and Tabu Search algorithm is combined. The proposed scheme has considered four parameters such as energy consumption, coverage, connectivity, and optimal size of schedule list. Performance comparison of the proposed Memtic-Tabu-based-WS scheme is performed in different network scenario and compared with three well-known state-of-art schemes in terms of coverage ratio, active sensor nodes and fitness value. The result analysis validate the superiority of the proposed scheme over the existing schemes with better coverage ratio and derivation of the optimal wake-up schedule.","PeriodicalId":432255,"journal":{"name":"Int. J. Softw. Sci. Comput. Intell.","volume":"47 2 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124974051","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
Fully Remote Software Development Due to COVID Factor: Results of Industry Research (2020) 基于COVID因素的全远程软件开发:行业研究结果(2020年)
Pub Date : 2021-07-01 DOI: 10.4018/IJSSCI.2021070105
D. Pashchenko
The internal transformation and using the fully remote software development under the influence of the pandemic has not only changed the industry, but heralded the construction of a new reality. This article presents the results of a study that covered the experience of transformation of 26 project teams from the world's leading IT corporations, software vendors, and high-tech companies with strong internal development practices: Alphabet, Amazon, BSC Group, Custis, Deutsche Bank, Evernote, Exness, Positive Technologies, PromSvyazBank, Sberbank, VTB, Yandex. Experts determined the results of rapid adaptation to changes, considered the medium-term impact of the pandemic factor on work processes, and made forecasts for 2021. The results of the study are accompanied by brief comments and recommendations of the author, the main idea of which is the need to quickly understand a new trend in software development, hiring specialists, and organizing teams associated with the refusal of high-tech IT companies to return to teamwork in shared offices.
疫情影响下的内部转型和使用全远程软件开发,不仅改变了行业,也预示着新的现实建设。本文介绍了一项研究的结果,该研究涵盖了来自世界领先的IT公司、软件供应商和具有强大内部开发实践的高科技公司的26个项目团队的转型经验:Alphabet、亚马逊、BSC集团、Custis、德意志银行、Evernote、Exness、Positive Technologies、PromSvyazBank、Sberbank、VTB、Yandex。专家们确定了快速适应变化的结果,审议了大流行因素对工作进程的中期影响,并对2021年作出了预测。研究结果伴随着作者的简短评论和建议,其主要思想是需要快速理解软件开发的新趋势,雇用专家,并组织与高科技IT公司拒绝在共享办公室中回归团队合作相关的团队。
{"title":"Fully Remote Software Development Due to COVID Factor: Results of Industry Research (2020)","authors":"D. Pashchenko","doi":"10.4018/IJSSCI.2021070105","DOIUrl":"https://doi.org/10.4018/IJSSCI.2021070105","url":null,"abstract":"The internal transformation and using the fully remote software development under the influence of the pandemic has not only changed the industry, but heralded the construction of a new reality. This article presents the results of a study that covered the experience of transformation of 26 project teams from the world's leading IT corporations, software vendors, and high-tech companies with strong internal development practices: Alphabet, Amazon, BSC Group, Custis, Deutsche Bank, Evernote, Exness, Positive Technologies, PromSvyazBank, Sberbank, VTB, Yandex. Experts determined the results of rapid adaptation to changes, considered the medium-term impact of the pandemic factor on work processes, and made forecasts for 2021. The results of the study are accompanied by brief comments and recommendations of the author, the main idea of which is the need to quickly understand a new trend in software development, hiring specialists, and organizing teams associated with the refusal of high-tech IT companies to return to teamwork in shared offices.","PeriodicalId":432255,"journal":{"name":"Int. J. Softw. Sci. Comput. Intell.","volume":"35 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130003618","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 11
Electricity Consumption Data Analysis Using Various Outlier Detection Methods 利用各种离群值检测方法分析用电量数据
Pub Date : 2021-07-01 DOI: 10.4018/IJSSCI.2021070102
Sidi Mohammed Kaddour, M. Lehsaini
Nowadays, detecting abnormal power consumption behavior of householders has become a big concern in the smart energy field; overcoming this limitation will help in identifying efficient solutions to reduce power consumption. This paper proposes a new methodology for detecting abnormal energy consumption in residential buildings based on hourly readings of energy consumption and peak energy consumption. The proposition is implemented using three unsupervised outlier detection methods (isolation forest, one-class SVM, and k-means). The authors propose this solution to help residents in reducing operating costs by detecting consumption failures that cannot be detected easily. On the other hand, energy providers will have the access to detailed data about anomalies, faulty appliances, and houses with poor power control strategy in general, which will help in pinpointing overconsumption problems, thus enhancing human awareness and reducing energy consumption.
当前,用户异常用电行为的检测已成为智能能源领域关注的热点;克服这一限制将有助于找到有效的解决方案来降低功耗。本文提出了一种基于每小时能耗和峰值能耗读数的住宅建筑能耗异常检测方法。该命题使用三种无监督离群值检测方法(隔离森林,一类支持向量机和k-means)实现。作者提出了这个解决方案,通过检测难以检测的消费故障,帮助居民降低运营成本。另一方面,能源供应商将获得有关异常、故障电器和一般电力控制策略较差的房屋的详细数据,这将有助于查明过度消费问题,从而提高人们的意识并减少能源消耗。
{"title":"Electricity Consumption Data Analysis Using Various Outlier Detection Methods","authors":"Sidi Mohammed Kaddour, M. Lehsaini","doi":"10.4018/IJSSCI.2021070102","DOIUrl":"https://doi.org/10.4018/IJSSCI.2021070102","url":null,"abstract":"Nowadays, detecting abnormal power consumption behavior of householders has become a big concern in the smart energy field; overcoming this limitation will help in identifying efficient solutions to reduce power consumption. This paper proposes a new methodology for detecting abnormal energy consumption in residential buildings based on hourly readings of energy consumption and peak energy consumption. The proposition is implemented using three unsupervised outlier detection methods (isolation forest, one-class SVM, and k-means). The authors propose this solution to help residents in reducing operating costs by detecting consumption failures that cannot be detected easily. On the other hand, energy providers will have the access to detailed data about anomalies, faulty appliances, and houses with poor power control strategy in general, which will help in pinpointing overconsumption problems, thus enhancing human awareness and reducing energy consumption.","PeriodicalId":432255,"journal":{"name":"Int. J. Softw. Sci. Comput. Intell.","volume":"15 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130924503","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 6
Recurrent Neural Network (RNN) to Analyse Mental Behaviour in Social Media 递归神经网络(RNN)分析社交媒体中的心理行为
Pub Date : 2021-07-01 DOI: 10.4018/IJSSCI.2021070101
Hadj Ahmed Bouarara
A recent British study of people between the ages of 14 and 35 has shown that social media has a negative impact on mental health. The purpose of the paper is to detect people with mental disorders' behaviour in social media in order to help Twitter users in overcoming their mental health problems such as anxiety, phobia, depression, paranoia. The authors have adapted the recurrent neural network (RNN) in order to prevent the situations of threats, suicide, loneliness, or any other form of psychological problem through the analysis of tweets. The obtained results were validated by different experimental measures such as f-measure, recall, precision, entropy, accuracy. The RNN gives best results with 85% of accuracy compared to other techniques in literature such as social cockroaches, decision tree, and naïve Bayes.
英国最近一项针对14至35岁人群的研究表明,社交媒体对心理健康有负面影响。这篇论文的目的是检测有精神障碍的人在社交媒体上的行为,以帮助Twitter用户克服他们的心理健康问题,如焦虑、恐惧症、抑郁、偏执。为了防止威胁、自杀、孤独或任何其他形式的心理问题,作者对循环神经网络(RNN)进行了改造。通过f-测度、召回率、精密度、熵、准确度等实验指标对所得结果进行了验证。与文献中的其他技术(如社会蟑螂、决策树和naïve贝叶斯)相比,RNN给出了最好的结果,准确率为85%。
{"title":"Recurrent Neural Network (RNN) to Analyse Mental Behaviour in Social Media","authors":"Hadj Ahmed Bouarara","doi":"10.4018/IJSSCI.2021070101","DOIUrl":"https://doi.org/10.4018/IJSSCI.2021070101","url":null,"abstract":"A recent British study of people between the ages of 14 and 35 has shown that social media has a negative impact on mental health. The purpose of the paper is to detect people with mental disorders' behaviour in social media in order to help Twitter users in overcoming their mental health problems such as anxiety, phobia, depression, paranoia. The authors have adapted the recurrent neural network (RNN) in order to prevent the situations of threats, suicide, loneliness, or any other form of psychological problem through the analysis of tweets. The obtained results were validated by different experimental measures such as f-measure, recall, precision, entropy, accuracy. The RNN gives best results with 85% of accuracy compared to other techniques in literature such as social cockroaches, decision tree, and naïve Bayes.","PeriodicalId":432255,"journal":{"name":"Int. J. Softw. Sci. Comput. Intell.","volume":"13 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124904818","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 26
Comparative Study Between the MySQL Relational Database and the MongoDB NoSQL Database MySQL关系数据库与MongoDB NoSQL数据库的比较研究
Pub Date : 2021-07-01 DOI: 10.4018/IJSSCI.2021070104
Houcine Matallah, Ghalem Belalem, K. Bouamrane
NoSQL databases are new architectures developed to remedy the various weaknesses that have affected relational databases in highly distributed systems such as cloud computing, social networks, electronic commerce. Several companies loyal to traditional relational SQL databases for several decades seek to switch to the new “NoSQL” databases to meet the new requirements related to the change of scale in data volumetry, the load increases, the diversity of types of data handled, and geographic distribution. This paper develops a comparative study in which the authors will evaluate the performance of two databases very widespread in the field: MySQL as a relational database and MongoDB as a NoSQL database. To accomplish this confrontation, this research uses the Yahoo! Cloud Serving Benchmark (YCSB). This contribution is to provide some answers to choose the appropriate database management system for the type of data used and the type of processing performed on that data.
NoSQL数据库是一种新的体系结构,用于弥补影响高度分布式系统(如云计算、社交网络、电子商务)中的关系数据库的各种弱点。几十年来,一些忠于传统关系SQL数据库的公司寻求转向新的“NoSQL”数据库,以满足与数据容量规模变化、负载增加、处理的数据类型的多样性和地理分布相关的新需求。本文开展了一项比较研究,作者将评估在该领域非常广泛的两个数据库的性能:MySQL作为关系数据库和MongoDB作为NoSQL数据库。为了实现这种对抗,本研究使用Yahoo!云服务基准(YCSB)。本文的贡献是提供一些答案,以便为所使用的数据类型和对该数据执行的处理类型选择适当的数据库管理系统。
{"title":"Comparative Study Between the MySQL Relational Database and the MongoDB NoSQL Database","authors":"Houcine Matallah, Ghalem Belalem, K. Bouamrane","doi":"10.4018/IJSSCI.2021070104","DOIUrl":"https://doi.org/10.4018/IJSSCI.2021070104","url":null,"abstract":"NoSQL databases are new architectures developed to remedy the various weaknesses that have affected relational databases in highly distributed systems such as cloud computing, social networks, electronic commerce. Several companies loyal to traditional relational SQL databases for several decades seek to switch to the new “NoSQL” databases to meet the new requirements related to the change of scale in data volumetry, the load increases, the diversity of types of data handled, and geographic distribution. This paper develops a comparative study in which the authors will evaluate the performance of two databases very widespread in the field: MySQL as a relational database and MongoDB as a NoSQL database. To accomplish this confrontation, this research uses the Yahoo! Cloud Serving Benchmark (YCSB). This contribution is to provide some answers to choose the appropriate database management system for the type of data used and the type of processing performed on that data.","PeriodicalId":432255,"journal":{"name":"Int. J. Softw. Sci. Comput. Intell.","volume":"29 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133453277","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 11
An Ensemble of Hybrid Search-Based Algorithms for Software Effort Prediction 基于混合搜索的软件工作量预测算法集成
Pub Date : 2021-07-01 DOI: 10.4018/IJSSCI.2021070103
Wasiur Rhmann
Software organizations rely on the estimation of efforts required for the development of software to negotiate customers and plan the schedule of the project. Proper estimation of efforts reduces the chances of project failures. Historical data of projects have been used to predict the effort required for software development. In recent years, various ensemble of machine learning techniques have been used to predict software effort. In the present work, a novel ensemble technique of hybrid search-based algorithms (EHSBA) is used for software effort estimation. Four HSBAs—fuzzy and random sets-based modeling (FRSBM-R), symbolic fuzzy learning based on genetic programming (GFS-GP-R), symbolic fuzzy learning based on genetic programming grammar operators and simulated annealing (GFS_GSP_R), and least mean squares linear regression (LinearLMS_R)—are used to create an ensemble (EHSBA). The EHSBA is compared with machine learning-based ensemble bagging, vote, and stacking on datasets obtained from PROMISE repository. Obtained results reported that EHSBA outperformed all other techniques.
软件组织依赖于对软件开发所需工作量的估计来与客户协商并计划项目的进度。适当的工作量评估可以减少项目失败的机会。项目的历史数据已经被用来预测软件开发所需的工作量。近年来,各种机器学习技术的集成已被用于预测软件的工作量。本文提出了一种基于混合搜索算法(EHSBA)的集成技术,用于软件工作量估算。四种hsbas -基于模糊和随机集的建模(FRSBM-R),基于遗传规划的符号模糊学习(GFS-GP-R),基于遗传规划语法算子和模拟退火的符号模糊学习(GFS_GSP_R),以及最小均方线性回归(LinearLMS_R) -用于创建集成(EHSBA)。将EHSBA与基于机器学习的集成装袋、投票和堆叠在PROMISE存储库中获得的数据集上进行了比较。获得的结果报告EHSBA优于所有其他技术。
{"title":"An Ensemble of Hybrid Search-Based Algorithms for Software Effort Prediction","authors":"Wasiur Rhmann","doi":"10.4018/IJSSCI.2021070103","DOIUrl":"https://doi.org/10.4018/IJSSCI.2021070103","url":null,"abstract":"Software organizations rely on the estimation of efforts required for the development of software to negotiate customers and plan the schedule of the project. Proper estimation of efforts reduces the chances of project failures. Historical data of projects have been used to predict the effort required for software development. In recent years, various ensemble of machine learning techniques have been used to predict software effort. In the present work, a novel ensemble technique of hybrid search-based algorithms (EHSBA) is used for software effort estimation. Four HSBAs—fuzzy and random sets-based modeling (FRSBM-R), symbolic fuzzy learning based on genetic programming (GFS-GP-R), symbolic fuzzy learning based on genetic programming grammar operators and simulated annealing (GFS_GSP_R), and least mean squares linear regression (LinearLMS_R)—are used to create an ensemble (EHSBA). The EHSBA is compared with machine learning-based ensemble bagging, vote, and stacking on datasets obtained from PROMISE repository. Obtained results reported that EHSBA outperformed all other techniques.","PeriodicalId":432255,"journal":{"name":"Int. J. Softw. Sci. Comput. Intell.","volume":"51 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114389644","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Two-Phase Load Balancing Algorithm for Cloud Environment 云环境下的两阶段负载均衡算法
Pub Date : 2021-01-01 DOI: 10.4018/ijssci.2021010103
Archana Singh, R. Kumar
Load balancing is the phenomenon of distributing workload over various computing resources efficiently. It offers enterprises to efficiently manage different application or workload demands by allocating available resources among different servers, computers, and networks. These services can be accessed and utilized either for home use or for business purposes. Due to the excessive load on the cloud, sometimes it is not feasible to offer all these services to different users efficiently. To solve this excessive load issue, an efficient load balancing technique is used to offer satisfactory services to users as per their expectations also leading to efficient utilization of resources and applications on the cloud platform. This paper presents an enhanced load balancing algorithm named as a two-phase load balancing algorithm. It uses a two-phase checking load balancing approach where the first phase is to divide all virtual machines into two different tables based on their state, that is, available or busy while in the second phase, it equally distributes the loads. The various parameters used to measure the performance of the proposed algorithm are cost, data center processing time, and response time. Cloud analyst simulation tool is used to simulate the algorithm. Simulation results demonstrate superiority of the algorithm with existing ones.
负载平衡是指在各种计算资源上有效分配工作负载的现象。它通过在不同的服务器、计算机和网络之间分配可用资源,使企业能够有效地管理不同的应用程序或工作负载需求。这些服务既可以用于家庭使用,也可以用于商业目的。由于云上的负载过大,有时无法有效地为不同的用户提供所有这些服务。为了解决这一负载过大的问题,采用高效的负载均衡技术,根据用户的期望为用户提供满意的服务,从而有效地利用云平台上的资源和应用程序。本文提出了一种改进的负载均衡算法,即两阶段负载均衡算法。它使用两阶段检查负载平衡方法,其中第一阶段是根据虚拟机的状态(即可用或繁忙)将所有虚拟机划分到两个不同的表中,而在第二阶段,它平均分配负载。用于衡量所提出算法性能的各种参数是成本、数据中心处理时间和响应时间。采用云分析仿真工具对算法进行仿真。仿真结果表明了该算法与现有算法相比的优越性。
{"title":"A Two-Phase Load Balancing Algorithm for Cloud Environment","authors":"Archana Singh, R. Kumar","doi":"10.4018/ijssci.2021010103","DOIUrl":"https://doi.org/10.4018/ijssci.2021010103","url":null,"abstract":"Load balancing is the phenomenon of distributing workload over various computing resources efficiently. It offers enterprises to efficiently manage different application or workload demands by allocating available resources among different servers, computers, and networks. These services can be accessed and utilized either for home use or for business purposes. Due to the excessive load on the cloud, sometimes it is not feasible to offer all these services to different users efficiently. To solve this excessive load issue, an efficient load balancing technique is used to offer satisfactory services to users as per their expectations also leading to efficient utilization of resources and applications on the cloud platform. This paper presents an enhanced load balancing algorithm named as a two-phase load balancing algorithm. It uses a two-phase checking load balancing approach where the first phase is to divide all virtual machines into two different tables based on their state, that is, available or busy while in the second phase, it equally distributes the loads. The various parameters used to measure the performance of the proposed algorithm are cost, data center processing time, and response time. Cloud analyst simulation tool is used to simulate the algorithm. Simulation results demonstrate superiority of the algorithm with existing ones.","PeriodicalId":432255,"journal":{"name":"Int. J. Softw. Sci. Comput. Intell.","volume":"13 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129231558","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 8
期刊
Int. J. Softw. Sci. Comput. Intell.
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1