首页 > 最新文献

14th IEEE International Conference on Tools with Artificial Intelligence, 2002. (ICTAI 2002). Proceedings.最新文献

英文 中文
Detecting similarities and differences in images using the PFF and LGG approaches 使用PFF和LGG方法检测图像的相似性和差异性
N. Bourbakis
This paper presents two methods for comparison of images and evaluation of visibility of artifacts due to hidden information, changes or noise. The first method is based on pixel flow functions (PFF) able to detect changes in images by projecting the pixel values vertically, horizontally and diagonally. These projections create "functions" related with the average values of pixels summarized horizontally, vertically and diagonally. These functions represent image signatures. The comparison of image signatures defines differences in images. The second method is based on a heuristic graph model, known as local-global graph (LGG), for evaluating visibility of modifications in digital images. The LGG is based on segmentation and comparing the segments while thresholding the differences in their attributes. The methods have been implemented in C++ and their performance is presented.
本文提出了两种用于图像比较和评估由于隐藏信息、变化或噪声而产生的伪像的可见性的方法。第一种方法是基于像素流函数(PFF),能够通过垂直、水平和对角线投影像素值来检测图像的变化。这些投影创建了与水平、垂直和对角线汇总的像素平均值相关的“函数”。这些函数表示图像签名。图像签名的比较定义了图像之间的差异。第二种方法是基于启发式图模型,称为局部全局图(LGG),用于评估数字图像中修改的可见性。LGG基于分割和比较片段,同时对其属性的差异设置阈值。在c++中实现了这些方法,并给出了它们的性能。
{"title":"Detecting similarities and differences in images using the PFF and LGG approaches","authors":"N. Bourbakis","doi":"10.1109/TAI.2002.1180825","DOIUrl":"https://doi.org/10.1109/TAI.2002.1180825","url":null,"abstract":"This paper presents two methods for comparison of images and evaluation of visibility of artifacts due to hidden information, changes or noise. The first method is based on pixel flow functions (PFF) able to detect changes in images by projecting the pixel values vertically, horizontally and diagonally. These projections create \"functions\" related with the average values of pixels summarized horizontally, vertically and diagonally. These functions represent image signatures. The comparison of image signatures defines differences in images. The second method is based on a heuristic graph model, known as local-global graph (LGG), for evaluating visibility of modifications in digital images. The LGG is based on segmentation and comparing the segments while thresholding the differences in their attributes. The methods have been implemented in C++ and their performance is presented.","PeriodicalId":197064,"journal":{"name":"14th IEEE International Conference on Tools with Artificial Intelligence, 2002. (ICTAI 2002). Proceedings.","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2002-11-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130369117","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Function approximation using robust wavelet neural networks 鲁棒小波神经网络的函数逼近
Sheng-Tun Li, Shu‐Ching Chen
Wavelet neural networks (WNN) have recently attracted great interest, because of their advantages over radial basis function networks (RBFN) as they are universal approximators but achieve faster convergence and are capable of dealing with the so-called "curse of dimensionality". In addition, WNN are generalized RBFN. However, the generalization performance of WNN trained by least-squares approach deteriorates when outliers are present. In this paper, we propose a robust wavelet neural network based on the theory of robust regression for dealing with outliers in the framework of function approximation. By adaptively adjusting the number of training data involved during training, the efficiency loss in the presence of Gaussian noise is accommodated. Simulation results are demonstrated to validate the generalization ability and efficiency of the proposed network.
小波神经网络(WNN)最近引起了人们的极大兴趣,因为它比径向基函数网络(RBFN)有优势,因为它是通用逼近器,但收敛速度更快,并且能够处理所谓的“维数诅咒”。此外,小波神经网络是广义RBFN。然而,当存在异常值时,最小二乘方法训练的小波神经网络的泛化性能会下降。本文提出了一种基于鲁棒回归理论的鲁棒小波神经网络,用于在函数逼近的框架下处理离群值。通过在训练过程中自适应调整训练数据的数量,可以适应高斯噪声存在时的效率损失。仿真结果验证了该网络的泛化能力和效率。
{"title":"Function approximation using robust wavelet neural networks","authors":"Sheng-Tun Li, Shu‐Ching Chen","doi":"10.1109/TAI.2002.1180842","DOIUrl":"https://doi.org/10.1109/TAI.2002.1180842","url":null,"abstract":"Wavelet neural networks (WNN) have recently attracted great interest, because of their advantages over radial basis function networks (RBFN) as they are universal approximators but achieve faster convergence and are capable of dealing with the so-called \"curse of dimensionality\". In addition, WNN are generalized RBFN. However, the generalization performance of WNN trained by least-squares approach deteriorates when outliers are present. In this paper, we propose a robust wavelet neural network based on the theory of robust regression for dealing with outliers in the framework of function approximation. By adaptively adjusting the number of training data involved during training, the efficiency loss in the presence of Gaussian noise is accommodated. Simulation results are demonstrated to validate the generalization ability and efficiency of the proposed network.","PeriodicalId":197064,"journal":{"name":"14th IEEE International Conference on Tools with Artificial Intelligence, 2002. (ICTAI 2002). Proceedings.","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2002-11-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131054733","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 46
Local search algorithm to improve the local search 局部搜索算法,改进局部搜索
M. Tounsi, P. David
In this paper, we present a new cooperative framework based on using successively two local search algorithms to solve constraint satisfaction and optimization problems. Our technique is based on the integration of local search algorithms as a mechanism to diversify the search instead of using a build on diversification mechanisms. Thus we avoid tuning the multiple parameters to escape from a local optimum. This technique improves the existing methods: it is generic especially when the given problem can be expressed as a constraint satisfaction problem. We present the way the local search algorithm can be used to diversify the search in order to solve real examination timetabling problems. We describe how the local search algorithm can be used to assist any other specific local search algorithm to escape from local optimality.
本文提出了一种基于连续使用两种局部搜索算法来解决约束满足和优化问题的协作框架。我们的技术是基于局部搜索算法的集成作为多样化搜索的机制,而不是使用多样化机制的构建。因此,我们避免了调整多个参数以逃避局部最优。这种技术改进了现有的方法:它是通用的,特别是当给定的问题可以表示为约束满足问题时。为了解决实际的考试排课问题,我们提出了一种利用局部搜索算法进行多样化搜索的方法。我们描述了如何使用局部搜索算法来帮助任何其他特定的局部搜索算法摆脱局部最优性。
{"title":"Local search algorithm to improve the local search","authors":"M. Tounsi, P. David","doi":"10.1109/TAI.2002.1180836","DOIUrl":"https://doi.org/10.1109/TAI.2002.1180836","url":null,"abstract":"In this paper, we present a new cooperative framework based on using successively two local search algorithms to solve constraint satisfaction and optimization problems. Our technique is based on the integration of local search algorithms as a mechanism to diversify the search instead of using a build on diversification mechanisms. Thus we avoid tuning the multiple parameters to escape from a local optimum. This technique improves the existing methods: it is generic especially when the given problem can be expressed as a constraint satisfaction problem. We present the way the local search algorithm can be used to diversify the search in order to solve real examination timetabling problems. We describe how the local search algorithm can be used to assist any other specific local search algorithm to escape from local optimality.","PeriodicalId":197064,"journal":{"name":"14th IEEE International Conference on Tools with Artificial Intelligence, 2002. (ICTAI 2002). Proceedings.","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2002-11-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128538347","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
DSatz: a directional SAT solver for planning DSatz:用于规划的定向 SAT 求解器
M. Iwen, A. Mali
SAT-based planners have been characterized as disjunctive planners that maintain a compact representation of search space of action sequences. Several ideas from refinement planners (conjunctive planners) have been used to improve performance of SAT-based planners or get a better understanding of planning as SAT One important lesson from refinement planning is that backward search being goal directed can be more efficient than forward search. Another lesson is that bidirectional search is generally not efficient. This is because the forward and backward searches can miss each other Though effect of direction of plan refinement (forward, backward, bidirectional etc.) on efficiency of plan synthesis has been deeply investigated in refinement planning, the effect of directional solving of SAT encodings is not investigated in depth. We solved several propositional encodings of benchmark planning problems with a modified form (DSatz) of the systematic SAT solver Satz. DSatz offers 21 options for solving a SAT encoding of a planning problem, where the options are about assigning truth values to action and/or fluent variables in forward or backward or both directions, in an intermittent or non-intermittent style. Our investigation shows that backward search on plan encodings (assigning values to fluent variables first, starting with goal) is very inferior We also show bidirectional solving options and forward solving options turn out to be far more efficient than other solving options. Our empirical results show that the efficient systematic solver Satz which exploits variable dependencies call be significantly enhanced with use of our variable ordering heuristics which are also computationally very cheap to apply. Our main results are that directionality does matter in solving SAT encodings of planning problems and that certain directional solving options are superior to others.
基于 SAT 的规划器的特点是,它能保持行动序列搜索空间的紧凑表示。细化规划器(连接规划器)中的一些思想已被用于提高基于 SAT 的规划器的性能,或更好地理解作为 SAT 的规划。细化规划器的一个重要经验是,目标定向的后向搜索可能比前向搜索更有效。另一个教训是,双向搜索通常效率不高。虽然细化规划中已经深入研究了规划细化方向(前向、后向、双向等)对规划合成效率的影响,但对 SAT 编码的定向求解的影响还没有深入研究。我们使用系统 SAT 求解器 Satz 的改进形式(DSatz)求解了几个基准规划问题的命题编码。DSatz 为解决规划问题的 SAT 编码提供了 21 个选项,这些选项涉及以间歇式或非间歇式方式向前或向后或双向为行动和/或流畅变量分配真值。我们的研究表明,在计划编码上进行后向搜索(首先为流变变量赋值,然后从目标开始)的效率非常低。我们还表明,双向求解选项和前向求解选项的效率远远高于其他求解选项。我们的实证结果表明,利用变量依赖性的高效系统求解器 Satz 可以通过使用我们的变量排序启发式方法显著提高效率,而且这种方法的计算成本也非常低。我们的主要结果表明,在求解规划问题的 SAT 编码时,方向性确实很重要,而且某些方向性求解方案优于其他方案。
{"title":"DSatz: a directional SAT solver for planning","authors":"M. Iwen, A. Mali","doi":"10.1109/TAI.2002.1180805","DOIUrl":"https://doi.org/10.1109/TAI.2002.1180805","url":null,"abstract":"SAT-based planners have been characterized as disjunctive planners that maintain a compact representation of search space of action sequences. Several ideas from refinement planners (conjunctive planners) have been used to improve performance of SAT-based planners or get a better understanding of planning as SAT One important lesson from refinement planning is that backward search being goal directed can be more efficient than forward search. Another lesson is that bidirectional search is generally not efficient. This is because the forward and backward searches can miss each other Though effect of direction of plan refinement (forward, backward, bidirectional etc.) on efficiency of plan synthesis has been deeply investigated in refinement planning, the effect of directional solving of SAT encodings is not investigated in depth. We solved several propositional encodings of benchmark planning problems with a modified form (DSatz) of the systematic SAT solver Satz. DSatz offers 21 options for solving a SAT encoding of a planning problem, where the options are about assigning truth values to action and/or fluent variables in forward or backward or both directions, in an intermittent or non-intermittent style. Our investigation shows that backward search on plan encodings (assigning values to fluent variables first, starting with goal) is very inferior We also show bidirectional solving options and forward solving options turn out to be far more efficient than other solving options. Our empirical results show that the efficient systematic solver Satz which exploits variable dependencies call be significantly enhanced with use of our variable ordering heuristics which are also computationally very cheap to apply. Our main results are that directionality does matter in solving SAT encodings of planning problems and that certain directional solving options are superior to others.","PeriodicalId":197064,"journal":{"name":"14th IEEE International Conference on Tools with Artificial Intelligence, 2002. (ICTAI 2002). Proceedings.","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2002-11-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129748022","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
A neural-network approach to modeling and analysis 建模和分析的神经网络方法
Chen-Yuan Chen, Cheng-Wu Chen, W. Chiang, Jing-Dong Hwang
A backpropagation network can always be used in modeling. This study is concerned with the stability problem of a neural network (NN) system which consists of a few subsystems represented by NN models. In this paper, the dynamics of each NN model is converted into linear inclusion representation. Subsequently, based on the representations, the stability conditions in terms of Lyapunov's direct method is derived to guarantee the asymptotic stability of NN systems.
反向传播网络总是可以用于建模。本文研究由神经网络模型表示的若干子系统组成的神经网络系统的稳定性问题。在本文中,每个神经网络模型的动态被转换成线性包含表示。在此基础上,导出了用Lyapunov直接方法表示的稳定性条件,以保证神经网络系统的渐近稳定性。
{"title":"A neural-network approach to modeling and analysis","authors":"Chen-Yuan Chen, Cheng-Wu Chen, W. Chiang, Jing-Dong Hwang","doi":"10.1109/TAI.2002.1180843","DOIUrl":"https://doi.org/10.1109/TAI.2002.1180843","url":null,"abstract":"A backpropagation network can always be used in modeling. This study is concerned with the stability problem of a neural network (NN) system which consists of a few subsystems represented by NN models. In this paper, the dynamics of each NN model is converted into linear inclusion representation. Subsequently, based on the representations, the stability conditions in terms of Lyapunov's direct method is derived to guarantee the asymptotic stability of NN systems.","PeriodicalId":197064,"journal":{"name":"14th IEEE International Conference on Tools with Artificial Intelligence, 2002. (ICTAI 2002). Proceedings.","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2002-11-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129997723","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Automated diagnosis of non-native English speaker's natural language 非英语母语者自然语言的自动诊断
Richard Fox, Mari Bowden
Typical grammar checking software use some form of natural language parsing to determine if errors exist in the text. If a sentence is found ungrammatical, the grammar checker usually seeks a single grammatical error as an explanation. For non-native speakers of English, it is possible that a given sentence contain multiple errors and grammar checkers may not adequately explain these mistakes. This paper presents GRADES, a diagnostic program that detects and explains grammatical mistakes made by non-native English speakers. GRADES performs its diagnostic task, not through parsing, but through the application of classification and pattern matching rules. This makes the diagnostic process more efficient than other grammar checkers. GRADES is envisioned as a tool to help non-native English speakers learn to correct their English mistakes, but is also a demonstration that grammar checking need not rely on parsing techniques.
典型的语法检查软件使用某种形式的自然语言解析来确定文本中是否存在错误。如果发现一个句子不合语法,语法检查员通常会找一个语法错误作为解释。对于非英语为母语的人来说,一个给定的句子可能包含多个错误,语法检查器可能无法充分解释这些错误。本文介绍了一个诊断程序GRADES,它可以检测和解释非英语母语者所犯的语法错误。GRADES不是通过解析,而是通过应用分类和模式匹配规则来执行诊断任务。这使得诊断过程比其他语法检查器更有效。GRADES被设想为一个帮助非英语母语者学习纠正英语错误的工具,但它也证明了语法检查不需要依赖解析技术。
{"title":"Automated diagnosis of non-native English speaker's natural language","authors":"Richard Fox, Mari Bowden","doi":"10.1109/TAI.2002.1180818","DOIUrl":"https://doi.org/10.1109/TAI.2002.1180818","url":null,"abstract":"Typical grammar checking software use some form of natural language parsing to determine if errors exist in the text. If a sentence is found ungrammatical, the grammar checker usually seeks a single grammatical error as an explanation. For non-native speakers of English, it is possible that a given sentence contain multiple errors and grammar checkers may not adequately explain these mistakes. This paper presents GRADES, a diagnostic program that detects and explains grammatical mistakes made by non-native English speakers. GRADES performs its diagnostic task, not through parsing, but through the application of classification and pattern matching rules. This makes the diagnostic process more efficient than other grammar checkers. GRADES is envisioned as a tool to help non-native English speakers learn to correct their English mistakes, but is also a demonstration that grammar checking need not rely on parsing techniques.","PeriodicalId":197064,"journal":{"name":"14th IEEE International Conference on Tools with Artificial Intelligence, 2002. (ICTAI 2002). Proceedings.","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2002-11-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122183831","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 7
Actions with duration and constraints: the ramification problem in temporal databases 具有持续时间和约束的操作:时态数据库中的分支问题
N. Papadakis, D. Plexousakis
The ramification problem is a hard and ever present problem in systems exhibiting dynamic behavior. The area of temporal databases in particular still lacks satisfactory solutions to the ramification problem. In this paper we address the ramification problem based on causal relationships that take time into account. We study the problem for both instantaneous actions and actions with duration. The proposed solution advances previous work by considering actions with effects occurring in any of the possible future situations resulting from an action's execution.
分支问题是表现出动态行为的系统中一直存在的难题。特别是在时态数据库领域,对于分支问题仍然缺乏令人满意的解决办法。在本文中,我们解决了基于考虑时间的因果关系的分支问题。我们研究了瞬时动作和持续动作的问题。建议的解决方案通过考虑在执行某个操作的任何可能的未来情况下发生的影响来推进先前的工作。
{"title":"Actions with duration and constraints: the ramification problem in temporal databases","authors":"N. Papadakis, D. Plexousakis","doi":"10.1109/TAI.2002.1180791","DOIUrl":"https://doi.org/10.1109/TAI.2002.1180791","url":null,"abstract":"The ramification problem is a hard and ever present problem in systems exhibiting dynamic behavior. The area of temporal databases in particular still lacks satisfactory solutions to the ramification problem. In this paper we address the ramification problem based on causal relationships that take time into account. We study the problem for both instantaneous actions and actions with duration. The proposed solution advances previous work by considering actions with effects occurring in any of the possible future situations resulting from an action's execution.","PeriodicalId":197064,"journal":{"name":"14th IEEE International Conference on Tools with Artificial Intelligence, 2002. (ICTAI 2002). Proceedings.","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2002-11-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116935289","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 28
Mining association rules in text databases using multipass with inverted hashing and pruning 基于反向哈希和剪接的多通道文本数据库关联规则挖掘
John D. Holt, S. M. Chung
In this paper, we propose a new algorithm named multipass with inverted hashing and pruning (MIHP) for mining association rules between words in text databases. The characteristics of text databases are quite different from those of retail transaction databases, and existing mining algorithms cannot handle text databases efficiently because of the large number of itemsets (i.e., words) that need to be counted. Two well-known mining algorithms, the apriori algorithm and the direct hashing and pruning (DHP) algorithm, are evaluated in the context of mining text databases, and are compared with the proposed MIHP algorithm. It has been shown that the MIHP algorithm performs better for large text databases.
本文提出了一种新的文本数据库词间关联规则挖掘算法——多通道倒哈希与剪接算法(multipass with倒哈希与剪接)。文本数据库的特征与零售交易数据库有很大的不同,现有的挖掘算法由于需要统计大量的项目集(即单词)而无法有效地处理文本数据库。在挖掘文本数据库的背景下,评估了两种著名的挖掘算法——先验算法和直接哈希和修剪(DHP)算法,并与所提出的MIHP算法进行了比较。研究表明,MIHP算法在大型文本数据库中表现更好。
{"title":"Mining association rules in text databases using multipass with inverted hashing and pruning","authors":"John D. Holt, S. M. Chung","doi":"10.1109/TAI.2002.1180787","DOIUrl":"https://doi.org/10.1109/TAI.2002.1180787","url":null,"abstract":"In this paper, we propose a new algorithm named multipass with inverted hashing and pruning (MIHP) for mining association rules between words in text databases. The characteristics of text databases are quite different from those of retail transaction databases, and existing mining algorithms cannot handle text databases efficiently because of the large number of itemsets (i.e., words) that need to be counted. Two well-known mining algorithms, the apriori algorithm and the direct hashing and pruning (DHP) algorithm, are evaluated in the context of mining text databases, and are compared with the proposed MIHP algorithm. It has been shown that the MIHP algorithm performs better for large text databases.","PeriodicalId":197064,"journal":{"name":"14th IEEE International Conference on Tools with Artificial Intelligence, 2002. (ICTAI 2002). Proceedings.","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2002-11-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133955480","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 10
Software quality classification modeling using the SPRINT decision tree algorithm 使用SPRINT决策树算法进行软件质量分类建模
T. Khoshgoftaar, Naeem Seliya
Predicting the quality of system modules prior to software testing and operations can benefit the software development team. Such a timely reliability estimation can be used to direct cost-effective quality improvement efforts to the high-risk modules. Tree-based software quality classification models based on software metrics are used to predict whether a software module is fault-prone or not fault-prone. They are white box quality estimation models with good accuracy, and are simple and easy to interpret. This paper presents an in-depth study of calibrating classification trees for software quality estimation using the SPRINT decision tree algorithm. Many classification algorithms have memory limitations including the requirement that data sets be memory resident. SPRINT removes all of these limitations and provides a fast and scalable analysis. It is an extension of a commonly used decision tree algorithm, CART, and provides a unique tree-pruning technique based on the minimum description length (MDL) principle. Combining the MDL pruning technique and the modified classification algorithm, SPRINT yields classification trees with useful prediction accuracy. The case study used comprises of software metrics and fault data collected over four releases from a very large telecommunications system. It is observed that classification trees built by SPRINT are more balanced and demonstrate better stability in comparison to those built by CART.
在软件测试和操作之前预测系统模块的质量可以使软件开发团队受益。这种及时的可靠性评估可以用于指导对高风险模块进行经济有效的质量改进工作。基于软件度量的树状软件质量分类模型用于预测软件模块是否容易出错。它们是白盒质量的估计模型,具有良好的准确性,并且简单易于解释。本文对基于SPRINT决策树算法的软件质量评估分类树的标定问题进行了深入研究。许多分类算法都有内存限制,包括要求数据集驻留在内存中。SPRINT消除了所有这些限制,并提供了快速和可伸缩的分析。它是一种常用的决策树算法CART的扩展,并提供了一种基于最小描述长度(MDL)原则的独特的树修剪技术。结合MDL剪枝技术和改进的分类算法,SPRINT生成了具有有效预测精度的分类树。所使用的案例研究包括从一个非常大的电信系统的四个版本中收集的软件度量和故障数据。结果表明,与CART构建的分类树相比,SPRINT构建的分类树更加平衡,稳定性更好。
{"title":"Software quality classification modeling using the SPRINT decision tree algorithm","authors":"T. Khoshgoftaar, Naeem Seliya","doi":"10.1109/TAI.2002.1180826","DOIUrl":"https://doi.org/10.1109/TAI.2002.1180826","url":null,"abstract":"Predicting the quality of system modules prior to software testing and operations can benefit the software development team. Such a timely reliability estimation can be used to direct cost-effective quality improvement efforts to the high-risk modules. Tree-based software quality classification models based on software metrics are used to predict whether a software module is fault-prone or not fault-prone. They are white box quality estimation models with good accuracy, and are simple and easy to interpret. This paper presents an in-depth study of calibrating classification trees for software quality estimation using the SPRINT decision tree algorithm. Many classification algorithms have memory limitations including the requirement that data sets be memory resident. SPRINT removes all of these limitations and provides a fast and scalable analysis. It is an extension of a commonly used decision tree algorithm, CART, and provides a unique tree-pruning technique based on the minimum description length (MDL) principle. Combining the MDL pruning technique and the modified classification algorithm, SPRINT yields classification trees with useful prediction accuracy. The case study used comprises of software metrics and fault data collected over four releases from a very large telecommunications system. It is observed that classification trees built by SPRINT are more balanced and demonstrate better stability in comparison to those built by CART.","PeriodicalId":197064,"journal":{"name":"14th IEEE International Conference on Tools with Artificial Intelligence, 2002. (ICTAI 2002). Proceedings.","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2002-11-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132877744","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 84
Protein secondary structure prediction with Bayesian learning method 基于贝叶斯学习方法的蛋白质二级结构预测
Peng Wang, Du Zhang
This paper describes a Bayesian learning based approach to protein secondary structure prediction. Four secondary structure types are considered, including /spl alpha/-helix, /spl beta/-strand, /spl beta/-turn and coil. A six-letter exchange group is utilized to represent a protein sequence. Training cases are expressed as sequence quaternion. A tool called Predictor is developed in Java that implements the proposed approach. To evaluate the tool, we select, from the protein data bank and based on the principle of one-protein-per-family according to the structure family of SCOP, six hundred and twenty-three known proteins without pair wise sequence homology. Several training/test data splits have been tried. The results show that our proposed approach can produce prediction accuracy comparable to those of the traditional prediction methods. Predictor has user-friendly and easy-to-use GUIs, and is of practical value to the molecular biology researchers.
本文介绍了一种基于贝叶斯学习的蛋白质二级结构预测方法。考虑了四种二级结构类型,包括/spl alpha/-helix, /spl beta/-strand, /spl beta/-turn和coil。一个六个字母的交换基团被用来表示一个蛋白质序列。训练用例用序列四元数表示。在Java中开发了一个名为Predictor的工具来实现所提出的方法。为了评估该工具,我们从蛋白质数据库中,根据SCOP的结构家族,根据每个家族一个蛋白质的原则,选择了623个已知的没有对序列同源性的蛋白质。已经尝试了几种训练/测试数据分割。结果表明,该方法的预测精度与传统预测方法相当。预测器具有用户友好和易于使用的图形用户界面,对分子生物学研究人员具有实用价值。
{"title":"Protein secondary structure prediction with Bayesian learning method","authors":"Peng Wang, Du Zhang","doi":"10.1109/TAI.2002.1180812","DOIUrl":"https://doi.org/10.1109/TAI.2002.1180812","url":null,"abstract":"This paper describes a Bayesian learning based approach to protein secondary structure prediction. Four secondary structure types are considered, including /spl alpha/-helix, /spl beta/-strand, /spl beta/-turn and coil. A six-letter exchange group is utilized to represent a protein sequence. Training cases are expressed as sequence quaternion. A tool called Predictor is developed in Java that implements the proposed approach. To evaluate the tool, we select, from the protein data bank and based on the principle of one-protein-per-family according to the structure family of SCOP, six hundred and twenty-three known proteins without pair wise sequence homology. Several training/test data splits have been tried. The results show that our proposed approach can produce prediction accuracy comparable to those of the traditional prediction methods. Predictor has user-friendly and easy-to-use GUIs, and is of practical value to the molecular biology researchers.","PeriodicalId":197064,"journal":{"name":"14th IEEE International Conference on Tools with Artificial Intelligence, 2002. (ICTAI 2002). Proceedings.","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2002-11-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"134179019","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
期刊
14th IEEE International Conference on Tools with Artificial Intelligence, 2002. (ICTAI 2002). Proceedings.
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1