首页 > 最新文献

CTIT technical reports series最新文献

英文 中文
D-optimal plans for variable selection in data bases 数据库中变量选择的最优方案
Pub Date : 2009-08-05 DOI: 10.17877/DE290R-8705
J. Schiffner, C. Weihs
This paper is based on an article of Pumplun et al. (2005a) that investigates the use of Design of Experiments in data bases in order to select variables that are relevant for classification in situations where a sufficient number of measurements of the explanatory variables is available, but measuring the class label is hard, e. g. expensive or time-consuming. Pumplun et al. searched for D-optimal designs in existing data sets by means of a genetic algorithm and assessed variable importance based on the found plans. If the design matrix is standardized these D-optimal plans are almost orthogonal and the explanatory variables are nearly uncorrelated. Thus Pumplun et al. expected that their importance for discrimination can be judged independently of each other. In a simulation study Pumplun et al. applied this approach in combination with five classification methods to eight data sets and the obtained error rates were compared with those resulting from variable selection on the basis of the complete data sets. Based on the D-optimal plans in some cases considerably lower error rates were achieved. Although Pumplun et al. (2005a) obtained some promising results, it was not clear for different reasons if D-optimality actually is beneficial for variable selection. For example, D-efficiency and orthogonality of the resulting plans were not investigated and a comparison with variable selection based on random samples of observations of the same size as the D-optimal plans was missing. In this paper we extend the simulation study of Pumplun et al. (2005a) in order to verify their results and as basis for further research in this field. Moreover, in Pumplun et al. D-optimal plans are only used for data preprocessing, that is variable selection. The classification models are estimated on the whole data set in order to assess the effects of D-optimality on variable selection separately. Since the number of measurements of the class label in fact is limited one would normally employ the same observations that were used for variable selection for learning, too. For this reason in our simulation study the appropriateness of D-optimal plans for training classification methods is additionally investigated. It turned out that in general in terms of the error rate there is no difference between variable selection on the basis of D-optimal plans and variable selection on random samples. However, for training of linear classification methods D-optimal plans seem to be beneficial.
本文基于Pumplun et al.(2005)的一篇文章,该文章调查了在数据库中使用实验设计,以便在解释变量的测量数量足够的情况下选择与分类相关的变量,但测量类别标签很难,例如昂贵或耗时。Pumplun等人通过遗传算法在现有数据集中搜索d -最优设计,并根据找到的方案评估变量重要性。如果设计矩阵是标准化的,这些d -最优方案几乎是正交的,解释变量几乎不相关。因此,Pumplun等人期望可以独立地判断它们对歧视的重要性。在模拟研究中,Pumplun等人将该方法与五种分类方法结合应用于八个数据集,并将得到的错误率与基于完整数据集的变量选择结果进行了比较。基于d -最优计划,在某些情况下可以实现相当低的错误率。尽管Pumplun et al. (2005a)获得了一些有希望的结果,但由于不同的原因,d -最优性是否真的有利于变量选择尚不清楚。例如,没有研究结果方案的d效率和正交性,也没有对基于与d最优方案相同大小的随机观察样本的变量选择进行比较。本文对Pumplun et al. (2005a)的仿真研究进行了扩展,以验证其结果,为该领域的进一步研究奠定基础。此外,在Pumplun等。d -最优方案仅用于数据预处理,即变量选择。为了单独评估d -最优性对变量选择的影响,在整个数据集上对分类模型进行了估计。由于类标签的测量次数实际上是有限的,因此通常也会采用用于学习变量选择的相同观察结果。由于这个原因,在我们的模拟研究中,d -最优计划对训练分类方法的适用性进行了额外的研究。结果表明,总体而言,基于d -最优方案的变量选择与基于随机样本的变量选择在错误率上没有差别。然而,对于线性分类方法的训练,d -最优计划似乎是有益的。
{"title":"D-optimal plans for variable selection in data bases","authors":"J. Schiffner, C. Weihs","doi":"10.17877/DE290R-8705","DOIUrl":"https://doi.org/10.17877/DE290R-8705","url":null,"abstract":"This paper is based on an article of Pumplun et al. (2005a) that investigates the use of Design of Experiments in data bases in order to select variables that are relevant for classification in situations where a sufficient number of measurements of the explanatory variables is available, but measuring the class label is hard, e. g. expensive or time-consuming. Pumplun et al. searched for D-optimal designs in existing data sets by means of a genetic algorithm and assessed variable importance based on the found plans. If the design matrix is standardized these D-optimal plans are almost orthogonal and the explanatory variables are nearly uncorrelated. Thus Pumplun et al. expected that their importance for discrimination can be judged independently of each other. In a simulation study Pumplun et al. applied this approach in combination with five classification methods to eight data sets and the obtained error rates were compared with those resulting from variable selection on the basis of the complete data sets. Based on the D-optimal plans in some cases considerably lower error rates were achieved. Although Pumplun et al. (2005a) obtained some promising results, it was not clear for different reasons if D-optimality actually is beneficial for variable selection. For example, D-efficiency and orthogonality of the resulting plans were not investigated and a comparison with variable selection based on random samples of observations of the same size as the D-optimal plans was missing. In this paper we extend the simulation study of Pumplun et al. (2005a) in order to verify their results and as basis for further research in this field. Moreover, in Pumplun et al. D-optimal plans are only used for data preprocessing, that is variable selection. The classification models are estimated on the whole data set in order to assess the effects of D-optimality on variable selection separately. Since the number of measurements of the class label in fact is limited one would normally employ the same observations that were used for variable selection for learning, too. For this reason in our simulation study the appropriateness of D-optimal plans for training classification methods is additionally investigated. It turned out that in general in terms of the error rate there is no difference between variable selection on the basis of D-optimal plans and variable selection on random samples. However, for training of linear classification methods D-optimal plans seem to be beneficial.","PeriodicalId":10841,"journal":{"name":"CTIT technical reports series","volume":"64 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2009-08-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"74715638","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 6
Optimal designs for an interference model 干涉模型的优化设计
Pub Date : 2009-08-05 DOI: 10.17877/DE290R-504
J. Kunert, S. Mersmann
Kunert and Martin (2000) determined optimal and efficient block designs in a model for field trials with interference effects, for block sizes up to 4. In this paper we use Kushner's method (Kushner, 1997) of finding optimal approximate designs to extend the work of Kunert and Martin (2000) to optimal designs with five or more plots per block. We give an overall upper bound a*t,b,k for the trace of the information matrix of any design and show that an universally optimal approximate design will have all its sequences from merely four different equivalence classes. We further determine the efficiency of a binary type I orthogonal array under the general p-criterion. We find that these designs achieve high efficiencies of more than 0:94.
Kunert和Martin(2000)在具有干扰效应的现场试验模型中确定了最优和有效的块设计,块大小为4。在本文中,我们使用Kushner的方法(Kushner, 1997)寻找最优近似设计,将Kunert和Martin(2000)的工作扩展到每个块具有五个或更多地块的最优设计。我们给出了任何设计的信息矩阵迹的总体上界a*t,b,k,并证明了一个普遍最优的近似设计的所有序列只来自四个不同的等价类。在一般p准则下,进一步确定了二元I型正交阵列的效率。我们发现这些设计实现了超过0:94的高效率。
{"title":"Optimal designs for an interference model","authors":"J. Kunert, S. Mersmann","doi":"10.17877/DE290R-504","DOIUrl":"https://doi.org/10.17877/DE290R-504","url":null,"abstract":"Kunert and Martin (2000) determined optimal and efficient block designs in a model for field trials with interference effects, for block sizes up to 4. In this paper we use Kushner's method (Kushner, 1997) of finding optimal approximate designs to extend the work of Kunert and Martin (2000) to optimal designs with five or more plots per block. We give an overall upper bound a*t,b,k for the trace of the information matrix of any design and show that an universally optimal approximate design will have all its sequences from merely four different equivalence classes. We further determine the efficiency of a binary type I orthogonal array under the general p-criterion. We find that these designs achieve high efficiencies of more than 0:94.","PeriodicalId":10841,"journal":{"name":"CTIT technical reports series","volume":"4 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2009-08-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"84088788","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 5
Distribution hierarchies in directed networks 有向网络中的分布层次结构
Pub Date : 2009-05-01 DOI: 10.3929/ETHZ-A-006733711
Ueli Peter, T. Hrúz
Recently, Ahnert and Fink [AF08] showed that some classes of directed networks are cleanly separated in the space of the clustering signature. In this work we will study the relation hierarchy among subgraph distributions in directed networks and derive how the clustering signature ts into this hierarchy. Thereby we gather a fundamental understanding of the network dynamics and build a framework for the analysis of stochastic processes.
最近,Ahnert和Fink [AF08]证明了有向网络的某些类别在聚类签名的空间中被清晰地分离。在这项工作中,我们将研究有向网络中子图分布之间的关系层次结构,并推导聚类签名如何进入该层次结构。由此,我们对网络动力学有了基本的了解,并建立了一个分析随机过程的框架。
{"title":"Distribution hierarchies in directed networks","authors":"Ueli Peter, T. Hrúz","doi":"10.3929/ETHZ-A-006733711","DOIUrl":"https://doi.org/10.3929/ETHZ-A-006733711","url":null,"abstract":"Recently, Ahnert and Fink [AF08] showed that some classes of directed networks are cleanly separated in the space of the clustering signature. In this work we will study the relation hierarchy among subgraph distributions in directed networks and derive how the clustering signature ts into this hierarchy. Thereby we gather a fundamental understanding of the network dynamics and build a framework for the analysis of stochastic processes.","PeriodicalId":10841,"journal":{"name":"CTIT technical reports series","volume":"18 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2009-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"89745362","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Constructing irregular histograms by penalized likelihood 用惩罚似然法构造不规则直方图
Pub Date : 2009-04-30 DOI: 10.17877/DE290R-587
Y. Rozenholc, Thoralf Mildenberger, U. Gather
We propose a fully automatic procedure for the construction of irregular histograms. For a given number of bins, the maximum likelihood histogram is known to be the result of a dynamic programming algorithm. To choose the number of bins, we propose two different penalties motivated by recent work in model selection by Castellan [6] and Massart [26]. We give a complete description of the algorithm and a proper tuning of the penalties. Finally, we compare our procedure to other existing proposals for a wide range of different densities and sample sizes.
我们提出了一种完全自动化的不规则直方图构造方法。对于给定数量的箱子,已知最大似然直方图是动态规划算法的结果。为了选择箱子的数量,我们提出了两种不同的惩罚,这是由Castellan[6]和Massart[26]最近在模型选择方面的工作所激发的。我们给出了一个完整的算法描述和适当的调整惩罚。最后,我们将我们的程序与其他现有的建议进行了比较,以适应不同密度和样本量的范围。
{"title":"Constructing irregular histograms by penalized likelihood","authors":"Y. Rozenholc, Thoralf Mildenberger, U. Gather","doi":"10.17877/DE290R-587","DOIUrl":"https://doi.org/10.17877/DE290R-587","url":null,"abstract":"We propose a fully automatic procedure for the construction of irregular histograms. For a given number of bins, the maximum likelihood histogram is known to be the result of a dynamic programming algorithm. To choose the number of bins, we propose two different penalties motivated by recent work in model selection by Castellan [6] and Massart [26]. We give a complete description of the algorithm and a proper tuning of the penalties. Finally, we compare our procedure to other existing proposals for a wide range of different densities and sample sizes.","PeriodicalId":10841,"journal":{"name":"CTIT technical reports series","volume":"6 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2009-04-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"79147064","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 5
Kernelized design of experiments 实验的核化设计
Pub Date : 2009-04-30 DOI: 10.17877/DE290R-8240
S. Rüping, C. Weihs
This paper describes an approach for selecting instances in regression problems in the cases where observations x are readily available, but obtaining labels y is hard. Given a database of observations, an algorithm inspired by statistical design of experiments and kernel methods is presented that selects a set of k instances to be chosen in order to maximize the prediction performance of a support vector machine. It is shown that the algorithm significantly outperforms related approaches on a number of real-world datasets.
本文描述了在观测值x容易获得,但标签y难以获得的情况下,在回归问题中选择实例的方法。在给定观测数据库的情况下,提出了一种受实验统计设计和核方法启发的算法,该算法从k个实例中选择一组,以最大限度地提高支持向量机的预测性能。结果表明,该算法在许多真实数据集上显著优于相关方法。
{"title":"Kernelized design of experiments","authors":"S. Rüping, C. Weihs","doi":"10.17877/DE290R-8240","DOIUrl":"https://doi.org/10.17877/DE290R-8240","url":null,"abstract":"This paper describes an approach for selecting instances in regression problems in the cases where observations x are readily available, but obtaining labels y is hard. Given a database of observations, an algorithm inspired by statistical design of experiments and kernel methods is presented that selects a set of k instances to be chosen in order to maximize the prediction performance of a support vector machine. It is shown that the algorithm significantly outperforms related approaches on a number of real-world datasets.","PeriodicalId":10841,"journal":{"name":"CTIT technical reports series","volume":"27 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2009-04-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"81177561","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
Online algorithms with advice 带有建议的在线算法
Pub Date : 2009-01-01 DOI: 10.3929/ETHZ-A-006733662
Hans-Joachim Bröckenhauer, D. Komm, Rastislav KráloviÄ, Richard KráloviÄ, Tobias Mömke
In online problems, the input forms a finite sequence of requests. Each request must be processed, i. e., a partial output has to be computed only depending on the requests having arrived so far, and it is not allowed to change this partial output subsequently. The aim of an online algorithm is to produce a sequence of partial outputs that optimizes some global measure. The most frequently used tool for analyzing the quality of online algorithms is the competitive analysis which compares the solution quality of an online algorithm to the optimal solution for the whole input sequence, and in fact measures the degradation in the solution quality caused by the lack of any information about the input. In this paper, we investigate to what extent the solution quality can be improved by allowing the algorithm to extract a given amount of information about the input. We consider the recently introduced notion of advice complexity where the algorithm, in addition to being fed the requests one by one, has access to a tape of advice bits that were computed by some oracle function from the complete input. The advice complexity is the number of advice bits read. We introduce an improved model of advice complexity and investigate the connections of advice complexity to the competitive ratio of both deterministic and randomized online algorithms using the paging problem, job shop scheduling, and the routing problem on a line as sample problems. Our results for all of these problems show that very small advice (only three bits in the case of paging) already suffices to significantly improve over the best deterministic algorithm. Moreover, to achieve the same competitive ratio as any randomized online algorithm, a logarithmic number of advice bits is sufficient. On the other hand, to obtain optimality, much larger advice is necessary.
在联机问题中,输入形成一个有限的请求序列。必须处理每个请求,也就是说,必须仅根据到目前为止到达的请求计算部分输出,并且不允许随后更改该部分输出。在线算法的目的是产生一个局部输出序列,以优化某些全局度量。分析在线算法质量最常用的工具是竞争分析,它将在线算法的解质量与整个输入序列的最优解进行比较,实际上是测量由于缺乏关于输入的任何信息而导致的解质量的下降。在本文中,我们研究了通过允许算法提取给定数量的输入信息,可以在多大程度上提高解的质量。我们考虑了最近引入的建议复杂性的概念,其中算法除了一个接一个地提供请求之外,还可以访问由某个oracle函数从完整输入中计算出来的建议位磁带。通知复杂度是指读取的通知位的数量。我们引入了一种改进的通知复杂度模型,并以寻呼问题、作业车间调度问题和线路上的路由问题为例,研究了通知复杂度与确定性和随机在线算法的竞争比之间的关系。我们对所有这些问题的结果表明,非常小的建议(在分页的情况下只有3位)已经足以显著改进最佳确定性算法。此外,为了达到与任何随机在线算法相同的竞争比,建议位的对数个数就足够了。另一方面,为了获得最优性,需要更大的通知。
{"title":"Online algorithms with advice","authors":"Hans-Joachim Bröckenhauer, D. Komm, Rastislav KráloviÄ, Richard KráloviÄ, Tobias Mömke","doi":"10.3929/ETHZ-A-006733662","DOIUrl":"https://doi.org/10.3929/ETHZ-A-006733662","url":null,"abstract":"In online problems, the input forms a finite sequence of requests. Each request must be processed, i. e., a partial output has to be computed only depending on the requests having arrived so far, and it is not allowed to change this partial output subsequently. The aim of an online algorithm is to produce a sequence of partial outputs that optimizes some global measure. The most frequently used tool for analyzing the quality of online algorithms is the competitive analysis which compares the solution quality of an online algorithm to the optimal solution for the whole input sequence, and in fact measures the degradation in the solution quality caused by the lack of any information about the input. In this paper, we investigate to what extent the solution quality can be improved by allowing the algorithm to extract a given amount of information about the input. We consider the recently introduced notion of advice complexity where the algorithm, in addition to being fed the requests one by one, has access to a tape of advice bits that were computed by some oracle function from the complete input. The advice complexity is the number of advice bits read. We introduce an improved model of advice complexity and investigate the connections of advice complexity to the competitive ratio of both deterministic and randomized online algorithms using the paging problem, job shop scheduling, and the routing problem on a line as sample problems. Our results for all of these problems show that very small advice (only three bits in the case of paging) already suffices to significantly improve over the best deterministic algorithm. Moreover, to achieve the same competitive ratio as any randomized online algorithm, a logarithmic number of advice bits is sufficient. On the other hand, to obtain optimality, much larger advice is necessary.","PeriodicalId":10841,"journal":{"name":"CTIT technical reports series","volume":"33 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2009-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"74545431","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 15
A Token-Ring for the TRM TRM的令牌环
Pub Date : 2009-01-01 DOI: 10.3929/ETHZ-A-006828754
N. Wirth
With the design of the Token-Ring for the TRM (Tiny Register Machine) I pursued mainly two aims. The first is to design a network connecting several TRM cores. The second is to go for a design that is as simple as possible, considering that the TRM project is oriented towards educational hard- and software. Featuring a ring architecture, it provides a welcome alternative to the already existing bus architecture implemented by Ling Liu, allowing to compare complexity and performance.
在为TRM(微型注册机)设计令牌环时,我主要追求两个目标。首先是设计一个连接多个TRM核心的网络。第二种是考虑到TRM项目是面向教育硬件和软件的,所以选择尽可能简单的设计。它以环形架构为特色,为Ling Liu实现的现有总线架构提供了一种受欢迎的替代方案,允许比较复杂性和性能。
{"title":"A Token-Ring for the TRM","authors":"N. Wirth","doi":"10.3929/ETHZ-A-006828754","DOIUrl":"https://doi.org/10.3929/ETHZ-A-006828754","url":null,"abstract":"With the design of the Token-Ring for the TRM (Tiny Register Machine) I pursued mainly two aims. The first is to design a network connecting several TRM cores. The second is to go for a design that is as simple as possible, considering that the TRM project is oriented towards educational hard- and software. Featuring a ring architecture, it provides a welcome alternative to the already existing bus architecture implemented by Ling Liu, allowing to compare complexity and performance.","PeriodicalId":10841,"journal":{"name":"CTIT technical reports series","volume":"14 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2009-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"73234434","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Separation, abstraction, multiple inheritance and view shifting 分离、抽象、多重继承和视图转换
Pub Date : 2009-01-01 DOI: 10.3929/ETHZ-A-006836686
S. Staden
Inheritance is a central mechanism in object-oriented programming. Many popular object-oriented languages support multiple inheritance or limited versions thereof. This work extends a powerful modular proof system for single inheritance, which uses separation logic and abstract predicate families, to multiple inheritance. The extended system allows view shifting in the logic: the ability to view an object under different abstractions and to shift between such views. Several examples illustrate the system’s use and utility.
继承是面向对象编程的核心机制。许多流行的面向对象语言支持多重继承或其有限版本。本文将一个使用分离逻辑和抽象谓词族的强大的单继承模块化证明系统扩展到多继承。扩展系统允许逻辑中的视图转换:在不同抽象下查看对象并在这些视图之间转换的能力。几个例子说明了该系统的使用和实用性。
{"title":"Separation, abstraction, multiple inheritance and view shifting","authors":"S. Staden","doi":"10.3929/ETHZ-A-006836686","DOIUrl":"https://doi.org/10.3929/ETHZ-A-006836686","url":null,"abstract":"Inheritance is a central mechanism in object-oriented programming. Many popular object-oriented languages support multiple inheritance or limited versions thereof. This work extends a powerful modular proof system for single inheritance, which uses separation logic and abstract predicate families, to multiple inheritance. The extended system allows view shifting in the logic: the ability to view an object under different abstractions and to shift between such views. Several examples illustrate the system’s use and utility.","PeriodicalId":10841,"journal":{"name":"CTIT technical reports series","volume":"19 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2009-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"77988599","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Specifying Access Control in Event-B 在Event-B中指定访问控制
Pub Date : 2009-01-01 DOI: 10.3929/ETHZ-A-006733720
Son Hoang
We investigate the idea of developing access control systems in Event-B by specifying separately the "insecure" target system and the security authorisation, then combining them together in order to construct a secure system. This is based on the work by Basin et. al. [6] where the chosen language is CSP-OZ. Moreover, in order to verify the secure system against some safety temporal properties, we propose an approach of constructing several abstract models corresponding to these properties, and using refinement to prove that the final system satisfies these properties.
我们研究了在Event-B中开发访问控制系统的想法,分别指定“不安全”的目标系统和安全授权,然后将它们组合在一起以构建安全系统。这是基于Basin等人[6]的工作,其中选择的语言是CSP-OZ。此外,为了验证安全系统是否符合某些安全时间性质,我们提出了一种方法,即构造几个与这些性质相对应的抽象模型,并使用改进来证明最终系统是否满足这些性质。
{"title":"Specifying Access Control in Event-B","authors":"Son Hoang","doi":"10.3929/ETHZ-A-006733720","DOIUrl":"https://doi.org/10.3929/ETHZ-A-006733720","url":null,"abstract":"We investigate the idea of developing access control systems in Event-B by specifying separately the \"insecure\" target system and the security authorisation, then combining them together in order to construct a secure system. This is based on the work by Basin et. al. [6] where the chosen language is CSP-OZ. Moreover, in order to verify the secure system against some safety temporal properties, we propose an approach of constructing several abstract models corresponding to these properties, and using refinement to prove that the final system satisfies these properties.","PeriodicalId":10841,"journal":{"name":"CTIT technical reports series","volume":"8 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2009-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"79938607","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 8
A Formalization of JML in the Coq Proof System Coq证明系统中JML的形式化
Pub Date : 2009-01-01 DOI: 10.3929/ETHZ-A-006903145
Andreas Kägi, Hermann Lehner, Peter Müller
JML is a complex specication language for Java. Its large scale and manifold features make it hard to precisely dene its semantics in a reference manual. It is thus desirable to formally specify the syntax and semantics of JML. There are many good reasons for a formalized semantics of JML in a theorem prover: It can be used to develop a sound verication condition generator for JML constructs. By formally defining the semantics in a theorem prover, we can detect and eliminate ambiguousities in the language. When using the semantics with an operational semantics for Java source code, we can dene a runtime assertion checker and prove it's soundness with respect to the semantics in Coq. We divide the problem of dening JML in Coq into several steps. Firstly, we dene a basic JML subset that has the full expressiveness of JML, but without syntactic sugar. We define the semantics for this subset in Coq. We introduce an extended (full) JML Syntax and a syntactic rewriting function from the extended syntax into the basic syntax. Finally, we built a translation frontend that transforms a JML-annotated Java program into it's equivalent in Coq. We managed to dene the full JML and Java syntax in Coq, minus some very rare and not clearly described concepts and minus everything related to floating point numbers. We implemented a lightweight translation frontent in Java. We defined a large set of rewritings that simplify the syntax of JML without loosing any precision. We then dened the semantics of the desugared JML, using Bicolano as a basis for the semantic domain. Finally, we conducted a case study evaluating the feasibility of proving on top of the formalisation.
JML是一种复杂的Java规范语言。它的规模大,特征多,很难在参考手册中精确地判断其语义。因此,需要正式指定JML的语法和语义。在定理证明器中使用JML的形式化语义有很多很好的理由:它可以用于为JML构造开发可靠的验证条件生成器。通过形式化地定义定理证明中的语义,我们可以检测和消除语言中的歧义。当将语义与Java源代码的操作语义一起使用时,我们可以定义一个运行时断言检查器,并证明它相对于Coq中的语义是健全的。我们将在Coq中定义JML的问题分为几个步骤。首先,我们定义了一个基本的JML子集,它具有JML的全部表达能力,但没有语法糖。我们在Coq中定义了这个子集的语义。我们介绍了扩展的(完整的)JML语法和从扩展语法到基本语法的语法重写功能。最后,我们构建了一个翻译前端,用于将带有jml注释的Java程序转换为Coq中的等效程序。我们设法在Coq中定义了完整的JML和Java语法,减去了一些非常罕见的和没有明确描述的概念,并减去了与浮点数相关的所有内容。我们用Java实现了一个轻量级的翻译前端。我们定义了大量的重写,这些重写简化了JML的语法,同时又不丢失任何精度。然后,我们使用Bicolano作为语义域的基础,定义了糖化后的JML的语义。最后,我们进行了一个案例研究,评估了在形式化之上证明的可行性。
{"title":"A Formalization of JML in the Coq Proof System","authors":"Andreas Kägi, Hermann Lehner, Peter Müller","doi":"10.3929/ETHZ-A-006903145","DOIUrl":"https://doi.org/10.3929/ETHZ-A-006903145","url":null,"abstract":"JML is a complex specication language for Java. Its large scale and manifold features make it hard to precisely dene its semantics in a reference manual. It is thus desirable to formally specify the syntax and semantics of JML. There are many good reasons for a formalized semantics of JML in a theorem prover: It can be used to develop a sound verication condition generator for JML constructs. By formally defining the semantics in a theorem prover, we can detect and eliminate ambiguousities in the language. When using the semantics with an operational semantics for Java source code, we can dene a runtime assertion checker and prove it's soundness with respect to the semantics in Coq. We divide the problem of dening JML in Coq into several steps. Firstly, we dene a basic JML subset that has the full expressiveness of JML, but without syntactic sugar. We define the semantics for this subset in Coq. We introduce an extended (full) JML Syntax and a syntactic rewriting function from the extended syntax into the basic syntax. Finally, we built a translation frontend that transforms a JML-annotated Java program into it's equivalent in Coq. We managed to dene the full JML and Java syntax in Coq, minus some very rare and not clearly described concepts and minus everything related to floating point numbers. We implemented a lightweight translation frontent in Java. We defined a large set of rewritings that simplify the syntax of JML without loosing any precision. We then dened the semantics of the desugared JML, using Bicolano as a basis for the semantic domain. Finally, we conducted a case study evaluating the feasibility of proving on top of the formalisation.","PeriodicalId":10841,"journal":{"name":"CTIT technical reports series","volume":"27 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2009-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"83195870","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
期刊
CTIT technical reports series
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1