Pub Date : 1992-03-01DOI: 10.1109/IPPS.1992.223007
Jianping Zhu
Discusses a householder factorization algorithm for a special type of matrix arising from the application of the Tikhnov regularization method to an ill-conditioned least square problem. The matrix involved is half dense and half sparse. The algorithm has been implemented on iPSC/860 hypercubes. By overlapping communications with computations, the code has been optimized to take advantage of the special structure of the matrix and minimize inter-node communications. Super-linear speed-up was observed in the numerical experiment for large problems. The algorithm has been used as a core routine in the program solving parameter identification problems in reservoir simulations.<>
{"title":"Householder transformation for the regularized least square problem on iPSC/860","authors":"Jianping Zhu","doi":"10.1109/IPPS.1992.223007","DOIUrl":"https://doi.org/10.1109/IPPS.1992.223007","url":null,"abstract":"Discusses a householder factorization algorithm for a special type of matrix arising from the application of the Tikhnov regularization method to an ill-conditioned least square problem. The matrix involved is half dense and half sparse. The algorithm has been implemented on iPSC/860 hypercubes. By overlapping communications with computations, the code has been optimized to take advantage of the special structure of the matrix and minimize inter-node communications. Super-linear speed-up was observed in the numerical experiment for large problems. The algorithm has been used as a core routine in the program solving parameter identification problems in reservoir simulations.<<ETX>>","PeriodicalId":340070,"journal":{"name":"Proceedings Sixth International Parallel Processing Symposium","volume":"25 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1992-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128298703","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 1992-03-01DOI: 10.1109/IPPS.1992.223053
Min-Ih Chen, C. Han, Kwei-Jay Lin
The article studies real-time systems which have a set of periodic readers and writers accessing a shared data object. The sufficient scheduling conditions for these tasks to finish their executions before the end of their periods are derived. The paper also studies a scheduling technique called job coalescence which combines many reader jobs or writer jobs into a job cluster and executes the cluster as a single job.<>
{"title":"Scheduling conditions for concurrent real-time readers and writers","authors":"Min-Ih Chen, C. Han, Kwei-Jay Lin","doi":"10.1109/IPPS.1992.223053","DOIUrl":"https://doi.org/10.1109/IPPS.1992.223053","url":null,"abstract":"The article studies real-time systems which have a set of periodic readers and writers accessing a shared data object. The sufficient scheduling conditions for these tasks to finish their executions before the end of their periods are derived. The paper also studies a scheduling technique called job coalescence which combines many reader jobs or writer jobs into a job cluster and executes the cluster as a single job.<<ETX>>","PeriodicalId":340070,"journal":{"name":"Proceedings Sixth International Parallel Processing Symposium","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1992-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131373448","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 1992-03-01DOI: 10.1109/IPPS.1992.222996
Masato Notomi, T. Murata
Petri nets are a versatile tool for modeling and analyzing parallel and distributed computing systems. However state explosion is a major impediment to their analysis and practical applications. To cope with this problem, this paper proposes a method for constructing hierarchically organized state space (HOSS) of a bounded Petri net. Using the HOSS, the authors obtain necessary and sufficient conditions for reachability and deadlock, and algorithms to test if a given state (marking) is reachable from the initial state and if there is a deadlock state (a state with no successor states).<>
{"title":"Hierarchically organized Petri net state space for reachability and deadlock analysis","authors":"Masato Notomi, T. Murata","doi":"10.1109/IPPS.1992.222996","DOIUrl":"https://doi.org/10.1109/IPPS.1992.222996","url":null,"abstract":"Petri nets are a versatile tool for modeling and analyzing parallel and distributed computing systems. However state explosion is a major impediment to their analysis and practical applications. To cope with this problem, this paper proposes a method for constructing hierarchically organized state space (HOSS) of a bounded Petri net. Using the HOSS, the authors obtain necessary and sufficient conditions for reachability and deadlock, and algorithms to test if a given state (marking) is reachable from the initial state and if there is a deadlock state (a state with no successor states).<<ETX>>","PeriodicalId":340070,"journal":{"name":"Proceedings Sixth International Parallel Processing Symposium","volume":"4 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1992-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121180466","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 1992-03-01DOI: 10.1109/IPPS.1992.223048
Jeff Baxter, J. Patel
A new task migration strategy is presented. The input program is supplied as a task graph and an initial static assignment of tasks to processors is given. The dynamic behaviour of the static allocation is then profiled, and a set of migration destinations and migration thresholds are identified. During any subsequent execution, the profiling based algorithm makes migration decisions solely on the basis of the profiled data and the local processor load. No neighboring processor load information is required. Results are presented for the algorithm, and compared against a dynamic mapping scheme with global knowledge of the multiprocessor state.<>
{"title":"Profiling based task migration","authors":"Jeff Baxter, J. Patel","doi":"10.1109/IPPS.1992.223048","DOIUrl":"https://doi.org/10.1109/IPPS.1992.223048","url":null,"abstract":"A new task migration strategy is presented. The input program is supplied as a task graph and an initial static assignment of tasks to processors is given. The dynamic behaviour of the static allocation is then profiled, and a set of migration destinations and migration thresholds are identified. During any subsequent execution, the profiling based algorithm makes migration decisions solely on the basis of the profiled data and the local processor load. No neighboring processor load information is required. Results are presented for the algorithm, and compared against a dynamic mapping scheme with global knowledge of the multiprocessor state.<<ETX>>","PeriodicalId":340070,"journal":{"name":"Proceedings Sixth International Parallel Processing Symposium","volume":"26 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1992-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124055237","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 1992-03-01DOI: 10.1109/IPPS.1992.222998
Nelson Ge, J. Antonio, S. Vitalpur
The authors propose a simple quasi-static routing scheme for buffered mesh connected network computers. Under the assumption of uniform traffic demands, it is demonstrated numerically that their simple routing scheme achieves near-optimal performance; in the sense of minimizing the total number of outstanding packets in a Jackson-type network. Simulation results are also presented for the case of non-uniform traffic demands. These simulations illustrate that our quasi-static control scheme can dynamically adjust the values of routing probabilities so as to provide acceptable and stable network performance. The results of these simulations are displayed using advanced video graphics.<>
{"title":"Visualization of a simple routing scheme for meshes","authors":"Nelson Ge, J. Antonio, S. Vitalpur","doi":"10.1109/IPPS.1992.222998","DOIUrl":"https://doi.org/10.1109/IPPS.1992.222998","url":null,"abstract":"The authors propose a simple quasi-static routing scheme for buffered mesh connected network computers. Under the assumption of uniform traffic demands, it is demonstrated numerically that their simple routing scheme achieves near-optimal performance; in the sense of minimizing the total number of outstanding packets in a Jackson-type network. Simulation results are also presented for the case of non-uniform traffic demands. These simulations illustrate that our quasi-static control scheme can dynamically adjust the values of routing probabilities so as to provide acceptable and stable network performance. The results of these simulations are displayed using advanced video graphics.<<ETX>>","PeriodicalId":340070,"journal":{"name":"Proceedings Sixth International Parallel Processing Symposium","volume":"16 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1992-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128598368","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 1992-03-01DOI: 10.1109/IPPS.1992.223057
D. Reisis
Presents a fast parallel solution to the convex hull problem on a two dimensional reconfigurable mesh. The parallel algorithm computes the convex hull of N/sup 1/2/ planar points on a N/sup 1/2/*N/sup 1/2/ reconfigurable mesh. The algorithm is very efficient and completes the convex hull computation in O(1) time. Efficient parallel sorting of N/sup 1/2/ data items on the reconfigurable mesh and improved parallel enumerating techniques utilized as basic steps in the convex hull computation are also shown.<>
{"title":"An efficient convex hull computation on the reconfigurable mesh","authors":"D. Reisis","doi":"10.1109/IPPS.1992.223057","DOIUrl":"https://doi.org/10.1109/IPPS.1992.223057","url":null,"abstract":"Presents a fast parallel solution to the convex hull problem on a two dimensional reconfigurable mesh. The parallel algorithm computes the convex hull of N/sup 1/2/ planar points on a N/sup 1/2/*N/sup 1/2/ reconfigurable mesh. The algorithm is very efficient and completes the convex hull computation in O(1) time. Efficient parallel sorting of N/sup 1/2/ data items on the reconfigurable mesh and improved parallel enumerating techniques utilized as basic steps in the convex hull computation are also shown.<<ETX>>","PeriodicalId":340070,"journal":{"name":"Proceedings Sixth International Parallel Processing Symposium","volume":"34 3 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1992-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124329352","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 1992-03-01DOI: 10.1109/IPPS.1992.223055
Sajal K. Das, Calvin Ching-Yuen Chen
The authors design an efficient parallel algorithm for constructing a breadth-first spanning tree of an interval graph. Their novel approach is based on elegantly capturing the structure of a given collection of intervals. This structure reveals important properties of the corresponding interval graph, and is found to be instrumental in solving many other problems including the computation of a breadth-depth spanning tree, which they report for the first time. The algorithm requires O(logn) time employing O(n) processors on the EREW PRAM model.<>
{"title":"A new parallel algorithm for breadth-first search on interval graphs","authors":"Sajal K. Das, Calvin Ching-Yuen Chen","doi":"10.1109/IPPS.1992.223055","DOIUrl":"https://doi.org/10.1109/IPPS.1992.223055","url":null,"abstract":"The authors design an efficient parallel algorithm for constructing a breadth-first spanning tree of an interval graph. Their novel approach is based on elegantly capturing the structure of a given collection of intervals. This structure reveals important properties of the corresponding interval graph, and is found to be instrumental in solving many other problems including the computation of a breadth-depth spanning tree, which they report for the first time. The algorithm requires O(logn) time employing O(n) processors on the EREW PRAM model.<<ETX>>","PeriodicalId":340070,"journal":{"name":"Proceedings Sixth International Parallel Processing Symposium","volume":"21 44","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1992-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132545490","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 1992-03-01DOI: 10.1109/IPPS.1992.223020
A. Sivasubramaniam, U. Ramachandran, H. Venkateswaran
The article develops a framework for message-passing architectures consisting of a machine model called communicating random access machine (CRAM) and a programming paradigm. The CRAM model serves as a vehicle for the design and analysis of message-passing algorithms. The message-passing paradigm makes the mapping of algorithms that fit this paradigm onto message-passing architectures more natural.<>
{"title":"A computational model for message-passing","authors":"A. Sivasubramaniam, U. Ramachandran, H. Venkateswaran","doi":"10.1109/IPPS.1992.223020","DOIUrl":"https://doi.org/10.1109/IPPS.1992.223020","url":null,"abstract":"The article develops a framework for message-passing architectures consisting of a machine model called communicating random access machine (CRAM) and a programming paradigm. The CRAM model serves as a vehicle for the design and analysis of message-passing algorithms. The message-passing paradigm makes the mapping of algorithms that fit this paradigm onto message-passing architectures more natural.<<ETX>>","PeriodicalId":340070,"journal":{"name":"Proceedings Sixth International Parallel Processing Symposium","volume":"10 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1992-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123828436","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 1992-03-01DOI: 10.1109/IPPS.1992.223003
Zhiwei Xu, K. Hwang
Advocates a structured style for parallel program development. It is argued that a high-quality parallel computing program should be terminating (the program always halts in finite time) and determinate (the program always produces the same result for the same input). A methodology is presented for structured parallel programming, drawing on the successful philosophies of structured sequential programming. A set of structured constructs is presented to demonstrate this methodology. Conditions for termination and determinacy are then derived, which suggests a set of rules for the development of terminating and determinate programs. Infinite waiting and indeterminacy anomalies are characterized. An algorithm is described to detect these anomalies by syntactical analysis.<>
{"title":"Language constructs for structured parallel programming","authors":"Zhiwei Xu, K. Hwang","doi":"10.1109/IPPS.1992.223003","DOIUrl":"https://doi.org/10.1109/IPPS.1992.223003","url":null,"abstract":"Advocates a structured style for parallel program development. It is argued that a high-quality parallel computing program should be terminating (the program always halts in finite time) and determinate (the program always produces the same result for the same input). A methodology is presented for structured parallel programming, drawing on the successful philosophies of structured sequential programming. A set of structured constructs is presented to demonstrate this methodology. Conditions for termination and determinacy are then derived, which suggests a set of rules for the development of terminating and determinate programs. Infinite waiting and indeterminacy anomalies are characterized. An algorithm is described to detect these anomalies by syntactical analysis.<<ETX>>","PeriodicalId":340070,"journal":{"name":"Proceedings Sixth International Parallel Processing Symposium","volume":"41 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1992-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127014242","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 1992-03-01DOI: 10.1109/IPPS.1992.223046
J. A. Yang, Young-il Choo
Presents a formal derivation of a highly efficient parallel implementation of the 2-D Gauss-Seidel method for machines based on the two-dimensional mesh of processors. A methodology is illustrated which formalizes the process of mapping and scheduling a high level algorithm onto a particular target parallel architecture. It starts from a simple initial program. Equational transformations are then applied: to partition the abstract problem onto processors; to make communication among processors explicit; to pipeline the computation by wave-front scheduling; and finally to map logical processors onto physical processors for perfect processor utilization. All the derivation steps preserve equalities so that the derived programs are equivalent to the initial program. Using this methodology, the paper develops efficient implementations for other parallel architectures.<>
{"title":"Formal derivation of an efficient parallel 2-D Gauss-Seidel method","authors":"J. A. Yang, Young-il Choo","doi":"10.1109/IPPS.1992.223046","DOIUrl":"https://doi.org/10.1109/IPPS.1992.223046","url":null,"abstract":"Presents a formal derivation of a highly efficient parallel implementation of the 2-D Gauss-Seidel method for machines based on the two-dimensional mesh of processors. A methodology is illustrated which formalizes the process of mapping and scheduling a high level algorithm onto a particular target parallel architecture. It starts from a simple initial program. Equational transformations are then applied: to partition the abstract problem onto processors; to make communication among processors explicit; to pipeline the computation by wave-front scheduling; and finally to map logical processors onto physical processors for perfect processor utilization. All the derivation steps preserve equalities so that the derived programs are equivalent to the initial program. Using this methodology, the paper develops efficient implementations for other parallel architectures.<<ETX>>","PeriodicalId":340070,"journal":{"name":"Proceedings Sixth International Parallel Processing Symposium","volume":"25 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1992-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114286666","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}