E1AR0002@SMUVM1.BITNET (02/14/86)
1985 TECHNICAL REPORTS (4th Quarter: October-December) UCLA COMPUTER SCIENCE DEPARTMENT University of California 3713 Boelter Hall Los Angeles, CA 90024 USA ****************************** AN EXPLANATION OF AND CURE FOR MINIMAX PATHOLOGY Bruce Abramson CSD-850034 $1.50 The minimax procedure has long been the standard method of evaluating nodes in game trees. The general assumption underlying its use in game-playing programs is that increasing search depth improves play. Recent work has shown that this assumption is not always valid; for a large class of games and evaluation functions, searching deeper decreases the probability of making a correct move. This phenomenon is called game tree pathology. Two structural properites of game trees have been suggested as causes of pathology: independence among the values of sibling nodes, and uniform depth of wins and losses. This paper examines the relationship between un- iform win depth and pathology from two angles. First, it proves mathemati- cally that as search deepens, an evaluation function that does not .ul ask whether wins can be forced from mid-game positions becomes decreasingly likely to choose forced wins. Second, it experimentally illustrates the connection between recognition of mid-game wins and pathological behavior. Two evaluation functions, which differ only in their ability to recognize wins in mid-game, are run on a series of games. Despite recognizing fewer mid-game wins than the theoretically predicted minimum needed to avoid pathology, the function that checked for them cleared up the pathological behavior of the one that did not. The analytic and empirical aspects of this paper combine to form one major result: As search deepens, so does the probability that failing to check for forced wins will change the game's outcome. This strengthens the hy- pothesis that uniform win depth is the cause of pathology. ****************************** DISTRIBUTED COMPUTER SIMULATION OF DATA COMMUNICATION NETWORKS; PROJECT REPORT III Shung Cheung, Jack W. Carlyle, Walter Karplus CSD-850037 $2.50 This is the third in a series of reports on continuing studies of discrete-event simulation using distributed processing (e.g., on networks of minicomputers or networks of microcomputer workstations), with applica- tion to performance prediction for data communication networks. Issues ex- amined include synchronization mechanism, task allocation algorithms, simu- lator implementation approaches, and deadlock prevention methods. ****************************** DISTRIBUTED ALGORITHMS FOR ELECTION IN UNIDIRECTIONAL AND COMPLETE NETWORKS Yehuda Afek Professor Leonard Kleinrock, Chair CSD-850036 $8.50 Consider a data communication network of n nodes, each of which has a unique identifier (id); otherwise the nodes are identical. The nodes are asleep and have no global information about network topology, number and ids of other nodes, etc. A distributed election algorithm is a means by which the nodes of the network distinguish one among them as the leader. The problem of distributively electing a leader in a network is viewed as a problem of synchronization among potential candidates for leadership. Each candidate tries to capture all the nodes. To guarantee that only one succeeds, all but one candidate are killed. Following this view election algorithms in a general, two component framework are designed. Component one is a capturing and termination detection mechanism, assuming only one candidate. Component two is a synchronization mechanism, to eliminate all but one candidate. In arbitrary networks the synchronization is complicated by the uncertain- ties of nodes about the network topology and the relative location of can- didates. Two network models are considered: first, a complete network in which a bidirectional communication link connects every node with every other, thus eliminating topological uncertainties; and second, the oppo- site extreme in which topological uncertainties are at maximum -- a strong- ly connected unidirectional network with some or all links transmitting messages in one direction only. The study produces an O(n.log n) messages O (log n) time synchronous and O(n.log n) messages O(n) time asynchronous election algorithm in complete networks. For unidirectional networks we derive a distributed election al- gorithm whose communication complexity is O(n.|E|+n2log n ) bits, where |E| is the total number of links. We also establish that OMEGA (n.log n) is a lower bound on the total number of messages transmitted for achieving election in synchronous complete net- works. Moreover, it is shown that the time complexity of message-optimal synchronous algorithms is OMEGA (n.log n ), hence the optimality of our synchronous complete network algorithm. It remains open whether a sub- linear time, message-optimal, asynchronous complete network election algo- rithm exists. ****************************** REDUCED REGISTER SAVING/RESTORING IN SINGLE-WINDOW REGISTER FILES Toma's Lang and Miquel Huguet CSD-850040 $1.75 In a register-oriented architecture, the sequence of instructions corresponding to the function call construct has been identified as consum- ing a significant fraction of the execution time of typical programs writ- ten in high-level languages such as C. A large reduction in this time is obtained by the use of multiple-window register files. However, this ap- proach has as disadvantages the cost of the large number of registers (in chips in a MSI/LSI implementation and in area in the VLSI case), the in- crease in cycle time in a VLSI implementation due to longer data buses, and the increase of the time for context switching. These disadvantages can be reduced by the use of multi-size windows and the implementation of the file as a set of shift registers. However, due to these disadvantages, the multiple-window approach might not be suitable for all situations. Conse- quently, it is interesting to develop architectures and compilers that reduce the cost of function call for the single-window case. In this paper we present schemes that permit the reduction of the memory traffic due to register saving and restoring, which is one of the major components of the execution time of function call/return. We also consider the implementa- tion of the selected scheme and conclude that the implementation is feasi- ble and might be a good use of resources to increase execution speed. ****************************** TASK RESPONSE TIME AND MODULE ASSIGNMENT FOR REAL-TIME DISTRIBUTED PROCESSING SYSTEMS Kin Kwong Leung Professor Wesley W. Chu, Chair CSD-850041 $14.00 Task response time is an important system performance measure for real-time systems. An analytic model is introduced to estimate task response time for loosely coupled distributed processing systems with real-time applica- tions. The model considers such factors as assignment of modules to com- puters, module precedence relationships, interprocessor communications, in- terconnection network delay and module scheduling policy. Simulation exper- iments are used to validate the model assumptions and to show the accuracy of the model. The analytic model is first employed to investigate the effects of module precedence relationships on response times. Our study reveals that the dis- tributions of module execution times and the mean execution time ratio for a pair of consecutive modules are major factors for the effects. The task response time model is then used to study module assignment for distributed systems. Based on the model, a new local search algorithm for module assignment is developed. Firstly, each module is assumed to be allo- cated to a single computer. Task response time is the optimality cri- terion, and the analytic model becomes the objective function. Search strategies are established to search for better module assignments. Furth- er, the algorithm is extended to handle module replications; that is, modules may be replicated on several computers. The design objective for replicated module assignment is to minimize task response time with the thread response time constraints. A new objective criterion which is the sum of task response time and possible penalty delay to account for the violations of thread response time constraints is introduced. With this objective function, not only module copies are optimally assigned to com- puters, the proper module multiplicities are also iteratively determined by the algorithm so as to achieve the objective. The algorithm is validated by applying to two distinct distributed systems for space defense applications. One system does not require module replica- tions while the other does. The sub-optimal module assignments generated by the algorithm provide excellent response time performance on both sys- tems since the analytic model has considered all major factors that affect task response time. Therefore, the algorithm can serve as a valuable tool for distributed systems design. ****************************** HUMAN PROBLEM UNDERSTANDING AND ADVICE GIVING: A COMPUTER MODEL Alexander Eric Quilici Professor Michael Dyer, Chair CSD-850039 $7.75 How are people able to understand someone else's problem and provide them with advice? How are people able to develop novel solutions to problems they have never seen before? The thesis presented here is a first step to- ward answering these questions, presenting a computer model of the process of problem understanding and advice giving. The problems we consider are typical planning problems that novice computer users encounter. We view advice giving as a memory search problem, guided by heuristics for problem understanding, advice generation, and plan creation. In this thesis we describe a representational system for user planning problems, show how advice can be generated using a taxonomy of planning problems and associated heuristics for advice formulation, present heuristics that can be used to repair failed plans and to create new plans by combining exist- ing plans in novel ways, and suggest a memory organization for planning knowledge that allows for efficient retrieval of relevant planning experi- ences. The theory discussed in this thesis is implemented in a computer program called AQUA (Alex Quilici's UNIX|+ Advisor). AQUA takes natural language descriptions of problems users are having with the UNIX operating system and provides natural language advice that explains their failures and sug- gests solutions. AQUA is also able to create solutions for problems that it has not been presented with before. ****************************** GRAPHOIDS: A GRAPH-BASED LOGIC FOR REASONING ABOUT RELEVANCE RELATIONS * Judea Pearl, Azaria Paz CSD-850038 $1.75 We consider 3-place relations I (x,z,y) where, x,y, and z are three non-intersecting sets of elements, (e.g., propositions), and I (x,z,y) stands for the statement: "Knowing z renders x irrelevant to y". We give sufficient conditions on I for the existence of a (minimal) graph G such that I (x,z,y) can be validated by testing whether z separates x from y in G. These conditions define a GRAPHOID. The theory of graphoids uncovers the axiomatic basis of probabilistic dependencies and ties it to vertex-separation conditions in graphs. The defining axioms can also be viewed as inference rules for deducing which propositions are relevant to each other, given a certain state of knowledge. ****************************** ANALYSIS OF ALTERNATIVES FOR A SINGULAR VALUE DECOMPOSITION PROCESSOR Jaime H. Moreno Professor Tomas Lang, Chair CSD-850035 $11.75 In this thesis we present an evaluation of different alternatives for the implementation of a digital system to compute the Singular Value Decomposi- tion (SVD), in terms of the throughput achievable with a given amount of hardware. An algorithmic model which captures the properties of the SVD computation and a methodology for the design of systems exploiting con- currency are presented and applied to the SVD. The model uses a directed graph as a description of the algorithm, where nodes correspond to subcomputations and arcs to precedences among the sub- functions. Each node is described by its execution time as a function of the number of operation units used. This model is utilized to evaluate the cost and performance of implementations with replication, pipelining and parallelism. The methodology for the design is essentially an iterative procedure con- sisting of top-down decomposition and bottom-up refinement of the nodes in the graph of the algorithm. It is shown here that, for throughput higher than a certain value which depends on the computation, a pipelined approach for the implementation of the SVD algorithm is more attractive than other alternatives currently pro- posed for such computation. The architecture devised is a multilevel pipe- lined processor, which exploits concurrency at several levels through internal pipelines and the use of the parallelism in the subcomputations. This scheme offers better performance characteristics for a given amount of hardware than the other alternatives proposed, keeps the realization at a level of complexity similar to those other alternatives, and is able to compute the decomposition of matrices of any size without hardware modifi- cations. However, implementations with the scheme presented exhibit expan- sibility characteristics such that they can be upgraded if higher throughput for larger matrices is desired. The algorithm used is a parallel version of Hestenes' one sided orthogonal- ization method proposed by Brent et al, which is adapted for implementation with few processors. The resulting scheme has the same characteristics of the original one regarding data transfers between units, and it is realiz- able with any number of parallel processors. ****************************** ****************************** Please circle the report(s) you would like to order. ALL REQUESTS MUST BE PREPAID. Postage has been included for Continental U.S. delivery only. For foreign delivery please add 15% to cover postage. Foreign checks/currency WILL NOT be accepted. Checks should be made payable to: "UCLA COMPUTER SCIENCE DEPARTMENT" Return this checklist and your check to: Ms. Brenda Ramsey UCLA Computer Science Department School of Engineering & Applied Science University of California 3731 Boelter Hall Los Angeles, CA 90024 CSD-850034 $1.00 CSD-850035 $10.75 CSD-850036 $8.25 CSD-850037 $2.00 CSD-850038 $1.50 CSD-850039 $6.75 CSD-850040 $1.50 CSD-850041 $12.25 Please let us know if you would like to have your name REMOVED or ADDED to our hard copy mailing list: Remove ( ) Add ( ) Name: Address: City: State: ZIP