[comp.os.research] Technical reports on replicated data available

darrell@midgard.ucsc.edu (Darrell Long) (05/23/89)
The following technical reports on replicated data managements are available
from UC Santa Cruz.

Please address correspondence to:

Technical Report Librarian
Baskin Center for Computer Engineering & Information Sciences
Applied Sciences Building
University of California
Santa Cruz, CA 95064


TR #88-35
Cost $4


The Effect of Failure and Repair Distributions on Consistency Protocols

                      John L. Carroll
                 San Diego State University

                     Darrell D. E. Long
            University of California, Santa Cruz

                          Abstract

The accessibility of vital information can  be  enhanced  by
replicating  the data on several sites, and employing a con-
sistency control protocol to manage the copies.

     Various protocols have been  proposed  to  ensure  that
only current copies of the data can be accessed.  The effect
these protocols have on the accessibility of the  replicated
data is investigated by simulating the operation of the net-
work and measuring the performance.  Several strategies  for
replica maintenance are considered, and the benefits of each
are analyzed.  The details of the simulations are discussed.
Measurements  of the reliability and the availability of the
replicated data are compared and contrasted.

     The sensitivity of  the  Available  Copy  and  Dynamic-
linear  Voting protocols to common patterns of site failures
and repairs is studied in detail.  Exponential, Erlang, uni-
form, and hyperexponential distributions are considered, and
the effect  the  second  moments  have  on  the  results  is
analyzed.   The  relative performance of competing protocols
is shown to be only marginally affected  by  non-exponential
distributions,  validating the robustness of the exponential
approximations.

TR #88-34
Cost $4

           Reliability of Replicated Data Objects

                     Darrell D. E. Long
            University of California, Santa Cruz

                    Jehan-Francois Paris
                   University of Houston

                      John L. Carroll
                 San Diego State University

                          Abstract

Improved  fault  tolerance  of  many  applications  can   be
achieved  by  replicating  data at several sites.  This data
redundancy requires a protocol to maintain  the  consistency
of  the  data  object in the presence of site failures.  The
most commonly used scheme is voting.  Voting and  its  vari-
ants  are  unaffected  by  network partitions.  When network
partitions cannot occur, better performance can be  achieved
with available copy protocols.

     Common measures of dependability  include  reliability,
which  is  the  probability  that  a  replicated object will
remain constantly available over a fixed  time  period.   We
investigate  the  reliability  of  replicated  data  objects
managed by voting, available copy and their variants.  Where
possible, closed-form expressions for the reliability of the
various consistency protocols  are  derived  using  standard
Markovian  assumptions.  In other cases, numerical solutions
are found and validated with simulation results.


TR #88-23
Cost $4


       Regeneration Protocols for Replicated Objects

                     Darrell D. E. Long
            University of California, Santa Cruz

                    Jehan-Francois Paris
                   University of Houston

                          Abstract

The reliability and  availability  of  replicated  data  can
often  be  increased  by  generating  new replicas when some
become inaccessible due to system malfunctions.  This  tech-
nique has been used in the Regeneration Algorithm, a replica
control protocol based on file regeneration.

     The read and write availabilities  of  replicated  data
managed  by the Regeneration Algorithm are evaluated and two
new regeneration protocols are presented that overcome  some
of  its  limitations.  The first protocol combines regenera-
tion and the Available Copy approach to improve availability
of  replicated  data.   The second combines regeneration and
the Dynamic Voting approach to guarantee data consistency in
the  presence of network partitions while maintaining a high
availability.  Expressions for the availabilities of  repli-
cated  data  managed by both protocols are derived and found
to improve significantly on the availability achieved  using
extant consistency protocols.


TR #88-07
Cost $10


   The Management of Replication in a Distributed System

                     Darrell D. E. Long
                       (Ph.D. Thesis)
            University of California, San Diego

                          Abstract

The field of consistency control  protocols  for  replicated
data  objects  has  existed  for about ten years.  Its birth
coincides with the advent of distributed data bases and  the
communications  technology  required  to support them.  When
data objects are replicated around  a  computer  network,  a
protocol  must  be  chosen to ensure a consistent view to an
accessing process.  The replicas of the data object are then
said  to  be  mutually  consistent.   The  protocols used to
insure mutual consistency are known as  replica  control  or
consistency control protocols.

     There are several advantages to  a  distributed  system
over  a  single processor system.  Among these are increased
computing power and the ability to tolerate partial failures
due to the malfunction of individual components.  The redun-
dancy present in a distributed system has been the focus  of
much  research in the area of distributed data base systems.
Another benefit of this natural redundancy, along  with  the
relatively  independent  failure modes of the processors, is
that it allows the system to continue operation  even  after
some  of  the  processors  have failed.  This can be used to
construct data objects that are robust in the face  of  par-
tial system failures.

     The focus of this dissertation is the  exploitation  of
the  redundancy  present  in distributed systems in order to
attain an  increased  level  of  fault  tolerance  for  data
objects.   The  use of replication as a method of increasing
fault tolerance  is  a  well-known  technique.   Replication
introduces  the  additional complexity of maintaining mutual
consistency among the replicas of the data object.  The pro-
tocols  that manage the replicated data and provide the user
with a single consistent view of that data are studied,  and
a  comprehensive analysis of the fault tolerance provided by
several of  the  most  promising  protocols  are  presented.
Several  techniques  are employed, including Markov analysis
and discrete event simulation.  Simulation is used  to  con-
firm  and  extend  the results obtained using analytic tech-
niques.