dole@spar.UUCP (Harry Dole) (11/14/85)
Enough of the equatorial polar bears and the machine protoplasm, time for some math. There has been some recent discussion on transcendental cardinals and I recall an old request for what category theory has to say about the Continuum Hypothesis (CH). I once did a seminar session at grad school about Tierney's proof on the independence of CH so I thought I would try to distill the essence of the proof for the readers of this net. The original proof appeared in Springer Lecture Notes series number 274 ( commonly referred to as the yellow-peril series ) under the title "Sheaf Theory and the Continuum Hypothesis." I cannot find my original copy of the article so I am relying on Peter Johnstone's book <Topos Theory> from Academic Press, pp323-329. This book is not for the casual reader of category theory but it is the single best reference on topos theory. Topos theory is essentially that part of category theory that models set theory. In fact, part of the reason for deriving topos theory to begin with was to handle the categorical version of CH ( the original topos theory work was done by Lawvere and Tierney ). I assume that not all readers of this net are familiar with categorical terms ( there certainly are plenty of mathematicians that are not familiar ) so I will attempt some deliberatly fuzzy definitions. If you know more accurate definitions then write down the dual definition and the evident diagram, apply the obvious argument and obtain the usual result. First of all, a category is a set of objects, and a set of arrows called the morphisms of the category. Any given morphism f has a domain object and a codomain object. Also given with the category is a composition rule on morphisms such that when given two morphisms f and g ( written as arrows ) such that the codomain of f is the domain of g: f g A----->B----->C then there exists a morphism gf:A----->C called the composite of g and f. The properties that must be satisfied are that composition is associative and for every object A there must be an identity morphism 1A so that composing with the identity yields the same morphism back. For example, the category of sets S is described as having for objects all sets and for morphisms all functions between sets ( note that the range of a function is contained within the codomain but is not necessarily equal to the codomain). A topos is a cartesian closed category with finite limits and a subobject classifier. Cartesian closed means that for any two given objects A and B in the category, then there is an object in the category that can be regarded as the set of all morphisms from A to B ( the real definition involves adjoint functors and I am going to avoid saying anything about them for as long as I can ). Essentially, cartesian closed means that hom-sets can be internalized. In the category of sets S the hom-sets are given for any two objects A and B as hom(A,B)={f|domain(f)=A and codomain(f)=B } and since a hom-set is a set the category of sets is cartesian closed. Typically, exponents are used to denote hom-sets in a general cartesian closed category so hom(A,B)=B**A in this ASCIIese. Finite limits means you can do finite products and pullbacks which intuitively means you can do cartesian products and (sortof) intersections of sets (actually pullbacks). A subobject classifier is a fancy way of saying you have an object in the category that can thought of as the set of truth values for the set theory that the topos models. As it happens, S is a topos with the subobject classifier being the set 2={T,F} containing T for true and F for false. There are many examples of toposes ( or topoi ) that are not Boolean like S--e.g., given any topological space U there exists a topos that is constructed by Grothendieck topologies whose truth values are the open sets of U ( warning: the math semantics of the latter instance of the word "topology" does not coincide with the former; I only mention Grothendieck here because this building of new toposes from old by Grothendieck topologies is used in the construction of a "set theory" that negates CH ). Now for the CH. What we want to come up with is a topos that looks like ordinary set theory--i.e., satisfies the axioms of set theory--but has an object X in it such that there is a one-to-one ( monomorphism ) map from the natural numbers N to X, a one-to-one map from X to the power set of N, denoted 2**N, such that there is no onto ( epimorphism ) map from P(N) to X and no onto map from X to N. Such an X implies that there is a cardinal number strictly between aleph-0 and aleph-1. This is the categorical form of Schroeder-Bernstein. The basic form of attack is to start with an internal form of the problem expressed in the internal language of an arbitrary topos as we have done somewhat in the previous paragragh. We take a two-valued model of set theory, say the S from above, choose a partially ordered set (poset) P in S ( as Johnstone puts it: normally the poset of "finite partial models" of the thing one is trying to construct ). One then constructs from S and P the topos consisting of the double-negation sheaves of the functor category S**P. This new category ( whatever it is ), denoted E, is also a topos but the subobject classifier is only known to be Boolean and not necessarily to be of just two elements as we expect for "standard" set theory. Assuming P was chosen correctly, the hypothesis to be proven ( for us, the negation of CH, or, more explicitly, the existence of that object X ) is then shown to be true in the new topos E. To get to a two-valued system of set theory, there is a general method ( the ultrapower construction ) of getting a two-valued set theory ( or topos ) from any Boolean topos such that the properties of the object X are preserved from the old topos E to the new two-valued topos, denoted S' ( this preservation of cardinals from the Boolean topos to the two-valued topos comes from the "logical" functor that maps E to S' ). The two-valued topos S' satisfies the same axioms that S did but it does not satisfy CH, thus yielding the independence of CH. Now for some details on that poset P. We assume that S contains an object N that "resembles" the natural numbers--i.e., the topos S satifies the axiom of infinity. Let I denote an object in S such that 2**N ( the power set of N ) is strictly contained in I. More explicitly, there is a one-to-one map from 2**N into I but there is no onto map from 2**N to I so that the cardinal number of 2**N is strictly less than that of I. The set P consists of all functions from a finite subset of the cartesian product of N and I to 2--i.e., the domain of any such function f consists of a finite number of ordered pairs of the form (n,i) where n is in N and i is in I, and each such ordered pair is sent by f to T or F in 2={T,F}. Given two such functions f and g in P, we say f<=g if g extends f, which means to say that the domain of g contains the domain of f and the values of g coincide with those of f on those elements that are in common. In Johnstone's words: We think of an element of P as a finite set of "forcing conditions", each of which says that a particular element n of N either is or is not in the particular element i of I. We use the notation card(x) to denote the cardinality of the object x. Without going into the technical lemmas or proofs, the following leaps of faith provide the mileposts to the independence of CH: A> Show that P satisfies the countable chain condition. This means that if A is a subset of P and no two elements of A have a common extension in P, then A is countable. B> Construct the topos of double-negation sheaves of the functor category S**P. This may be a little hard to explain. The functor category S**P is the set of all functors from P considered as a category to S. An object of the category S**P can then be thought of as attaching to each element p in P a set Yp in S such that if p<=q in P then there is a given one-to-one map from Yp to Yq in S. From general principles it follows that S**P is a topos but not necessarily a Boolean one. Since S**P is a topos it has a subobject classifier ( intuitively, the set of truth values for S**P ) so denote it by M. It is known that M is a Heyting algebra so it has a negation map not:M--->M ( the subobject classifier of a topos is always a Heyting algebra; a Boolean algebra is an example of a Heyting algebra). From the map not-not ( not composed with itself once ) we can construct a Grothendieck topology from which we obtain the topos of double-negation sheaves, denoted E. This construction is "nice" so that many facts that hold in S also hold in E. More to the point, E is a Boolean topos even though it is not a two-valued model of set theory ( remember, we get to the two-valued one via the ultrapower construction ). In particular, the construction provides a mapping ( functor ) from S to E and it can be shown that this mapping preserves cardinals. Denote this mapping as G:S--->E. C> Let W denote the subobject classifier in E. We transport the set I from S to E by the mapping G to obtain the object G(I) in E. It can be shown that there is a one-to-one mapping of G(I) to the object W**N ( this gives card(G(I)) <= card(W**N) ). The object W**N can be thought of ( in E, not S! ) as the set of subsets of the natural numbers ( the natural numbers in the topos E look very different from the natural numbers in S but still satisfy properties of natural numbers ). In the set theory E card(N) is aleph-0 and card(W**N) is aleph-1. This step is where the countable chain condition properties of the poset P get exercised. Remember that in the standard set theory topos S we have card(2**N) <= card(I). This should lead you to believe that G(2) is not the same as W in the topos E so it is clear that G does *not* preserve subobject classifiers. D> Next it has to be shown that in the topos E, the object G(2**N) negates the Continuum Hypothesis ( we get this when we can assert card(N) < card(G(2**N)) < card(W**N) ). Note that in both toposes S and E we use the same symbol N for the natural numbers despite the fact that they are quite different. This can be forgiven since G(N in S) is isomorphic to N in E ( isomorphism means intuitively that there is a one-to-one and onto map between the two; more to the point, G preserves natural number objects). First, we have the following sequence of one-to-one maps ( monomorphisms ): N iso G(N) -----> G(2**N) -----> G(I) -----> W**N where the first two come from the assumptions about I in S and the last one comes from step C above. From the above we get: card(N)=card(G(N))<=card(G(2**N))<=card(G(I))<=card(W**N) All we need now is that the first two of the above inequalities are strict and that the last is an equality. Let Epi(A,B) denote the set of epimorphisms from the set A to the set B. Note that Epi(A,B) is empty iff the cardinal number of A is strictly less than that of B. From our previous assumptions in S, we know that Epi(N,2**N) and Epi(2**N,I) are both empty. It follows in E that Epi(N,G(2**N)) and Epi(G(2**N),G(I)) are both empty ( the functor G does preserves epimorphisms). At this point we know card(N) < card(G(2**N)) < card(G(I)). By suitable magic ( injectives and global elements ) we can obtain an onto map ( epimorphism ) q:W**N---->G(I). This last epimorphism gives card(G(I))=card(W**N). Thus, in the Boolean topos E ( or Boolean set theory if you prefer ), we have card(N) < card(G(2**N)) < card(W**N). E> Using the ultrapower construction, we can reduce the Boolean topos E to a two-valued topos S'. This gives a mapping L from the topos E to S' that preserves cardinals *and* preserves subobject classifiers. Thus, L(W)=2 where this 2 is the subobject classifier in S', and moreover, L(N)=N where this last N is the natural number object in S'. It follows that card(L(N))=card(N) < card(L(G(2**N)) < card(L(W**N)) = card(L(W)**L(N)) = card(2**N). We now set X=L(G(2**N)). Thus, we have constructed a set theory S'=L(G(S)) that has a cardinal number strictly between aleph-0 and aleph-1. Now that the smoke has cleared and the reader's brain is suitably fused, one might ask what is the difference between this "proof" and Cohen's. By and large, the difference is that the fundamental terms in Cohen's come from set theory and those here come from category theory. So depending on what you already know, the set theory version may make more sense. However, many of the constructions used in the categorical version are standard and should be known by most users in the field ( perhaps not the ultrapower construction, but the functor categories for sure and most should at least know of Grothendieck topologies ). I would maintain that the categorical version of CH does not bring any new light on CH itself, but for someone who is versed in category theory as myself, the categorical version of the proof is far easier to read than the original Cohen paper ( yes, I had to read both for the seminar that I gave so I could tell the interested spectator what the differences were ). Moreover, the categorical techniques are quite easily generalizable and one can hope that other independence results can be categorically "forced" rather than having to deal with the drudgery of set theory. So far to my knowledge, only one other result has been shown to be independent via category theory and that was Souslin's Hypothesis shown by Marta Bunge at McGill University, Montreal. I did a seminar on that one as well but there is a glaring error that had passed others by: An arbitrary union of finite trees is a finite tree. Since I was giving the seminar, I was expected by prof Mac Lane ) to fix the error. I did this by reading the original independence proof written in set theory and laboriously performing a translation to categorical terms. The resulting categorical argument was smaller than the original ( blessed is the hindsight ) but this categorical treatment did not win any points with the logicians at my school. In other words, those people who make a living at "forcing" do not use categorical techniques. Long Live the Categorical Imperative. Harry Dole {decwrl,hplabs}!spar!dole
colonel@sunybcs.UUCP (Col. G. L. Sicherman) (12/13/85)
["Antiset: a mathematical structure that contains none of its elements."] I've seen models of ^CH that were easier than Cohen's but didn't use category theory either. (Of course they used _some_ algebra! e.g., ultrafilters.) -- Col. G. L. Sicherman UU: ...{rocksvax|decvax}!sunybcs!colonel CS: colonel@buffalo-cs BI: csdsicher@sunyabva