Now we’ll discuss the backbone of all modern math, set theory. Math can be thought of as, in it’s most basic form, the study of collections; a series of advanced ways of counting. A set is effectively a well-define collection of objects; however, it is actually quite difficult to specify what “well-defined” means. There was a time when a set was taken to be any collection of objects which could be described; however, this leads to paradoxes. For example, let denote the collection of all collections which do not contain themselves. Is in ? If it is, then contains itself, and so cannot be an element of , a contradiction. But if it is not, then does not contain , so must be in , another contradiction. The only possibility is that “membership”, the relation that specifies if is in the collection , is not well defined, and so the collection is not well-defined.
The schema where any collection of objects is a set is called “naive set theory”, and has some pretty clear paradoxes. To remedy this, a plethora of set theories were invented; these take the form of a set of axioms or definitions (depending on how you like to look at things) which outline how sets behave. Zermello-Fraenkel Choice (ZFC) set theory is what one might call the “industry standard” set theory, the one most mathemeticians use. The first two words (Zermello-Fraenkel) refer to the two mathemeticians who first defined it, while the “Choice” part refers to a particularly interesting axiom which ZFC includes (called the axiom of choice).
We’ll be studying the properties of ZFC specifically for the next two or three lessons, although all of math can be thought of as consequences of ZFC. Today, we’ll introduce the axioms of Zermelo set theory. Next lesson we’ll play with those axioms and see what constructions we can create from them. Finally, two lessons from now, we’ll discuss two additional axioms to make Zermelo set theory into ZFC.
Language of ZFC
In set theory, the objects we deal with are called sets. In addition to the logical connectives and the quantifiers , we specify an equality symbol and a membership symbol . Two sets and are said to be equal whenever can be replaced by in any logical statement without changing it’s truth value; in this case we say . If , we say that is a member of , or lies in . The meaning of this statement is dictated by the axioms of ZFC. For more information, see Wikipedia: Zermello-Fraenkel Set Theory - Formal Language. We will define in the axioms the intersection and union of two sets.
Notational Remark
For this article, I use lowercase letters to represent sets. In math, almost everything is a set, but it’s useful to view some sets as numbers, some sets as functions, some sets as relations, etc. When we do this, we’ll often keep track of what things we want to deal with as sets and what things we want to deal with otherwise by writing the sets as uppercase letters and the elements with lowercase letters. We’ll start doing this next week and remind you then.
Axioms of Zermelo Set Theory
We’ll begin by discussing Zermelo’s set theory, then add on axioms which Fraenkel included, and finally discuss the axiom of choice. These axioms are somewhat redundant; in reality, only nine are necessary. I’ve endeavored to put the ones I find most useful or use the most often first; the ones at the end are technical specifications to avoid paradoxes, which I almost never use. For more information, see here. Zermelo set theory includes, in addition to the axioms listed here, an axiom known as the axiom of choice, which we’ll discuss later. I’m drawing some of my descriptions of the axioms from the relevent descriptions here.
-
Empty Set. There is a set , called the empty set, which contains no element. Formally, we say
-
Extensionality. Two sets are equal if and only if they contain the same elements. This means whenever sets and include the same element, we can replace for in any logical statement and obtain the same truth-value. This can be written formally as
Note that this implies the empty set is unique, i.e. if a set has , then . (Show this to be true!).
-
Schema of Restricted Comprehension. Let be a set. For any predicate , the collection of all such that holds is a set. This is technically speaking an axiom for every predicate , but we’ll generally ignore that logical technicality. (That is, however, the reason for the name “schema”). We write this set using “set builder notation” as
and this is read “the set of all in such that (holds)”. A special case of this is the intersection. For sets and we define the intersection to be
i.e. the set of all in such that is (also) in .
-
Pairing. If and are sets, then there is a set which contains and as elements. Formally, we say
-
Power Set. If is a set, we say is a subset of (or , or ) if every element of is an element of . Formally,
The axiom of power set states that for each set there is a set such that if , then . Formally,
-
Union. If is a set, there is a set which contains for each and for each . We can write this as
If and are sets, we apply this axiom to (which is a set by the axiom of pairing) and denote the result .
-
Infinity. Let be a set. We know that is a set by the axiom of pairing; then the set exists by the axiom of pairing as well. Define to be
there is a set containing both the empty set and the set for each set . (This axiom effectively asserts the existence of the natural numbers, as we’ll see next week).
-
Regularity. Every non-empty set contains a member such that . If we say that and are disjoint. We can write this logically as
This exists to ensure that paradoxes don’t occur. In particular, no set can be an element of itself; if there was a set such that ,