Definition 1.1 (Sample space): The sample space Ω is a non-empty set.
Note: The sample space Ω is the space of all outcomes ω∈Ω of a random experiment.
Definition 1.2 (Complement): For any subset A⊆Ω, the complement is Ac=Ω∖A.
We will use the notion of collections and families of subsets of Ω frequently.
Definition 1.3 (Collection): The set C⊆P(Ω) is called a collection.
Definition 1.4 (Familiy): Let I be a non-empty index set. A family of subsets of Ω is a mapping I↦P(Ω). We denote it by (Ai)i∈I.
Note:
We write (Ai)i∈I⊆C to denote ∀i∈I:Ai∈C.
If a collection is the image of a family (Ai)i∈I, we denote it with {Ai}i∈I={S∈P(Ω)∣∃i∈I:S=Ai}.
We introduce the concept of a σ-algebra.
Definition 1.5 (σ-algebra): A collection F is a σ-algebra on Ω if
Ω∈F
If A∈F, then Ac∈F
If (An)n∈N⊆F, then ⋃n∈NAn∈F
Note:
The elements A∈F are called measurable sets or events.
An arbitrary intersection of σ-algebras is a σ-algebra.
Proposition 1.6 (Stability under countable intersections): If (An)n∈N⊆F, then ⋂n∈NAn∈F.
Note: The tuple (Ω,F) is called a measurable space.
Definition 1.7 (Probability measure): The function P:F→[0,1] is a probability measure on the measurable space (Ω,F) if
P(Ω)=1
Let (An)n∈N⊆F with Ai,Aj pairwise disjoint for any i=j, then
P(n∈N⋃An)=n∈N∑P(An)
Note: For an event A∈F, the quantity P(A) measures the probability that an outcome ω∈Ω lies in A.
We introduce the concept of a probability space.
Definition 1.8 (Probability space): A probability space is a triple (Ω,F,P), where
Ω is a sample space
F is a σ-algebra on Ω
P is a probability measure on (Ω,F)
Note: In this text, unless noted otherwise, we consider the probability space as fixed to a specific triple (Ω,F,P).
We present some important properties of the probability measure.
Proposition 1.9 (Probability of complement): Let A∈F, then P(Ac)=1−P(A).
Proposition 1.10 (Continuity of probability measure): Let (An)n∈N⊆F.
If ∀n∈N:An⊆An+1, then
P(n→∞limAn)=n→∞limP(An)
i.e. P is continuous from below.
If ∀n∈N:An+1⊆An, then
P(n→∞limAn)=n→∞limP(An)
i.e. P is continuous from above.
Proposition 1.11 (Union bound): Let (An)n∈N⊆F, then
P(n∈N⋃An)≤n∈N∑P(An)
Proposition 1.12 (Bonferoni inequality): Let (An)n∈N⊆F, then
P(n∈N⋂An)≥1−n∈N∑P(Anc)
1.2 Collection-Generated σ-Algebra
The σ-algebra generated by a collection C is the collection of all events whose occurence is determined once we know whether ω in A for all A∈C. We formalize this concept.
Definition 1.13 (Collection-generated σ-algebra): Let C be a collection, then
σ(C)=⋂{F~⊆P(Ω)C⊆F~ and F~ is a σ-algebra}
is the σ\sigmaσ-algebra generated by C\colCC.
Note:
σ(C)\sigma(\colC)σ(C) is the smallest σ\sigmaσ-algebra containing C\colCC.
If F\sigmaFF is a σ\sigmaσ-algebra, then σ(F)=F\sigma(\sigmaF) = \sigmaFσ(F)=F.
If C⊆C~\colC \subset \tilde{\colC}C⊆C~, then σ(C)⊆σ(C~)\sigma(\colC) \subset \sigma(\tilde{\colC})σ(C)⊆σ(C~).
If C\colCC is finite, σ(C)\sigma(\colC)σ(C) is the set of all possible unions of atoms of C\colCC.
We take a closer look at a special case of collection-generated σ\sigmaσ-algebras.
Recap (Topological space): A topological space is a tuple (E,T)(\setE, \colT)(E,T) where E\setEE is a non-empty set and T\colTT is a collection of subsets satisfying
∅,E∈T\varnothing, \setE \in \colT∅,E∈T
If (Oi)i∈I⊆T\family*{\setO}{i}{\setI} \subset \colT(Oi)i∈I⊆T, then ⋃i∈IOi∈T\Union_{i \in \setI} \setO_i \in \colT⋃i∈IOi∈T
If (Oi)i∈I⊆T\family*{\setO}{i}{\setI} \subset \colT(Oi)i∈I⊆T with I\setII finite, then ⋂i∈IOi∈T\Intersect_{i \in \setI} \setO_i \in \colT⋂i∈IOi∈T
Definition 1.14 (Borel σ\sigmaσ-algebra): Let (E,T)(\setE, \colT)(E,T) be a topological space. Then B(E)=σ(T)\borelB(\setE) = \sigma(\colT)B(E)=σ(T) is the Borel σ\sigmaσ-algebra of E\setEE.
Proposition 1.15 (Borel σ\sigmaσ-algebra of R\RR): Let (R,{(a,b)|a,b∈R})(\R, \set{(a,b) \mid a,b \in \R})(R,{(a,b)∣a,b∈R}) be the standard topology on R\RR, we have
B(R)=σ({(a,b)|a,b∈R})
\borelB(\R) = \sigma(\set{(a,b) \mid a,b \in \R})
B(R)=σ({(a,b)∣a,b∈R})
Note: There are multiple equivalent definitions of B(R)\borelB(\R)B(R) such as:
B(R)=σ({(−∞,a)|a∈R})\borelB(\R) = \sigma(\set{(-\infty,a) \mid a \in \R})B(R)=σ({(−∞,a)∣a∈R})
B(R)=σ({(−∞,a]|a∈R})\borelB(\R) = \sigma(\set{(-\infty,a] \mid a \in \R})B(R)=σ({(−∞,a]∣a∈R})
1.3 λ\lambdaλ- and π\piπ-Systems
Definition 1.16 (λ\lambdaλ-system): A collection D\colDD is a λ\lambdaλ-sytem if
Ω∈D\Omega \in \colDΩ∈D
If A∈D\setA \in \colDA∈D, then Ac∈D\setA^c \in \colDAc∈D
If (An)n∈N⊆D\family{\setA} \subset \colD(An)n∈N⊆D pairwise disjoint, then ⋃n∈NAn∈D\UnionN \setA_n \in \colD⋃n∈NAn∈D
Note: If D\colDD is a σ\sigmaσ-algebra, then D\colDD is a λ\lambdaλ-system. The converse does not hold in general. To see that, let for example Ω={1,2,3,4}\Omega = \set{1,2,3,4}Ω={1,2,3,4} and D={∅,Ω,{1,2},{3,4},{1,3},{2,4}}\colD = \set{\varnothing, \Omega, \set{1,2}, \set{3,4}, \set{1,3}, \set{2,4}}D={∅,Ω,{1,2},{3,4},{1,3},{2,4}}. Then D\colDD is a λ\lambdaλ-system but not a σ\sigmaσ-algebra.
The notion of a λ\lambdaλ-sytem is very similar to a σ\sigmaσ-algebra. The only difference lies in the third condition, where it suffices that D\colDD is stable under disjoint countable unions.
Definition 1.17 (Collection-generated λ\lambdaλ-system): Let C\colCC be a collection, then
λ(C)=⋂{D~⊆P(Ω)|C⊆D~ and D~ is a λ-system}
\lambda(\colC) = \Intersect \set{ \tilde{\colD} \subset \powerset(\Omega) \mid \text{C⊆D~\colC \subset \tilde{\colD}C⊆D~ and D~\tilde{\colD}D~ is a λ\lambdaλ-system} }
λ(C)=⋂{D~⊆P(Ω)C⊆D~ and D~ is a λ-system}
is the λ\lambdaλ-system generated by C\colCC.
Note:
λ(C)\lambda(\colC)λ(C) is the smallest λ\lambdaλ-system containing C\colCC.
If D\colDD is a λ\lambdaλ-system, then λ(D)=D\lambda(\colD) = \colDλ(D)=D.
If C⊆C~\colC \subset \tilde{\colC}C⊆C~, then λ(C)⊆λ(C~)\lambda(\colC) \subset \lambda(\tilde{\colC})λ(C)⊆λ(C~).
As σ(C)\sigma(\colC)σ(C) is a λ\lambdaλ-system containing C\colCC, we have λ(C)⊆σ(C)\lambda(\colC) \subset \sigma(\colC)λ(C)⊆σ(C).
The converse of the latter property does not hold in general. To see that let Ω\OmegaΩ and D\colDD be as defined previously. Then λ(D)=D\lambda(\colD) = \colDλ(D)=D but σ(D)=P(Ω)\sigma(\colD) = \powerset(\Omega)σ(D)=P(Ω).
Definition 1.18 (π\piπ-system): A collection D\colDD is a π\piπ-system if for any (Ai)i∈I⊆D\family*{\setA}{i}{\setI} \subset \colD(Ai)i∈I⊆D with I\setII finite, we have ⋂{Ai}i∈I∈D\Intersect \collection*{\setA}{i}{\setI} \in \colD⋂{Ai}i∈I∈D.
Note:
In other words, the only condition for a collection to be a π\piπ-system is that it is closed under finite intersections.
By induction, it is sufficient to show if A,B∈C\setA, \setB \in \colCA,B∈C then A∩B∈C\setA \intersect \setB \in \colCA∩B∈C.
Proposition 1.19 (π\piπ and λ\lambdaλ implies σ\sigmaσ): Let D\colDD be a π\piπ-system. Then it is a σ\sigmaσ-algebra if and only if it is a λ\lambdaλ-system.
Dynkin's Theorem asserts that the inclusion λ(C)⊆σ(C)\lambda(\colC) \subset \sigma(\colC)λ(C)⊆σ(C) is an equality when C\colCC is a π\piπ-system.
Theorem 1.20 (Dynkin's Theorem): Let C\colCC be a π\piπ-sytem. Then, λ(C)=σ(C)\lambda(\colC) = \sigma(\colC)λ(C)=σ(C).
Proof: TODO
Dynkin's Theorem simplifies proofs for generated σ\sigmaσ-algebras σ(C)\sigma(\colC)σ(C) by allowing one to verify a property merely for the λ\lambdaλ-system λ(C)\lambda(\colC)λ(C), which requires the easier condition of stability under disjoint countable unions rather than general countable unions. Consequently, if C\colCC is stable under finite intersection, the theorem guarantees that any property holding for λ(C)\lambda(\colC)λ(C) is valid for σ(C)\sigma(\colC)σ(C). An example application is shown in the following.
Example (Uniqueness of measure): Let μ\muμ and ν\nuν be two measures on the measurable space (R,B(R))(\R, \borelB(\R))(R,B(R)) for which ∀a∈R:μ((−∞,a])=ν((−∞,a])\forall a \in \R : \mu((-\infty,a]) = \nu((-\infty,a])∀a∈R:μ((−∞,a])=ν((−∞,a]). We want to prove that μ\muμ and ν\nuν are equal on the entirety of B(R)\borelB(\R)B(R). We note
∀a∈R:μ((−∞,a])=ν((−∞,a])⟺∀A∈C:μ(A)=ν(A)
\forall a \in \R : \mu((-\infty,a]) = \nu((-\infty,a]) \iff \forall \setA \in \colC : \mu(\setA) = \nu(\setA)
∀a∈R:μ((−∞,a])=ν((−∞,a])⟺∀A∈C:μ(A)=ν(A)
for C={(−∞,a]|a∈R}\colC = \set{(-\infty,a] \mid a \in \R}C={(−∞,a]∣a∈R}. We show that C\colCC is a π\piπ-system and that the property of interest, the equality of μ\muμ and ν\nuν, holds on λ(C)\lambda(\colC)λ(C).
C\colCC is a π\piπ-system: Let A1=(−∞,a1]∈C\setA_1 = (-\infty,a_1] \in \colCA1=(−∞,a1]∈C and A2=(−∞,a2]∈C\setA_2 = (-\infty,a_2] \in \colCA2=(−∞,a2]∈C. Without loss of generality, let a1≤a2a_1 \leq a_2a1≤a2. Then, A1∩A2=A1∈C\setA_1 \intersect \setA_2 = \setA_1 \in \colCA1∩A2=A1∈C.
μ=ν\mu = \nuμ=ν holds on λ(C)\lambda(\colC)λ(C): Let L={A∈B(R)|μ(A)=ν(A)}\colL = \set{\setA \in \borelB(\R) \mid \mu(\setA) = \nu(\setA)}L={A∈B(R)∣μ(A)=ν(A)} be the collection of all subsets where the measures agree. By hypothesis, C⊆L\colC \subset \colLC⊆L. We prove that L\colLL is a λ\lambdaλ-system. First, Ω∈L\Omega \in \colLΩ∈L since μ(Ω)=ν(Ω)\mu(\Omega) = \nu(\Omega)μ(Ω)=ν(Ω). Second, if A∈L\setA \in \colLA∈L, then μ(Ac)=1−μ(A)=1−ν(A)=ν(Ac)\mu(\setA^c) = 1 - \mu(\setA) = 1 - \nu(\setA) = \nu(\setA^c)μ(Ac)=1−μ(A)=1−ν(A)=ν(Ac), so Ac∈L\setA^c \in \colLAc∈L. Third, if (An)n∈N⊆L\family{\setA} \subset \colL(An)n∈N⊆L pairwise disjoint, then
μ (⋃n∈NAn)=∑n∈Nμ(An)=∑n∈Nν(An)=ν (⋃n∈NAn)
\mu\of{\UnionN \setA_n} = \sumN \mu(\setA_n) = \sumN \nu(\setA_n) = \nu\of{\UnionN \setA_n}
μ(n∈N⋃An)=n∈N∑μ(An)=n∈N∑ν(An)=ν(n∈N⋃An)
so ⋃n∈NAn∈L\UnionN \setA_n \in \colL⋃n∈NAn∈L. Since L\colLL is a λ\lambdaλ-system containing C\colCC, we have λ(C)⊆L\lambda(\colC) \subset \colLλ(C)⊆L. Therefore, μ=ν\mu = \nuμ=ν on λ(C)\lambda(\colC)λ(C).
By Dynkin's Theorem, λ(C)=σ(C)=B(R)\lambda(\colC) = \sigma(\colC) = \borelB(\R)λ(C)=σ(C)=B(R). Thus μ=ν\mu = \nuμ=ν holds on B(R)\borelB(\R)B(R).
Note: The technique used in step 2. is called the Good Sets Principle. It allows one to prove that a property A\mathfrak{A}A holds for all sets in λ(C)\lambda(\colC)λ(C) without constructing the sets explicitly. The logic proceeds in three steps.
One defines the collection of good sets L={A∈σ(C)|A(A) is true}\colL = \set{ \setA \in \sigma(\colC) \mid \mathfrak{A}(\setA) \text{ is true}}L={A∈σ(C)∣A(A) is true}.
One shows that C\colCC is contained in L\colLL and that L\colLL forms a λ\lambdaλ-system.
By the minimality of generated systems, one concludes that λ(C)⊆L\lambda(\colC) \subseteq \mathcal{L}λ(C)⊆L, implying A\mathfrak{A}A holds for λ(C)\lambda(\colC)λ(C).
1.4 Independence of Collections
The language of probability theory relies on measure theory, together with the concept of independence. We introduce the concept of independence for a collection of events.
Definition 1.21 (Independent collection): Let I\setII be an arbitrary index set. A collection C={Ai}i∈I⊆F\colC = \collection*{\setA}{i}{\setI} \subset \sigmaFC={Ai}i∈I⊆F is called independent if for each finite index set J⊆I\setJ \subset \setIJ⊆I we have
P(⋂j∈JAj)=∏j∈JP (Aj)
\probP*{\Intersect_{j \in \setJ} \evA_j} = \prod_{j \in \setJ} \probP{\evA_j}
P(j∈J⋂Aj)=j∈J∏P(Aj)
We extend the concept of independence to family of collections.
Definition 1.22 (Independent family of collections): Let K\setKK be an arbitrary index set and for each k∈Kk \in \setKk∈K, let Ck⊆F\colC_k \subset \sigmaFCk⊆F be a collection of events. The family of collections (Ck)k∈K\family*{\colC}{k}{\setK}(Ck)k∈K is called independent if for each finite index set {k1,…,kn}⊆K\set{k_1, \ldots, k_n} \subset \setK{k1,…,kn}⊆K we have
∀Ak1∈Ck1⋯∀Akn∈Ckn:P(⋂i=1nAki)=∏i=1nP (Aki)
\forall \evA_{k_1} \in \colC_{k_1} \sep \cdots \sep \forall \evA_{k_n} \in \colC_{k_n} : \probP*{\Intersect_{i = 1}^n \evA_{k_i}} = \prod_{i = 1}^n \probP{\evA_{k_i}}
∀Ak1∈Ck1⋯∀Akn∈Ckn:P(i=1⋂nAki)=i=1∏nP(Aki)
Note:
The independence of a family of collections (Ck)k∈K\family*{\colC}{k}{\setK}(Ck)k∈K does not imply that each individual collection Ck\colC_kCk is an independent collection.
If each family is made of a single event, i.e. Ck={Ak}\colC_k = \set{\setA_k}Ck={Ak}, then the independence of the families family of collections (Ck)k∈K\family*{\colC}{k}{\setK}(Ck)k∈K is equivalent to the independence of the collection C=⋃k∈KCk\colC = \Union_{k \in \setK} \colC_kC=⋃k∈KCk.
(Ck)k∈K\family*{\colC}{k}{\setK}(Ck)k∈K is independent if and only if (Cki)ki∈K~\family*{\colC}{k_i}{\tilde{\setK}}(Cki)ki∈K~ is independent for each finite index set K~⊆K\tilde{\setK} \subset \setKK~⊆K.
Let n∈Nn \in \Nn∈N. The finite family of collections C1,…,Cn⊆F\colC_1, \ldots, \colC_n \subset \sigmaFC1,…,Cn⊆F is independent if and only if
∀A1∈C1∪{Ω}⋯∀An∈Cn∪{Ω}:P(⋂i=1nAi)=∏i=1nP (Ai)
\forall \evA_{1} \in \colC_{1} \union \set{\Omega} \sep \cdots \sep \forall \evA_{n} \in \colC_{n} \union \set{\Omega} : \probP*{\Intersect_{i = 1}^n \evA_{i}} = \prod_{i = 1}^n \probP{\evA_{i}}
∀A1∈C1∪{Ω}⋯∀An∈Cn∪{Ω}:P(i=1⋂nAi)=i=1∏nP(Ai)
The following proposition asserts that to establish independence of σ\sigmaσ-algebras, it suffices to prove independence of generating π\piπ-systems.
Proposition 1.23 (σ\sigmaσ-algebras inherit independence): Let I\setII be an arbitrary index set and for each i∈Ii \in \setIi∈I, let Ci⊆F\colC_i \subset \sigmaFCi⊆F be a π\piπ-system. The family of collections (Ci)i∈I\family*{\colC}{i}{\setI}(Ci)i∈I is independent if and only if the family of collections (σ(Ci))i∈I(\sigma(\colC_i))_{i \in \setI}(σ(Ci))i∈I is independent.
Proof: TODO
The following is a direct consequence of the previous proposition.
Proposition 1.24 (Independent collection of complements): Let I\setII be an arbitrary index set. The collection {Ai}i∈I\collection*{\setA}{i}{\setI}{Ai}i∈I is independent if and only if the collection {Aic}i∈I\{\setA_i^c\}_{i \in \setI}{Aic}i∈I is independent.