2.1 Functions
2.1.1 Elementary Conventions
Definition 2.1 (Function): Let 𝐴 and 𝐵 be two sets. A function 𝑓 :𝐴 →𝐵 is a rule which assigns to each element 𝑎 ∈𝐴 exactly one element 𝑓(𝑎) ∈𝐵.
Definition 2.2 (Image, Preimage)
: Let
𝑓 :𝐴 →𝐵 be a function. We define the following sets:
Image: The image of 𝑓 under 𝐶 ⊂𝐴 is the set 𝑓(𝐶) ={𝑓(𝑎):𝑎∈𝐶}.
Preimage: The preimage of 𝑓 under 𝐷 is the set 𝑓−1(𝐷) ={𝑎:𝑓(𝑎)∈𝐷}.
Example (Image): Let 𝑓 :ℝ →ℝ, 𝑓(𝑥) =𝑥2. We have that 𝑓(ℝ) =[0,∞). To see it, it is clear that 𝑓(ℝ) ⊂[0,∞). For the other inclusion, take any 𝑦 ∈[0,∞). If we set 𝑥 =√𝑦, then 𝑥2 =𝑦. Thus, 𝑦 ∈𝑓(ℝ) and [0,∞) ⊂𝑓(ℝ).
Example (Vector valued function): A function 𝑓 :𝐴 →ℝ𝑘 assigns to each element 𝑎 ∈𝐴 a vector 𝑓(𝑎) ∈ℝ𝑘. We use the notation 𝑓(𝑎) =(𝑓1(𝑎),…,𝑓𝑘(𝑎)) for the value of 𝑓 at 𝑎, where 𝑓𝑖(𝑎), 𝑖 =1,…,𝑘, are referred to as the coordinate functions of 𝑓.
To indicate that an element 𝑎 is assigned to an element 𝑓(𝑎) we often use the notation 𝑎 ↦𝑓(𝑎) to define the assignment, i.e. the function. The following definition clarifies some notation for algebraic operations on functions.
Definition 2.3 (Sum, product, quotient)
: Let
𝑓,𝑔 :𝐴 →ℝ be two real-valued functions.
Sum: 𝑓 +𝑔 is the function 𝑎 ↦(𝑓 +𝑔)(𝑎) =𝑓(𝑎) +𝑔(𝑎).
Product: 𝑓𝑔 is the function 𝑎 ↦(𝑓𝑔)(𝑎) =𝑓(𝑎)𝑔(𝑎).
Quotient: If 𝑔(𝑎) ≠0 ∀𝑎 ∈𝐴, then 𝑓/𝑔 is the function 𝑎 ↦(𝑓/𝑔)(𝑎) =𝑓(𝑎)/𝑔(𝑎).
If a function takes values in the extended real numbers, we rely on the following definition regarding the infimum and supremum of the image.
Definition 2.4 (Infimum and supremum of image): Let 𝑓 :𝐴 →–ℝ be a function and 𝐸 ⊂𝐴. For the image 𝑓(𝐸) we use the notation inf𝑓(𝐸) =inf{𝑓(𝑒):𝑒∈𝐸} =inf𝑒∈𝐸𝑓(𝑒) and sup𝑓(𝐸) =sup{𝑓(𝑒):𝑒∈𝐸} =sup𝑒∈𝐸𝑓(𝑒).
2.1.2 Invertible Mappings
Definition 2.5 (Surjective, injective, bijective)
: Let
𝑓 :𝐴 →𝐵 be a function.
Surjective: 𝑓 is called surjective, if 𝑓(𝐴) =𝐵.
Injective: 𝑓 is called injective, if 𝑎 ≠𝑎2 ⟹ 𝑓(𝑎1) ≠𝑓(𝑎2).
Bijective: 𝑓 is called bijective if it is surjective and injective.
Example (Surjection): Let 𝑓 :ℝ →[0,∞), 𝑓(𝑥) =𝑥2. We already know that 𝑓 is surjective. However, it is not injective, since for 𝑥1 = −1 and 𝑥2 =1, 𝑓(𝑥1) =𝑓(𝑥2) =1. This shows that 𝑥 ↦𝑥2 is not bijective.
Definition 2.6 (Restriction): Let 𝑓 :𝐴 →𝐵 be a function and 𝐸 ⊂𝐴. The restriction of 𝑓 to 𝐸 is the function 𝑓|𝐸 :𝐸 →𝐵 given by 𝑓|𝐸(𝑎) =𝑓(𝑎) for any 𝑎 ∈𝐸.
Example (Restriction): The function 𝑓 :ℝ →[0,∞), 𝑓(𝑥) =𝑥2 is not bijective. However, its restriction to [0,∞) is.
Definition 2.7 (Composition): Let 𝑓 :𝐴 →𝐵 and 𝑔 :𝐵 →𝐶 be two functions. The composition 𝑔 ∘𝑓 or 𝑔(𝑓) is defined pointwise, i.e. (𝑔 ∘𝑓)(𝑎) =𝑔(𝑓)(𝑎) =𝑔(𝑓(𝑎)). Thus, 𝑔 ∘𝑓 :𝐴 →𝐶.
Definition 2.8 (Inverse): Let 𝑓 :𝐴 →𝐵 be a function. A function 𝑔 :𝐵 →𝐴 is called an inverse of 𝑓 if 𝑔(𝑓(𝑎)) =𝑎 ∀𝑎 ∈𝐴 and 𝑓(𝑔(𝑏)) =𝑏 ∀𝑏 ∈𝐵. If 𝑓 has an inverse it is called invertible.
A characterization of invertible functions is given by the notions of surjectivity and injectivity. This is summarized in the following result.
Proposition 2.9 (Invertible iff bijection): Let 𝑓 :𝐴 →𝐵 be a function. Then 𝑓 is bijective if and only if it is invertible.
Example (Exponential function): Let 𝑓(𝑥) =𝑒𝑥, 𝑥 ∈ℝ, i.e. 𝑓 is the natural exponential function. One can show that 𝑓 :ℝ →(0,∞) is bijective where the inverse is given by the natural logarithm log(𝑦) :(0,∞) →ℝ.
Example (Trigonometric functions): The trigonometric functions sin :ℝ →ℝ and cos :ℝ →ℝ are clearly not bijective. However, sin|[−𝜋/2,𝜋/2] :[ −𝜋/2,𝜋/2] →[ −1,1] and cos|[0,𝜋] :[0,𝜋] →[ −1,1] are bijective with inverse arcsin :[ −1,1] →[ −𝜋/2,𝜋/2] and arccos :[ −1,1] →[0,𝜋], respectively. As a further example, the cotangent cot :(0,𝜋) →ℝ, cot(𝜃) =cos(𝜃)sin(𝜃), is bijective with inverse arccot :ℝ →(0,𝜋).
2.1.3 Set Operations for Image and Preimage
Proposition 2.10 (Set operations for image and preimage)
: Let
𝑓 :𝐴 →𝐵 be a function. Let
𝐵∗ ⊂𝐵. Then
𝑓−1(𝐵𝑐∗) =𝑓−1(𝐵∗)𝑐.
Let 𝐼 and 𝐽 be some sets and 𝐴𝑖 ⊂𝐴, 𝑖 ∈𝐼, and 𝐵𝑗 ⊂𝐵, 𝑗 ∈𝐽, be collections of sets from 𝐴 and 𝐵 respectively. Then
- 𝑓(⋃𝑖∈𝐼𝐴𝑖) =⋃𝑖∈𝐼𝑓(𝐴𝑖)
- 𝑓−1(⋃𝑗∈𝐽𝐵𝑗) =⋃𝑗∈𝐽𝑓−1(𝐵𝑗)
- 𝑓−1(⋂𝑗∈𝐽𝐵𝑗) =⋂𝑗∈𝐽𝑓−1(𝐵𝑗)
2.1.4 Polar Coordinate Representation
Introduce the set 𝑈 ={(𝜌,𝜃)∈ℝ2:𝜌>0,0<𝜃<2𝜋} =(0,∞) ×(0,2𝜋). Define the function 𝑇(𝜌,𝜃) =(𝜌cos(𝜃),𝜌sin(𝜃)), (𝜌,𝜃) ∈𝑈. Clearly, 𝑇(𝑈) ⊂ℝ2. The identification 𝑥 =𝜌cos(𝜃) and 𝑦 =𝜌sin(𝜃) is known as a polar coordinate representation of the point (𝑥,𝑦) ∈ℝ2. Define 𝑉 =𝑅2 ([0,∞) ×0).
Proposition 2.11 (Bijection of polar coordinates): The mapping 𝑇 :𝑈 →𝑉 is bijective.
Notice that for any point 𝑠 =(𝑥,0), 𝑥 ≥0, 𝑠 ∉𝑇(𝑈). In particular, 𝑇 :𝑈 →ℝ2 can not be bijective.
2.1.5 Monotonic Function
Definition 2.12 (Increasing, decreasing, monotonic)
: Let
𝑓 :𝐼 →ℝ,
𝐼 ⊂ℝ be a function.
Increasing, decreasing: 𝑓 is called increasing, resp. decreasing, if 𝑥1 ≤𝑥2 ⟹ 𝑓(𝑥1) ≤𝑓(𝑥2), resp. 𝑥1 ≤𝑥2 ⟹ 𝑓(𝑥1) ≥𝑓(𝑥2).
Strictly increasing, decreasing: 𝑓 is called strictly increasing, resp. strictly decreasing, if 𝑥1 <𝑥2 ⟹ 𝑓(𝑥1) <𝑓(𝑥2), resp. 𝑥1 <𝑥2 ⟹ 𝑓(𝑥1) >𝑓(𝑥2).
Monotonic: 𝑓 is called monotonic, resp. strictly monotonic, if it is either increasing or decreasing, resp. strictly increasing or decreasing.
Proposition 2.13 (Strict monotonicity and inverse): Let 𝑓 :𝐼 →ℝ, 𝐼 ⊂ℝ be a strictly monotonic function. Then, 𝑓 :𝐼 →𝑓(𝐼) is a bijection. Further, if 𝑓 is strictly increasing, resp. strictly decreasing, on 𝐼, then an inverse of 𝑓 is strictly increasing, resp. strictly decreasing, on 𝑓(𝐼).
Example (Strict monotonicity and inverse): Let 𝑓 :[0,∞) →[0,∞), 𝑓(𝑥) =𝑥2. We know that 𝑓([0,∞)) =[0,∞). Further, if 𝑥1 <𝑥2, then 𝑓(𝑥1) <𝑥21 <𝑥22 =𝑓(𝑥2). Thus, 𝑓 is strictly increasing and therefore we know that 𝑓 has an inverse 𝑓−1 :[0,∞) →[0,∞) which is also strictly increasing. In this case, we know that 𝑓−1(𝑦) =√𝑦 is the unique inverse of 𝑓, since 𝑦 =𝑥2 has only one non-negative solution.
Proposition 2.14 (Monotonicity and derivatives)
: Let
𝑎,𝑏 ∈ℝ and
𝑓 :(𝑎,𝑏) →ℝ be a function that is differentiable on
(𝑎,𝑏). That is, the derivative
𝑓′ of
𝑓 exists on
(𝑎,𝑏). Then:
- 𝑓′(𝑥) ≥0 ∀𝑥 ∈(𝑎,𝑏) implies 𝑓|(𝑎,𝑏) is increasing
- 𝑓′(𝑥) >0 ∀𝑥 ∈(𝑎,𝑏) implies 𝑓|(𝑎,𝑏) is strictly increasing
- 𝑓′(𝑥) ≤0 ∀𝑥 ∈(𝑎,𝑏) implies 𝑓|(𝑎,𝑏) is decreasing
- 𝑓′(𝑥) <0 ∀𝑥 ∈(𝑎,𝑏) implies 𝑓|(𝑎,𝑏) is strictly decreasing
2.2 Cardinality of Sets
2.2.1 Cardinality
Definition 2.15 (Finite and infinte sets): Let 𝐴 be a set. 𝐴 is said to be finite if it contains a finite number of elements. Otherwise, if the number of elements 𝐴 is not finite, 𝐴 is referred to as infinite.
Example (Infinite sets): The sets ℕ, ℤ, ℚ, ℝ and ℂ are all infinite.
Definition 2.16 (Cardinality equality): Let 𝐴 and 𝐵 be two arbitrary sets. 𝐴 and 𝐵 are said to have the same cardinality, #𝐴 =#𝐵, if and only if there exists a bijection 𝑓 :𝐴 →𝐵.
Example (Cardinality of finite sets): If 𝐴 and 𝐵 are finite, then #𝐴 =#𝐵 precisely means that 𝐴 and 𝐵 have the same number of elements. In particular #∅ =0.
Definition 2.17 (Cardinality order): Let 𝐴 and 𝐵 be two sets and 𝐶 ⊂𝐵. If there exists a bijection 𝑔 :𝐴 →𝐶 we write #𝐴 ≤#𝐵. If there exists a bijection 𝑔 :𝐴 →𝐶 but no bijection 𝑓 :𝐴 →𝐵, we write #𝐴 <#𝐵.
Example (Cardinality of subsets): Let 𝐴 and 𝐵 with 𝐴 ⊂𝐵, then #𝐴 ≤#𝐵. This is because the identity map 𝑔 :𝐴 →𝐴, 𝑔(𝑎) =𝑎, is a bijection.
Proposition 2.18 (Cardinality relations): We have that #ℕ =#ℤ =#ℚ, #ℝ >#ℕ and that any interval 𝐼 ⊂ℝ is s.t. #𝐼 >#ℕ.
2.2.2 Countability
Definition 2.19 (Countable and uncountable sets): Let 𝐴 be a set. If #𝐴 ≤#ℕ, then 𝐴 is countable, otherwise it is uncountable.
Proposition 2.20 (Countably infinite sets): Let 𝐴 be a set. If 𝐴 is countable but not finite, then #𝐴 =#ℕ and 𝐴 is said to be countably infinite.
2.2.3 Cardinality of Set Operations
Proposition 2.21 (Union of countable sets): Let {𝐴𝑖:𝑖∈ℕ} be a collection of sets s.t. for any 𝑖 ∈ℕ, 𝐴𝑖 is countable. Then the union ⋃𝑖∈ℕ𝐴𝑖 is countable as well.
Proposition 2.22 (Cartesian product of countable sets): Let 𝐴𝑖, 𝑖 =1,…,𝑛, be countable sets. Then, their Cartesian product ×𝑛𝑖=1𝐴𝑖 is countable.
2.2.4 Heine-Borel Theorem
Theorem 2.23 (Heine-Borel theorem for intervals): Let [𝑎,𝑏], 𝑎,𝑏 ∈𝑅, 𝑎 <𝑏, be any closed interval. Let 𝐼 be some countable set and assume that there exists a family of open intervals (𝑎𝑖,𝑏𝑖), 𝑎𝑖,𝑏𝑖 ∈ℝ, 𝑖 ∈𝐼, s.t. [𝑎,𝑏] ⊂⋃𝑖∈𝐼(𝑎𝑖,𝑏𝑖). Then, there exists 𝑁 ∈ℕ and 𝑖1,…,𝑖𝑁 ∈𝐼 s.t. [𝑎,𝑏] ⊂⋃𝑁𝑗=1(𝑎𝑖𝑗,𝑏𝑖𝑗).
The latter result shows that any closed interval on the real line which is covered by a countable collection of open intervals can be covered by a finite sub-collection.
2.3 Open and Closed Sets in Real Coordinate Spaces
2.3.1 Euclidean Norm
Definition 2.24 (Euclidean norm): Let 𝐱 ∈ℝ𝑘, 𝐱 =(𝑥1,…,𝑥𝑘). The Euclidean norm of 𝐱 is given by ‖𝐱‖ =√𝑥21+…+𝑥2𝑘.
‖𝐱−𝐲‖ gives the Euclidean distance between two points 𝐱,𝐲 ∈ℝ𝑘.
Example (Euclidean norm in one dimension): If 𝑘 =1, then we write ‖𝑥‖ =|𝑥|, where the function 𝑥 ↦|𝑥| maps 𝑥 ∈ℝ to its absolute value √𝑥2 =|𝑥| ∈[0,∞).
For the next proposition, we use the notation 𝟎 for the vector in ℝ𝑘, 𝑘 ∈ℕ, with all of its coordinates equal to zero.
Proposition 2.25 (Properties of the Euclidean norm)
: For any
𝐱,𝐲 ∈ℝ𝑘, the Euclidean norm satisfies:
- ‖𝐱‖ ≥0
- ‖𝐱‖ =0 ⟺ 𝐱 =𝟎
- ‖𝜆𝐱‖ =𝜆‖𝐱‖ for any 𝜆 ∈ℝ
- ‖𝐱−𝐲‖ =‖𝐲−𝐱‖
- ‖𝐱+𝐲‖ ≤‖𝐱‖ +‖𝐲‖, the triangular inequality
- ‖𝐱−𝐲‖ ≥|‖𝐱‖−‖𝐲‖|, the reverse triangular inequality
2.3.2 Open and Closed Balls and Boundedness
Definition 2.26 (Open and closed balls): The open, resp. closed, ball of radius 𝑟 >0 with center 𝐲 ∈ℝ𝑘 is 𝐵𝑟(𝐲) ={𝐱∈ℝ𝑘:‖𝐲−𝐱‖<𝑟}, resp. 𝐵𝑟[𝐲] ={𝐱∈ℝ𝑘:‖𝐲−𝐱‖≤𝑟}.
Definition 2.27 (Open set): A set 𝑈 ⊂ℝ𝑘 is called open if for any point 𝐱 ∈𝑈 there exists an 𝜀 >0 s.t. 𝐵𝜀(𝐱) ⊂𝑈.
Example (Intersection of open sets is open): Let 𝑈1, 𝑈2 be two open subsets of ℝ𝑘. By definition, for an arbitrary 𝑥 ∈𝑈1 ∩𝑈2 there exist 𝐵𝜀1(𝐱) and 𝐵𝜀2(𝐱) s.t. 𝐵𝜀1(𝐱) ⊂𝑈1 and 𝐵𝜀2(𝐱). Set 𝜀 =min{𝜀1,𝜀2} and we obtain 𝐵𝜀(𝐱) ⊂𝑈1 ∩𝑈2. Hence, 𝑈1 ∩𝑈2 is open.
Example (Open interval is an open set): Let 𝑎,𝑏 ∈ℝ, 𝑎 <𝑏 s.t. (𝑎,𝑏) is an open interval. Let 𝑥 ∈(𝑎,𝑏). Note that |𝑥−𝑎| =𝑥 −𝑎 and |𝑥−𝑏| =𝑏 −𝑥. Let 𝜀 <min{𝑥−𝑎,𝑏−𝑥} and take any 𝑦 ∈(𝑥 −𝜀,𝑥 +𝜀) =𝐵𝜀(𝑥). We have |𝑥−𝑦| <𝜀 and thus 𝑥 −𝑦 ≤|𝑥−𝑦| <𝜀 <𝑥 −𝑎 and 𝑦 −𝑥 ≤|𝑥−𝑦| <𝜀 <𝑏 −𝑥. Hence 𝑎 <𝑦 <𝑏 and 𝑦 ∈(𝑎,𝑏) and 𝐵𝜀(𝑥) ⊂(𝑎,𝑏).
Example (Open ball is an open set): Let 𝐵𝑟(𝐲) ⊂ℝ𝑘 and 𝐱 ∈𝐵𝑟(𝐲) be an arbitrary point. Set 𝛿 =‖𝐱−𝐲‖. By definition of 𝐵𝑟(𝐲) we have 𝛿 <𝑟. Let 𝜀 =𝑟 −𝛿. For any 𝐳 ∈𝐵𝜀(𝐱) we have by the triangle inequality ‖𝐳−𝐲‖ ≤‖𝐳−𝐱‖ +‖𝐱−𝐲‖ <𝜀 +𝛿 =𝑟. Hence 𝐵𝜀(𝐱) ⊂𝐵𝑟(𝐲).
Definition 2.28 (Closed set): A set 𝑉 ⊂ℝ𝑘 is said to be closed if 𝑉𝑐 is open.
Definition 2.29 (Bounded set): A set 𝐴 ⊂ℝ𝑘 is said to be bounded if there exists an 𝑟 >0 and 𝐲 ∈𝐴 s.t. 𝐴 ⊂𝐵𝑟[𝐲].
Definition 2.30 (Bounded function): Let 𝑓 :𝐴 →–ℝ and 𝐸 ⊂𝐴. THe function 𝑓 is said to be bounded on 𝐸 if thre exists a 0 ≤𝑀 <∞ s.t. |𝑓(𝑎)| ≤𝑀 for any 𝑎 ∈𝐸.
2.3.3 Open Set Characterization of Continuity
Continuous functions on Euclidean spaces are characterized in terms of open sets.
Definition 2.31 (Continuous function on Euclidean spaces): A function 𝑓 :ℝ𝑚 →ℝ𝑘 is continuous if for any open set 𝑈 ⊂ℝ𝑘 the preimage 𝑓−1(𝑈) is open in ℝ𝑚.
Example (Continuity of the identity function): Let 𝑓 :ℝ →ℝ, 𝑓(𝑥) =𝑥. Take any open set 𝑈 ⊂ℝ. Then 𝑓−1(𝑈) =𝑈. Hence, 𝑓−1(𝑈) is an open subset of ℝ and 𝑓 is continuous.
In terms of functions defined on subsets of Euclidean spaces ℝ𝑚, continuity is characterized as follows.
Proposition 2.32 (Generalized notion of continuity): Let 𝐸 ⊂ℝ𝑚, 𝐸 ≠∅. A function 𝑓 :𝐸 →ℝ𝑘 is continuous iff for any open set 𝑈 ⊂ℝ𝑘 we have 𝑓−1(𝑈) ∈{𝐺∩𝐸:𝐺 open in ℝ𝑚}.
Another characterization of continuity, known as the sequence criterion for continuous functions, will be introduced in the next chapter.