Financial Engineering 2025-03-20

4. Itô’s Calculus

4.1 Differentiation

In the previous chapter, we have observed that a sample Brownian path is nowhere differentiable. In other words, the differentiation $\frac{d W_t}{dt}$ does not exist in a classical calculus sense. However, while studying Brownian motions, or when using Brownian motion as a model, the situation of estimating the difference of a function $f(W_t)$ over an infinitesimal interval $[t, t + \Delta t]$ arises frequently.

Theorem 4.1 (Itô’s Lemma): Let

f(t,x)

be a differentiable function, and let

X_t

be a stochastic process satisfying

\dd X_t = \mu_t \dd t + \sigma_t \dd W_t

for a Brownian motion

W_t

Then

\dd f(t,X_t) = \pa{\frac{\partial f}{\partial t} + \mu_t \frac{\partial f}{\partial x} + \frac{1}{2} \sigma_t^2 \frac{\partial^2 f}{\partial x^2}} \dd t + \sigma_t \frac{\partial f}{\partial x} \dd W_t

Proof: We expand

f(t + \Delta t, x + \Delta x)

using Taylor

\begin{align*} & f(t + \Delta t, x + \Delta x) - f(t,x) \\ ={} & \frac{\partial f}{\partial t} \Delta t + \frac{\partial f}{\partial x} \Delta x + \frac{1}{2} \frac{\partial^2 f}{\partial t^2} (\Delta t)^2 + \frac{\partial^2 f}{\partial t \partial x} \Delta t \Delta x + \frac{1}{2} \frac{\partial^2 f}{\partial x^2} (\Delta x)^2 + \ldots \end{align*}

For

\Delta t \to \dd t

and

\Delta x \to \dd x

we have

\begin{align*} \dd f(t,x) &= \frac{\partial f}{\partial t} \dd t + \frac{\partial f}{\partial x} \dd x + \frac{1}{2} \frac{\partial^2 f}{\partial t^2} \dd t^2 + \frac{\partial^2 f}{\partial t \partial x} \dd t \dd x + \frac{1}{2} \frac{\partial^2 f}{\partial x^2} \dd x^2 + \ldots \\ &= \frac{\partial f}{\partial t} \dd t + \frac{\partial f}{\partial x} \dd x + \frac{1}{2} \frac{\partial^2 f}{\partial x^2} \dd x^2 + o(\mathrm{d} t) \end{align*}

\dd X_t = \mu_t \dd t + \sigma_t \dd W_t

we have

\begin{align*} \dd X_t^2 &= (\mu_t \dd t + \sigma_t \dd W_t)^2 \\ &= \mu_t^2 \dd t^2 + 2 \mu_t \sigma_t \dd t \dd W_t + \sigma_t^2 \dd W_t^2 \\ &= \sigma_t^2 \dd t + o(\mathrm{d} t) \end{align*}

where we used

\dd W_t \to 0

as Brownian paths are continuous as well as the fact that the local quadratic variation

\dd W_t^2

\dd t

Therefore, omitting

o(\mathrm{d} t)

we have

\begin{align*} \dd f(t,X_t) &= \frac{\partial f}{\partial t} \dd t + \frac{\partial f}{\partial x} \dd X_t + \frac{1}{2} \frac{\partial^2 f}{\partial x^2} \dd X_t^2 \\ &= \frac{\partial f}{\partial t} \dd t + \frac{\partial f}{\partial x} \pa{\mu_t \dd t + \sigma_t \dd W_t} + \frac{1}{2} \frac{\partial^2 f}{\partial x^2} \pa{\sigma_t^2 \dd t} \\ &= \pa{\frac{\partial f}{\partial t} + \mu_t \frac{\partial f}{\partial x} + \frac{1}{2} \sigma_t^2 \frac{\partial^2 f}{\partial x^2}} \dd t + \sigma_t \frac{\partial f}{\partial x} \dd W_t \end{align*}

Note: The stochastic process

X_t = \mu t + \sigma W_t

is known as the Brownian motion with drift

\mu

and volatility

\sigma

Proposition 4.2 (Itô’s Lemma for Brownian Motion): Let

X_t = W_t

then Itô’s Lemma simplifies to

\dd f(t,X_t) = \frac{\partial f}{\partial t} \dd t + \frac{\partial f}{\partial x} \dd W_t + \frac{1}{2} \frac{\partial^2 f}{\partial x^2} \dd t

Example (Univariate quadratic function): Consider the function

f(t,x) = \frac{1}{2} x^2

and

X_t = W_t

Then

\begin{align*} \dd f(t,X_t) &= \frac{\partial f}{\partial t} \dd t + \frac{\partial f}{\partial x} \dd W_t + \frac{1}{2} \frac{\partial^2 f}{\partial x^2} \dd t \\ &= W_t \dd W_t + \frac{1}{2} \dd t \end{align*}

Note how this contradicts classical derivation rules, i.e.

\frac{\dd f(W_t)}{\dd W_t} \neq W_t

Example (Multivariate quadratic function): Consider the function

f(t,x) = t^2 + x^2

Then

\begin{align*} \dd f(t,X_t) &= \pa{\frac{\partial f}{\partial t} + \mu_t \frac{\partial f}{\partial x} + \frac{1}{2} \sigma_t^2 \frac{\partial^2 f}{\partial x^2}} \dd t + \sigma_t \frac{\partial f}{\partial x} \dd W_t \\ &= \pa{2t + 2\mu_t X_t + \sigma_t^2} \dd t + 2\sigma_t X_t \dd W_t \end{align*}

Example (Exponential function): Consider the function

f(t,x) = \exp{X_t}

Then

\begin{align*} \dd f(t,X_t) &= \pa{\frac{\partial f}{\partial t} + \mu_t \frac{\partial f}{\partial x} + \frac{1}{2} \sigma_t^2 \frac{\partial^2 f}{\partial x^2}} \dd t + \sigma_t \frac{\partial f}{\partial x} \dd W_t \\ &= \pa{\mu_t + \frac{1}{2} \sigma_t^2} \exp{X_t} \dd t + \sigma_t \exp{X_t} \dd W_t \\ &= \pa{\mu_t \dd t + \frac{1}{2} \sigma_t^2 \dd t + \sigma_t \dd W_t} \exp{X_t} \end{align*}

Fixing

\sigma_t = \sigma

and

\mu_t = \mu - \frac{1}{2} \sigma^2

we obtain

\dd f(t,X_t) = \pa{\mu \dd t + \sigma \dd W_t} f(t, X_t)

where

X_t = \pa{\mu - \frac{1}{2} \sigma^2} t + \sigma W_t

Note: Given

X_t

from the last example, we define the geometric Brownian motion

S_t = s_0 \cdot \exp{X_t} = s_0 \cdot \exp{\pa{\mu - \frac{1}{2} \sigma^2} t + \sigma W_t}

where

s_0 \in \R

is the constant initial value. Note that

S_t

solves the stochastic differential equation of the return price model

\frac{\dd S_t}{S_t} = \mu \dd t + \sigma \dd W_t

introduced in the previous chapter.

4.2 Integration

We aim to provide an intuitive understanding of integration in the context of stochastic processes.

Recap (

L^2

-norm of a random variable): The

L^2

-norm of a random variable

Y

is defined as

\norm{Y} = \sqrt{\mathbb{E}[X^2]}

In similar fashion, we define the $L^2$ -norm of a process

\cb{X_t}_{t \in [0,T]}

Definition 4.3 (

L^2

-norm of a stochastic process): The

L^2

-norm of a process

\cb{X_t}_{t \in [0,T]}

is defined as

\norm{X} = \sqrt{\int_0^T \mathbb{E}\bk{X_t^2} \dd t}.

We define the concept of a random step function to approximate a stochastic process.

Definition 4.4 (Random step function): A stochastic process

\cb{X_t}_{t \in [0,T]}

can be approximated by any degree of accuracy by a random step function

\cb{X_{t_k}^{(n)}}_{k \in \cb{0, \ldots, n}}

with intervals of length

\frac{T}{n}

Note: Approximated by any degree of accuracy means that the sequence

\cb{X^{(n)}}_{n \in \N}

is Cauchy, i.e it converges to

X

in the

L^2

-norm.

With the random step function definition, we can define the discrete stochastic integral.

Definition 4.5 (Discrete stochastic integral): Let

\cb{X_t}_{t \in [0,T]}

be a stochastic process with

\norm{X} < \infty

The discrete stochastic integral is defined as

I\pa{X^{(n)}} = \sum_{k=0}^{n-1} X_{t_{k}}^{(n)} \pa{W_{t_{k+1}} - W_{t_k}}

With that and with convergence in $L^2$ -norm, we can define the Itô integral.

Definition 4.6 (Itô integral): Let

\cb{X_t}_{t \in [0,T]}

be a stochastic process. The Itô stochastic integral

I(X) = \int_0^T X_t \dd W_t

is defined as the limit in

L^2

-norm of the discrete stochastic integral

I(X^{(n)})

i.e.

I(X) \lneq{2} \lim_{n \to \infty} I\pa{X^{(n)}}

Note: We note that the stochastic integral depends on the choice of the random step function evaluation. To show that, let

X_{t_k}^{(n)} = W_{\tau_k}

with

\tau_k = (1 - \alpha) t_k + \alpha t_{k+1}

Then one can prove that

\int_0^T W_t \dd W_t = \frac{1}{2} W_T^2 + \pa{\alpha - \frac{1}{2}} T

For the Itô integral we have

\alpha = 0

This integral is non-anticipating and therefore works well in a financial setting.

Proposition 4.7 (Itô Integral for separable integrand): If the integrand

\dd Y_t

is separable, i.e.

\dd Y_t = g(t) \dd t + h(W_t) \dd W_t

for a Brownian motion

W_t

then

\int_{t_a}^{t_b} \dd Y_t = G(t_b) - G(t_a) + H(W_{t_b}) - H(W_{t_a})

where

G(t) = \int g(t) \dd t

and

H(x) = \int h(x) \dd x

Example (Itô integral of Brownina motion): We want to figure out the Itô integral of a Brownian motion from

t_0

T

i.e.

\int_{t_0}^T W_t \dd W_t

i.e.

\dd Y_t = W_t \dd W_t

We define

f(t,x) = x^2

and

X_t = W_t

and note using Itô’s Lemma for differentiation that

\begin{align*} & \dd W_t^2 = \dd f(t,X_t) = 2 W_t \dd W_t + \dd t \\ \implies ~ & W_t \dd W_t = \frac{1}{2} \dd W_t^2 - \frac{1}{2} \dd t \end{align*}

Hence the integrand is separable and we conclude

\int_0^T W_t \dd W_t = \frac{1}{2} \pa{W_T^2 - W_0^2} - \frac{1}{2} (T - t_0)

We take a look at the properties of the Itô integral.

Proposition 4.8 (Isometry of the Itô integral): Let

\cb{X_t}_{t \in [0,T]}

be a stochastic process. Then

\E\bk{\pa{\int_0^T X_t \dd W_t}^2} = \int_0^T \E\bk{X_t^2} \dd t

or equivalently written in norms

\norm{I(X)} = \norm{X}

Proposition 4.9 (Properties of the Itô integral): Let

\cb{X_{t}}_{t \in [0,T]}

and

\{\tilde{X}_{t}\}_{t \in [0,T]}

be stochastic processes for which the Itô integral is well-defined. The following properties hold:

linearity: $\int_0^T \alpha X_{t} + \beta \tilde{X}_{t} \dd W_t = \alpha \int_0^T X_{t} \dd W_t + \beta \int_0^T \tilde{X}_{t} \dd W_t$
expectation:
\E\bk{\int_0^T X_t \dd W_t} = 0
variance:
\Var\bk{\int_0^T X_t \dd W_t} = \int_0^T \E\bk{X_t^2} \dd t
martingale property:
\E\bk{\int_0^T X_t \dd W_t \mid F_u} = \int_0^u X_t \dd W_t
for $u \in [0,T]$
product property:
\E\bk{\int_0^T X_t \dd W_t ~ \int_0^T \tilde{X}_t \dd W_t} = \int_0^T \E\bk{X_t \tilde{X}_t} \dd t

4.3 Generalization

Theorem 4.10 (Itô’s lemma for multidimensional processes): Let

f: \mathbb{R}^N \to \mathbb{R}

be a function in

\mathcal{C}^2

and

\cb{\boldsymbol{X}_t}_{t} = \cb{\pa{X_t^{(1)}, \ldots, X_t^{(N)}}}_{t}

be an

N

-dimensional stochastic process. Then

\dd f(\boldsymbol{X}_t)=\sum_{i=1}^N \frac{\partial f}{\partial x_i} \dd X_t^{(i)}+\frac{1}{2} \sum_{i=1}^N \sum_{j=1}^N \frac{\partial^2 f}{\partial x_i \partial x_j} \dd [X^{(i)}, X^{(j)}]_t

or equivalently

f(\boldsymbol{X}_{t_b}) = f(\boldsymbol{X}_{t_a}) + \sum_{i=1}^N \int_{t_a}^{t_b} \frac{\partial f}{\partial x_i}(\boldsymbol{X}_s) \dd X_s^{(i)} + \frac{1}{2} \sum_{i=1}^N \sum_{j=1}^N \int_{t_a}^{t_b} \frac{\partial^2 f}{\partial x_i \partial x_j}(\boldsymbol{X}_s) \dd [X^{(i)}, X^{(j)}]_s

Proposition 4.11 (Itô’s product rule): Let

X_t

and

Y_t

be two stochastic processes. Then

\dd (X_t Y_t) = X_t \dd Y_t + Y_t \dd X_t + \dd [X,Y]_t