This is the first post in a series on extending analysis from real-valued functions to functions valued in Banach spaces. Our first topic is the Bochner integral. That is, the extension of the Lebesgue integral to functions valued in Banach spaces.
Three line summary
-
The Bochner integral is a way of integrating functions $f$ from a measure space to a Banach space.
-
Like the Lebesgue integral, it is first constructed for piecewise constant functions $\mathcal{A}$ and extended continuously to the completion $\overline{\mathcal{A}}$.
-
The completion $\mathcal{A}$ can be explicitly described as the space of functions with separable image and with finite $L^1$ norm. This naturally leads to the definition of $L^p$ spaces.
Notation
-
We consider a measure space $(\Omega,\mathcal{F},\mu )$ which may not be $\sigma$-finite, and a Banach space $(X,\mathcal{B}(X))$ where $\mathcal{B}$ is the Borel sigma-algebra (that is, the smallest $\sigma$-algebra on $X$ containing all of the open sets) of $X$.
-
We denote the dual space of $X$ as $X^\star $ and given $x^\star \in X^\star $ we denote the pairing of $x\in X$ and $x^\star $ as $(x,x^\star ):= x^\star (x)$.
-
Given a subset $A\subset \Omega$ we denote the indicator function of $A$ as $1\U A$.
-
Given $f: \Omega \to \mathbb{R}$ and $x \in X$ we define the function $f\otimes x: \Omega \to X$ by
Strong measurability
Our goal is to define integration for functions valued in a Banach space
When working with measure spaces, we are, in general, not interested in what happens on sets of measure $0$. As a result, we are contented with properties of interest holding perhaps not everywhere but almost everywhere.
Definition 1. A property is said to hold $\mu$-almost everywhere if there exists a set $N\in \mathcal{F}$ with $\mu(N)=0$ such that the property holds on $\Omega\setminus N$.
As anticipated, we first consider the class of simple functions
We can define their integral quite naturally as
If we take equivalence classes and identify functions that are equal $\mu$ almost everywhere, we can define the norm
A verification shows that integration is linear and for all $f\in\mathcal{A}$.
That is, integration is a linear and continuous map
Since $X$ is a complete, we can linearly extend integration in a unique way to the completion $\overline{\mathcal{A}}$ of $(\mathcal{A},\left\lVert \cdot \right\rVert\U A)$. The space $\overline{\mathcal{A}}$, which can be built through taking limits of simple functions, is thus the space of functions that we can integrate. Our next step is to figure out what this is.
Definition 2. We say a function $f:(\Omega,\mathcal{F})\to (X,\mathcal{B}(X))$ is $\mu$-strongly measurable if there exists a sequence of simple functions $f\U n$ such that
Since the simple $f\U n$ are separately valued (that is, $f\U N(\Omega )\subset X$ is separable), $f$ will also be (almost everywhere) separably valued. As a result we will always end up working with separable Banach spaces. The following properties are of use
Exercise 1 . Let $X$ be a separable Banach space with dual $X^\star $ , show that
a) There exists $\left\{x\U n^\star \right\}\U {n=1}^\infty\subset X^\star $ such that
Such a sequence is called a norming sequence.
b) The Borel $\sigma$-algebra $\mathcal{B}(X)$ is equal to the $\sigma$-algebra generated by $\left\{x\U n^\star \right\}\U {n=1}^\infty$, and by $X^\star $. That is,
If $X$ is not separable, the inclusion $\mathcal{B}(X)\subset \sigma(X^\star )$ may fail. See Vakhania, 2012 page 23 for a counterexample.
c) A function $f: \Omega \to X$ is measurable if and only if it is weakly measurable. That is, if and only if for all $x^\star \in X^\star $ the function $(f,x^\star ): \Omega \to \mathbb{R}$ is measurable.
d) The dual $X^\star $ with the weak-$^\star $ topology (the topology generated by $X$ viewed as a subset of $X^{\star \star }$) is separable .
The following characterizes the space of strongly measurable functions
Theorem 3 (Pettis measurability theorem). A function $f:\Omega\to X$ is $\mu$-strongly measurable if and only if $f$ is $\mu$-almost everywhere separately valued and $(f, x^\star )$ is $\mu$ strongly measurable for all $x^\star \in X^\star $.
Proof. We first prove the implication. For some $x\U k^{(n)} \in X , A\U k^{(n)} \in \mathcal{F}$,
As a result, $f$ takes almost everywhere values in the separable space spanned by the countably many $x\U k^{(n)}$. To see that $(f,x^\star )$ is $\mu$-strongly measurable we note that $g\U n :=(f\U n,x^\star )$ are simple functions and
We now prove the reverse implication. By assumption, there exists $\Omega \U 1 \subset \Omega$ such that $\mu (\Omega \U 1^c)=0$ and $X\U 1=\overline{f(\Omega\U 1 )}$ is separable. As a result, and by point $1$ of exercise 1 there exists a norming sequence $\left\{x\U n^\star \right\}\U {n=1}^\infty \subset X\U 1$.
Since we only need to prove limits almost everywhere, we can suppose that $f$ is separably valued by restricting to $\Omega \U 1$. Now, by assumption the functions $g\U n:=(f,x\U n^\star )$ are $\mu$-strongly measurable. Let $K\U n$ be the support of $g\U n$ and $K=\cup\U {n \geq 1}K\U n$. By definition of strongly $\mu$-measurable function and since the union of countable sets is countable, $K$ is $\sigma$-finite. Since $x\U n^\star $ separate points, $f$ is $0$ outside of $K$, and by restricting to $K$ we can suppose that $\mu$ is $\sigma$-finite.
Let $\left\{x\U n\right\}\U {n=1}^\infty$ be a countable dense subset of $X\U 1$. Given $n \in \mathbb{N}$, we define $\varphi\U n (x)$ to be the $x\U k \in \left\{x\U j\right\}\U {j=1}^ n$ which is the closest to $x$, and where in the case of a tie we take the one with the smallest index $j$. We can now define the simple functions
The function $F\U n$ takes at most $n$-different values $x\U 1,\dots, x\U n$ with
Since $(f-x\U k,x\U n^\star )$ is $\mu$-strongly measurable, we deduce that the following function is measurable almost everywhere.
So, without loss of generality, we may suppose they are measurable by restricting once more. Then, by (\ref{sv}) $F\U n$ is measurable. Since we had restricted $u$ to be $\sigma$-finite, we may take a partition $\left\{\Omega \U n\right\}\U {n=1}^\infty$ of $\Omega$ with finite measure. Consider $f\U n:=F\U n 1\U {\Omega \U n}$, we have that $f\U n$ is $\mu$ simple (we need to multiply by $1\U {\Omega \U n}$ so that the support of the indicators have finite measure) and converges to $f$ almost everywhere. ◻
If we include $x\U 0=0$ in the norming sequence of Theorem 3, we have that $\left\lVert f\U n-f \right\rVert$ as in the proof above is bounded by $\left\lVert f \right\rVert$ almost everywhere and obtain the following corollary which will be used later to show the density of simple functions in the integrable ones.
Corollary 4 . Let $f : \Omega \to X$ be $\mu$-strongly measurable. Then, there exists a sequence of simple functions $f\U n$ converging to $f$ almost everywhere and such that
Most typically, one works in the case where $\mu$ is $\sigma$-finite (for example, if $\mu$ is the Lebesgue measure or any probability measure) and identifies functions that are equal almost everywhere. In this case, the following more simple statement holds.
Theorem 5 (Pettis theorem for $\sigma$-finite measures). Let $(\Omega, \mu, \mathcal{F})$ be a $\sigma$-finite measure space and identify functions that are equal almost everywhere. Then $f$ is $\mu$-strongly measurable if and only if $f$ is separately valued and measurable.
Proof. Let $f$ be $\mu$-strongly measurable. Then, $f$ is the limit of separately valued and measurable functions. As a result, $f$ is separately valued and measurable.
Suppose now that $f$ is separately valued and measurable. Then, the same is true for $f 1\U {\Omega\U n}$ where $\left\{\Omega\U n\right\}\U {n=1}^\infty$ is a partition of $\Omega$ with finite measure. Moreover, these functions are strongly measurable as the measure of $\Omega \U n$ is finite (one can start from $f 1\U {\Omega\U n}$ and form $F\U n$ as in the previous proof converging to $f 1\U {\Omega \U n}$) and since they form a partition of $\Omega$,
By a basic argument, the limit of $\mu$-strongly measurable functions is $\mu$-strongly measurable and we conclude that $f$ is $\mu$-strongly measurable, as desired. ◻
Construction of the Bochner integral
Since the norm $\left\lVert \cdot \right\rVert$ is a continuous function, given a measurable function $f: \Omega \to X$, the real-valued function $\left\lVert f \right\rVert: \Omega \to \mathbb{R}$ is also measurable. As a result, we can define the Lebesgue integrals
Likewise, the sets $\left\lVert f \right\rVert>r$ are in the $\sigma$-algebra $\mathcal{F}$ and we can define
where the infimum is defined to be $\infty$ if the set is empty.
Knowledge of the real-valued case shows that (\ref{Lp}) and (\ref{Linfty}) define a seminorm on the spaces of measurable function where they are finite, and they define a norm if we define the equivalence relation
Following common practice, we will identify functions that are equal almost everywhere from now on and make no distinction between functions and their equivalence classes. Finally, we arrive at the following definition.
Definition 6 . Given a measure space $(\Omega,\mathcal{F},\mu)$, a Banach space $(X,\mathcal{B}(X))$ we define the space of $p$-integrable functions as (the equivalence classes)
If one wishes to be explicit about the underlying measure space one can also write $L^p(\Omega,\mathcal{F},\mu, X)$. As in the real case, the $L^p$ spaces are complete.
Theorem 7 (Fischer-Riesz). The space $L^p(\Omega\to X)$ is a Banach space for all $p\in [1,\infty].$
Proof. The proof follows along the lines of the real case, substituting the absolute value in $\mathbb{R}$ by the norm in $X$ as necessary. We first prove the case $p \in [1,\infty)$ Recall that a normed space is complete if and only if every absolutely convergent series converges.That is, we need to show that if $f\U n \in L^p(\Omega \to X)$ is such that
Then there exists $f\in L^p(\Omega\to X)$ such that
To do so, one first applies Minkowski’s inequality for real-valued functions to show that
Thus, the sum is finite almost everywhere. Since $X$ is complete, we have that the above sum converges pointwise almost everywhere to some function
Furthermore we have that $f$ is strongly measurable as it is the limit of strongly measurable functions. Finally, using Fatou’s lemma for real-valued functions and the triangle inequality for norms shows that
Which shows convergence in $L^p(\Omega\to X)$ for $p\in [1,\infty)$.
For the case $p=\infty$ consider a Cauchy sequence $f\U n \in L^p(\Omega \to X)$ and write
By construction, $A\U {nm}$ and thus $\mathbb{A}^c$ have measure zero and $f\U n$ converges uniformly on $A$. As a result, $f\U n$ converges almost everywhere to some $f\in L^\infty(\Omega\to X)$. This completes the proof. ◻
Just as in the case of Lebesgue integrals, the proof of the completeness of $L^p(\Omega\to X)$ serves to show that every convergent sequence must have a subsequence converging almost everywhere. This proposition is not necessary for the rest of the constructions, it’s just a nice property to have in reserve.
Proposition 8. Let $f\U n\to f\in L^p(\Omega\to X)$, then there exists a subsequence $f\U {n\U k}$ converging to $f$ almost everywhere.
Proof. In the proof of the above proposition, we saw that for any absolutely convergent sum converges almost everywhere to its limit. Further, since $f\U n$ is Cauchy, we can extract a subsequence $f\U {n\U k}$ with $|f\U {n\U k}-f\U {n\U {k-1}}|\leq 2^{-k}$. By construction, the sequence
is normally convergent and converges in $f$. By the above discussion we conclude the proof. ◻
Ok, so we’ve constructed some spaces of $p$-integrable functions and shown that they are complete. You know where this is going. Next stop is density town. In the standard construction of the Lebesgue integral, it is used that every measurable function to $\mathbb{R}$ can be pointwise approximated by simple functions. One can achieve the same result for arbitrary metric spaces if the image of $f$ is separable.
Proposition 9. Every function in $L^p(\Omega\to X)$ is the limit almost everywhere and in the norm of a sequence of simple functions.
Proof. By Corollary 4, there exists a sequence of simple functions $f\U n$ converging to $f$ almost everywhere and such that $\left\lVert f\U n-f \right\rVert<\left\lVert f \right\rVert$ almost everywhere. By the dominated convergence theorem for real valued functions, we have that
◻
As a corollary, we obtain the following
Corollary 10. The simple functions $\mathcal{A}$ are a dense subset of $L^p(\Omega\to X)$.
Since $L^1(\Omega\to X)$ is complete, we have that the closure of $\overline{\mathcal{A}}$ with the norm $\left\lVert \cdot \right\rVert\U {L^1(\Omega \to X)}$ is $L^1(\Omega\to X)$. Furthermore, by the triangle inequality (\ref{triangle}) , integration is continuous with this norm. This continuity allows us to extend integration to $L^1(\Omega \to X)$ and shows that the space of integrable functions is $L^1(\Omega\to X)$.
Definition 11 . We define the integral on $L^1(\Omega\to X)$ as the unique continuous extension with the norm $\left\lVert \cdot \right\rVert\U {L^1(\Omega\to X)}$ of the integral on $\mathcal{A}$. That is, given $f\in L^1(\Omega\to X)$ we define
Where $f\U n\in \mathcal{A}$ is any sequence such that $\left\lVert f-f\U n \right\rVert\U {L^1(\Omega\to X)}\to 0$.
Observation 1. We could also work with the spaces
These spaces are once more complete, however, they do not contain simple functions as a dense subset. As a result, given $f \in \hat{L}^1(\Omega \to X)$ it is not possible to make sense of the expression $\int\U \Omega f \,\mathrm{d}\mu$.
Familiar properties
Many properties of integration hold for the Bochner integral as well. For example, the following is a result of the definition of the Bochner integral and a passage to the limit, as it holds for simple functions.
Corollary 12 . Let $f\in L^1(\Omega,X)$ with $X$ a Banach space, then
-
$\left\lVert \int\U \Omega f \,\mathrm{d}\mu \right\rVert\leq\int\U \Omega{\left\lVert f \right\rVert\,\mathrm{d}\mu}$
-
Let $Y$ be another Banach space and $L\in L(X,Y)$. Then
Knowledge of scalar-valued results also goes a long way; for example, we can prove the following
Exercise 2 (Fubini-Tonelli). Let $(\Omega\U 1, \mathcal{F}\U 1 , \mu \U 1 )$ and $(\Omega\U 2, \mathcal{F}\U 2 , \mu \U 2 )$ be $\sigma$-finite measure spaces and consider the space $\Omega\U 1\times \Omega\U 2$ with the $\sigma$-algebra $\mathcal{F}\U 1 \otimes \mathcal{F}\U 2$ generated by sets of the form $A\U 1 \times A\U 2$ with $A\U i \in \Omega \U i$ and the unique measure $\mu \U 1 \otimes \mu \U 2$ such that $(\mu \U 1 \otimes \mu \U 2)(A\U 1 \times A\U 2)=\mu \U 1(A\U 1)\mu \U 2(A\U 2)$. Let $X$ be a Banach space and $f:\Omega\U 1\times \Omega\U 2 \to X$ be strongly measurable. Then
-
The functions $f(\omega\U 1,\cdot )$ and $f(\cdot,\omega\U 2)$ are strongly measurable.
-
If any of the following integrals is finite
Then all of the integrals in (\ref{norm ints}) are equal, and
Sometimes the following theorem is more useful when instead of using the product measure $\mu\U 1 \otimes \mu\U 2$ on $X \times Y$, we use its completion $\overline{\mu\U 1 \times \mu\U 2}$. In this case, Fubini-Tonelli’s Theorem 2 still holds. It is only necessary to note that the sections of $f$ are now almost always measurable (see Tao, 2011 page 203 for more details).
Exercise 3 (Minkowski’s integral inequality). Show that given $p \in [1,\infty)$ and $f \in L^1(\Omega\U 1 \to L^p(\Omega\U 2 \to Y))$ it holds that
Exercise 4 (Dominated convergence theorem). Let $f\U n,f\in L^1(\Omega\to X)$ be such that $f\U n\to f$ almost everywhere and there exists $g\in L^1(\Omega\to X)$ such that $\left\lVert f\U n \right\rVert\leq g$ almost everywhere. Then
The dominated convergence theorem is a powerful tool that allows us to pass to the limit under the integral sign. For example, it can be used to show that, under necessary conditions, if $f(t,\omega )$ is continuous (differentiable) in $t$ then so is $\int f(t,\omega )\,\mathrm{d}\mu(\omega)$ (see here for a proof). These results, together with the density of simple functions, can be used to prove the standard approximation theorems.
Theorem 13 (Convolution, regularization and smooth approximation). Let $p \in [1,\infty)$ and consider $f\in L^p(\mathbb{R}^d\to X)$ and $\phi \in L^1(\mathbb{R}^d)$.
a) Young’s inequality. The convolution
is well-defined almost everywhere and satisfies
b) Mollifiers. Define $\phi\U \epsilon(x):=\epsilon^{-d}\phi(x/\epsilon)$. Then,
c) Smoothing. If $\phi \in C^k\U c(\mathbb{R}^d)$ then $f\star \phi\U \epsilon \in C^k(\mathbb{R}^d\to X)$ with
d) Smooth approximation. The space $C\U c^\infty(\mathbb{R}^d)$ is dense in $L^p(\mathbb{R}^d\to X)$.
Ok, that’s it. This post was a bit more technical than some of the others, but you get the picture. Define an integral for simple functions, and figure out what can be approximated by simple functions. As we saw, the extra requirement that appears over the Lebesgue case is that the function $f$ is separately valued and justifies why, as we will see in future posts on SPDEs, the image of $f$ is often taken to be some separable Hilbert space. Until the next time!
A (possibly not updated) pdf of version of this page is provided here.