One of the most influential websites for me was John Baez’s This Week’s Finds. I learned many things from these blog posts and it helped me realize which scientific or mathematical topics interested me most. Lately, I have been spending quite some time studying several topics, usually a few weeks each, and it occurred to me that it might be fun and interesting to write about them (even though everything hasn’t been worked out yet).

In a future post, I am going to talk a bit more about the mathematics of financial markets (I am working in finance after all). One of the important theorems there is that of Girsanov’s theorem. During the study of this topic, however, I stumbled upon a related result (a predecessor to be exact), the Cameron–Martin formula. This post will introduce some notions and generalizations that occur when studying translations of Gaussian measures.


This post is structured as follows:

  • Short introduction to Gaussian measures.
  • Some necessary ideas from linear algebra and topology.
  • Doing probability theory on abstract vector spaces.
  • Some extras and applications.

One of the most archetypical probability distributions1 is the (standard) normal distribution (also called Gaussian distribution). In 1D, it is given by the following probability density function:

N(x)=12πexp(x22).

In higher dimensions (nN), the generalization is given by:

(1)N(x)=1(2π)n/2exp(|x|22).

A common question in physics and mathematics is what the transformations are that leave a certain structure invariant (cf. symmetry transformations as in this post). Since the (centered2) normal distribution only depends on the norm of the point x, it is clearly invariant under orthogonal transformations (i.e. rotations). However, next to rotations, also translations

xx+a

are of major importance in the study of vector spaces. However, a famous result by Haar implies that, up to a constant factor, the standard volume measure λ (i.e. Lebesgue measure) is the only measure on Rn that is invariant under translations. Nonetheless, the measures λ and aλ, where the latter is the pushforward of λ under the translation defined above, are equivalent in that they are absolutely continuous with respect to each other. The (classical) Cameron–Martin theorem states that the Radon–Nikodym derivative in this case is given by the following formula:

(2)daλdλ(x)=exp(xa|a|22).

The essence of this month’s blog post will be to work through the extension of this theorem to Gaussian measures on so-called Banach and Hilbert spaces.


We first take some steps back and look at the algebraic structure of Rn (or any other n-dimensional vector space since these are linearly isomorphic). Euclidean space comes equipped with the structure of an inner product space and, consequently, that of a normed space. These are formally defined as follows.

A vector space V equipped with a bilinear operation :V×VR such that the following conditions are satisfied:

  1. Nondegeneracy: vV:vw=0w=0.
  2. Positive-definiteness: vV:vv0.
  3. Symmetry: v,wV:vw=wv.

A vector space V equipped with a nonnegative function :VR+ such that the following conditions are satisfied:

  1. Nondegeneracy: v=0v=0.
  2. Positive-homogeneity: vV,λR:λv=|λ|v.
  3. Triangle inequality: v,wV:v+wv+w.

Every inner product induces an associated norm by the following formula:

v:=vv.

Using this convention, any finite-dimensional inner-product space gives rise to a canonical Gaussian measure as in Eq. (1).

In the realm of infinite-dimensional vector spaces, as usually treated in functional analysis, everything gets slightly more tricky due to the presence of topological subtleties. For the purpose of this post, three notions will be of interest: Separability, Banach spaces and Hilbert spaces. (The first one is purely topological, the latter two are also algebraic.)

A topological space is said to be separable if it admits a countable dense subset.

A complete normed space V, i.e. a normed space (V,) such that V with the induced metric

d(v,w):=vw

is a complete metric space.

An inner product space that is also a Banach space.

A very important result in functional analysis states that every separable Hilbert space is either

  1. finite dimensional, or
  2. isomorphic to 2, the space of square-summable sequences.

Assume that Eq. (1) would make sense on a separable infinite-dimensional Hilbert space H and choose an orthonormal basis (en)nN. Under the would-be Gaussian measure γH, the random variables

Xn:HR:xxen

would be independent standard normal random variables. Now, by the strong law of large numbers, the norm of the random vector x=nNXn(x)en would diverge a.e.:

x2=nNXn(x)2+.

Of course, this cannot be true. The idea, introduced by Gross, is that H is actually too small to support a Gaussian measure and one should endow H with a less conservative norm and consider its associated completion.

To this end, one needs to consider a suitable σ-algebra on a (separable) vector space. Consider a vector space V and its continuous dual3 V’. (In fact, any dual pairing (X,Y) suffices.) For every finite collection ϕ1,,ϕnV, consider the sets

(3)Cϕ1,,ϕn(B):={vV(ϕ1(v),,ϕn(v))B},

where B ranges over the Borel sets in Rn. Sets of this form are called cylinder sets. Note that the collection of all cylinder sets, ranging over both indexing functionals and Borel sets, does not form a σ-algebra. Only when keeping the indexing functionals fixed, a σ-algebra Cϕ1,,ϕn(V) is obtained. However, the σ-algebra generated by all cylinder sets is called the cylindrical σ-algebra C(V). For separable Banach spaces, it was shown by Mourier that the cylindrical algebra coincides with the natural Borel algebra B(V).

Now, a large part of my time invested in understanding this topic was spent on the following property. In the seminal paper by Gross, and some subsequent work such as by Dudley, a different convention for the cylinder sets was used:

(4)CF(B~):={vVπF(v)B~}=B~+F,

where F is a finite-dimensional subspace of V, B~ is a Borel set in FV/FF and πF:VV/F is the canonical projection with the complement F defined as follows:

F:={vVϕF:ϕ(v)=0}.

To actually see the equivalence of these two constructions, I needed quite some pen-and-paper work, so let us go through it together.

Proof
For the cylinder set Cϕ1,,ϕn(B) in Eq. (3), it is clear that adding any element of ϕ1,,ϕn, where indicates the linear span of a set of vectors, would leave the set invariant. As such, one obtains Cϕ1,,ϕn(B)={vV/ϕ1,,ϕn(ϕ1(v),,ϕn(v))B}+ϕ1,,ϕn={vϕ1,,ϕn(ϕ1(v),,ϕn(v))B}+ϕ1,,ϕn. Since the map (ϕ1,,ϕn) is linear (and acts between finite-dimensional spaces), it is also continuous and, hence, the set in the first term is equal to some Borel set B~B(ϕ1,,ϕn). Accordingly, it is of the form of Eq. (4).

Conversely, consider any basis {ϕ1,,ϕn} of the finite-dimensional subspace FV. By reversing the steps above, it is not hard to see that CF is equivalent to Cϕ1,,ϕn.

Given a cylindrical algebra C(V), a cylindrical measure (or cylinder set measure) is a nonnegative, finitely additive set function μ:C(V)R+ that restricts to a proper measure on every sub-σ-algebra Cϕ1,,ϕn(V).

Although the Gaussian measure in Eq. (1) does not make sense as a measure on C(H), with H again a separable, infinite-dimensional Hilbert space, it does give rise to a cylindrical measure μ. To obtain a proper measure, the Hilbert norm H is now replaced by a norm that interacts nicely with respect to μ. A (continuous) norm on H is said to be measurable (with respect to μ) if, for every ε>0, there exists a finite-dimensional subspace VεH such that the following relation holds for all finite-dimensional subspaces VH:

VVεμ({vVv>ε})<ε,.

Now, choose such a measurable norm (e.g. any norm induced by an injective Hilbert–Schmidt operator4) and consider the (Banach) completion B of H with respect to this norm. The triple (H,B,μ) is called an abstract Wiener space. The cylindrical measure μ can be simply extended to B as follows:

μ(B):=μ(BH)

for all BB(B). It can now be shown that μ can actually be enhanced to a proper probability measure γ on B (as in the Kolmogorov–Daniell extension theorem).5 Given this measure γ, one can define the moments of functionals:

E[ϕ]:=Bϕ(x)dγ(x),Cov[ϕ,ψ]:=B(ϕ(x)E[ϕ])(ψ(x)E[ψ])dγ(x).

Using these operators, one can express the characteristic function of γ as follows:

γ~(ϕ)=exp(iE[ϕ]Cov[ϕ,ϕ]2).

On B, one can now consider the following norm:

xγ:=sup{ϕ(x)ϕBCov[ϕ,ϕ]1}.

Note that when the Gaussian measure was extended through the choice of a measurable norm , then

x=xγ

for all xB. Moreover, in this case, it can be shown that the subspace

Hγ:={xBxγ<+}

is isomorphic to H. H is also called the Cameron–Martin space of the abstract Wiener space (B,H,γ) for the following reason. The translated measure hγ, where hH, is absolutely continuous with respect to γ with Radon–Nikodym derivative

dhλdλ(x)=exp(xhhγ22).

Comparing this expression to Eq. (2) should explain the terminology.

As a last note, it should be said that although it is the Cameron–Martin space H that essentially gives all the structure to the abstract Wiener space (H,B,γ), it is ‘very small’ in that

γ(H)=0

whenever dim(H)=+.



  1. For a refresher on measure and probability theory, see this appendix

  2. Centered distributions are those with mean 0. 

  3. For finite-dimensional spaces, this is simply the linear dual V. However, for general topological vector spaces, the continuous dual V is the subset of the linear dual V consisting of continuous functionals. 

  4. See e.g. the paper by Sato: “Gaussian measure on a Banach space and abstract Winer measure”. 

  5. It was shown by Sato that the new norm actually does not have to be measurable. This is only a sufficient condition. Only for Hilbert norms, i.e. those induced by an inner product, do ‘admissibility’ and ‘measurability’ coincide.