Is Stoch enriched in Met?

Question

Let $\mathsf{Stoch}$ denote the Kleisli category of the Giry monad. That is, $\mathsf{Stoch}$ is a category whose objects are measurable spaces and for which a morphism $f\in\mathsf{Stoch}(X,Y)$ is a map sending points in $X$ to probability measures on $Y$; see nLab: Giry monad for details.

Let $\mathsf{Met}$ denote the symmetric monoidal category of metric spaces $(X,d)$ and short maps (functions $f\colon X\to Y$ satisfying $d(x_1,x_2)\geq d(fx_1,fx_2)$ for all $x_1,x_2\in X$).

Question: Is there a way to enrich $\mathsf{Stoch}$ in $\mathsf{Met}$?

In other words, I'm looking for a way to endow each hom-set $\mathsf{Stoch}(X,Y)$ with a metric space structure in such a way that the composition operation is short: $$ d(f_1,f_2)+d(g_1,g_2) \geq d(g_1\circ f_1, g_2\circ f_2) $$

The upshot would be that when working in $\mathsf{Stoch}$, diagrams would not simply either commute or not commute; they would commute up to some distance, and this distance would behave sensibly under "diagram chasing".

Paolo Perrone · Accepted Answer · 2022-11-30 15:35:41Z

A way to enrich $\mathsf{Stoch}$ could be as follows.

Let $X$ be a measurable space and let $PX$ be the space of probability measures over $X$. We can equip $PX$ with the total variational distance, which is given by the following equivalent forms. $$ d(p,q) \;=\; \sup_{A\subseteq X} |p(A) - q(A)| \;=\; \sup_{f:X\to[0,1]} \left| \int_X f\, dp - \int_X f\; dq \right| . $$ Above, and in what follows, the subsets $A\subseteq X$ and the functions $f:X\to\Bbb{R}$ are assumed measurable.

Now first of all let $f:X \to Y$ be a measurable (deterministic) function. Denote by $f_*:PX\to PY$ the pushforward of measures (for category theorists, the action of the functor $P$ on the morphisms). We have \begin{align*} d( f_*p, f_*q ) \;&=\; \sup_{B\subseteq Y} |f_*p(B) - f_*q(B)| \\\ \;&=\; \sup_{B\subseteq Y} |p(f^{-1}(B)) - q(f^{-1}(B))| \\\ \;&\le\; \sup_{A\subseteq X} |p(A) - q(A)| \\\ \;&=\; d( p, q ) , \end{align*} which means that $f_*:PX\to PY$ is short for the metric just defined. This will be of use later.

We can equip now the sets $\mathsf{Stoch}(X,Y)$ with the "product" or "$L^\infty$" metric construction induced by the one on $PY$, as follows. We use the following notation, a stochastic map $k:X\to PY$ evaluated at a point $x\in X$ gives the measure $k_x\in PY$. Now for stochastic maps $k,h:X\to PY$, we set $$ d(k,h) \;=\; \sup_{x\in X} d\big( k_x, h_y \big) . $$ This number is bounded by $1$. (Thanks to Martin Hairer for pointing this out, see the comments.)

We now have to prove that the (Kleisli) composition of $\mathsf{Stoch}$ is short. Given $h:X \to PY$ and $l: Y \to PZ$, their Kleisli composition, which we denote by $lh$, is given by the Chapman-Kolmogorov formula. Explicitly, for every measurable $C \subseteq Z$, we have $$ lh_x(C) \;=\; \int_{PZ} q(C) \,d(l_*h_x)(q) \;=\; \int_{Y} l_y(C) \,dh_x(y) . $$

Now, to see that postcomposition is short, let $h,k:X \to PY$ and $l: Y \to PZ$. We have \begin{align*} d( lh, lk ) \;&=\; \sup_{x\in X} \sup_{C\subseteq Z} | lh_x(C) - lk_x(C) | \\\ \;&=\; \sup_{x\in X} \sup_{C\subseteq Z} \left| \int_{PZ} q(C) \,d(l_*h_x)(q) - \int_{PZ} q(C) \,d(l_*k_x)(q) \right| \\\ \;&\le\; \sup_{x\in X} d\big( l_*h_x , l_*k_x \big) \\\ \;&\le\; \sup_{x\in X} d( h_x , k_x ) \\\ \;&=\; d( h, k ) . \end{align*} (We used the fact that the evaluation map $PX\to \Bbb{R}$ given by $p\mapsto p(A)$ for a measurable set $A\subseteq X$ is measurable for the $\sigma$-algebra on $PX$ in the construction of the Giry monad.)

To see that precomposition is short, let also $g: W \to PX$. Then \begin{align*} d( hg, kg ) \;&=\; \sup_{w\in W} \sup_{B\subseteq Y} | hg_w(B) - kg_w(B) | \\\ \;&=\; \sup_{w\in W} \sup_{B\subseteq Y} \left| \int_{X} h_x(B) \,dg_w(x) - \int_{X} k_x(B) \,dg_w(x) \right| \\\ \;&\le\; \sup_{w\in W} \int_{X} \sup_{B\subseteq Y} | h_x(B) - k_x(B) | \,d(g_w)(x) \\\ \;&=\; \sup_{w\in W} \int_{X} d( h_x, k_x ) \,d(g_w)(x) \\\ \;&\le\; d( h_x, k_x ) . \end{align*}

Consider now the set $\mathsf{Stoch}(X,Y)\times\mathsf{Stoch}(Y,Z)$. We can equip it either with the metric $$ d \big( (h_1,l_1), (h_2,l_2) \big) \;=\; \max\{ d(h_1,h_2), d(l_1,l_2) \}, $$ which corresponds to the metric of the cartesian (categorical) product of $\mathsf{Met}$, or with the metric $$ d \big( (h_1,l_1), (h_2,l_2) \big) \;=\; d(h_1,h_2) + d(l_1,l_2) , $$ which is the metric of the closed monoidal product of $\mathsf{Met}$ as an (i.e. exhibiting a hom-tensor adjunction).

For either choice, the map $\circ: \mathsf{Stoch}(X,Y)\times\mathsf{Stoch}(Y,Z) \to \mathsf{Stoch}(X,Z)$ given by composition is short. This gives the desired enrichment.

By the way, if we want talk about diagrams "commuting up to $\varepsilon$", then by definition of the metric, $\varepsilon$ has to be a uniform bound. For example, for the diagram $\require{AMScd}$ \begin{CD} A @>f>> B\\ @V m V V @VV n V\\ C @>>g> D \end{CD} to commute up to $\varepsilon$, in the sense that $d(n\circ f,g\circ m)\lt\varepsilon$, we need that $$ \sup_{a\in A} d\big(n(f(a)),g(m(a))\big) \;\lt\;\varepsilon, $$ where the distance above is the one of $D$. For that to happen, $\varepsilon$ needs to be independent of $a$, which is quite a strong condition in practice (if $A$ is not compact).

There are no infinite distances here, $d$ is bounded by $1$. (Your dual representation of TV is incorrect, you want $f \colon X \to [0,1]$.) Note also that the 'non-boring' case $d \neq 1$ is a fancy way of talking about Doeblin's condition which is 'rarely' verified 'in practice'. (Of course, maybe the OP has in mind some specific situation where it is relevant.) — Martin Hairer, Nov 23, 2019 at 14:00

alesia · Accepted Answer · 2019-11-23 03:07:19Z

3

A boring way is to set $d(f,g)=1$ if $f$ and $g$ are different.

A slightly less boring way is to take $d(f,g)$ to be the supremum over $x$ of the probability that two independent variables with laws $f(x)$ and $g(x)$ differ. Different functions may then have a small distance if their image consists of distributions with atoms.

answered Nov 23, 2019 at 3:07

alesia

2,5728 silver badges19 bronze badges

Add a comment |

kirk sturtz · Accepted Answer · 2019-11-23 03:29:36Z

If you're willing to work with generalized metric spaces, then the answer is yes. $\mathsf{Stoch}$ is enriched over the category of convex spaces, which in turn is enriched over $V$-cat as there exists a closed functor from convex spaces to $V$-cat, where $V=([0,\infty], \ge)$ is the SMCC whose objects can be viewed as generalized metric spaces, and arrows viewed as short maps. This all follows from Meng's thesis, proposition 5. (Available at nLab)

You can actually say a little more by noting that $\mathsf{Stoch}$ is enriched over the category of super convex spaces, which differs from convex spaces in that one considers countable sums of one, rather than finite sums, and requires the morphisms to preserve the countable sums. (The proof of enrichment over super convex spaces is essentially identical to that found in Meng.) The benefit of working with (coseparated) super convex spaces is that it is a $*$-autonomous category.

Stack Exchange Network

Is Stoch enriched in Met?

3 Answers 3

Your Answer

Not the answer you're looking for? Browse other questions tagged
pr.probability
ct.category-theory
mg.metric-geometry
measure-theory
or ask your own question.

Is Stoch enriched in Met?

3 Answers 3

Your Answer

Sign up or log in

Post as a guest

Not the answer you're looking for? Browse other questions tagged pr.probabilityct.category-theorymg.metric-geometrymeasure-theory or ask your own question.

Related

Not the answer you're looking for? Browse other questions tagged
pr.probability
ct.category-theory
mg.metric-geometry
measure-theory
or ask your own question.