category theory


To continue from the previous post

Twisted Differential Cohomology

Ulrich Bunke gave a talk introducing differential cohomology theories, and Thomas Nikolaus gave one about a twisted version of such theories (unfortunately, perhaps in the wrong order). The idea here is that cohomology can give a classification of field theories, and if we don’t want the theories to be purely topological, we would need to refine this. A cohomology theory is a (contravariant) functorial way of assigning to any space X, which we take to be a manifold, a \mathbb{Z}-graded group: that is, a tower of groups of “cocycles”, one group for each n, with some coboundary maps linking them. (In some cases, the groups are also rings) For example, the group of differential forms, graded by degree.

Cohomology theories satisfy some axioms – for example, the Mayer-Vietoris sequence has to apply whenever you cut a manifold into parts. Differential cohomology relaxes one axiom, the requirement that cohomology be a homotopy invariant of X. Given a differential cohomology theory, one can impose equivalence relations on the differential cocycles to get a theory that does satisfy this axiom – so we say the finer theory is a “differential refinement” of the coarser. So, in particular, ordinary cohomology theories are classified by spectra (this is related to the Brown representability theorem), whereas the differential ones are represented by sheaves of spectra – where the constant sheaves represent the cohomology theories which happen to be homotopy invariants.

The “twisting” part of this story can be applied to either an ordinary cohomology theory, or a differential refinement of one (though this needs similarly refined “twisting” data). The idea is that, if R is a cohomology theory, it can be “twisted” over X by a map \tau: X \rightarrow Pic_R into the “Picard group” of R. This is the group of invertible R-modules (where an R-module means a module for the cohomology ring assigned to X) – essentially, tensoring with these modules is what defines the “twisting” of a cohomology element.

An example of all this is twisted differential K-theory. Here the groups are of isomorphism classes of certain vector bundles over X, and the twisting is particularly simple (the Picard group in the topological case is just \mathbb{Z}_2). The main result is that, while topological twists are classified by appropriate gerbes on X (for K-theory, U(1)-gerbes), the differential ones are classified by gerbes with connection.

Fusion Categories

Scott Morrison gave a talk about Classifying Fusion Categories, the point of which was just to collect together a bunch of results constructing particular examples. The talk opens with a quote by Rutherford: “All science is either physics or stamp collecting” – that is, either about systematizing data and finding simple principles which explain it, or about collecting lots of data. This talk was unabashed stamp-collecting, on the grounds that we just don’t have a lot of data to systematically understand yet – and for that very reason I won’t try to summarize all the results, but the slides are well worth a look-over. The point is that fusion categories are very useful in constructing TQFT’s, and there are several different constructions that begin “given a fusion category \mathcal{C}“… and yet there aren’t all that many examples, and very few large ones, known.

Scott also makes the analogy that fusion categories are “noncommutative finite groups” – which is a little confusing, since not all finite groups are commutative anyway – but the idea is that the symmetric fusion categories are exactly the representation categories of finite groups. So general fusion categories are a non-symmetric generalization of such groups. Since classifying finite groups turned out to be difficult, and involve a laundry-list of sporadic groups, it shouldn’t be too surprising that understanding fusion categories (which, for the symmetric case, include the representation categories of all these examples) should be correspondingly tricky. Since, as he points out, we don’t have very many non-symmetric examples beyond rank 12 (analogous to knowing only finite groups with at most 12 elements), it’s likely that we don’t have a very good understanding of these categories in general yet.

There were a couple of talks – one during the workshop by Sonia Natale, and one the previous week by Sebastian Burciu, whom I also had the chance to talk with that week – about “Equivariantization” of fusion categories, and some fairly detailed descriptions of what results. The two of them have a paper on this which gives more details, which I won’t summarize – but I will say a bit about the construction.

An “equivariantization” of a category C acted on by a group G is supposed to be a generalization of the notion of the set of fixed points for a group acting on a set.  The category C^G has objects which consist of an object x \in C which is fixed by the action of G, together with an isomorphism \mu_g : x \rightarrow x for each g \in G, satisfying a bunch of unsurprising conditions like being compatible with the group operation. The morphisms are maps in C between the objects, which form commuting squares for each g \in G. Their paper, and the talks, described how this works when C is a fusion category – namely, C^G is also a fusion category, and one can work out its fusion rules (i.e. monoidal structure). In some cases, it’s a “group theoretical” fusion category (it looks like Rep(H) for some group H) – or a weakened version of such a thing (it’s Morita equivalent to ).

A nice special case of this is if the group action happens to be trivial, so that every object of C is a fixed point. In this case, C^G is just the category of objects of C equipped with a G-action, and the intertwining maps between these. For example, if C = Vect, then C^G = Rep(G) (in particular, a “group-theoretical fusion category”). What’s more, this construction is functorial in G itself: given a subgroup H \subset G, we get an adjoint pair of functors between C^G and C^H, which in our special case are just the induced-representation and restricted-representation functors for that subgroup inclusion. That is, we have a Mackey functor here. These generalize, however, to any fusion category C, and to nontrivial actions of G on C. The point of their paper, then, is to give a good characterization of the categories that come out of these constructions.

Quantizing with Higher Categories

The last talk I’d like to describe was by Urs Schreiber, called Linear Homotopy Type Theory for Quantization. Urs has been giving evolving talks on this topic for some time, and it’s quite a big subject (see the long version of the notes above if there’s any doubt). However, I always try to get a handle on these talks, because it seems to be describing the most general framework that fits the general approach I use in my own work. This particular one borrows a lot from the language of logic (the “linear” in the title alludes to linear logic).

Basically, Urs’ motivation is to describe a good mathematical setting in which to construct field theories using ingredients familiar to the physics approach to “field theory”, namely… fields. (See the description of Kevin Walker’s talk.) Also, Lagrangian functionals – that is, the notion of a physical action. Constructing TQFT from modular tensor categories, for instance, is great, but the fields and the action seem to be hiding in this picture. There are many conceptual problems with field theories – like the mathematical meaning of path integrals, for instance. Part of the approach here is to find a good setting in which to locate the moduli spaces of fields (and the spaces in which path integrals are done). Then, one has to come up with a notion of quantization that makes sense in that context.

The first claim is that the category of such spaces should form a differentially cohesive infinity-topos which we’ll call \mathbb{H}. The “infinity” part means we allow morphisms between field configurations of all orders (2-morphisms, 3-morphisms, etc.). The “topos” part means that all sorts of reasonable constructions can be done – for example, pullbacks. The “differentially cohesive” part captures the sort of structure that ensures we can really treat these as spaces of the suitable kind: “cohesive” means that we have a notion of connected components around (it’s implemented by having a bunch of adjoint functors between spaces and points). The “differential” part is meant to allow for the sort of structures discussed above under “differential cohomology” – really, that we can capture geometric structure, as in gauge theories, and not just topological structure.

In this case, we take \mathbb{H} to have objects which are spectral-valued infinity-stacks on manifolds. This may be unfamiliar, but the main point is that it’s a kind of generalization of a space. Now, the sort of situation where quantization makes sense is: we have a space (i.e. \mathbb{H}-object) of field configurations to start, then a space of paths (this is WHERE “path-integrals” are defined), and a space of field configurations in the final system where we observe the result. There are maps from the space of paths to identify starting and ending points. That is, we have a span:

A \leftarrow X \rightarrow B

Now, in fact, these may all lie over some manifold, such as B^n(U(1)), the classifying space for U(1) (n-1)-gerbes. That is, we don’t just have these “spaces”, but these spaces equipped with one of those pieces of cohomological twisting data discussed up above. That enters the quantization like an action (it’s WHAT you integrate in a path integral).

Aside: To continue the parallel, quantization is playing the role of a cohomology theory, and the action is the twist. I really need to come back and complete an old post about motives, because there’s a close analogy here. If quantization is a cohomology theory, it should come by factoring through a universal one. In the world of motives, where “space” now means something like “scheme”, the target of this universal cohomology theory is a mild variation on just the category of spans I just alluded to. Then all others come from some functor out of it.

Then the issue is what quantization looks like on this sort of scenario. The Atiyah-Singer viewpoint on TQFT isn’t completely lost here: quantization should be a functor into some monoidal category. This target needs properties which allow it to capture the basic “quantum” phenomena of superposition (i.e. some additivity property), and interference (some actual linearity over \mathbb{C}). The target category Urs talked about was the category of E_{\infty}-rings. The point is that these are just algebras that live in the world of spectra, which is where our spaces already lived. The appropriate target will depend on exactly what \mathbb{H} is.

But what Urs did do was give a characterization of what the target category should be LIKE for a certain construction to work. It’s a “pull-push” construction: see the link way above on Mackey functors – restriction and induction of representations are an example . It’s what he calls a “(2-monoidal, Beck-Chevalley) Linear Homotopy-Type Theory”. Essentially, this is a list of conditions which ensure that, for the two morphisms in the span above, we have a “pull” operation for some and left and right adjoints to it (which need to be related in a nice way – the jargon here is that we must be in a Wirthmuller context), satisfying some nice relations, and that everything is functorial.

The intuition is that if we have some way of getting a “linear gadget” out of one of our configuration spaces of fields (analogous to constructing a space of functions when we do canonical quantization over, let’s say, a symplectic manifold), then we should be able to lift it (the “pull” operation) to the space of paths. Then the “push” part of the operation is where the “path integral” part comes in: many paths might contribute to the value of a function (or functor, or whatever it may be) at the end-point of those paths, because there are many ways to get from A to B, and all of them contribute in a linear way.

So, if this all seems rather abstract, that’s because the point of it is to characterize very generally what has to be available for the ideas that appear in physics notions of path-integral quantization to make sense. Many of the particulars – spectra, E_{\infty}-rings, infinity-stacks, and so on – which showed up in the example are in a sense just placeholders for anything with the right formal properties. So at the same time as it moves into seemingly very abstract terrain, this approach is also supposed to get out of the toy-model realm of TQFT, and really address the trouble in rigorously defining what’s meant by some of the standard practice of physics in field theory by analyzing the logical structure of what this practice is really saying. If it turns out to involve some unexpected math – well, given the underlying issues, it would have been more surprising if it didn’t.

It’s not clear to me how far along this road this program gets us, as far as dealing with questions an actual physicist would like to ask (for the most part, if the standard practice works as an algorithm to produce results, physicists seldom need to ask what it means in rigorous math language), but it does seem like an interesting question.

So it’s been a while since I last posted – the end of 2013 ended up being busy with a couple of visits to Jamie Vicary in Oxford, and Roger Picken in Lisbon. In the aftermath of the two trips, I did manage to get a major revision of this paper submitted to a journal, and put this one out in public. A couple of others will be coming down the pipeline this year as well.

I’m hoping to get back to a post about motives which I planned earlier, but for the moment, I’d like to write a little about the second paper, with Roger Picken.

Global and Local Symmetry

The upshot is that it’s about categorifying the concept of symmetry. More specifically, it’s about finding the analog in the world of categories for the interplay between global and local symmetry which occurs in the world of set-based structures (sets, topological spaces, vector spaces, etc.) This distinction is discussed in a nice way by Alan Weinstein in this article from the Notices of the AMS from

The global symmetry of an object X in some category \mathbf{C} can be described in terms of its group of automorphisms: all the ways the object can be transformed which leave it “the same”. This fits our understanding of “symmetry” when the morphisms can really be interpreted as transformations of some sort. So let’s suppose the object is a set with some structure, and the morphisms are set-maps that preserve the structure: for example, the objects could be sets of vertices and edges of a graph, so that morphisms are maps of the underlying data that preserve incidence relations. So a symmetry of an object is a way of transforming it into itself – and an invertible one at that – and these automorphisms naturally form a group Aut(X). More generally, we can talk about an action of a group G on an object X, which is a map \phi : G \rightarrow Aut(X).

“Local symmetry” is different, and it makes most sense in a context where the object X is a set – or at least, where it makes sense to talk about elements of X, so that X has an underlying set of some sort.

Actually, being a set-with-structure, in a lingo I associate with Jim Dolan, means that the forgetful functor U : \mathbf{C} \rightarrow \mathbf{Sets} is faithful: you can tell morphisms in \mathbf{C} (in particular, automorphisms of X) apart by looking at what they do to the underlying set. The intuition is that the morphisms of \mathbf{C} are exactly set maps which preserve the structure which U forgets about – or, conversely, that the structure on objects of \mathbf{C} is exactly that which is forgotten by U. Certainly, knowing only this information determines \mathbf{C} up to equivalence. In any case, suppose we have an object like this: then knowing about the symmetries of X amounts to knowing about a certain group action, namely the action of Aut(X), on the underlying set U(X).

From this point of view, symmetry is about group actions on sets. The way we represent local symmetry (following Weinstein’s discussion, above) is to encode it as a groupoid – a category whose morphisms are all invertible. There is a level-slip happening here, since X is now no longer seen as an object inside a category: it is the collection of all the objects of a groupoid. What makes this a representation of “local” symmetry is that each morphism now represents, not just a transformation of the whole object X, but a relationship under some specific symmetry between one element of X and another. If there is an isomorphism between x \in X and y \in X, then x and y are “symmetric” points under some transformation. As Weinstein’s article illustrates nicely, though, there is no assumption that the given transformation actually extends to the entire object X: it may be that only part of X has, for example, a reflection symmetry, but the symmetry doesn’t extend globally.

Transformation Groupoid

The “interplay” I alluded to above, between the global and local pictures of symmetry, is to build a “transformation groupoid” (or “action groupoid“) associated to a group G acting on a set X. The result is called X // G for short. Its morphisms consist of pairs such that  (g,x) : x \rightarrow (g \rhd x) is a morphism taking x to its image under the action of g \in G. The “local” symmetry view of X // G treats each of these symmetry relations between points as a distinct bit of data, but coming from a global symmetry – that is, a group action – means that the set of morphisms comes from the product G \times X.

Indeed, the “target” map in X // G from morphisms to objects is exactly a map G \times X \rightarrow X. It is not hard to show that this map is an action in another standard sense. Namely, if we have a real action \phi : G \rightarrow Hom(X,X), then this map is just \hat{\phi} : G \times X \rightarrow X, which moves one of the arguments to the left side. If \phi was a functor, then $\hat{\phi}$ satisfies the “action” condition, namely that the following square commutes:

actionsquare

(Here, m is the multiplication in G, and this is the familiar associativity-type axiom for a group action: acting by a product of two elements in G is the same as acting by each one successively.

So the starting point for the paper with Roger Picken was to categorify this. It’s useful, before doing that, to stop and think for a moment about what makes this possible.

First, as stated, this assumed that X either is a set, or has an underlying set by way of some faithful forgetful functor: that is, every morphism in Aut(X) corresponds to a unique set map from the elements of X to itself. We needed this to describe the groupoid X // G, whose objects are exactly the elements of X. The diagram above suggests a different way to think about this. The action diagram lives in the category \mathbf{Set}: we are thinking of G as a set together with some structure maps. X and the morphism \hat{\phi} must be in the same category, \mathbf{Set}, for this characterization to make sense.

So in fact, what matters is that the category X lived in was closed: that is, it is enriched in itself, so that for any objects X,Y, there is an object Hom(X,Y), the internal hom. In this case, it’s G = Hom(X,X) which appears in the diagram. Such an internal hom is supposed to be a dual to \mathbf{Set}‘s monoidal product (which happens to be the Cartesian product \times): this is exactly what lets us talk about \hat{\phi}.

So really, this construction of a transformation groupoid will work for any closed monoidal category \mathbf{C}, producing a groupoid in \mathbf{C}. It may be easier to understand in cases like \mathbf{C}=\mathbf{Top}, the category of topological spaces, where there is indeed a faithful underlying set functor. But although talking explicitly about elements of X was useful for intuitively seeing how X//G relates global and local symmetries, it played no particular role in the construction.

Categorify Everything

In the circles I run in, a popular hobby is to “categorify everything“: there are different versions, but what we mean here is to turn ideas expressed in the world of sets into ideas in the world of categories. (Technical aside: all the categories here are assumed to be small). In principle, this is harder than just reproducing all of the above in any old closed monoidal category: the “world” of categories is \mathbf{Cat}, which is a closed monoidal 2-category, which is a more complicated notion. This means that doing all the above “strictly” is a special case: all the equalities (like the commutativity of the action square) might in principle be replaced by (natural) isomorphisms, and a good categorification involves picking these to have good properties.

(In our paper, we left this to an appendix, because the strict special case is already interesting, and in any case there are “strictification” results, such as the fact that weak 2-groups are all equivalent to strict 2-groups, which mean that the weak case isn’t as much more general as it looks. For higher n-categories, this will fail – which is why we include the appendix to suggest how the pattern might continue).

Why is this interesting to us? Bumping up the “categorical level” appeals for different reasons, but the ones matter most to me have to do with taking low-dimensional (or -codimensional) structures, and finding analogous ones at higher (co)dimension. In our case, the starting point had to do with looking at the symmetries of “higher gauge theories” – which can be used to describe the transport of higher-dimensional surfaces in a background geometry, the way gauge theories can describe the transport of point particles. But I won’t ask you to understand that example right now, as long as you can accept that “what are the global/local symmetries of a category like?” is a possibly interesting question.

So let’s categorify the discussion about symmetry above… To begin with, we can just take our (closed monoidal) category to be \mathbf{Cat}, and follow the same construction above. So our first ingredient is a 2-group \mathcal{G}. As with groups, we can think of a 2-group either as a 2-category with just one object \star, or as a 1-category with some structure – a group object in \mathbf{Cat}, which we’ll call C(\mathcal{G}) if it comes from a given 2-group. (In our paper, we keep these distinct by using the term “categorical group” for the second. The group axioms amount to saying that we have a monoidal category (\mathcal{G}, \otimes, I). Its objects are the morphisms of the 2-group, and the composition becomes the monoidal product \otimes.)

(In fact, we often use a third equivalent definition, that of crossed modules of groups, but to avoid getting into that machinery here, I’ll be changing our notation a little.)

2-Group Actions

So, again, there are two ways to talk about an action of a 2-group on some category \mathbf{C}. One is to define an action as a 2-functor \Phi : \mathcal{G} \rightarrow \mathbf{Cat}. The object being acted on, \mathbf{C} \in \mathbf{Cat}, is the unique object \Phi(\star) – so that the 2-functor amounts to a monoidal functor from the categorical group C(\mathcal{G}) into Aut(\mathbf{C}). Notice that here we’re taking advantage of the fact that \mathbf{Cat} is closed, so that the hom-“sets” are actually categories, and the automorphisms of \mathbf{C} – invertible functors from \mathbf{C} to itself – form the objects of a monoidal category, and in fact a categorical group. What’s new, though, is that there are also 2-morphisms – natural transformations between these functors.

To begin with, then, we show that there is a map \hat{\Phi} : \mathcal{G} \times \mathbf{C} \rightarrow \mathbf{C}, which corresponds to the 2-functor \Phi, and satisfies an action axiom like the square above, with \otimes playing the role of group multiplication. (Again, remember that we’re only talking about the version where this square commutes strictly here – in an appendix of the paper, we talk about the weak version of all this.) This is an intuitive generalization of the situation for groups, but it is slightly more complicated.

The action \Phi directly gives three maps. First, functors \Phi(\gamma) : \mathbf{C} \rightarrow \mathbf{C} for each 2-group morphism \gamma – each of which consists of a function between objects of \mathbf{C}, together with a function between morphisms of \mathbf{C}. Second, natural transformations \Phi(\eta) : \Phi(\gamma) \rightarrow \Phi(\gamma ') for 2-morphisms \eta : \gamma \rightarrow \gamma' in the 2-group – each of which consists of a function from objects to morphisms of \mathbf{C}.

On the other hand, \hat{\Phi} : \mathcal{G} \times \mathbf{C} \rightarrow \mathbf{C} is just a functor: it gives two maps, one taking pairs of objects to objects, the other doing the same for morphisms. Clearly, the map (\gamma,x) \mapsto x' is just given by x' = \Phi(\gamma)(x). The map taking pairs of morphisms (\eta,f) : (\gamma,x) \rightarrow (\gamma ', y) to morphisms of \mathbf{C} is less intuitively obvious. Since I already claimed \Phi and \hat{\Phi} are equivalent, it should be no surprise that we ought to be able to reconstruct the other two parts of \Phi from it as special cases. These are morphism-maps for the functors, (which give \Phi(\gamma)(f) or \Phi(\gamma ')(f)), and the natural transformation maps (which give \Phi(\eta)(x) or \Phi(\eta)(y)). In fact, there are only two sensible ways to combine these four bits of information, and the fact that \Phi(\eta) is natural means precisely that they’re the same, so:

\hat{\Phi}(\eta,f) = \Phi(\eta)(y) \circ \Phi(\gamma)(f) = \Phi(\gamma ')(f) \circ \Phi(\eta)(x)

Given the above, though, it’s not so hard to see that a 2-group action really involves two group actions: of the objects of \mathcal{G} on the objects of \mathbf{C}, and of the morphisms of \mathcal{G} on objects of \mathbf{C}. They fit together nicely because objects can be identified with their identity morphisms: furthermore, \Phi being a functor gives an action of \mathcal{G}-objects on \mathbf{C}-morphisms which fits in between them nicely.

But what of the transformation groupoid? What is the analog of the transformation groupoid, if we repeat its construction in \mathbf{Cat}?

The Transformation Double Category of a 2-Group Action

The answer is that a category (such as a groupoid) internal to \mathbf{Cat} is a double category. The compact way to describe it is as a “category in \mathbf{Cat}“, with a category of objects and a category of morphisms, each of which of course has objects and morphisms of its own. For the transformation double category, following the same construction as for sets, the object-category is just \mathbf{C}, and the morphism-category is \mathcal{G} \times \mathbf{C}, and the target functor is just the action map \hat{\Phi}. (The other structure maps that make this into a category in \mathbf{Cat} can similarly be worked out by following your nose).

This is fine, but the internal description tends to obscure an underlying symmetry in the idea of double categories, in which morphisms in the object-category and objects in the morphism-category can switch roles, and get a different description of “the same” double category, denoted the “transpose”.

A different approach considers these as two different types of morphism, “horizontal” and “vertical”: they are the morphisms of horizontal and vertical categories, built on the same set of objects (the objects of the object-category). The morphisms of the morphism-category are then called “squares”. This makes a convenient way to draw diagrams in the double category. Here’s a version of a diagram from our paper with the notation I’ve used here, showing what a square corresponding to a morphism (\chi,f) \in \mathcal{G} \times \mathbf{C} looks like:

squarepic

The square (with the boxed label) has the dashed arrows at the top and bottom for its source and target horizontal morphisms (its images under the source and target functors: the argument above about naturality means they’re well-defined). The vertical arrows connecting them are the source and target vertical morphisms (its images under the source and target maps in the morphism-category).

Horizontal and Vertical Slices of \mathbf{C} // \mathcal{G}

So by construction, the horizontal category of these squares is just the object-category \mathbf{C}.  For the same reason, the squares and vertical morphisms, make up the category \mathcal{G} \times \mathbf{C}.

On the other hand, the vertical category has the same objects as \mathbf{C}, but different morphisms: it’s not hard to see that the vertical category is just the transformation groupoid for the action of the group of \mathbf{G}-objects on the set of \mathbf{C}-objects, Ob(\mathbf{C}) // Ob(\mathcal{G}). Meanwhile, the horizontal morphisms and squares make up the transformation groupoid Mor(\mathbf{C}) // Mor(\mathcal{G}). These are the object-category and morphism-category of the transpose of the double-category we started with.

We can take this further: if squares aren’t hip enough for you – or if you’re someone who’s happy with 2-categories but finds double categories unfamiliar – the horizontal and vertical categories can be extended to make horizontal and vertical bicategories. They have the same objects and morphisms, but we add new 2-cells which correspond to squares where the boundaries have identity morphisms in the direction we’re not interested in. These two turn out to feel quite different in style.

First, the horizontal bicategory extends \mathbf{C} by adding 2-morphisms to it, corresponding to morphisms of \mathcal{G}: roughly, it makes the morphisms of \mathbf{C} into the objects of a new transformation groupoid, based on the action of the group of automorphisms of the identity in \mathcal{G} (which ensures the square has identity edges on the sides.) This last point is the only constraint, and it’s not a very strong one since Aut(1_G) and G essentially determine the entire 2-group: the constraint only relates to the structure of \mathcal{G}.

The constraint for the vertical bicategory is different in flavour because it depends more on the action \Phi. Here we are extending a transformation groupoid, Ob(\mathbf{C}) // Ob(\mathcal{G}). But, for some actions, many morphisms in \mathcal{G} might just not show up at all. For 1-morphisms (\gamma, x), the only 2-morphisms which can appear are those taking \gamma to some \gamma ' which has the same effect on x as \gamma. So, for example, this will look very different if \Phi is free (so only automorphisms show up), or a trivial action (so that all morphisms appear).

In the paper, we look at these in the special case of an adjoint action of a 2-group, so you can look there if you’d like a more concrete example of this difference.

Speculative Remarks

The starting point for this was a project (which I talked about a year ago) to do with higher gauge theory – see the last part of the linked post for more detail. The point is that, in gauge theory, one deals with connections on bundles, and morphisms between them called gauge transformations. If one builds a groupoid out of these in a natural way, it turns out to result from the action of a big symmetry group of all gauge transformations on the moduli space of connections.

In higher gauge theory, one deals with connections on gerbes (or higher gerbes – a bundle is essentially a “0-gerbe”). There are now also (2-)morphisms between gauge transformations (and, in higher cases, this continues further), which Roger Picken and I have been calling “gauge modifications”. If we try to repeat the situation for gauge theory, we can construct a 2-groupoid out of these, which expresses this local symmetry. The thing which is different for gerbes (and will continue to get even more different if we move to n-gerbes and the corresponding (n+1)-groupoids) is that this is not the same type of object as a transformation double category.

Now, in our next paper (which this one was written to make possible) we show that the 2-groupoid is actually very intimately related to the transformation double category: that is, the local picture of symmetry for a higher gauge theory is, just as in the lower-dimensional situation, intimately related to a global symmetry of an entire moduli 2-space, i.e. a category. The reason this wasn’t obvious at first is that the moduli space which includes only connections is just the space of objects of this category: the point is that there are really two special kinds of gauge transformations. One should be thought of as the morphisms in the moduli 2-space, and the other as part of the symmetries of that 2-space. The intuition that comes from ordinary gauge theory overlooks this, because the phenomenon doesn’t occur there.

Physically-motivated theories are starting to use these higher-categorical concepts more and more, and symmetry is a crucial idea in physics. What I’ve sketched here is presumably only the start of a pattern in which “symmetry” extends to higher-categorical entities. When we get to 3-groups, our simplifying assumptions that use “strictification” results won’t even be available any more, so we would expect still further new phenomena to show up – but it seems plausible that the tight relation between global and local symmetry will still exist, but in a way that is more subtle, and refines the standard understanding we have of symmetry today.

This is the 100th entry on this blog! It’s taken a while, but we’ve arrived at a meaningless but convenient milestone. This post constitutes Part III of the posts on the topics course which I shared with Susama Agarwala. In the first, I summarized the core idea in the series of lectures I did, which introduced toposes and sheaves, and explained how, at least for appropriate sites, sheaves can be thought of as generalized spaces. In the second, I described the guest lecture by John Huerta which described how supermanifolds can be seen as an example of that notion.

In this post, I’ll describe the machinery I set up as part of the context for Susama’s talks. The connections are a bit tangential, but it gives some helpful context for what’s to come. Namely, my last couple of lectures were on sheaves with structure, and derived categories. In algebraic geometry and elsewhere, derived categories are a common tool for studying spaces. They have a cohomological flavour, because they involve sheaves of complexes (or complexes of sheaves) of abelian groups. Having talked about the background of sheaves in Part I, let’s consider how these categories arise.

Structured Sheaves and Internal Constructions in Toposes

The definition of a (pre)sheaf as a functor valued in Sets is the basic one, but there are parallel notions for presheaves valued in categories other than Sets – for instance, in Abelian groups, rings, simplicial sets, complexes etc. Abelian groups are particularly important for geometry/cohomology.

But for the most part, as long as the target category can be defined in terms of sets and structure maps (such as the multiplication map for groups, face maps for simplicial sets, or boundary maps in complexes), we can just think of these in terms of objects “internal to a category of sheaves”. That is, we have a definition of “abelian group object” in any reasonably nice category – in particular, any topos. Then the category of “abelian group objects in Sh(\mathcal{T})” is equivalent to a category of “abelian-group-valued sheaves on \mathcal{T}“, denoted Sh((\mathcal{T},J),\mathbf{AbGrp}). (As usual, I’ll omit the Grothendieck topology J in the notation from now on, though it’s important that it is still there.)

Sheaves of abelian groups are supposed to generalize the prototypical example, namely sheaves of functions valued in abelian groups, (indeed, rings) such as \mathbb{Z}, \mathbb{R}, or \mathbb{C}.

To begin with, we look at the category Sh(\mathcal{T},\mathbf{AbGrp}), which amounts to the same as the category of abelian group objects in  Sh(\mathcal{T}). This inherits several properties from \mathbf{AbGrp} itself. In particular, it’s an abelian category: this gives us that there is a direct sum for objects, a zero object, exact sequences split, all morphisms have kernels and cokernels, and so forth. These useful properties all hold because at each U \in \mathcal{T}, the direct sum of sheaves of abelian group just gives (A \oplus A')(U) = A(U) \oplus A'(U), and all the properties hold locally at each U.

So, sheaves of abelian groups can be seen as abelian groups in a topos of sheaves Sh(\mathcal{T}). In the same way, other kinds of structures can be built up inside the topos of sheaves, and there are corresponding “external” point of view. One good example would be simplicial objects: one can talk about the simplicial objects in Sh(\mathcal{T},\mathbf{Set}), or sheaves of simplicial sets, Sh(\mathcal{T},\mathbf{sSet}). (Though it’s worth noting that since simplicial sets model infinity-groupoids, there are more sophisticated forms of the sheaf condition which can be applied here. But for now, this isn’t what we need.)

Recall that simplicial objects in a category \mathcal{C} are functors S \in Fun(\Delta^{op},\mathcal{C}) – that is, \mathcal{C}-valued presheaves on \Delta, the simplex category. This \Delta has nonnegative integers as its objects, and the morphisms from n to m are the order-preserving functions from \{ 1, 2, \dots, n \} to \{ 1, 2, \dots, m \}. If \mathcal{C} = \mathbf{Sets}, we get “simplicial sets”, where S(n) is the “set of n-dimensional simplices”. The various morphisms in \Delta turn into (composites of) the face and degeneracy maps. Simplicial sets are useful because they are a good model for “spaces”.

Just as with abelian groups, simplicial objects in Sh(\mathcal{T}) can also be seen as sheaves on \mathcal{T} valued in the category \mathbf{sSet} of simplicial sets, i.e. objects of Sh(\mathcal{T},\mathbf{sSet}). These things are called, naturally, “simplicial sheaves”, and there is a rather extensive body of work on them. (See, for instance, the canonical book by Goerss and Jardine.)

This correspondence is just because there is a fairly obvious bunch of isomorphisms turning functors with two inputs into functors with one input returning another functor with one input:

Fun(\Delta^{op} \times \mathcal{T}^{op},\mathbf{Sets}) \cong Fun(\Delta^{op}, Fun(\mathcal{T}^{op}, \mathbf{Sets}))

and

Fun(\Delta^{op} \times \mathcal{T}^{op},\mathbf{Sets}) \cong Fun(\mathcal{T}^{op},Fun(\Delta^{op},\mathbf{Sets})

(These are all presheaf categories – if we put a trivial topology on \Delta, we can refine this to consider only those functors which are sheaves in every position, where we use a certain product topology on \Delta \times \mathcal{T}.)

Another relevant example would be complexes. This word is a bit overloaded, but here I’m referring to the sort of complexes appearing in cohomology, such as the de Rahm complex, where the terms of the complex are the sheaves of differential forms on a space, linked by the exterior derivative. A complex X^{\bullet} is a sequence of Abelian groups with boundary maps \partial^i : X^i \rightarrow X^{i+1} (or just \partial for short), like so:

\dots \rightarrow^{\partial} X^0 \rightarrow^{\partial} X^1 \rightarrow^{\partial} X^2 \rightarrow^{\partial} \dots

with the property that \partial^{i+1} \circ \partial^i = 0. Morphisms between these are sequences of morphisms between the terms of the complexes (\dots,f_0,f_1,f_2,\dots) where each f_i : X^i \rightarrow Y^i which commute with all the boundary maps. These all assemble into a category of complexes C^{\bullet}(\mathbf{AbGrp}). We also have C^{\bullet}_+ and C^{\bullet}_-, the (full) subcategories of complexes where all the negative (respectively, positive) terms are trivial.

One can generalize this to replace \mathbf{AbGrp} by any category enriched in abelian groups, which we need to make sense of the requirement that a morphism is zero. In particular, one can generalize it to sheaves of abelian groups. This is an example where the above discussion about internalization can be extended to more than one structure at a time: “sheaves-of-(complexes-of-abelian-groups)” is equivalent to “complexes-of-(sheaves-of-abelian-groups)”.

This brings us to the next point, which is that, within Sh(\mathcal{T},\mathbf{AbGrp}), the last two examples, simplicial objects and complexes, are secretly the same thing.

Dold-Puppe Correspondence

The fact I just alluded to is a special case of the Dold-Puppe correspondence, which says:

Theorem: In any abelian category \mathcal{A}, the category of simplicial objects Fun(\Delta^{op},\mathcal{A}) is equivalent to the category of positive chain complexes C^{\bullet}_+(\mathcal{A}).

The better-known name “Dold-Kan Theorem” refers to the case where \mathcal{A} = \mathbf{AbGrp}. If \mathcal{A} is a category of \mathbf{AbGrp}-valued sheaves, the Dold-Puppe correspondence amounts to using Dold-Kan at each U.

The point is that complexes have only coboundary maps, rather than a plethora of many different face and boundary maps, so we gain some convenience when we’re looking at, for instance, abelian groups in our category of spaces, by passing to this equivalent description.

The correspondence works by way of two maps (for more details, see the book by Goerss and Jardine linked above, or see the summary here). The easy direction is the Moore complex functor, N : Fun(\Delta^{op},\mathcal{A} \rightarrow C^{\bullet}_+(\mathcal{A}). On objects, it gives the intersection of all the kernels of the face maps:

(NS)_k = \bigcap_{j=1}^{k-1} ker(d_i)

The boundary map from this is then just \partial_n = (-1)^n d_n. This ends up satisfying the “boundary-squared is zero” condition because of the identities for the face maps.

The other direction is a little more complicated, so for current purposes, I’ll leave you to follow the references above, except to say that the functor \Gamma from complexes to simplicial objects in \mathcal{A} is defined so as to be adjoint to N. Indeed, N and \Gamma together form an adjoint equivalence of the categories.

Chain Homotopies and Quasi-Isomorphisms

One source of complexes in mathematics is in cohomology theories. So, for example, there is de Rahm cohomology, where one starts with the complex with \Omega^n(M) the space of smooth differential n-forms on some smooth manifold M, with the exterior derivatives as the coboundary maps. But no matter which complex you start with, there is a sequence of cohomology groups, because we have a sequence of cohomology functors:

H^k : C^{\bullet}(\mathcal{A}) \rightarrow \mathcal{A}

given by the quotients

H^k(A^{\bullet}) = Ker(\partial_k) / Im(\partial_{k-1})

That is, it’s the cocycles (things whose coboundary is zero), up to equivalence where cocycles are considered equivalent if their difference is a coboundary (i.e. something which is itself the coboundary of something else). In fact, these assemble into a functor H^{\bullet} : C^{\bullet}(\mathcal{A}) \rightarrow C^{\bullet}(\mathcal{A}), since there are natural transformations between these functors

\delta^k(A^{\bullet}) : H^k(A^{\bullet} \rightarrow H^{k+1}(A^{\bullet})

which just come from the restrictions of the \partial^k to the kernel Ker(\partial^k). (In fact, this makes the maps trivial – but the main point is that this restriction is well-defined on equivalence classes, and so we get an actual complex again.) The fact that we get a functor means that any chain map f^{\bullet} : A^{\bullet} \rightarrow B^{\bullet} gives a corresponding H^{\bullet}(f^{\bullet}) : H^{\bullet}(A^{\bullet}) \rightarrow H^{\bullet}(B^{\bullet}).

Now, the original motivation of cohomology for a space, like the de Rahm cohomology of a manifold M, is to measure something about the topology of M. If M is trivial (say, a contractible space), then its cohomology groups are all trivial. In the general setting, we say that A^{\bullet} is acyclic if all the H^k(A^{\bullet}) = 0. But of course, this doesn’t mean that the chain itself is zero.

More generally, just because two complexes have isomorphic cohomology, doesn’t mean they are themselves isomorphic, but we say that f^{\bullet} is a quasi-isomorphism if H^{\bullet}(f^{\bullet}) is an isomorphism. The idea is that, as far as we can tell from the information that coholomology detects, it might as well be an isomorphism.

Now, for spaces, as represented by simplicial sets, we have a similar notion: a map between spaces is a quasi-isomorphism if it induces an isomorphism on cohomology. Then the key thing is the Whitehead Theorem (viz), which in this language says:

Theorem: If f : X \rightarrow Y is a quasi-isomorphism, it is a homotopy equivalence.

That is, it has a homotopy inverse f' : Y \rightarrow X, which means there is a homotopy h : f' \circ f \rightarrow Id.

What about for complexes? We said that in an abelian category, simplicial objects and complexes are equivalent constructions by the Dold-Puppe correspondence. However, the question of what is homotopy equivalent to what is a bit more complicated in the world of complexes. The convenience we gain when passing from simplicial objects to the simpler structure of complexes must be paid for it with a little extra complexity in describing what corresponds to homotopy equivalences.

The usual notion of a chain homotopy between two maps f^{\bullet}, g^{\bullet} : A^{\bullet} \rightarrow B^{\bullet} is a collection of maps which shift degrees, h^k : A^k \rightarrow B^{k-1}, such that f-g = \partial \circ h. That is, the coboundary of h is the difference between f and g. (The “co” version of the usual intuition of a homotopy, whose ingoing and outgoing boundaries are the things which are supposed to be homotopic).

The Whitehead theorem doesn’t work for chain complexes: the usual “naive” notion of chain homotopy isn’t quite good enough to correspond to the notion of homotopy in spaces. (There is some discussion of this in the nLab article on the subject. That is the reason for…

Derived Categories

Taking “derived categories” for some abelian category can be thought of as analogous, for complexes, to finding the homotopy category for simplicial objects. It compensates for the fact that taking a quotient by chain homotopy doesn’t give the same “homotopy classes” of maps of complexes as the corresponding operation over in spaces.

That is, simplicial sets, as a model category, know everything about the homotopy type of spaces: so taking simplicial objects in \mathcal{C} is like internalizing the homotopy theory of spaces in a category \mathcal{C}. So, if what we’re interested in are the homotopical properties of spaces described as simplicial sets, we want to “mod out” by homotopy equivalences. However, we have two notions which are easy to describe in the world of complexes, which between them capture the notion “homotopy” in simplicial sets. There are chain homotopies and quasi-isomorphisms. So, naturally, we mod out by both notions.

So, suppose we have an abelian category \mathcal{A}. In the background, keep in mind the typical example where \mathcal{A} = Sh( (\mathcal{T},J), \mathbf{AbGrp} ), and even where \mathcal{T} = TOP(X) for some reasonably nice space X, if it helps to picture things. Then the derived category of \mathcal{A} is built up in a few steps:

  1. Take the category C^{\bullet} ( \mathcal{A} ) of complexes. (This stands in for “spaces in \mathcal{A}” as above, although we’ve dropped the “+“, so the correct analogy is really with spectra. This is a bit too far afield to get into here, though, so for now let’s just ignore it.)
  2. Take morphisms only up to homotopy equivalence. That is, define the equivalence relation with f \sim g whenever there is a homotopy h with f-g = \partial \circ h.  Then K^{\bullet}(\mathcal{A}) = C^{\bullet}(\mathcal{A})/ \sim is the quotient by this relation.
  3. Localize at quasi-isomorphisms. That is, formally throw in inverses for all quasi-isomorphisms f, to turn them into actual isomorphisms. The result is D^{\bullet}(\mathcal{A}).

(Since we have direct sums of complexes (componentwise), it’s also possible to think of the last step as defining D^{\bullet}(\mathcal{A}) = K^{\bullet}(\mathcal{A})/N^{\bullet}(\mathcal{A}), where N^{\bullet}(\mathcal{A}) is the category of acyclic complexes – the ones whose cohomology complexes are zero.)

Explicitly, the morphisms of D^{\bullet}(\mathcal{A}) can be thought of as “zig-zags” in K^{\bullet}(\mathcal{A}),

X^{\bullet}_0 \leftarrow X^{\bullet}_1 \rightarrow X^{\bullet}_2 \leftarrow \dots \rightarrow X^{\bullet}_n

where all the left-pointing arrows are quasi-isomorphisms. (The left-pointing arrows are standing in for their new inverses in D^{\bullet}(\mathcal{A}), pointing right.) This relates to the notion of a category of spans: in a reasonably nice category, we can always compose these zig-zags to get one of length two, with one leftward and one rightward arrow. In general, though, this might not happen.

Now, the point here is that this is a way of extracting “homotopical” or “cohomological” information about \mathcal{A}, and hence about X if \mathcal{A} = Sh(TOP(X),\mathbf{AbGrp}) or something similar. In the next post, I’ll talk about Susama’s series of lectures, on the subject of motives. This uses some of the same technology described above, in the specific context of schemes (which introduces some extra considerations specific to that world). It’s aim is to produce a category (and a functor into it) which captures all the cohomological information about spaces – in some sense a universal cohomology theory from which any other can be found.

John Huerta visited here for about a week earlier this month, and gave a couple of talks. The one I want to write about here was a guest lecture in the topics course Susama Agarwala and I were teaching this past semester. The course was about topics in category theory of interest to geometry, and in the case of this lecture, “geometry” means supergeometry. It follows the approach I mentioned in the previous post about looking at sheaves as a kind of generalized space. The talk was an introduction to a program of seeing supermanifolds as a kind of sheaf on the site of “super-points”. This approach was first proposed by Albert Schwartz, though see, for instance, this review by Christophe Sachse for more about this approach, and this paper (comparing the situation for real and complex (super)manifolds) for more recent work.

It’s amazing how many geometrical techniques can be applied in quite general algebras once they’re formulated correctly. It’s perhaps less amazing for supermanifolds, in which commutativity fails in about the mildest possible way.  Essentially, the algebras in question split into bosonic and fermionic parts. Everything in the bosonic part commutes with everything, and the fermionic part commutes “up to a negative sign” within itself.

Supermanifolds

Supermanifolds are geometric objects, which were introduced as a setting on which “supersymmetric” quantum field theories could be defined. Whether or not “real” physics has this symmetry (the evidence is still pending, though ), these are quite nicely behaved theories. (Throwing in extra symmetry assumptions tends to make things nicer, and supersymmetry is in some sense the maximum extra symmetry we might reasonably hope for in a QFT).

Roughly, the idea is that supermanifolds are spaces like manifolds, but with some non-commuting coordinates. Supermanifolds are therefore in some sense “noncommutative spaces”. Noncommutative algebraic or differential geometry start with various dualities to the effect that some category of spaces is equivalent to the opposite of a corresponding category of algebras – for instance, a manifold M corresponds to the C^{\infty} algebra C^{\infty}(M,\mathbb{R}). So a generalized category of “spaces” can be found by dropping the “commutative” requirement from that statement. The category \mathbf{SMan} of supermanifolds only weakens the condition slightly: the algebras are \mathbb{Z}_2-graded, and are “supercommutative”, i.e. commute up to a sign which depends on the grading.

Now, the conventional definition of supermanifolds, as with schemes, is to say that they are spaces equipped with a “structure sheaf” which defines an appropriate class of functions. For ordinary (real) manifolds, this would be the sheaf assigning to an open set U the ring C^{\infty}(U,\mathbb{R}) of all the smooth real-valued functions. The existence of an atlas of charts for the manifold amounts to saying that the structure sheaf locally looks like C^{\infty}(V,\mathbb{R}) for some open set V \subset \mathbb{R}^p. (For fixed dimension p).

For supermanifolds, the condition on the local rings says that, for fixed dimension (p \bar q ), a p|q-dimensional supermanifold has structure sheaf in which $they look like

\mathcal{O}(\mathcal{U}) \cong C^{\infty}(V,\mathbb{R}) \otimes \Lambda_q

In this, V is as above, and the notation

\Lambda_q = \Lambda ( \theta_1, \dots , \theta_q )

refers to the exterior algebra, which we can think of as polynomials in the \theta_i, with the wedge product, which satisfies \theta_i \wedge \theta_j = - \theta_j \wedge \theta_i. The idea is that one is supposed to think of this as the algebra of smooth functions on a space with p ordinary dimensions, and q “anti-commuting” dimensions with coordinates \theta_i. The commuting variables, say x_1,\dots,x_p, are called “bosonic” or “even”, and the anticommuting ones are “fermionic” or “odd”. (The term “fermionic” is related to the fact that, in quantum mechanics, when building a Hilbert space for a bunch of identical fermions, one takes the antisymmetric part of the tensor product of their individual Hilbert spaces, so that, for instance, v_1 \otimes v_2 = - v_2 \otimes v_1).

The structure sheaf picture can therefore be thought of as giving an atlas of charts, so that the neighborhoods locally look like “super-domains”, the super-geometry equivalent of open sets V \subset \mathbb{R}^p.

In fact, there’s a long-known theorem of Batchelor which says that any real supermanifold is given exactly by the algebra of “global sections”, which looks like \mathcal{O}(M) = C^{\infty}(M_{red},\mathbb{R}) \otimes \Lambda_q. That is, sections in the local rings (“functions on” open neighborhoods of M) always glue together to give a section in \mathcal{O}(M).

Another way to put this is that every supermanifold can be seen as just bundle of exterior algebras. That is, a bundle over a base manifold M_{red}, whose fibres are the “super-points” \mathbb{R}^{0|q} corresponding to \Lambda_q. The base space M_{red} is called the “reduced” manifold. Any such bundle gives back a supermanifold, where the algebras in the structure sheaf are the algebras of sections of the bundle.

One shouldn’t be too complacent about saying they are exactly the same, though: this correspondence isn’t functorial. That is, the maps between supermanifolds are not just bundle maps. (Also, Batchelor’s theorem works only for real, not for complex, supermanifolds, where only the local neighborhoods necessarily look like such bundles).

Why, by the way, say that \mathbb{R}^{0|q} is a super “point”, when \mathbb{R}^{p|0} is a whole vector space? Since the fermionic variables are anticommuting, no term can have more than one of each \theta_i, so this is a finite-dimensional algebra. This is unlike C{\infty}(V,\mathbb{R}), which suggests that the noncommutative directions are quite different. Any element of \Lambda_q is nilpotent, so if we think of a Taylor series for some function – a power series in the (x_1,\dots,x_p,\theta_1,\dots,\theta_q) – we see note that no term has a coefficient for \theta_i greater than 1, or of degree higher than q in all the \theta_i – so imagines that only infinitesimal behaviour in these directions exists at all. Thus, a supermanifold M is like an ordinary p-dimensional manifold M_{red}, built from the ordinary domains V, equipped with a bundle whose fibres are a sort of “infinitesimal fuzz” about each point of the “even part” of the supermanifold, described by the \Lambda_q.

But this intuition is a bit vague. We can sharpen it a bit using the functor of points approach…

Supermanifolds as Manifold-Valued Sheaves

As with schemes, there is also a point of view that sees supermanifolds as “ordinary” manifolds, constructed in the topos of sheaves over a certain site. The basic insight behind the picture of these spaces, as in the previous post, is based on the fact that the Yoneda lemma lets us think of sheaves as describing all the “probes” of a generalized space (actually an algebra in this case). The “probes” are the objects of a certain category, and are called “superpoints“.

This category is just \mathbf{Spt} = \mathbf{Gr}^{op}, the opposite of the category of Grassman algebras (i.e. exterior algebras) – that is, polynomial algebras in noncommuting variables, like \Lambda(\theta_1,\dots,\theta_q). These objects naturally come with a \mathbb{Z}_2-grading, which are spanned, respectively, by the monomials with even and odd degree: \Lambda_q = latex \mathbf{SMan}$ (\Lambda_q)_0 \oplus (\Lambda_q)_1$

(\Lambda_q)_0 = span( 1, \theta_i \theta_j, \theta_{i_1}\dots\theta{i_4}, \dots )

and

(\Lambda_q)_1 = span( \theta_i, \theta_i \theta_j \theta_k, \theta_{i_1}\dots\theta_{i_5},\dots )

This is a \mathbb{Z}_2-grading since the even ones commute with anything, and the odd ones anti-commute with each other. So if f_i and f_j are homogeneous (live entirely in one grade or the other), then f_i f_j = (-1)^{deg(i)deg(j)} f_j f_i.

The \Lambda_q should be thought of as the (0|q)-dimensional supermanifold: it looks like a point, with a q-dimensional fermionic tangent space (the “infinitesimal fuzz” noted above) attached. The morphisms in \mathbf{Spt} from \Lambda_q to $llatex \Lambda_r$ are just the grade-preserving algebra homomorphisms from \Lambda_r to \Lambda_q. There are quite a few of these: these objects are not terminal objects like the actual point. But this makes them good probes. Thi gets to be a site with the trivial topology, so that all presheaves are sheaves.

Then, as usual, a presheaf M on this category is to be understood as giving, for each object A=\Lambda_q, the collection of maps from \Lambda_q to a space M. The case q=0 gives the set of points of M, and the various other algebras A give sets of “A-points”. This term is based on the analogy that a point of a topological space (or indeed element of a set) is just the same as a map from the terminal object 1, the one point space (or one element set). Then an “A-point” of a space X is just a map from another object A. If A is not terminal, this is close to the notion of a “subspace” (though a subspace, strictly, would be a monomorphism from A). These are maps from A in \mathbf{Spt} = \mathbf{Gr}^{op}, or as algebra maps, M_A consists of all the maps \mathcal{O}(M) \rightarrow A.

What’s more, since this is a functor, we have to have a system of maps between the M_A. For any algebra maps A \rightarrow A', we should get corresponding maps M_{A'} \rightarrow M_A. These are really algebra maps \Lambda_q \rightarrow \Lambda_{q'}, of which there are plenty, all determined by the images of the generators \theta_1, \dots, \theta_q.

Now, really, a sheaf on \mathbf{Spt} is actually just what we might call a “super-set”, with sets M_A for each A \in \mathbf{Spt}. To make super-manifolds, one wants to say they are “manifold-valued sheaves”. Since manifolds themselves don’t form a topos, one needs to be a bit careful about defining the extra structure which makes a set a manifold.

Thus, a supermanifold M is a manifold constructed in the topos Sh(\mathbf{Spt}). That is, M must also be equipped with a topology and a collection of charts defining the manifold structure. These are all construed internally using objects and morphisms in the category of sheaves, where charts are based on super-domains, namely those algebras which look like C^{\infty}(V) \otimes \Lambda_q, for V an open subset of \mathbb{R}^p.

The reduced manifold M_{red} which appears in Batchelor’s theorem is the manifold of ordinary points M_{\mathbb{R}}. That is, it is all the \mathbb{R}-points, where \mathbb{R} is playing the role of functions on the zero-dimensional domain with just one point. All the extra structure in an atlas of charts for all of M to make it a supermanifold amounts to putting the structure of ordinary manifolds on the M_A – but in compatible ways.

(Alternatively, we could have described \mathbf{SMan} as sheaves in Sh(\mathbf{SDom}), where \mathbf{SDom} is a site of “superdomains”, and put all the structure defining a manifold into \mathbf{SDom}. But working over super-points is preferable for the moment, since it makes it clear that manifolds and supermanifolds are just manifestations of the same basic definition, but realized in two different toposes.)

The fact that the manifold structure on the M_A must be put on them compatibly means there is a relatively nice way to picture all these spaces.

Values of the Functor of Points as Bundles

The main idea which I find helps to understand the functor of points is that, for every superpoint \mathbb{R}^{0|n} (i.e. for every Grassman algebra A=\Lambda_n), one gets a manifold M_A. (Note the convention that q is the odd dimension of M, and n is the odd dimension of the probe superpoint).

Just as every supermanifold is a bundle of superpoints, every manifold M_A is a perfectly conventional vector bundle over the conventional manifold M_{red} of ordinary points. So for each A, we get a bundle, M_A \rightarrow M_{red}.

Now this manifold, M_{red}, consists exactly of all the “points” of M – this tells us immediately that \mathbf{SMan} is not a category of concrete sheaves (in the sense I explained in the previous post). Put another way, it’s not a concrete category – that would mean that there is an underlying set functor, which gives a set for each object, and that morphisms are determined by what they do to underlying sets. Non-concrete categories are, by nature, trickier to understand.

However, the functor of points gives a way to turn the non-concrete M into a tower of concrete manifolds M_A, and the morphisms between various M amount to compatible towers of maps between the various M_A for each A. The fact that the compatibility is controlled by algebra maps \Lambda_q \rightarrow \Lambda_{q'} explains why this is the same as maps between these bundles of superpoints.

Specifically, then, we have

M_A = \{ \mathcal{O}(M) \rightarrow A \}

This splits into maps of the even parts, and of the odd parts, where the grassman algebra A = \Lambda_n has even and odd parts: A = A_0 \oplus A_1, as above. Similarly, \mathcal{O}(M) splits into odd and even parts, and since the functions on M_{red} are entirely even, this is:

( \mathcal{O}(M))_0 = C^{\infty}(M_{red}) \otimes ( \Lambda_q)_0

and

( \mathcal{O}(M))_1 = C^{\infty}(M_{red}) \otimes (\Lambda_q)_1)

Now, the duality of “hom” and tensor means that Hom(\mathcal{O}(M),A) \cong \mathcal{O}(M) \otimes A, and algebra maps preserve the grading. So we just have tensor products of these with the even and odd parts, respectively, of the probe superpoint. Since the even part A_0 includes the multiples of the constants, part of this just gives a copy of U itself. The remaining part of A_0 is nilpotent (since it’s made of even-degree polynomials in the nilpotent \theta_i, so what we end up with, looking at the bundle over an open neighborhood U \subset M_{red}, is:

U_A = U \times ( (\Lambda_q)_0 \otimes A^{nil}_0) \times ((\Lambda_q)_1 \otimes A_1)

The projection map U_A \rightarrow U is the obvious projection onto the first factor. These assemble into a bundle over M_{red}.

We should think of these bundles as “shifting up” the nilpotent part of M (which are invisible at the level of ordinary points in M_{red}) by the algebra A. Writing them this way makes it clear that this is functorial in the superpoints A = \Lambda_n: given choices n and n', and any morphism between the corresponding A and A', it’s easy to see how we get maps between these bundles.

Now, maps between supermanifolds are the same thing as natural transformations between the functors of points. These include maps of the base manifolds, along with maps between the total spaces of all these bundles. More, this tower of maps must commute with all those bundle maps coming from algebra maps A \rightarrow A'. (In particular, since A = \Lambda_0, the ordinary point, is one of these, they have to commute with the projection to M_{red}.) These conditions may be quite restrictive, but it leaves us with, at least, a quite concrete image of what maps of supermanifolds

Super-Poincaré Group

One of the main settings where super-geometry appears is in so-called “supersymmetric” field theories, which is a concept that makes sense when fields live on supermanifolds. Supersymmetry, and symmetries associated to super-Lie groups, is exactly the kind of thing that John has worked on. A super-Lie group, of course, is a supermanifold that has the structure of a group (i.e. it’s a Lie group in the topos of presheaves over the site of super-points – so the discussion above means it can be thought of as a big tower of Lie groups, all bundles over a Lie group G_{red}).

In fact, John has mostly worked with super-Lie algebras (and the connection between these and division algebras, though that’s another story). These are \mathbb{Z}_2-graded algebras with a Lie bracket whose commutation properties are the graded version of those for an ordinary Lie algebra. But part of the value of the framework above is that we can simply borrow results from Lie theory for manifolds, import it into the new topos PSh(\mathbf{Spt}), and know at once that super-Lie algebras integrate up to super-Lie groups in just the same way that happens in the old topos (of sets).

Supersymmetry refers to a particular example, namely the “super-Poincaré group”. Just as the Poincaré group is the symmetry group of Minkowski space, a 4-manifold with a certain metric on it, the super-Poincaré group has the same relation to a certain supermanifold. (There are actually a few different versions, depending on the odd dimension.) The algebra is generated by infinitesimal translations and boosts, plus some “translations” in fermionic directions, which generate the odd part of the algebra.

Now, symmetry in a quantum theory means that this algebra (or, on integration, the corresponding group) acts on the Hilbert space \mathcal{H} of possible states of the theory: that is, the space of states is actually a representation of this algebra. In fact, to make sense of this, we need a super-Hilbert space (i.e. a graded one). The even generators of the algebra then produce grade-preserving self-maps of \mathcal{H}, and the odd generators produce grade-reversing ones. (This fact that there are symmetries which flip the “bosonic” and “fermionic” parts of the total \mathcal{H} is why supersymmetric theories have “superpartners” for each particle, with the opposite parity, since particles are labelled by irreducible representations of the Poincaré group and the gauge group).

To date, so far as I know, there’s no conclusive empirical evidence that real quantum field theories actually exhibit supersymmetry, such as detecting actual super-partners for known particles. Even if not, however, it still has some use as a way of developing toy models of quite complicated theories which are more tractable than one might expect, precisely because they have lots of symmetry. It’s somewhat like how it’s much easier to study computationally difficult theories like gravity by assuming, for instance, spherical symmetry as an extra assumption. In any case, from a mathematician’s point of view, this sort of symmetry is just a particularly simple case of symmetries for theories which live on noncommutative backgrounds, which is quite an interesting topic in its own right. As usual, physics generates lots of math which remains both true and interesting whether or not it applies in the way it was originally suggested.

In any case, what the functor-of-points viewpoint suggests is that ordinary and super- symmetries are just two special cases of “symmetries of a field theory” in two different toposes. Understanding these and other examples from this point of view seems to give a different understanding of what “symmetry”, one of the most fundamental yet slippery concepts in mathematics and science, actually means.

This semester, Susama Agarwala and I have been sharing a lecture series for graduate students. (A caveat: there are lecture notes there, by student request, but they’re rough notes, and contain some mistakes, omissions, and represent a very selective view of the subject.) Being a “topics” course, it consists of a few different sections, loosely related, which revolve around the theme of categorical tools which are useful for geometry (and topology).

What this has amounted to is: I gave a half-semester worth of courses on toposes, sheaves, and the basics of derived categories. Susama is now giving the second half, which is about motives. This post will talk about the part of the course I gave. Though this was a whole series of lectures which introduced all these topics more or less carefully, I want to focus here on the part of the lecture which built up to a discussion of sheaves as spaces. Nothing here, or in the two posts to follow, is particularly new, but they do amount to a nice set of snapshots of some related ideas.

Coming up soon: John Huerta is currently visiting Hamburg, and on  July 8, he gave a guest-lecture which uses some of this machinery to talk about supermanifolds, which will be the subject of the next post in this series. In a later post, I’ll talk about Susama’s lectures about motives and how this relates to the discussion here (loosely).

Grothendieck Toposes

The first half of our course was about various aspects of Grothendieck toposes. In the first lecture, I talked about “Elementary” (or Lawvere-Tierney) toposes. One way to look at these is to say that they are categories \mathcal{E} which have all the properties of the category of Sets which make it useful for doing most of ordinary mathematics. Thus, a topos in this sense is a category with a bunch of properties – there are various equivalent definitions, but for example, toposes have all finite limits (in particular, products), and all colimits.

More particularly, they have “power objects”. That is, if A and B are objects of \mathcal{E}, then there is an object B^A, with an “evaluation map” B^A \times A \rightarrow B, which makes it possible to think of B^A as the object of “morphisms from A to B”.

The other main thing a topos has is a “subobject classifier”. Now, a subobject of A \in \mathcal{E} is an equivalence class of monomorphisms into A – think of sets, where this amounts to specifying the image, and the monomorphisms are the various inclusions which pick out the same subset as their image. A classifier for subobjects should be thought of as something like the two-element set is Sets, whose elements we can tall “true” and “false”. Then every subset of A corresponds to a characteristic function A \rightarrow \mathbf{2}. In general, a subobject classifies is an object \Omega together with a map from the terminal object, T : 1 \rightarrow \Omega, such that every inclusion of subobject is a pullback of T along a characteristic function.

Now, elementary toposes were invented chronologically later than Grothendieck toposes, which are a special class of example. These are categories of sheaves on (Grothendieck) sites. A site is a category \mathcal{T} together with a “topology” J, which is a rule which, for each U \in \mathcal{T}, picks out J(U), a set of collections of maps into U, called seives for U. They collections J(U) have to satisfy certain conditions, but the idea can be understood in terms of the basic example, \mathcal{T} = TOP(X). Given a topological space, TOP(X) is the category whose objects are the open sets U \subset X, and the morphisms are all the inclusions. Then  that each collection in J(U) is an open cover of U – that is, a bunch of inclusions of open sets, which together cover all of U in the usual sense.

(This is a little special to TOP(X), where every map is an inclusion – in a general site, the J(U) need to be closed under composition with any other morphism (like an ideal in a ring). So for instance, \mathcal{T} = Top, the category of topological spaces, the usual choice of J(U) consists of all collections of maps which are jointly surjective.)

The point is that a presheaf on \mathcal{T} is just a functor \mathcal{T}^{op} \rightarrow Sets. That is, it’s a way of assigning a set to each U \in \mathcal{T}. So, for instance, for either of the cases we just mentioned, one has B : \mathcal{T}^{op} \rightarrow Sets, which assigns to each open set U the set of all bounded functions on U, and to every inclusion the restriction map. Or, again, one has C : \mathcal{T}^{op} \rightarrow Sets, which assigns the set of all continuous functions.

These two examples illustrate the condition which distinguishes those presheaves S which are sheaves – namely, those which satisfy some “gluing” conditions. Thus, suppose we’re, given an open cover \{ f_i : U_i \rightarrow U \}, and a choice of one element x_i from each S(U_i), which form a “matching family” in the sense that they agree when restricted to any overlaps. Then the sheaf condition says that there’s a unique “amalgamation” of this family – that is, one element x \in S(U) which restricts to all the x_i under the maps S(f_i) : S(U) \rightarrow S(U_i).

Sheaves as Generalized Spaces

There are various ways of looking at sheaves, but for the purposes of the course on categorical methods in geometry, I decided to emphasize the point of view that they are a sort of generalized spaces.

The intuition here is that all the objects and morphisms in a site \mathcal{T} have corresponding objects and morphisms in Psh(\mathcal{T}). Namely, the objects appear as the representable presheaves, U \mapsto Hom(-,U), and the morphisms U \rightarrow V show up as the induced natural transformations between these functors. This map y : \mathcal{T} \rightarrow Psh(\mathcal{T}) is called the Yoneda embedding. If \mathcal{T} is at all well-behaved (as it is in all the examples we’re interested in here), these presheaves will always be sheaves: the image of y lands in Sh(\mathcal{T}).

In this case, the Yoneda embedding embeds \mathcal{T} as a sub-category of Sh(\mathcal{T}). What’s more, it’s a full subcategory: all the natural transformations between representable presheaves come from the morphisms of \mathcal{T}-objects in a unique way. So  Sh(\mathcal{T}) is, in this sense, a generalization of \mathcal{T} itself.

More precisely, it’s the Yoneda lemma which makes sense of all this. The idea is to start with the way ordinary \mathcal{T}-objects (from now on, just call them “spaces”) S become presheaves: they become functors which assign to each U the set of all maps into S. So the idea is to turn this around, and declare that even non-representable sheaves should have the same interpretation. The Yoneda Lemma makes this a sensible interpretation: it says that, for any presheaf F \in Psh(\mathcal{T}), and any U \in \mathcal{T}, the set F(U) is naturally isomorphic to Hom(y(U),F): that is, F(U) literally is the collection of morphisms from U (or rather, its image under the Yoneda embedding) and a “generalized space” F. (See also Tom Leinster’s nice discussion of the Yoneda Lemma if this isn’t familiar.) We describe U as a “probe” object: one probes the space F by mapping U into it in various ways. Knowing the results for all U \in \mathcal{T} tells you all about the “space” F. (Thus, for instance, one can get all the information about the homotopy type of a space if you know all the maps into it from spheres of all dimensions up to homotopy. So spheres are acting as “probes” to reveal things about the space.)

Furthermore, since Sh(\mathcal{T}) is a topos, it is often a nicer category than the one you start with. It has limits and colimits, for instance, which the original category might not have. For example, if the kind of spaces you want to generalize are manifolds, one doesn’t have colimits, such as the space you get by gluing together two lines at a point. The sheaf category does. Likewise, the sheaf category has exponentials, and manifolds don’t (at least not without the more involved definitions needed to allow infinite-dimensional manifolds).

These last remarks about manifolds suggest the motivation for the first example…

Diffeological Spaces

The lecture I gave about sheaves as spaces used this paper by John Baez and Alex Hoffnung about “smooth spaces” (they treat Souriau’s diffeological spaces, and the different but related Chen spaces in the same framework) to illustrate the point. They describe In that case, the objects of the sites are open (or, for Chen spaces, convex) subsets of \mathbb{R}^n, for all choices of n, the maps are the smooth maps in the usual sense (i.e. the sense to be generalized), and the covers are jointly surjective collections of maps.

Now, that example is a somewhat special situation: they talk about concrete sheaves, on concrete sites, and the resulting categories are only quasitoposes – a slightly weaker condition than being a topos, but one still gets a useful collection of spaces, which among other things include all manifolds. The “concreteness” condition – that \mathcal{T} has a terminal object to play the role of “the point”. Being a concrete sheaf then means that all the “generalized spaces” have an underlying set of points (namely, the set of maps from the point object), and that all morphisms between the spaces are completely determined by what they do to the underlying set of points. This means that the “spaces” really are just sets with some structure.

Now, if the site happens to be TOP(X), then we have a slightly intuition: the “generalized” spaces are something like generalized bundles over X, and the “probes” are now sections of such a bundle. A simple example would be an actual sheaf of functions: these are sections of a trivial bundle, since, say, \mathbb{C}-valued functions are sections of the bundle \pi: X \times \mathbb{C} \rightarrow X. Given a nontrivial bundle \pi : M \rightarrow X, there is a sheaf of sections – on each U, one gets F_M(U) to be all the one-sided inverses s : U \rightarrow M which are one-sided inverses of \pi. For a generic sheaf, we can imagine a sort of “generalized bundle” over X.

Schemes

Another example of the fact that sheaves can be seen as spaces is the category of schemes: these are often described as topological spaces which are themselves equipped with a sheaf of rings. “Scheme” is to algebraic geometry what “manifold” is to differential geometry: a kind of space which looks locally like something classical and familiar. Schemes, in some neighborhood of each point, must resemble varieties – i.e. the locus of zeroes of some algebraic function on $\mathbb{k}^n$. For varieties, the rings attached to neighborhoods are rings of algebraic functions on this locus, which will be a quotient of the ring of polynomials.

But another way to think of schemes is as concrete sheaves on a site whose objects are varieties and whose morphisms are algebraic maps. This is dual to the other point of view, just as thinking of diffeological spaces as sheaves is dual to a viewpoint in which they’re seen as topological spaces equipped with a notion of “smooth function”.

(Some general discussion of this in a talk by Victor Piercey)

Generalities

These two viewpoints (defining the structure of a space by a class of maps into it, or by a class of maps out of it) in principle give different definitions. To move between them, you really need everything to be concrete: the space has an underlying set, the set of probes is a collection of real set-functions. Likewise, for something like a scheme, you’d need the ring for any open set to be a ring of actual set-functions. In this case, one can move between the two descriptions of the space as long as there is a pre-existing concept of the right kind of function  on the “probe” spaces. Given a smooth space, say, one can define a sheaf of smooth functions on each open set by taking those whose composites with every probe are smooth. Conversely, given something like a scheme, where the structure sheaf is of function rings on each open subspace (i.e. the sheaf is representable), one can define the probes from varieties to be those which give algebraic functions when composed with every function in these rings. Neither of these will work in general: the two approaches define different categories of spaces (in the smooth context, see Andrew Stacey’s comparison of various categories of smooth spaces, defined either by specifying the smooth maps in, or out, or both). But for very concrete situations, they fit together neatly.

The concrete case is therefore nice for getting an intuition for what it means to think of sheaves as spaces. For sheaves which aren’t concrete, morphisms aren’t determined by what they do to the underlying points i.e. the forgetful “underlying set” functor isn’t faithful. Here, we might think of a “generalized space” which looks like two copies of the same topological space: the sheaf gives two different elements of F(U) for each map of underlying sets. We could think of such generalized space as built from sets equipped with extra “stuff” (say, a set consisting of pairs (x,i) \in X \times \{ blue , green \} – so it consists of a “blue” copy of X and a “green” copy of X, but the underlying set functor ignores the colouring.

Still, useful as they may be to get a first handle on this concept of sheaf as generalized space, one shouldn’t rely on these intuitions too much: if \mathcal{T} doesn’t even have a “point” object, there is no underlying set functor at all. Eventually, one simply has to get used to the idea of defining a space by the information revealed by probes.

In the next post, I’ll talk more about this in the context of John Huerta’s guest lecture, applying this idea to the category of supermanifolds, which can be seen as manifolds built internal to the topos of (pre)sheaves on a site whose objects are called “super-points”.

Well, it’s been a while, but it’s now a new semester here in Hamburg, and I wanted to go back and look at some of what we talked about in last semester’s research seminar. This semester, Susama Agarwala and I are sharing the teaching in a topics class on “Category Theory for Geometry“, in which I’ll be talking about categories of sheaves, and building up the technology for Susama to talk about Voevodsky’s theory of motives (enough to give a starting point to read something like this).

As for last semester’s seminar, one of the two main threads, the one which Alessandro Valentino and I helped to organize, was a look at some of the material needed to approach Jacob Lurie’s paper on the classification of topological quantum field theories. The idea was for the research seminar to present the basic tools that are used in that paper to a larger audience, mostly of graduate students – enough to give a fairly precise statement, and develop the tools needed to follow the proof. (By the way, for a nice and lengthier discussion by Chris Schommer-Pries about this subject, which includes more details on much of what’s in this post, check out this video.)

So: the key result is a slightly generalized form of the Cobordism Hypothesis.

Cobordism Hypothesis

The sort of theory which the paper classifies are those which “extend down to a point”. So what does this mean? A topological field theory can be seen as a sort of “quantum field theory up to homotopy”, which abstract away any geometric information about the underlying space where the fields live – their local degrees of freedom.  We do this by looking only at the classes of fields up to the diffeomorphism symmetries of the space.  The local, geometric, information gets thrown away by taking this quotient of the space of solutions.

In spite of reducing the space of fields this way, we want to capture the intuition that the theory is still somehow “local”, in that we can cut up spaces into parts and make sense of the theory on those parts separately, and determine what it does on a larger space by gluing pieces together, rather than somehow having to take account of the entire space at once, indissolubly. This reasoning should apply to the highest-dimensional space, but also to boundaries, and to any figures we draw on boundaries when cutting them up in turn.

Carrying this on to the logical end point, this means that a topological quantum field theory in the fully extended sense should assign some sort of data to every geometric entity from a zero-dimensional point up to an n-dimensional cobordism.  This is all expressed by saying it’s an n-functor:

Z : Bord^{fr}_n(n) \rightarrow nAlg.

Well, once we know what this means, we’ll know (in principle) what a TQFT is.  It’s less important, for the purposes of Lurie’s paper, what nAlg is than what Bord^){fr}_n(n) is.  The reason is that we want to classify these field theories (i.e. functors).  It will turn out that Bord_n(n) has the sort of structure that makes it easy to classify the functors out of it into any target n-category \mathcal{C}.  A guess about what kind of structure is actually there was expressed by Baez and Dolan as the Cobordism Hypothesis.  It’s been slightly rephrased from the original form to get a form which has a proof.  The version Lurie proves says:

The (\infty,n)-category Bord^{fr}_n(n) is equivalent to the free symmetric monoidal (\infty,n)-category generated by one fully-dualizable object.

The basic point is that, since Bord^{fr}_n(n) is a free structure, the classification means that the extended TQFT’s amount precisely to the choice of a fully-dualizable object of \mathcal{C} (which includes a choice of a bunch of morphisms exhibiting the “dualizability”). However, to make sense of this, we need to have a suitable idea of an (\infty,n)-category, and know what a fully dualizable object is. Let’s begin with the first.

(\infty,n)-Categories

In one sense, the Cobordism Hypothesis, which was originally made about n-categories at a time when these were only beginning to be defined, could be taken as a criterion for an acceptable definition. That is, it expressed an intuition which was important enough that any definition which wouldn’t allow one to prove the Cobordism Hypothesis in some form ought to be rejected. To really make it work, one had to bring in the “infinity” part of (\infty,n)-categories. The point here is that we are talking about category-like structures which have morphisms between objects, 2-morphisms between morphisms, and so on, with j-morphisms between j-1-morphisms for every possible degree. The inspiration for this comes from homotopy theory, where one has maps, homotopies of maps, homotopies of homotopies, etc.

Nowadays, there are several possible concrete models for (\infty,n)-categories (see this survey article by Julie Bergner for a summary of four of them). They are all equivalent definitions, in a suitable up-to-homotopy way, but for purposes of the proof, Lurie is taking the definition that an (\infty,n)-category is an n-fold complete Segal space. One theme that shows up in all the definitions is that of simplicial methods. (In our seminar, we started with a series of two talks introducing the notions of simplicial sets, simplicial objects in a category, and Kan complexes. If you don’t already know this, essentially everything we need is nicely explained in here.)

One of the underlying ideas is that a category C can be associated with a simplicial set, its nerve N(C)_{\bullet}, where the set N(C)_k of k-dimensional simplexes is just the set of composable k-tuples of morphisms in C. If C is a groupoid (everything is invertible), then the simplicial set is a Kan complex – it satisfies some filling conditions, which ensure that any morphism has an inverse. Not every Kan complex is the nerve of a groupoid, but one can think of them as weak versions of groupoids – \infty-groupoids, or (\infty,0)-categories – where the higher morphisms may not be completely trivial (as with a groupoid), but where at least they’re all invertible. This leads to another desirable feature in any definition of (\infty,n)-category, which is the Homotopy Hypothesis: that the (\infty,1)-category of (\infty,0)-categories, also called \infty-groupoids, should be equivalent (in the same weak sense) to a category of Hausdorff spaces with some other nice properties, which we call \mathbf{Top} for short. This is true of Kan complexes.

Thus, up to homotopy, specifying an \infty-groupoid is the same as specifying a space.

The data which defines a Segal space (which was however first explicitly defined by Charlez Rezk) is a simplicial space X_{\bullet}: for each n, there are spaces X_n, thought of as the space of composable n-tuples of morphisms. To keep things tame, we suppose that X_0, the space of objects, is discrete – that is, we have only a set of objects. Being a simplicial space means that the X_n come equipped with a collection of face maps d_i : X_n \rightarrow X_{n-1}, which we should think of as compositions: to get from an n-tuple to an (n-1)-tuple of morphisms, one can compose two morphisms together at any of (n-1) positions in the tuple.

One condition which a simplicial space has to satisfy to be a Segal space has to do with the “weakening” which makes a Segal space a weaker notion than just a category lies in the fact that the X_n cannot be arbitrary, but must be homotopy equivalent to the “actual” space of n-tuples, which is a strict pullback X_1 \times_{X_0} \dots \times_{X_0} X_1. That is, in a Segal space, the pullback which defines these tuples for a category is weakened to be a homotopy pullback. Combining this with the various face maps, we therefore get a weakened notion of composition: X_1 \times_{X_0} \dots \times_{X_0} X_1 \cong X_n \rightarrow X_1. Because we start by replacing the space of n-tuples with the homotopy-equivalent X_n, the composition rule will only satisfy all the relations which define composition (associativity, for instance) up to homotopy.

To be complete, the Segal space must have a notion of equivalence for X_{\bullet} which agrees with that for Kan complexes seen as \infty-groupoids. In particular, there is a sub-simplicial object Core(X_{\bullet}), which we understand to consist of the spaces of invertible k-morphisms. Since there should be nothing interesting happening above the top dimension, we ask that, for these spaces, the face and degeneracy maps are all homotopy equivalences: up to homotopy, the space of invertible higher morphisms has no new information.

Then, an n-fold complete Segal space is defined recursively, just as one might define n-categories (without the infinitely many layers of invertible morphisms “at the top”). In that case, we might say that a double category is just a category internal to \mathbf{Cat}: it has a category of objects, and a category of morphims, and the various maps and operations, such as composition, which make up the definition of a category are all defined as functors. That turns out to be the same as a structure with objects, horizontal and vertical morphisms, and square-shaped 2-cells. If we insist that the category of objects is discrete (i.e. really just a set, with no interesting morphisms), then the result amounts to a 2-category. Then we can define a 3-category to be a category internal to \mathbf{2Cat} (whose 2-category of objects is discrete), and so on. This approach really defines an n-fold category (see e.g. Chapter 5 of Cheng and Lauda to see a variation of this approach, due to Tamsamani and Simpson), but imposing the condition that the objects really amount to a set at each step gives exactly the usual intuition of a (strict!) n-category.

This is exactly the approach we take with n-fold complete Segal spaces, except that some degree of weakness is automatic. Since a C.S.S. is a simplicial object with some properties (we separately define objects of k-tuples of morphisms for every k, and all the various composition operations), the same recursive approach leads to a definition of an “n-fold complete Segal space” as simply a simplicial object in (n-1)-fold C.S.S.’s (with the same properties), such that the objects form a set. In principle, this gives a big class of “spaces of morphisms” one needs to define – one for every n-fold product of simplexes of any dimension – but all those requirements that any space of objects “is just a set” (i.e. is homotopy-equivalent to a discrete set of points) simplifies things a bit.

Cobordism Category as (\infty,n)-Category

So how should we think of cobordisms as forming an (\infty,n)-category? There are a few stages in making a precise definition, but the basic idea is simple enough. One starts with manifolds and cobordisms embedded in some fixed finite-dimensional vector space V \times \mathbb{R}^n, and then takes a limit over all V. In each V \times \mathbb{R}^n, the coordinates of the \mathbb{R}^n factor give n ways of cutting the cobordism into pieces, and gluing them back together defines composition in a different direction. Now, this won’t actually produce a complete Segal space: one has to take a certain kind of completion. But the idea is intuitive enough.

We want to define an n-fold C.S.S. of cobordisms (and cobordisms between cobordisms, and so on, up to n-morphisms). To start with, think of the case n=1: then the space of objects of Bord^{fr}_1(1) consists of all embeddings of a (d-1)-dimensional manifold into V. The space of k-simplexes (of k-tuples of morphisms) consists of all ways of cutting up a d-dimensional cobordism embedded in V \times \mathbb{R} by choosing t_0, \dots , t_{k-2}, where we think of the cobordism having been glued from two pieces, where at the slice V \times {t_i}, we have the object where the two pieces were composed. (One has to be careful to specify that the Morse function on the cobordisms, got by projection only \mathbb{R}, has its critical points away from the t_i – the generic case – to make sure that the objects where gluing happens are actual manifolds.)

Now, what about the higher morphisms of the (\infty,1)-category? The point is that one needs to have an \infty-groupoid – that is, a space! – of morphisms between two cobordisms M and N. To make sense of this, we just take the space Diff(M,N) of diffeomorphisms – not just as a set of morphisms, but including its topology as well. The higher morphisms, therefore, can be thought of precisely as paths, homotopies, homotopies between homotopies, and so on, in these spaces. So the essential difference between the 1-category of cobordisms and the (\infty,1)-category is that in the first case, morphisms are diffeomorphism classes of cobordisms, whereas in the latter, the higher morphisms are made precisely of the space of diffeomorphisms which we quotient out by in the first case.

Now, (\infty,n)-categories, can have non-invertible morphisms between morphisms all the way up to dimension n, after which everything is invertible. An n-fold C.S.S. does this by taking the definition of a complete Segal space and copying it inside (n-1)-fold C.S.S’s: that is, one has an (n-1)-fold Complete Segal Space of k-tuples of morphisms, for each k, they form a simplicial object, and so forth.

Now, if we want to build an (\infty,n)-category Bord^{fr}_n(n) of cobordisms, the idea is the same, except that we have a simplicial object, in a category of simplicial objects, and so on. However, the way to define this is essentially similar. To specify an n-fold C.S.S., we have to specify a whole collection of spaces associated to cobordisms equipped with embeddings into V \times \mathbb{R}^n. In particular, for each tuple (k_1,\dots,k_n), we have the space of such embeddings, such that for each i = 1 \dots n one has k_i special points t_{i,j} along the i^{th} coordinate axis. These are the ways of breaking down a given cobordism into a composite of k_i +1 pieces. Again, one has to make sure that these critical points of the Morse functions defined by the projections onto these coordinate axes avoid these special t_{i,j} which define the manifolds where gluing takes place. The composition maps which make these into a simplical object are quite natural – they just come by deleting special points.

Finally, we take a limit over all V (to get around limits to embeddings due to the dimension of V). So we know (at least abstractly) what the (\infty,n)-category of cobordisms should be. The cobordism hypothesis claims it is equivalent to one defined in a free, algebraically-flavoured way, namely as the free symmetric monoidal (\infty,n)-category on a fully-dualizable object. (That object is “the point” – which, up to the kind of homotopically-flavoured equivalence that matters here, is the only object when our highest-dimensional cobordisms have dimension n).

Dualizability

So what does that mean, a “fully dualizable object”?

First, to get the idea, let’s think of the 1-dimensional example.  Instead of “(\infty,n)-category”, we would like to just think of this as a statement about a category.  Then Bord^{fr}_1(1) is the 1-category of framed bordisms. For a manifold (or cobordism, which is a manifold with boundary), a framing is a trivialization of the tangent bundle.  That is, it amounts to a choice of isomorphism at each point between the tangent space there and the corresponding \mathbb{R}^n.  So the objects of Bord^{fr}_1(1) are collections of (signed) points, and the morphisms are equivalence classes of framed 1-dimensional cobordisms.  These amount to oriented 1-manifolds with boundary, where the points (objects) on the boundary are the source and target of the cobordism.

Now we want to classify what TQFT’s live on this category.  These are functors Z : Bord^{fr}_1(1).  We have two generating objects, + and -, the two signed points.  A TQFT must assign these objects vector spaces, which we’ll call V and W.  Collections of points get assigned tensor products of all the corresponding vector spaces, since the functor is monoidal, so knowing these two vector spaces determines what Z does to all objects.

What does Z do to morphisms?  Well, some generating morphsims of interest are cups and caps: these are lines which connect a positive to a negative point, but thought of as cobordisms taking two points to the empty set, and vice versa.  That is, we have an evaluation:This statement is what is generalized to say that n-dimensional TQFT’s are classified by “fully” dualizable objects.

ev: W \otimes V \rightarrow \mathbb{C}

and a coevaluation:

coev: \mathbb{C} \rightarrow V \otimes W

Now, since cobordisms are taken up to equivalence, which in particular includes topological deformations, we get a bunch of relations which these have to satisfy.  The essential one is the “zig-zag” identity, reflecting the fact that a bent line can be straightened out, and we have the same 1-morphism in Born^{fr}_1(1).  This implies that:

(ev \otimes id) \circ (id \otimes coev) : W \rightarrow W \otimes V \otimes W \rightarrow W

is the same as the identity.  This in turn means that the evaluation and coevaluation maps define a nondegenerate pairing between V and W.  The fact that this exists means two things.  First, W is the dual of V: W \cong V*.  Second, this only makes sense if both V and its dual are finite dimensional (since the evaluation will just be the trace map, which is not even defined on the identity if V is infinite dimensional).

On the other hand, once we know, V, this determines W \cong V* up to isomorphism, as well as the evaluation and coevaluation maps.  In fact, this turns out to be enough to specify Z entirely.  The classification then is: 1-D TQFT’s are classified by finite-dimensional vector spaces V.  Crucially, what made finiteness important is the existence of the dual V* and the (co)evaluation maps which express the duality.

In an (\infty,n)-category, to say that an object is “fully dualizable” means more that the object has a dual (which, itself, implies the existence of the morphisms ev and coev). It also means that ev and coev have duals themselves – or rather, since we’re talking about morphisms, “adjoints”. This in turn implies the existence of 2-morphisms which are the unit and counit of the adjunctions (the defining properties are essentially the same as those for morphisms which define a dual). In fact, every time we get a morphism of degree less than n in this process, “fully dualizable” means that it too must have a dual (i.e. an adjoint).

This does run out eventually, though, since we only require this goes up to dimension (n-1): the n-morphisms which this forces to exist (quite a few) aren’t required to have duals. This is good, because if they were, since all the higher morphisms available are invertible, this would mean that the dual n-morphisms would actually be weak inverses (that is, their composite is isomorphic to the identity)… But that would mean that the dual (n-1)-morphisms which forced them to exist would also be weak inverses (their composite would be weakly isomorphic to the identity)… and so on! In fact, if the property of “having duals” didn’t stop, then everything would be weakly invertible: we’d actually have a (weak) \infty-groupoid!

Classifying TQFT

So finally, the point of the Cobordism Hypothesis is that a (fully extended) TQFT is a functor Z out of this nBord^{fr}_n(n) into some target (\infty,1)-category \mathcal{C}. There are various options, but whatever we pick, the functor must assign something in \mathcal{C} to the point, say Z(pt), and something to each of ev and coev, as well as all the higher morphisms which must exist. Then functoriality means that all these images have to again satisfy the properties which make Z(pt) a fully dualizable object. Furthermore, since nBord^{fr}_n(n) is the free gadget with all these properties on the single object pt, this is exactly what it means that Z is a functor. Saying that Z(pt) is fully dualizable, by implication, includes all the choices of morphisms like Z(ev) etc. which show it as fully dualizable. (Conceivably one could make the same object fully dualizable in more than one way – these would be different functors).

So an extended n-dimensional TQFT is exactly the choice of a fully dualizable object Z(pt) \in \mathcal{C}, for some (\infty,n)-category \mathcal{C}. This object is “what the TQFT assigns to a point”, but if we understand the structure of the object as a fully dualizable object, then we know what the TQFT assigns to any other manifold of any dimension up to n, the highest dimension in the theory. This is how this algebraic characterization of cobordisms helps to classify such theories.

Hamburg

Since I moved to Hamburg,   Alessandro Valentino and I have been organizing one series of seminar talks whose goal is to bring people (mostly graduate students, and some postdocs and others) up to speed on the tools used in Jacob Lurie’s big paper on the classification of TQFT and proof of the Cobordism Hypothesis.  This is part of the Forschungsseminar (“research seminar”) for the working groups of Christoph Schweigert, Ingo Runkel, and Christoph Wockel.  First, I gave one introducing myself and what I’ve done on Extended TQFT. In our main series We’ve had a series of four so far – two in which Alessandro outlined a sketch of what Lurie’s result is, and another two by Sebastian Novak and Marc Palm that started catching our audience up on the simplicial methods used in the theory of (\infty,n)-categories which it uses.  Coming up in the New Year, Nathan Bowler and I will be talking about first (\infty,1)-categories, and then (\infty,n)-categories.   I’ll do a few posts summarizing the talks around then.

Some people in the group have done some work on quantum field theories with defects, in relation to which, there’s this workshop coming up here in February!  The idea here is that one could have two regions of space where different field theories apply, which are connected along a boundary. We might imagine these are theories which are different approximations to what’s going on physically, with a different approximation useful in each region.  Whatever the intuition, the regions will be labelled by some category, and boundaries between regions are labelled by functors between categories.  Where different boundary walls meet, one can have natural transformations.  There’s a whole theory of how a 3D TQFT can be associated to modular tensor categories, in sort of the same sense that a 2D TQFT is associated to a Frobenius algebra. This whole program is intimately connected with the idea of “extending” a given TQFT, in the sense that it deals with theories that have inputs which are spaces (or, in the case of defects, sub-spaces of given ones) of many different dimensions.  Lurie’s paper describing the n-dimensional cobordism category, is very much related to the input to a theory like this.

Brno Visit

This time, I’d like to mention something which I began working on with Roger Picken in Lisbon, and talked about for the first time in Brno, Czech Republic, where I was invited to visit at Masaryk University.  I was in Brno for a week or so, and on Thursday, December 13, I gave this talk, called “Higher Gauge Theory and 2-Group Actions”.  But first, some pictures!

This fellow was near the hotel I stayed in:

Image

Since this sculpture is both faceless and hard at work on nonspecific manual labour, I assume he’s a Communist-era artwork, but I don’t really know for sure.

The Christmas market was on in Náměstí Svobody (Freedom Square) in the centre of town.  This four-headed dragon caught my eye:

Image

On the way back from Brno to Hamburg, I met up with my wife to spend a couple of days in Prague.  Here’s the Christmas market in the Old Town Square of Prague:

Image

Anyway, it was a good visit to the Czech Republic.  Now, about the talk!

Moduli Spaces in Higher Gauge Theory

The motivation which I tried to emphasize is to define a specific, concrete situation in which to explore the concept of “2-Symmetry”.  The situation is supposed to be, if not a realistic physical theory, then at least one which has enough physics-like features to give a good proof of concept argument that such higher symmetries should be meaningful in nature.  The idea is that Higher Gauge theory is a field theory which can be understood as one in which the possible (classical) fields on a space/spacetime manifold consist of maps from that space into some target space X.  For the topological theory, they are actually just homotopy classes of maps.  This is somewhat related to Sigma models used in theoretical physics, and mathematically to Homotopy Quantum Field Theory, which considers these maps as geometric structure on a manifold.  An HQFT is a functor taking such structured manifolds and cobordisms into Hilbert spaces and linear maps.  In the paper Roger and I are working on, we don’t talk about this stage of the process: we’re just considering how higher-symmetry appears in the moduli spaces for fields of this kind, which we think of in terms of Higher Gauge Theory.

Ordinary topological gauge theory – the study of flat connections on G-bundles for some Lie group G, can be looked at this way.  The target space X = BG is the “classifying space” of the Lie group – homotopy classes of maps in Hom(M,BG) are the same as groupoid homomorphisms in Hom(\Pi_1(M),G).  Specifically, the pair of functors \Pi_1 and B relating groupoids and topological spaces are adjoints.  Now, this deals with the situation where X = BG is a homotopy 1-type, which is to say that it has a fundamental groupoid \Pi_1(X) = G, and no other interesting homotopy groups.  To deal with more general target spaces X, one should really deal with infinity-groupoids, which can capture the whole homotopy type of X – in particular, all its higher homotopy groups at once (and various relations between them).  What we’re talking about in this paper is exactly one step in that direction: we deal with 2-groupoids.

We can think of this in terms of maps into a target space X which is a 2-type, with nontrivial fundamental groupoid \Pi_1(X), but also interesting second homotopy group \pi_2(X) (and nothing higher).  These fit together to make a 2-groupoid \Pi_2(X), which is a 2-group if X is connected.  The idea is that X is the classifying space of some 2-group \mathcal{G}, which plays the role of the Lie group G in gauge theory.  It is the “gauge 2-group”.  Homotopy classes of maps into X = B \mathcal{G} correspond to flat connections in this 2-group.

For practical purposes, we use the fact that there are several equivalent ways of describing 2-groups.  Two very directly equivalent ways to define them are as group objects internal to \mathbf{Cat}, or as categories internal to \mathbf{Grp} – which have a group of objects and a group of morphisms, and group homomorphisms that define source, target, composition, and so on.  This second way is fairly close to the equivalent formulation as crossed modules (G,H,\rhd,\partial).  The definition is in the slides, but essentially the point is that G is the group of objects, and with the action G \rhd H, one gets the semidirect product G \ltimes H which is the group of morphisms.  The map \partial : H \rightarrow G makes it possible to speak of G and H acting on each other, and that these actions “look like conjugation” (the precise meaning of which is in the defining properties of the crossed module).

The reason for looking at the crossed-module formulation is that it then becomes fairly easy to understand the geometric nature of the fields we’re talking about.  In ordinary gauge theory, a connection can be described locally as a 1-form with values in Lie(G), the Lie algebra of G.  Integrating such forms along curves gives another way to describe the connection, in terms of a rule assigning to every curve a holonomy valued in G which describes how to transport something (generally, a fibre of a bundle) along the curve.  It’s somewhat nontrivial to say how this relates to the classic definition of a connection on a bundle, which can be described locally on “patches” of the manifold via 1-forms together with gluing functions where patches overlap.  The resulting categories are equivalent, though.

In higher gauge theory, we take a similar view. There is a local view of “connections on gerbes“, described by forms and gluing functions (the main difference in higher gauge theory is that the gluing functions related to higher cohomology).  But we will take the equivalent point of view where the connection is described by G-valued holonomies along paths, and H-valued holonomies over surfaces, for a crossed module (G,H,\rhd,\partial), which satisfy some flatness conditions.  These amount to 2-functors of 2-categories \Pi_2(M) \rightarrow \mathcal{G}.

The moduli space of all such 2-connections is only part of the story.  2-functors are related by natural transformations, which are in turn related by “modifications”.  In gauge theory, the natural transformations are called “gauge transformations”, and though the term doesn’t seem to be in common use, the obvious term for the next layer would be “gauge modifications”. It is possible to assemble a 2-groupoid Hom(\Pi_2(M),\mathcal{G}, whose space of objects is exactly the moduli space of 2-connections, and whose 1- and 2-morphisms are exactly these gauge transformations and modifications.  So the question is, what is the meaning of the extra information contained in the 2-groupoid which doesn’t appear in the moduli space itself?

Our claim is that this information expresses how the moduli space carries “higher symmetry”.

2-Group Actions and the Transformation Double Category

What would it mean to say that something exhibits “higher” symmetry? A rudimentary way to formalize the intuition of “symmetry” is to say that there is a group (of “symmetries”) which acts on some object. One could get more subtle, but this should be enough to begin with. We already noted that “higher” gauge theory uses 2-groups (and beyond into n-groups) in the place of ordinary groups.  So in this context, the natural way to interpret it is by saying that there is an action of a 2-group on something.

Just as there are several equivalent ways to define a 2-group, there are different ways to say what it means for it to have an action on something.  One definition of a 2-group is to say that it’s a 2-category with one object and all morphisms and 2-morphisms invertible.  This definition makes it clear that a 2-group has to act on an object of some 2-category \mathcal{C}. For our purposes, just as we normally think of group actions on sets, we will focus on 2-group actions on categories, so that \mathcal{C} = \mathbf{Cat} is the 2-category of interest. Then an action is just a map:

\Phi : \mathcal{G} \rightarrow \mathbf{Cat}

The unique object of \mathcal{G} – let’s call it \star, gets taken to some object \mathbf{C} = \Phi(\star) \in \mathbf{Cat}.  This object \mathbf{C} is the thing being “acted on” by \mathcal{G}.  The existence of the action implies that there are automorphisms \Phi(g) : \mathbf{C} \rightarrow \mathbf{C} for every morphism in \mathbf{G} (which correspond to the elements of the group G of the crossed module).  This would be enough to describe ordinary symmetry, but the higher symmetry is also expressed in the images of 2-morphisms \Phi( \eta : g \rightarrow g') = \Phi(\eta) : \Phi(g) \rightarrow \Phi(g'), which we might call 2-symmetries relating 1-symmetries.

What we want to do in our paper, which the talk summarizes, is to show how this sort of 2-group action gives rise to a 2-groupoid (actually, just a 2-category when the \mathbf{C} being acted on is a general category).  Then we claim that the 2-groupoid of connections can be seen as one that shows up in exactly this way.  (In the following, I have to give some credit to Dany Majard for talking this out and helping to find a better formalism.)

To make sense of this, we use the fact that there is a diagrammatic way to describe the transformation groupoid associated to the action of a group G on a set S.  The set of morphisms is built as a pullback of the action map, \rhd : (g,s) \mapsto g(s).

pullback

This means that morphisms are pairs (g,s), thought of as going from s to g(s).  The rule for composing these is another pullback.  The diagram which shows how it’s done appears in the slides.  The whole construction ends up giving a cubical diagram in \mathbf{Sets}, whose top and bottom faces are mere commuting diagrams, and whose four other faces are all pullback squares.

To construct a 2-category from a 2-group action is similar. For now we assume that the 2-group action is strict (rather than being given by \Phi a weak 2-functor).  In this case, it’s enough to think of our 2-group \mathcal{G} not as a 2-category, but as a group-object in \mathbf{Cat} – the same way that a 1-group, as well as being a category, can be seen as a group object in \mathbf{Set}.  The set of objects of this category is the group G of morphisms of the 2-category, and the morphisms make up the group G \ltimes H of 2-morphisms.  Being a group object is the same as having all the extra structure making up a 2-group.

To describe a strict action of such a \mathcal{G} on \mathbf{C}, we just reproduce in \mathbf{Cat} the diagram that defines an action in \mathbf{Sets}:

action

The fact that \rhd is an action just means this commutes. In principle, we could define a weak action, which would mean that this commutes up to isomorphism, but we won’t be looking at that here.

Constructing the same diagram which describes the structure of a transformation groupoid (p29 in the slides for the talk), we get a structure with a “category of objects” and a “category of morphisms”.  The construction in \mathbf{Set} gives us directly a set of morphisms, while S itself is the set of objects. Similarly, in \mathbf{Cat}, the category of objects is just \mathbf{C}, while the construction gives a category of morphisms.

The two together make a category internal to \mathbf{Cat}, which is to say a double category.  By analogy with S / \!\! / G, we call this double category \mathbf{C} / \!\! / \mathcal{G}.

We take \mathbf{C} as the category of objects, as the “horizontal category”, whose morphisms are the horizontal arrows of the double category. The category of morphisms of \mathbf{C} /\!\!/ \mathcal{G} shows up by letting its objects be the vertical arrows of the double category, and its morphisms be the squares.  These look like this:

squares

The vertical arrows are given by pairs of objects (\gamma, x), and just like the transformation 1-groupoid, each corresponds to the fact that the action of \gamma takes x to \gamma \rhd x. Each square (morphism in the category of morphisms) is given by a pair ( (\gamma, \eta), f) of morphisms, one from \mathcal{G} (given by an element in G \rtimes H), and one from \mathbf{C}.

The horizontal arrow on the bottom of this square is:

(\partial \eta) \gamma \rhd f \circ \Phi(\gamma,\eta)_x = \Phi(\gamma,\eta)_y \circ \gamma \rhd f

The fact that these are equal is exactly the fact that \Phi(\gamma,\eta) is a natural transformation.

The double category \mathbf{C} /\!\!/ \mathcal{G} turns out to have a very natural example which occurs in higher gauge theory.

Higher Symmetry of the Moduli Space

The point of the talk is to show how the 2-groupoid of connections, previously described as Hom(\Pi_2(M),\mathcal{G}), can be seen as coming from a 2-group action on a category – the objects of this category being exactly the connections. In the slides above, for various reasons, we did this in a discretized setting – a manifold with a decomposition into cells. This is useful for writing things down explicitly, but not essential to the idea behind the 2-symmetry of the moduli space.

The point is that there is a category we call \mathbf{Conn}, whose objects are the connections: these assign G-holonomies to edges of our discretization (in general, to paths), and H-holonomies to 2D faces. (Without discretization, one would describe these in terms of Lie(G)-valued 1-forms and Lie(H)-valued 2-forms.)

The morphisms of \mathbf{Conn} are one type of “gauge transformation”: namely, those which assign H-holonomies to edges. (Or: Lie(H)-valued 1-forms). They affect the edge holonomies of a connection just like a 2-morphism in \mathcal{G}.  Face holonomies are affected by the H-value that comes from the boundary of the face.

What’s physically significant here is that both objects and morphisms of \mathbf{Conn} describe nonlocal geometric information.  They describe holonomies over edges and surfaces: not what happens at a point.  The “2-group of gauge transformations”, which we call \mathbf{Gauge}, on the other hand, is purely about local transformations.  If V is the vertex set of the discretized manifold, then \mathbf{Gauge} = \mathcal{G}^V: one copy of the gauge 2-group at each vertex.  (Keeping this finite dimensional and avoiding technical details was one main reason we chose to use a discretization.  In principle, one could also talk about the 2-group of \mathcal{G}-valued functions, whose objects and morphisms, thinking of it as a group object in \mathbf{Cat}, are functions valued in morphisms of \mathcal{G}.)

Now, the way \mathbf{Gauge} acts on \mathbf{Conn} is essentially by conjugation: edge holonomies are affected by pre- and post-multiplication by the values at the two vertices on the edge – whether objects or morphisms of \mathbf{Gauge}.  (Face holonomies are unaffected).  There are details about this in the slides, but the important thing is that this is a 2-group of purely local changes.  The objects of \mathbf{Gauge} are gauge transformations of this other type.  In a continuous setting, they would be described by G-valued functions.  The morphisms are gauge modifications, and could be described by H-valued functions.

The main conceptual point here is that we have really distinguished between two kinds of gauge transformation, which are the horizontal and vertical arrows of the double category \mathbf{Conn} /\!\!/ \mathbf{Gauge}.  This expresses the 2-symmetry by moving some gauge transformations into the category of connections, and others into the 2-group which acts on it.  But physically, we would like to say that both are “gauge transformations”.  So one way to do this is to “collapse” the double category to a bicategory: just formally allow horizontal and vertical arrows to compose, so that there is only one kind of arrow.  Squares become 2-cells.

So then if we collapse the double category expressing our 2-symmetry relation this way, the result is exactly equivalent to the functor category way of describing connections.  (The morphisms will all be invertible because \mathbf{Conn} is a groupoid and \mathbf{Gauge} is a 2-group).

I’m interested in this kind of geometrical example partly because it gives a good way to visualize something new happening here.  There appears to be some natural 2-symmetry on this space of fields, which is fairly easy to see geometrically, and distinguishes in a fundamental way between two types of gauge transformation.  This sort of phenomenon doesn’t occur in the world of \mathbf{Sets} – a set S has no morphisms, after all, so the transformation groupoid for a group action on it is much simpler.

In broad terms, this means that 2-symmetry has qualitatively new features that familiar old 1-symmetry doesn’t have.  Higher categorical versions – n-groups acting on n-groupoids, as might show up in more complicated HQFT – will certainly be even more complicated.  The 2-categorical version is just the first non-trivial situation where this happens, so it gives a nice starting point to understand what’s new in higher symmetry that we didn’t already know.

Since the last post, I’ve been busily attending some conferences, as well as moving to my new job at the University of Hamburg, in the Graduiertenkolleg 1670, “Mathematics Inspired by String Theory and Quantum Field Theory”.  The week before I started, I was already here in Hamburg, at the conference they were organizing “New Perspectives in Topological Quantum Field Theory“.  But since I last posted, I was also at the 20th Oporto Meeting on Geometry, Topology, and Physics, as well as the third Higher Structures in China workshop, at Jilin University in Changchun.  Right now, I’d like to say a few things about some of the highlights of that workshop.

Higher Structures in China III

So last year I had a bunch of discussions I had with Chenchang Zhu and Weiwei Pan, who at the time were both in Göttingen, about my work with Jamie Vicary, which I wrote about last time when the paper was posted to the arXiv.  In that, we showed how the Baez-Dolan groupoidification of the Heisenberg algebra can be seen as a representation of Khovanov’s categorification.  Chenchang and Weiwei and I had been talking about how these ideas might extend to other examples, in particular to give nice groupoidifications of categorified Lie algebras and quantum groups.

That is still under development, but I was invited to give a couple of talks on the subject at the workshop.  It was a long trip: from Lisbon, the farthest-west of the main cities of (continental) Eurasia all the way to one of the furthest-East.   (Not quite the furthest, but Changchun is in the northeast of China, just a few hours north of Korea, and it took just about exactly 24 hours including stopovers to get there).  It was a long way to go for a three day workshop, but as there were also three days of a big excursion to Changbai Mountain, just on the border with North Korea, for hiking and general touring around.  So that was a sort of holiday, with 11 other mathematicians.  Here is me with Dany Majard, in a national park along the way to the mountains:

Here’s me with Alex Hoffnung, on Changbai Mountain (in the background is China):

And finally, here’s me a little to the left of the previous picture, where you can see into the volcanic crater.  The lake at the bottom is cut out of the picture, but you can see the crater rim, of which this particular part is in North Korea, as seen from China:

Well, that was fun!

Anyway, the format of the workshop involved some talks from foreigners and some from locals, with a fairly big local audience including a good many graduate students from Jilin University.  So they got a chance to see some new work being done elsewhere – mostly in categorification of one kind or another.  We got a chance to see a little of what’s being done in China, although not as much as we might have. I gather that not much is being done yet that fit the theme of the workshop, which was part of the reason to organize the workshop, and especially for having a session aimed specially at the graduate students.

Categorified Algebra

This is a sort of broad term, but certainly would include my own talk.  The essential point is to show how the groupoidification of the Heisenberg algebra is a representation of Khovanov’s categorification of the same algebra, in a particular 2-category.  The emphasis here is on the fact that it’s a representation in a 2-category whose objects are groupoids, but whose morphisms aren’t just functors, but spans of functors – that is, composites of functors and co-functors.  This is a pretty conservative weakening of “representations on categories” – but it lets one build really simple combinatorial examples.  I’ve discussed this general subject in recent posts, so I won’t elaborate too much.  The lecture notes are here, if you like, though – they have more detail than my previous post, but are less technical than the paper with Jamie Vicary.

Aaron Lauda gave a nice introduction to the program of categorifying quantum groups, mainly through the example of the special case U_q(sl_2), somewhat along the same lines as in his introductory paper on the subject.  The story which gives the motivation is nice: one has knot invariants such as the Jones polynomial, based on representations of groups and quantum groups.  The Jones polynomial can be categorified to give Khovanov homology (which assigns a complex to a knot, whose graded Euler characteristic is the Jones polynomial) – but also assigns maps of complexes to cobordisms of knots.  One then wants to categorify the representation theory behind it – to describe actions of, for instance, quantum sl_2 on categories.  This starting point is nice, because it can work by just mimicking the construction of sl_2 and U_q(sl_2) representations in terms of weight spaces: one gets categories V_{-N}, \dots, V_N which correspond to the “weight spaces” (usually just vector spaces), and the E and F operators give functors between them, and so forth.

Finding examples of categories and functors with this structure, and satisfying the right relations, gives “categorified representations” of the algebra – the monoidal categories of diagrams which are the “categorifications of the algebra” then are seen as the abstraction of exactly which relations these are supposed to satisfy.  One such example involves flag varieties.  A flag, as one might eventually guess from the name, is a nested collection of subspaces in some n-dimensional space.  A simple example is the Grassmannian Gr(1,V), which is the space of all 1-dimensional subspaces of V (i.e. the projective space P(V)), which is of course an algebraic variety.  Likewise, Gr(k,V), the space of all k-dimensional subspaces of V is a variety.  The flag variety Fl(k,k+1,V) consists of all pairs W_k \subset W_{k+1}, of a k-dimensional subspace of V, inside a (k+1)-dimensional subspace (the case k=2 calls to mind the reason for the name: a plane intersecting a given line resembles a flag stuck to a flagpole).  This collection is again a variety.  One can go all the way up to the variety of “complete flags”, Fl(1,2,\dots,n,V) (where V is n-dimenisonal), any point of which picks out a subspace of each dimension, each inside the next.

The way this relates to representations is by way of geometric representation theory. One can see those flag varieties of the form Fl(k,k+1,V) as relating the Grassmanians: there are projections Fl(k,k+1,V) \rightarrow Gr(k,V) and Fl(k,k+1,V) \rightarrow Gr(k+1,V), which act by just ignoring one or the other of the two subspaces of a flag.  This pair of maps, by way of pulling-back and pushing-forward functions, gives maps between the cohomology rings of these spaces.  So one gets a sequence H_0, H_1, \dots, H_n, and maps between the adjacent ones.  This becomes a representation of the Lie algebra.  Categorifying this, one replaces the cohomology rings with derived categories of sheaves on the flag varieties – then the same sort of “pull-push” operation through (derived categories of sheaves on) the flag varieties defines functors between those categories.  So one gets a categorified representation.

Heather Russell‘s talk, based on this paper with Aaron Lauda, built on the idea that categorified algebras were motivated by Khovanov homology.  The point is that there are really two different kinds of Khovanov homology – the usual kind, and an Odd Khovanov Homology, which is mainly different in that the role played in Khovanov homology by a symmetric algebra is instead played by an exterior (antisymmetric) algebra.  The two look the same over a field of characteristic 2, but otherwise different.  The idea is then that there should be “odd” versions of various structures that show up in the categorifications of U_q(sl_2) (and other algebras) mentioned above.

One example is the fact that, in the “even” form of those categorifications, there is a natural action of the Nil Hecke algebra on composites of the generators.  This is an algebra which can be seen to act on the space of polynomials in n commuting variables, \mathbb{C}[x_1,\dots,x_n], generated by the multiplication operators x_i, and the “divided difference operators” based on the swapping of two adjacent variables.  The Hecke algebra is defined in terms of “swap” generators, which satisfy some q-deformed variation of the relations that define the symmetric group (and hence its group algebra).   The Nil Hecke algebra is so called since the “swap” (i.e. the divided difference) is nilpotent: the square of the swap is zero.  The way this acts on the objects of the diagrammatic category is reflected by morphisms drawn as crossings of strands, which are then formally forced to satisfy the relations of the Nil Hecke algebra.

The ODD Nil Hecke algebra, on the other hand, is an analogue of this, but the x_i are anti-commuting, and one has different relations satisfied by the generators (they differ by a sign, because of the anti-commutation).  This sort of “oddification” is then supposed to happen all over.  The main point of the talk was to to describe the “odd” version of the categorified representation defined using flag varieties.  Then the odd Nil Hecke algebra acts on that, analogously to the even case above.

Marco Mackaay gave a couple of talks about the sl_3 web algebra, describing the results of this paper with Weiwei Pan and Daniel Tubbenhauer.  This is the analog of the above, for U_q(sl_3), describing a diagram calculus which accounts for representations of the quantum group.  The “web algebra” was introduced by Greg Kuperberg – it’s an algebra built from diagrams which can now include some trivalent vertices, along with rules imposing relations on these.  When categorifying, one gets a calculus of “foams” between such diagrams.  Since this is obviously fairly diagram-heavy, I won’t try here to reproduce what’s in the paper – but an important part of is the correspondence between webs and Young Tableaux, since these are labels in the representation theory of the quantum group – so there is some interesting combinatorics here as well.

Algebraic Structures

Some of the talks were about structures in algebra in a more conventional sense.

Jiang-Hua Lu: On a class of iterated Poisson polynomial algebras.  The starting point of this talk was to look at Poisson brackets on certain spaces and see that they can be found in terms of “semiclassical limits” of some associative product.  That is, the associative product of two elements gives a power series in some parameter h (which one should think of as something like Planck’s constant in a quantum setting).  The “classical” limit is the constant term of the power series, and the “semiclassical” limit is the first-order term.  This gives a Poisson bracket (or rather, the commutator of the associative product does).  In the examples, the spaces where these things are defined are all spaces of polynomials (which makes a lot of explicit computer-driven calculations more convenient). The talk gives a way of constructing a big class of Poisson brackets (having some nice properties: they are “iterated Poisson brackets”) coming from quantum groups as semiclassical limits.  The construction uses words in the generating reflections for the Weyl group of a Lie group G.

Li Guo: Successors and Duplicators of Operads – first described a whole range of different algebra-like structures which have come up in various settings, from physics and dynamical systems, through quantum field theory, to Hopf algebras, combinatorics, and so on.  Each of them is some sort of set (or vector space, etc.) with some number of operations satisfying some conditions – in some cases, lots of operations, and even more conditions.  In the slides you can find several examples – pre-Lie and post-Lie algebras, dendriform algebras, quadri- and octo-algebras, etc. etc.  Taken as a big pile of definitions of complicated structures, this seems like a terrible mess.  The point of the talk is to point out that it’s less messy than it appears: first, each definition of an algebra-like structure comes from an operad, which is a formal way of summing up a collection of operations with various “arities” (number of inputs), and relations that have to hold.  The second point is that there are some operations, “successor” and “duplicator”, which take one operad and give another, and that many of these complicated structures can be generated from simple structures by just these two operations.  The “successor” operation for an operad introduces a new product related to old ones – for example, the way one can get a Lie bracket from an associative product by taking the commutator.  The “duplicator” operation takes existing products and introduces two new products, whose sum is the previous one, and which satisfy various nice relations.  Combining these two operations in various ways to various starting points yields up a plethora of apparently complicated structures.

Dany Majard gave a talk about algebraic structures which are related to double groupoids, namely double categories where all the morphisms are invertible.  The first part just defined double categories: graphically, one has horizontal and vertical 1-morphisms, and square 2-morphsims, which compose in both directions.  Then there are several special degenerate cases, in the same way that categories have as degenerate cases (a) sets, seen as categories with only identity morphisms, and (b) monoids, seen as one-object categories.  Double categories have ordinary categories (and hence monoids and sets) as degenerate cases.  Other degenerate cases are 2-categories (horizontal and vertical morphisms are the same thing), and therefore their own special cases, monoidal categories and symmetric monoids.  There is also the special degenerate case of a double monoid (and the extra-special case of a double group).  (The slides have nice pictures showing how they’re all degenerate cases).  Dany then talked about some structure of double group(oids) – and gave a list of properties for double groupoids, (such as being “slim” – having at most one 2-cell per boundary configuration – as well as two others) which ensure that they’re equivalent to the semidirect product of an abelian group with the “bicrossed product”  H \bowtie K of two groups H and K (each of which has to act on the other for this to make sense).  He gave the example of the Poincare double group, which breaks down as a triple bicrossed product by the Iwasawa decomposition:

Poinc = (SO(3) \bowtie (SO(1; 1) \bowtie N)) \ltimes \mathbb{R}_4

(N is certain group of matrices).  So there’s a unique double group which corresponds to it – it has squares labelled by \mathbb{R}_4, and the horizontial and vertical morphisms by elements of SO(3) and N respectively.  Dany finished by explaining that there are higher-dimensional analogs of all this – n-tuple categories can be defined recursively by internalization (“internal categories in (n-1)-tuple-Cat”).  There are somewhat more sophisticated versions of the same kind of structure, and finally leading up to a special class of n-tuple groups.  The analogous theorem says that a special class of them is just the same as the semidirect product of an abelian group with an n-fold iterated bicrossed product of groups.

Also in this category, Alex Hoffnung talked about deformation of formal group laws (based on this paper with various collaborators).  FGL’s are are structures with an algebraic operation which satisfies axioms similar to a group, but which can be expressed in terms of power series.  (So, in particular they have an underlying ring, for this to make sense).  In particular, the talk was about formal group algebras – essentially, parametrized deformations of group algebras – and in particular for Hecke Algebras.  Unfortunately, my notes on this talk are mangled, so I’ll just refer to the paper.

Physics

I’m using the subject-header “physics” to refer to those talks which are most directly inspired by physical ideas, though in fact the talks themselves were mathematical in nature.

Fei Han gave a series of overview talks intorducing “Equivariant Cohomology via Gauged Supersymmetric Field Theory”, explaining the Stolz-Teichner program.  There is more, using tools from differential geometry and cohomology to dig into these theories, but for now a summary will do.  Essentially, the point is that one can look at “fields” as sections of various bundles on manifolds, and these fields are related to cohomology theories.  For instance, the usual cohomology of a space X is a quotient of the space of closed forms (so the k^{th} cohomology, H^{k}(X) = \Omega^{k}, is a quotient of the space of closed k-forms – the quotient being that forms differing by a coboundary are considered the same).  There’s a similar construction for the K-theory K(X), which can be modelled as a quotient of the space of vector bundles over X.  Fei Han mentioned topological modular forms, modelled by a quotient of the space of “Fredholm bundles” – bundles of Banach spaces with a Fredholm operator around.

The first two of these examples are known to be related to certain supersymmetric topological quantum field theories.  Now, a TFT is a functor into some kind of vector spaces from a category of (n-1)-dimensional manifolds and n-dimensional cobordisms

Z : d-Bord \rightarrow Vect

Intuitively, it gives a vector space of possible fields on the given space and a linear map on a given spacetime.  A supersymmetric field theory is likewise a functor, but one changes the category of “spacetimes” to have both bosonic and fermionic dimension.  A normal smooth manifold is a ringed space (M,\mathcal{O}), since it comes equipped with a sheaf of rings (each open set has an associated ring of smooth functions, and these glue together nicely).  Supersymmetric theories work with manifolds which change this sheaf – so a d|\delta-dimensional space has the sheaf of rings where one introduces some new antisymmetric coordinate functions \theta_i, the “fermionic dimensions”:

\mathcal{O}(U) = C^{\infty}(U) \otimes \bigwedge^{\ast}[\theta_1,\dots,\theta_{\delta}]

Then a supersymmetric TFT is a functor:

E : (d|\delta)-Bord \rightarrow STV

(where STV is the category of supersymmetric topological vector spaces – defined similarly).  The connection to cohomology theories is that the classes of such field theories, up to a notion of equivalence called “concordance”, are classified by various cohomology theories.  Ordinary cohomology corresponds then to 0|1-dimensional extended TFT (that is, with 0 bosonic and 1 fermionic dimension), and K-theory to a 1|1-dimensional extended TFT.  The Stoltz-Teichner Conjecture is that the third example (topological modular forms) is related in the same way to a 2_1-dimensional extended TFT – so these are the start of a series of cohomology theories related to various-dimension TFT’s.

Last but not least, Chris Rogers spoke about his ideas on “Higher Geometric Quantization”, on which he’s written a number of papers.  This is intended as a sort of categorification of the usual ways of quantizing symplectic manifolds.  I am still trying to catch up on some of the geometry This is rooted in some ideas that have been discussed by Brylinski, for example.  Roughly, the message here is that “categorification” of a space can be thought of as a way of acting on the loop space of a space.  The point is that, if points in a space are objects and paths are morphisms, then a loop space L(X) shifts things by one categorical level: its points are loops in X, and its paths are therefore certain 2-morphisms of X.  In particular, there is a parallel to the fact that a bundle with connection on a loop space can be thought of as a gerbe on the base space.  Intuitively, one can “parallel transport” things along a path in the loop space, which is a surface given by a path of loops in the original space.  The local description of this situation says that a 1-form (which can give transport along a curve, by integration) on the loop space is associated with a 2-form (giving transport along a surface) on the original space.

Then the idea is that geometric quantization of loop spaces is a sort of higher version of quantization of the original space. This “higher” version is associated with a form of higher degree than the symplectic (2-)form used in geometric quantization of X.   The general notion of n-plectic geometry, where the usual symplectic geometry is the case n=1, involves a (n+1)-form analogous to the usual symplectic form.  Now, there’s a lot more to say here than I properly understand, much less can summarize in a couple of paragraphs.  But the main theorem of the talk gives a relation between n-plectic manifolds (i.e. ones endowed with the right kind of form) and Lie n-algebras built from the complex of forms on the manifold.  An important example (a theorem of Chris’ and John Baez) is that one has a natural example of a 2-plectic manifold in any compact simple Lie group G together with a 3-form naturally constructed from its Maurer-Cartan form.

At any rate, this workshop had a great proportion of interesting talks, and overall, including the chance to see a little more of China, was a great experience!

This blog has been on hiatus for a while, as I’ve been doing various other things, including spending some time in Hamburg getting set up for the move there. Another of these things has been working with Jamie Vicary on our project on the groupoidified Quantum Harmonic Oscillator (QHO for short). We’ve now put the first of two papers on the arXiv – this one is a relatively nonrigorous look at how this relates to categorification of the Heisenberg Algebra. Since John Baez is a high-speed blogging machine, he’s already beaten me to an overview of what the paper says, and there’s been some interesting discussion already. So I’ll try to say some different things about what it means, and let you take a look over there, or read the paper, for details.

I’ve given some talks about this project, but as we’ve been writing it up, it’s expanded considerably, including a lot of category-theoretic details which are going to be in the second paper in this series. But the basic point of this current paper is essentially visual and, in my opinion, fairly simple. The groupoidification of the QHO has a nice visual description, since it is all about the combinatorics of finite sets. This was described originally by Baez and Dolan, and in more detail in my very first paper. The other visual part here is the relation to Khovanov’s categorification of the Heisenberg algebra using a graphical calculus. (I wrote about this back when I first became aware of it.)

As a Representation

The scenario here actually has some common features with my last post. First, we have a monoidal category with duals, let’s say C presented in terms of some generators and relations. Then, we find some concrete model of this abstractly-presented monoidal category with duals in a specific setting, namely Span(Gpd).

Calling this “concrete” just refers to the fact that the objects in Span(Gpd) have some particular structure in terms of underlying sets and so on. By a “model” I just mean a functor C \rightarrow Span(Gpd) (“model” and “representation” mean essentially the same thing in this context). In fact, for this to make sense, I think of C as a 2-category with one object. Then a model is just some particular choices: a groupoid to represent the unique object, spans of groupoids to represent the generating morphisms, spans of spans to represent the generating 2-morphisms, all chosen so that the defining relations hold.

In my previous post, C was a category of cobordisms, but in this case, it’s essentially Khovanov’s monoidal category H' whose objects are (oriented) dots and whose morphisms are certain classes of diagrams. The nice fact about the particular model we get is that the reasons these relations hold are easy to see in terms of a the combinatorics of sets. This is why our title describes what we got as “a combinatorial representation” Khovanov’s category H' of diagrams, for which the ring of isomorphism classes of objects is the integral form of the algebra. This uses that Span(Gpd) is not just a monoidal category: it can be a monoidal 2-category. What’s more, the monoidal category H' “is” also a 2-category – with one object. The objects of H' are really the morphisms of this 2-category.

So H' is in some sense a universal theory (because it’s defined freely in terms of generators and relations) of what a categorification of the Heisenberg algebra must look like. Baez-Dolan groupoidification of the QHO then turns out to be a representation or model of it. In fact, the model is faithful, so that we can even say that it provides a combinatorial interpretation of that category.

The Combinatorial Model

Between the links above, you can find a good summary of the situation, so I’ll be a bit cursory. The model is described in terms of structures on finite sets. This is why our title calls this a “combinatorial representation” of Khovanov’s categorification.

This means that the one object of H (as a 2-category) is taken to the groupoid FinSet_0 of finite sets and bijections (which we just called S in the paper for brevity). This is the “Fock space” object. For simplicity, we can take an equivalent groupoid, which has just one n-element set for each n.

Now, a groupoid represents a system, whose possible configurations are the objects and whose symmetries are the morphisms. In this case, the possible configurations are the different numbers of “quanta”, and the symmetries (all set-bijections) show that all the quanta are interchangeable. I imagine a box containing some number of ping-pong balls.

A span of groupoids represents a process. It has a groupoid whose objects are histories (and morphisms are symmetries of histories). This groupoid has a pair of maps: to the system the process starts in, and to the system it ends in. In our model, the most important processes (which generate everything else) are the creation and annihilation operators, a^{\dagger} and a – and their categorified equivalents, A and A^{\dagger}. The spans that represent them are very simple: they are processes which put a new ball into the box, or take one out, respectively. (Algebraically, they’re just a way to organize all the inclusions of symmetric groups S_n \subset S_{n+1}.)

The “canonical commutation relation“, which we write without subtraction thus:

A A^{\dagger} = A^{\dagger} A + 1

is already understood in the Baez-Dolan story: it says that there is one more way to remove a ball from a box after putting a new one into it (one more history for the process A A^{\dagger}) than to remove a ball and then add a new one (histories for a^{\dagger} a). This is fairly obvious: in the first instance, you have one more to choose from when removing the ball.

But the original Baez-Dolan story has no interesting 2-morphisms (the actual diagrams which are the 1-morphisms in H), whereas these are absolutely the whole point of a categorification in the sense Khovanov gets one, since the 1-morphisms of H' determine what the isomorphism classes of objects even are.

So this means that we need to figure out what the 2-morphisms in Span(Gpd) need to be – first in general, and second in our particular representation of H.

In general, a 2-morphism in Span(Gpd) is a span of span-maps. You’ll find other people who take it to be a span-map. This would be a functor between the groupoids of histories: roughly, a map which assigns a history in the source span to a history in the target span (and likewise for symmetries), in a way that respects how they’re histories. But we don’t want just a map: we want a process which has histories of its own. We want to describe a “movie of processes” which change one process into another. These can have many histories of their own.

In fact, they’re not too complicated. Here’s one of Khovanov’s relation in H' which forms part of how the commutation relation is expressed (shuffled to get rid of negatives, which we constantly need to do in the combinatorial model since we have no negative sets):

We read an upward arrow as “add a ball to the box”, and a downward arrow as “remove a ball”, and read right-to-left.  Both processes begin and end with“add then remove”. The right-hand side just leaves this process alone: it’s the identity.

The left-hand side shows a process-movie whose histories have two different cases. Suppose we begin with a history for which we add x and then remove y. The first case is that x = y: we remove the same ball we put in. This amounts to doing nothing, so the first part of the movie eliminates all the adding and removing. The second part puts the add-remove pair back in.

The second case ensures that x \neq y, since it takes the initial history to the history (of a different process!) in which we remove y and then add x (impossible if y = x, since we can’t remove this ball before adding it). This in turn is taken to the history (of the original process!) where we add x and then remove y; so this relates every history to itself, except for the case that x = y. Overall the sum of these relations give the identity on histories, which is the right hand side.

This picture includes several of the new 2-morphisms that we need to add to the Baez-Dolan picture: swapping the order of two generators, and adding or removing a pair of add/remove operations. Finding spans of spans which accomplish this (and showing they satisfy the right relations) is all that’s needed to finish up the combinatorial model.  So, for instance, the span of spans which adds a “remove-then-add” pair is this one:

If this isn’t clear, well, it’s explained in more detail in the paper.  (Do notice, though, that this is a diagram in groupoids: we need to specify that there are identity 2-cells in the span, rather than some other 2-cells.)

So this is basically how the combinatorial model works.

Adjointness

But in fact this description is (as often happens) chronologically backwards: what actually happened was that we had worked out what the 2-morphisms should be for different reasons. While trying to to understand what kind of structure this produced, we realized (thanks to Marco Mackaay) that the result was related to H, which in turn shed more light on the 2-morphisms we’d found.

So far so good. But what makes it possible to represent the kind of monoidal category we’re talking about in this setting is adjointness. This is another way of saying what I meant up at the top by saying we start with a monoidal category with duals.  This means morphisms each have a partner – a dual, or adjoint – going in the opposite direction.  The representations of the raising and lowering operators of the Heisenberg algebra on the Hilbert space for the QHO are linear adjoints. Their categorifications also need to be adjoints in the sense of adjoint 1-morphisms in a 2-category.

This is an abstraction of what it means for two functors F and G to be adjoint. In particular, it means there have to be certain 2-cells such as the unit \eta : Id \Rightarrow G \circ F and counit \epsilon : F \circ G \Rightarrow Id satisfying some nice relations. In fact, this only makes F a left adjoint and G a right adjoint – in this situation, we also have another pair which makes F a right adjoint and G a left one. That is, they should be “ambidextrous adjoints”, or “ambiadjoints” for short. This is crucial if they’re going to represent any graphical calculus of the kind that’s involved here (see the first part of this paper by Aaron Lauda, for instance).

So one of the theorems in the longer paper will show concretely that any 1-morphism in Span(Gpd) has an ambiadjoint – which happens to look like the same span, but thought of as going in the reverse direction. This is somewhat like how the adjoint of a real linear map, expressed as a matrix relative to well-chosen bases, is just the transpose of the same matrix. In particular, A and A^{\dagger} are adjoints in just this way. The span-of-span-maps I showed above is exactly the unit for one side of this ambi-adjunction – but it is just a special case of something that will work for any span and its adjoint.

Finally, there’s something a little funny here. Since the morphisms of Span(Gpd) aren’t functors or maps, this combinatorial model is not exactly what people often mean by a “categorified representation”. That would be an action on a category in terms of functors and natural transformations. We do talk about how to get one of these on a 2-vector space out of our groupoidal representation toward the end.

In particular, this amounts to a functor into 2Vect – the objects of 2Vect being categories of a particular kind, and the morphisms being functors that preserve all the structure of those categories. As it turns out, the thing about this setting which is good for this purpose is that all those functors have ambiadjoints. The “2-linearization” that takes Span(Gpd) into 2Vect is a 2-functor, and this means that all the 2-cells and equations that make two morphisms ambiadjoints carry over. In 2Vect, it’s very easy for this to happen, since all those ambiadjoints are already present. So getting representations of categorified algebras that are made using these monoidal categories of diagrams on 2-vector spaces is fairly natural – and it agrees with the usual intuition about what “representation” means.

Anything I start to say about this is in danger of ballooning, but since we’re already some 40 pages into the second paper, I’ll save the elaboration for that…

I’ve written here before about building topological quantum field theories using groupoidification, but I haven’t yet gotten around to discussing a refinement of this idea, which is in the most recent version of my paper on the subject.  I also gave a talk about this last year in Erlangen. The main point of the paper is to pull apart some constructions which are already fairly well known into two parts, as part of setting up a category which is nice for supporting models of fairly general physical systems, using an extension of the  concept of groupoidification. So here’s a somewhat lengthy post which tries to unpack this stuff a bit.

Factoring TQFT

The older version of this paper talked about the untwisted version of the Dijkgraaf-Witten (DW for short) model, which is a certain kind of TQFT based on a gauge theory with a finite gauge group.  (Freed and Quinn put it as: “Chern-Simons theory with finite gauge group”).  The new version gets the general – that is, the twisted – form in the same way: factoring the theory into two parts. So, the DW model, which was originally described by Dijkgraaf and Witten in terms of a state-sum, is a functor

Z : 3Cob \rightarrow Vect

The “twisting” is the point of their paper, “Topological Gauge Theories and Group Cohomology”.  The twisting has to do with the action for some physical theory. Now, for a gauge theory involving flat connections, the kind of gauge-theory actions which involve the curvature of a connection make no sense: the curvature is zero.  So one wants an action which reflects purely global features of connections.  The cohomology of the gauge group is where this comes from.

Now, the machinery I describe is based on a point of view which has been described in a famous paper by Freed, Hopkins, Lurie and Teleman (FHLT for short – see further discussion here) in terms in which the two stages are called the “classical field theory” (which has values in groupoids), and the “quantization functor”, which takes one into Hilbert spaces.

Actually, we really want to have an “extended” TQFT: a TQFT gives a Hilbert space for each 2D manifold (“space”), and a linear map for a 3D cobordism (“spacetime”) between them. An extended TQFT will assign (higher) algebraic data to lower-dimension boundaries still.  My paper talks only about the case where we’ve extended down to codimension 2, whereas FHLT talk about extending “down to a point”. The point of this first stopping point is to unpack explicitly and computationally what the factorization into two parts looks like at the first level beyond the usual TQFT.

In the terminology I use, the classical field theory is:

A^{\omega} : nCob_2 \rightarrow Span_2(Gpd)^{U(1)}

This depends on a cohomology class [\omega] \in H^3(G,U(1)). The “quantization functor” (which in this case I call “2-linearization”):

\Lambda^{U(1)} : Span_2(Gpd)^{U(1)} \rightarrow 2Vect

The middle stage involves the monoidal 2-category I call Span_2(Gpd)^{U(1)}.  (In FHLT, they use different terminology, for instance “families” rather than “spans”, but the principle is the same.)

Freed and Quinn looked at the quantization of the “extended” DW model, and got a nice geometric picture. In it, the action is understood as a section of some particular line-bundle over a moduli space. This geometric picture is very elegant once you see how it works, which I found was a little easier in light of a factorization through Span_2(Gpd).

This factorization isolates the geometry of this particular situation in the “classical field theory” – and reveals which of the features of their setup (the line bundle over a moduli space) are really part of some more universal construction.

In particular, this means laying out an explicit definition of both Span_2(Gpd)^{U(1)} and \Lambda^{U(1)}.

2-Linearization Recalled

While I’ve talked about it before, it’s worth a brief recap of how 2-linearization works with a view to what happens when you twist it via groupoid cohomology. Here we have a 2-category Span(Gpd), whose objects are groupoids (A, B, etc.), whose morphisms are spans of groupoids:

A \stackrel{s}{\leftarrow} X \stackrel{t}{\rightarrow} B

and whose 2-morphisms are spans of span-maps (taken up to isomorphism), which look like so:

span of span maps

(And, by the by: how annoying that WordPress doesn’t appear to support xypic figures…)

These form a (symmetric monoidal) 2-category, where composition of spans works by taking weak pullbacks.  Physically, the idea is that a groupoid has objects which are configurations (in the cause of gauge theory, connections on a manifold), and morphisms which are symmetries (gauge transformations, in this case).  Then a span is a groupoid of histories (connections on a cobordism, thought of as spacetime), and the maps s,t pick out its starting and ending configuration.  That is, A = A_G(S) is the groupoid of flat G-connections on a manifold S, and X = A_G(\Sigma) is the groupoid of flat G-connections on some cobordism \Sigma, of which S is part of the boundary.  So any such connection can be restricted to the boundary, and this restriction is s.

Now 2-linearization is a 2-functor:

\Lambda : Span_2(Gpd)^{U(1)} \rightarrow 2Vect

It gives a 2-vector space (a nice kind of category) for each groupoid G.  Specifically, the category of its representations, Rep(G).  Then a span turns into a functor which comes from “pulling” back along s (the restricted representation where X acts by first applying s then the representation), then “pushing” forward along t (to the induced representation).

What happens to the 2-morphisms is conceptually more complicated, but it depends on the fact that “pulling” and “pushing” are two-sided adjoints. Concretely, it ends up being described as a kind of “sum over histories” (where “histories” are the objects of Y), which turns out to be exactly the path integral that occurs in the TQFT.

Or at least, it’s the path integral when the action is trivial! That is, if S=0, so that what’s integrated over paths (“histories”) is just e^{iS}=1. So one question is: is there a way to factor things in this way if there’s a nontrivial action?

Cohomological Twisting

The answer is by twisting via cohomology. First, let’s remember what that means…

We’re talking about groupoid cohomology for some groupoid G (which you can take to be a group, if you like).  “Cochains” will measure how much some nice algebraic fact, such as being a homomorphism, or being associative, “fails to occur”.  “Twisting by a cocycle” is a controlled way to force some such failure to happen.

So, an n-cocycle is some function of n composable morphisms of G (or, if there’s only one object, “group elements”, which amounts to the same thing).  It takes values in some group of coefficients, which for us is always U(1)

The trivial case where n=0 is actually slightly subtle: a 0-cocycle is an invariant function on the objects of a groupoid. (That is, it takes the same value on any two objects related by an (iso)morphism. (Think of the object as a sequence of zero composable morphisms: it tells you where to start, but nothing else.)

The case n=1 is maybe a little more obvious. A 1-cochain f \in Z^1_{gpd}(G,U(1)) can measure how a function h on objects might fail to be a 0-cocycle. It is a U(1)-valued function of morphisms (or, if you like, group elements).  The natural condition to ask for is that it be a homomorphism:

f(g_1 \circ g_2) = f(g_1) f(g_2)

This condition means that a cochain f is a cocycle. They form an abelian group, because functions satisfying the cocycle condition are closed under pointwise multiplication in U(1). It will automatically by satisfied for a coboundary (i.e. if f comes from a function h on objects as f(g) = \delta h (g) = h(t(g)) - h(s(g))). But not every cocycle is a coboundary: the first cohomology H^1(G,U(1)) is the quotient of cocycles by coboundaries. This pattern repeats.

It’s handy to think of this condition in terms of a triangle with edges g_1, g_2, and g_1 \circ g_2.  It says that if we go from the source to the target of the sequence (g_1, g_2) with or without composing, and accumulate f-values, our f gives the same result.  Generally, a cocycle is a cochain satisfying a “coboundary” condition, which can be described in terms of an n-simplex, like this triangle. What about a 2-cocycle? This describes how composition might fail to be respected.

So, for instance, a twisted representation R of a group is not a representation in the strict sense. That would be a map into End(V), such that R(g_1) \circ R(g_2) = R(g_1 \circ g_2).  That is, the group composition rule gets taken directly to the corresponding rule for composition of endomorphisms of the vector space V.  A twisted representation \rho only satisfies this up to a phase:

\rho(g_1) \circ \rho(g_2) = \theta(g_1,g_2) \rho(g_1 \circ g_2)

where \theta : G^2 \rightarrow U(1) is a function that captures the way this “representation” fails to respect composition.  Still, we want some nice properties: \theta is a “cocycle” exactly when this twisting still makes \rho respect the associative law:

\rho(g_1) \rho( g_2 \circ g_3) = \rho( g_1 \circ g_2) \circ \rho( g_3)

Working out what this says in terms of \theta, the cocycle condition says that for any composable triple (g_1, g_2, g_3) we have:

\theta( g_1, g_2 \circ g_3) \theta (g_2,g_3) = \theta(g_1,g_2) \theta(g_1 \circ g_2, g_3)

So H^2_{grp}(G,U(1)) – the second group-cohomology group of G – consists of exactly these \theta which satisfy this condition, which ensures we have associativity.

Given one of these \theta maps, we get a category Rep^{\theta}(G) of all the \theta-twisted representations of G. It behaves just like an ordinary representation category… because in fact it is one! It’s the category of representations of a twisted version of the group algebra of G, called C^{\theta}(G). The point is, we can use \theta to twist the convolution product for functions on G, and this is still an associative algebra just because \theta satisfies the cocycle condition.

The pattern continues: a 3-cocycle captures how some function of 2 variable may fail to be associative: it specifies an associator map (a function of three variables), which has to satisfy some conditions for any four composable morphisms. A 4-cocycle captures how a map might fail to satisfy this condition, and so on. At each stage, the cocycle condition is automatically satisfied by coboundaries. Cohomology classes are elements of the quotient of cocycles by coboundaries.

So the idea of “twisted 2-linearization” is that we use this sort of data to change 2-linearization.

Twisted 2-Linearization

The idea behind the 2-category Span(Gpd)^{U(1)} is that it contains Span(Gpd), but that objects and morphisms also carry information about how to “twist” when applying the 2-linearization \Lambda.  So in particular, what we have is a (symmetric monoidal) 2-category where:

  • Objects consist of (A, \theta), where A is a groupoid and $\theta \in Z^2(A,U(1))$
  • Morphisms from A to B consist of a span (X,s,t) from A to B, together with \alpha \in Z^1(X,U(1))
  • 2-Morphisms from X_1 to X_2 consist of a span (Y,\sigma,\tau) from X, together with \beta \in Z^0(Y,U(1))

The cocycles have to satisfy some compatibility conditions (essentially, pullbacks of the cocycles from the source and target of a span should land in the same cohomology class).  One way to see the point of this requirement is to make twisted 2-linearization well-defined.

One can extend the monoidal structure and composition rules to objects with cocycles without too much trouble so that Span(Gpd) is a subcategory of Span(Gpd)^{U(1)}. The 2-linearization functor extends to \Lambda^{U(1)} : Span(Gpd)^{U(1)} \rightarrow 2Vect:

  • On Objects: \Lambda^{U(1)} (A, \theta) = Rep^{\theta}(A), the category of \theta-twisted representation of A
  • On Morphisms: \Lambda^{U(1)} ( (X,s,t) , \alpha ) comes by pulling back a twisted representation in Rep^{\theta_A}(A) to one in Rep^{s^{\ast}\theta_A}(X), pulling it through the algebra map “multiplication by \alpha“, and pushing forward to Rep^{\theta_B}(B)
  • On 2-Morphisms: For a span of span maps, one uses the usual formula (see the paper for details), but a sum over the objects y \in Y picks up a weight of \beta(y) at each object

When the cocycles are trivial (evaluate to 1 always), we get back the 2-linearization we had before. Now the main point here is that the “sum over histories” that appears in the 2-morphisms now carries a weight.

So the twisted form of 2-linearization uses the same “pull-push” ideas as 2-linearization, but applied now to twisted representations. This twisting (at the object level) uses a 2-cocycle. At the morphism level, we have a “twist” between “pull” and “push” in constructing . What the “twist” actually means depends on which cohomology degree we’re in – in other words, whether it’s applied to objects, morphisms, or 2-morphisms.

The “twisting” by a 0-cocycle just means having a weight for each object – in other words, for each “history”, or connection on spacetime, in a big sum over histories. Physically, the 0-cocycle is playing the role of the Lagrangian functional for the DW model. Part of the point in the FHLT program can be expressed by saying that what Freed and Quinn are doing is showing how the other cocycles are also the Lagrangian – as it’s seen at higher codimension in the more “local” theory.

For a TQFT, the 1-cocycles associated to morphisms describe how to glue together values for the Lagrangian that are associated to histories that live on different parts of spacetime: the action isn’t just a number. It is a number only “locally”, and when we compose 2-morphisms, the 0-cocycle on the composite picks up a factor from the 1-morphism (or 0-morphism, for a horizontal composite) where they’re composed.

This has to do with the fact that connections on bits of spacetime can be glued by particular gauge transformations – that is, morphisms of the groupoid of connections. Just as the gauge transformations tell how to glue connections, the cocycles associated to them tell how to glue the actions. This is how the cohomological twisting captures the geometric insight that the action is a section of a line bundle – not just a function, which is a section of a trivial bundle – over the moduli space of histories.

So this explains how these cocycles can all be seen as parts of the Lagrangian when we quantize: they explain how to glue actions together before using them in a sum-over histories. Gluing them this way is essential to make sure that \Lambda^{U(1)} is actually a functor. But if we’re really going to see all the cocycles as aspects of “the action”, then what is the action really? Where do they come from, that they’re all slices of this bigger thing?

Twisting as Lagrangian

Now the DW model is a 3D theory, whose action is specified by a group-cohomology class [\omega] \in H^3_{grp}(G,U(1)). But this is the same thing as a class in the cohomology of the classifying space: [\omega] \in H^3(BG,U(1)). This takes a little unpacking, but certainly it’s helpful to understand that what cohomology classes actually classify are… gerbes. So another way to put a key idea of the FHLT paper, as Urs Schreiber put it to me a while ago, is that “the action is a gerbe on the classifying space for fields“.

What does this mean?

This map is given as a path integral over all connections on the space(-time) S, which is actually just a sum, since the gauge group is finite and so all the connections are flat.  The point is that they’re described by assigning group elements to loops in S:

A : \pi_1(M) \rightarrow G

But this amounts to the same thing as a map into the classifying space of G:

f_A : M \rightarrow BG

This is essentially the definition of BG, and it implies various things, such as the fact that BG is a space whose fundamental group is G, and has all other homotopy groups trivial. That is, BG is the Eilenberg-MacLane space K(G,1). But the point is that the groupoid of connections and gauge transformations on S just corresponds to the mapping space Maps(S,BG). So the groupoid cohomology classes we get amount to the same thing as cohomology classes on this space. If we’re given [\omega] \in H^3(BG,U(1)), then we can get at these by “transgression” – which is very nicely explained in a paper by Simon Willerton.

The essential idea is that a 3-cocycle \omega (representing the class [\omega]) amounts to a nice 3-form on BG which we can integrate over a 3-dimentional submanifold to get a number. For a d-dimensional S, we get such a 3-manifold from a (3-d)-dimensional submanifold of Maps(S,BG): each point gives a copy of S in BG. Then we get a (3-d)-cocycle on Maps(S,BG) whose values come from integrating \omega over this image. Here’s a picture I used to illustrate this in my talk:

Now, it turns out that this gives 2-cocycles for 1-manifolds (the objects of 3Cob_2, 1-cocycles on 2D cobordisms between them, and 0-cocycles on 3D cobordisms between these cobordisms. The cocycles are for the groupoid of connections and gauge transformations in each case. In fact, because of Stokes’ theorem in BG, these have to satisfy all the conditions that make them into objects, morphisms, and 2-morphisms of Span^{U(1)}(Gpd). This is the geometric content of the Lagrangian: all the cocycles are really “reflections” of \omega as seen by transgression: pulling back along the evaluation map ev from the picture. Then the way you use it in the quantization is described exactly by \Lambda^{U(1)}.

What I like about this is that \Lambda^{U(1)} is a fairly universal sort of thing – so while this example gets its cocycles from the nice geometry of BG which Freed and Quinn talk about, the insight that an action is a section of a (twisted) line bundle, that actions can be glued together in particular ways, and so on… These presumably can be moved to other contexts.

Next Page »

Follow

Get every new post delivered to your Inbox.

Join 45 other followers