quantum mechanics


I’d just like to post something about a conceptual clarification that came up recently. Last week I gave the first of a couple of talks in the Algebra seminar in our department, about the ideas of structure types and stuff types, more or less as outlined in this paper which I put out a couple of years ago. It summarizes and traipses a little way beyond the matter of the 2003/2004 quantum gravity seminar at UCR, whence on this paper by John Baez and Jim Dolan, and even further back on work by André Joyal, particularly in the paper “Foncteurs analytiques et espèces de structures“, which regrettably doesn’t seem to be available either online. (I gave a blackboard version of the talk, but it was an expanded form of this one hour version.)

(Semantic side note: these espèces de structures are often referred to as “combinatorial species” in English. This is the more common translation than “structure type”, but unfortunately, it doesn’t capture the modifier “de structures“, instead choosing the more generic “combinatorial”, which makes it hard to distinguish “structure types” from “stuff types” in the Baez-Dolan sense. Also, “species” is probably over-specific as a translation of “espèces” in a way that “type” isn’t. The generic sense of “species” as “a kind of” in English is a bit recherché.)

In any case, what I’m interested in this post is the sense in which stuff types give a “categorification” of a vector space. In a nutshell, a stuff type is a groupoid over FinSet_0 (the groupoid whose objects are finite sets, and whose morphisms are bijections). That is, it’s really a functor X \stackrel{\psi}{\longrightarrow} FinSet_0, which we call the “underlying set” functor. For example, consider the groupoid T of all binary trees, where the underlying set is the set of nodes (or, a different example, the set of leaves). Any isomorphism between two such trees gives a bijection between the underlying sets, so this actually is a functor. Or one could take the functor FinSet_0 \times FinSet_0 \stackrel{\pi_1}{\longrightarrow} FinSet_0, where the “underlying set” of a pair of sets (S_1,S_2) is just S_1, and likewise for morphisms. (Notice that different bijections “up above” in the bundle may give the same bijection “below” - in cases where this doesn’t happen, we have one of Joyal’s “structure types”). In some ways, it’s better to think of it as a bundle of groupoids - one fibre over each object in FinSet_0

The thing is, that map gives an invariant for objects in the category of groupoids, but not a complete invariant. Unlike, say, finite sets and the natural numbers. Natural numbers correspond exactly to isomorphism classes of sets - not so with groupoid cardinalities. So there’s an equivalence relation, and reducing the object set modulo that equivalence relation gives a structure - but it’s not the minimal throwing-away of information about objects that taking isomorphism classes would be.

But in any case, it’s the whole category of groupoids (over FinSet_0) which gets “decategorified” down to a vector space, in that world. There is a concept of groupoid cardinality, which is given by Baez and Dolan in the paper above, and which is also linked to Tom Leinster’s definition of the Euler characteristic of a category. This adds up, over all the isomorphism classes of objects, \frac{1}{|Aut(x)|}, the reciprocals of the sizes of automorphism groups. Reasons why this is the nicest concept of cardinality are described in some of those references, but all that really matters here is that groupoid cardinality gets along with disjoint unions of groupoids (corresponding to sums of cardinalitys), and products of groupoids (which get the product of the two cardinalities). That is, the categorical coproduct and product, respectively, define operations on the set of cardinalities!

In particular, taking stuff types - groupoids over FinSet_0, we can take the cardinalities of the fibres over sets of each size n giving the n^{th} coordinate in a vector. So then is, the slice category \mathbf{Grpd}/FinSet_0 has this “cardinality” on objects into a set, and the structure of the category gives well-defined operations on this set, turning it into a vector space. In fact, there’s an operation (weak pullback) which makes it an inner product space. (To make this work in complex cardinalities takes some fudging with phases in U(1), but it can be done.)

The details are interesting, and I’m coming back to looking at some of this again, but what I want to point out at the moment is a more fundamental point, which has to do with the offhanded use of the handy, but imprecise, term “categorify”. With the category of (U(1)-) stuff types, we have a category with a “decategorification” map that compresses it into a vector space. This sure sounds like a “categorified vector space”. In fact, this seems to be what people who hear the term “categorification” often want it to mean: I look for a categorification of mathematical object X by finding a category which, secretly, looks like X.

The problem is, there’s another concept attached to the phrase “categorified vector space”, namely that of 2-vector space in the sense of Kapranov and Voevodski, as discussed, say, here. There’s a different level of abstraction at work here. The specific category of stuff types provides a categorification (if that indeed is the right word to use) of a specific vector space. The concept of a KV 2-vector space categorifies the concept of a regular vector space in a particular way: putting “additive” structure on objects, and “C-linear” structure on morphisms. (The Baez-Crans version does the same job in a different way).

You don’t think of a specific KV 2-vector space “decategorifying to” a specific vector space. Indeed, just taking the “minimal” equivalence relation - isomorphism classes of objects - what we get from a KV 2-vector space is more like an \mathbb{N}-module (over a rig, not a ring). Basically, 2-vectors have components which are vector spaces, and therefore classified by their dimension. The relationship between THIS kind of 2-vector space and the non-categorified concept is that real vector spaces show up as the hom-sets in a KV 2-vector space.

Elucidating exactly what’s going on with these two forms of categorification would be nice - perhaps somebody’s done it, but if so, I don’t know who. I also don’t know any nice conditions that tell you when you have a “category that can be mistaken for a vector space”, like stuff types: a good characterization of these things would be nice. Or again: both versions of “categorification” of vector space have special relationships to groupoids - but of two very different natures (in one, the groupoids can be interpreted as 2-vectors - in the other, there are whole 2-vector spaces associated to groupoids). Just a coincidence?

Another possibility that comes to mind would be to form some kind of hybrid structure - where the “vector spaces” which show up in the hom-sets in a KV 2-v.s. are secretly this fake-vector space type of category. Since both types seem to have physics-y ambitions, such a setup that combines both approaches is appealing, rather than a muddled and confusing competition for the term “categorification”.

I don’t have a good ending to this story, which is why this is a blog, not a book.

In the past couple of weeks, Masoud Khalkhali and I have been reading and discussing this paper by Marcolli and Al-Yasry. Along the way, I’ve been explaining some things I know about bicategories, spans, cospans and cobordisms, and so on, while Masoud has been explaining to me some of the basic ideas of noncommutative geometry, and (today) K-theory and cyclic cohomology. I find the paper pretty interesting, especially with a bit of that background help to identify and understand the main points. Noncommutative geometry is fairly new to me, but a lot of the material that goes into it turns out to be familiar stuff bearing unfamiliar names, or looked at in a somewhat different way than the one I’m accustomed to. For example, as I mentioned when I went to the Groupoidfest conference, there’s a theme in NCG involving groupoids, and algebras of \mathbb{C}-linear combinations of “elements” in a groupoid. But these “elements” are actually morphisms, and this picture is commonly drawn without objects at all. I’ve mentioned before some ideas for how to deal with this (roughly: \mathbb{C} is easy to confuse with the algebra of 1 \times 1 matrices over \mathbb{C}), but anything special I have to say about that is something I’ll hide under my hat for the moment.

I must say that, though some aspects of how people talk about it, like the one I just mentioned, seem a bit off, to my mind, I like NCG in many respects. One is the way it ties in to ideas I know a bit about from the physics end of things, such as algebras of operators on Hilbert spaces. People talk about Hamiltonians, concepts of time-evolution, creation and annihilation operators, and so on in the algebras that are supposed to represent spaces. I don’t yet understand how this all fits together, but it’s definitely appealing.

Another good thing about NCG is the clever elegance of Connes’ original idea of yet another way to generalize the concept “space”. Namely, there was already a duality between spaces (in the usual sense) and commutative algebras (of functions on spaces), so generalizing to noncommutative algebras should give corresponding concepts of “spaces” which are different from all the usual ones in fairly profound ways. I’m assured, though I don’t really know how it all works, that one can do all sorts of things with these “spaces”, such as finding their volumes, defining derivatives of functions on them, and so on. They do lack some qualities traditionally associated with space - for instance, many of them don’t have many, or in some cases any, points. But then, “point” is a dubious concept to begin with, if you want a framework for physics - nobody’s ever seen one, physically, and it’s not clear to me what seeing one would consist of…

(As an aside - this is different from other versions of “pointless” topology, such as the passage from ordinary topologies to, sites in the sense of Grothendieck. The notion of “space” went through some fairly serious mutations during the 20th century: from Einstein’s two theories of relativity, to these and other mathematicians’ generalizations, the concept of “space” has turned out to be either very problematic, or wonderfully flexible. A neat book is Max Jammer’s “Concepts of Space“: though it focuses on physics and stops in the 1930’s, you get to appreciate how this concept gradually came together out of folk concepts, went through several very different stages, and in the 20th century started to be warped out of all recognition. It’s as if - to adapt Dan Dennett - “their word for milk became our word for health”.I would like to see a comparable history of mathematicians’ more various concepts, covering more of the 20th century. Plus, one could probably write a less Eurocentric genealogy nowadays than Jammer did in 1954.)

Anyway, what I’d like to say about the Marcolli and Al-Yasry paper at the moment has to do with the setup, rather than the later parts, which are also interesting. This has to do with the idea of a correspondence between noncommutative spaces. Masoud explained to me that, related to the matter of not having many points, such “spaces” also tend to be short on honest-to-goodness maps between them. Instead, it seems that people often use correspondences. Using that duality to replace spaces with algebras, a recurring idea is to think of a category where morphism from algebra A to algebra B is not a map, but a left-right (A,B)-bimodule, _AM_B. This is similar to the business of making categories of spans.

Let me describe briefly what Marcolli and Al-Yasry describe in the paper. They actually have a 2-category. It has:

Objects: An object is a copy of the 3-sphere S^3 with an embedded graph G.

Morphisms: A morphism is a span of branched covers of 3-manifolds over S^3:

G_1 \subset S^3 \stackrel{\pi_1}{\longleftarrow} M \stackrel{\pi_2}{\longrightarrow} S^3 \supset G_2

such that each of the maps \pi_i is branched over a graph containing G_i (perhaps strictly). In fact, as they point out, there’s a theorem (due to Alexander) proving that ANY 3-manifold M can be realized as a branched cover over the 3-sphere, branched at some graph (though perhaps not including a given G, and certainly not uniquely).

2-Morphisms: A 2-morphism between morphisms M_1 and M_2 (together with their \pi maps) is a cobordism M_1 \rightarrow W \leftarrow M_2, in a way that’s compatible with the structure of the $lateux M_i$ as branched covers of the 3-sphere. The M_i are being included as components of the boundary \partial W - I’m writing it this way to emphasize that a cobordism is a kind of cospan. Here, it’s a cospan between spans.

This is somewhat familiar to me, though I’d been thinking mostly about examples of cospans between cospans - in fact, thinking of both as cobordisms. From a categorical point of view, this is very similar, except that with spans you compose not by gluing along a shared boundary, but taking a fibred product over one of the objects (in this case, one of the spheres). Abstractly, these are dual - one is a pushout, and the other is a pullback - but in practice, they look quite different.

However, this higher-categorical stuff can be put aside temporarily - they get back to it later, but to start with, they just collapse all the hom-categories into hom-sets by taking morphisms to be connected components of the categories. That is, they think about taking morphisms to be cobordism classes of manifolds (in a setting where both manifolds and cobordisms have some branched-covering information hanging around that needs to be respected - they’re supposed to be morphisms, after all).

So the result is a category. Because they’re writing for noncommutative geometry people, who are happy with the word “groupoid” but not “category”, they actually call it a “semigroupoid” - but as they point out, “semigroupoid” is essentially a synonym for (small) “category”.

Apparently it’s quite common in NCG to do certain things with groupoids \mathcal{G} - like taking the groupoid algebra \mathbb{C}[\mathcal{G}] of \mathbb{C}-linear combinations of morphisms, with a product that comes from multiplying coefficients and composing morphisms whenever possible. The corresponding general thing is a categorical algebra. There are several quantum-mechanical-flavoured things that can be done with it. One is to let it act as an algebra of operators on a Hilbert space.

This is, again, a fairly standard business. The way it works is to define a Hilbert space \mathcal{H}(G) at each object G of the category, which has a basis consisting of all morphisms whose source is G. Then the algebra acts on this, since any morphism M' which can be post-composed with one M starting at G acts (by composition) to give a new morphism M' \circ M starting at G - that is, it acts on basis elements of \mathcal{H}(G) to give new ones. Extending linearly, algebra elements (combinations of morphisms) also act on \mathcal{H}(G).

So this gives, at each object G, an algebra of operators acting on a Hilbert space \mathcal{H}(G) - the main components of a noncommutative space (actually, these need to be defined by a spectral triple: the missing ingredient in this description is a special Dirac operator). Furthermore, the morphisms (which in this case are, remember, given by those spans of branched covers) give correspondences between these.

Anyway, I don’t really grasp the big picture this fits into, but reading this paper with Masoud is interesting. It ties into a number of things I’ve already thought about, but also suggests all sorts of connections with other topics and opportunities to learn some new ideas. That’s nice, because although I still have plenty of work to do getting papers written up on work already done, I was starting to feel a little bit narrowly focused.

One of the first things I did after arriving at PI on Wednesday (and having lunch) was to attend the colloquium talk which was being given by Robert Spekkens. It was called “Why the Quantum?”, but as he described it, the real point of the talk was to take a close look at the features of quantum physics that are commonly considered “weird” or “mysterious” and see what’s really innovative in the departure from classical physics. For the most part, “physics” here means “mechanics”, but he also touched on optics, theory of computation, and briefly on electromagnetism and gravity in a more speculative way.

The main message of his talk is that very few of the things about quantum physics which seem strange are really all that innovative. He showed this by describing a kind of classical theory that has many of them - interference, noncommuting observables, entanglement, “wavefunction collapse”, wave-particle duality, teleportation and a no-cloning theorem, superposition of states, and so forth. All of these, he told us, will show up in a model based on a classical mechanical system, where the “quantum” theory is a theory of probability distributions (or, equivalently, of the knowledge of observers about a classical system) subject to a restriction about what distributions are allowed.

The point is to start with some classical system: let’s say it’s a mechanical system of some moving particles. Then there’s a configuration space of all the possible (classical) configurations of the system - one point in this space for each configuration. Classical mechanics is then about defining a “flow” on this space, which tells you where a point will move over time (how the system will go from one configuration to another). Then Liouville mechanics is about probability distributions in this space: you might not know exactly which configuration the system is in, but you have a way of estimating the probabilities. Then you impose the restriction that the only allowed probability distributions are ones for which the products of the variances for conjugate variables are at least Planck’s constant. (Actually, I think Spekkens formulated this differently, but that’s about what it amounts to, as I understand it.) The result is equivalent to “Gaussian quantum mechanics” - one where probability distributions are all Gaussians.

This also puts limits on what the rule for evolving states can be: any rule for how individual states evolve over time also gives a result for how probability distributions evolve over time. (Picture a cloud of ink, with varying density, flowing along in moving water - knowing the flow lines tells you where the cloud goes.) If there are restrictions on what kind of probability distributions can be set up, these have to be preserved over time - otherwise, you could set up an allowed distribution, and then wait until it evolves into a disallowed one. In particular, for Gaussian quantum mechanics, he told us that systems with a quadratic Hamiltonian will satisfy this condition.

The important fact here is that this is a “realist” interpretation. It says the quantum mechanical uncertainty reflects that QM is a theory about your knowledge of the state of the system, which, however, really exists. Often in quantum mechanics, one defines a “wave function” as a function living on configuration space (complex-valued, not real-valued like a probability density, but a function nonetheless). However, it’s now pretty standard to think of this wave function as the “real” state of the system - the view that it represents a state of knowledge was popular for a while, but ran into various problems in the form of experiments that are hard to account for, such as Bell inequality violations. The point of the talk was to see just how many of the “strange” features of quantum mechanics are genuine problems for this view, and to show the answer is “not many”.

The features he claimed are really mysterious from this point of view are fairly few: Bell inequality violations, some no-go theorems for models of physics involving local hidden variables such as the Kochen-Specker Theorem, and a few others. So Spekkens’ suggestion was that this concept of quantum mechanics as a theory of probability with an “epistemic” restriction (i.e. limits on what’s knowable) might be salvaged if the underlying classical theory were non-local - and perhaps had some other odd features yet to be precisely delineated - to begin with. However, it might not have to be terribly strange apart from that, since quantum mechanical features like interference and superposition of states all show up in the restricted statistical picture.

The gist of his argument then seemed to be that to really straighten out some foundational issues in quantum physics, one approach would be: (a) come up with a well-founded justification for the assumption about restrictions on possible probability distributions, and (b) come up with at least one (and as few as possible) other principles to account for the remaining mysterious things - he also suggested they all seem to have something to do with “contextuality”. As I understand it, this last is the idea that an observable might have definite, but multiple, values - and that which values are seen depend on which groups of observables are measured together. I don’t know what, if anything, to make of that oddball-sounding idea.

However, he did argue that in some cases at least, the restriction can be justified by the observer effect: you have to look at a system using some apparatus, whose state you don’t know completely, and which interferes with the system in order to observe it (for instance, measuring the position of a particle by scattering it off another one, whose state is partly unknown, and imparts an unknown momentum).

My overall reaction to the talk is that it’s interesting to know that realist interpretations of quantum physics (where the “reality” is more or less classical, and quantum effects some kind of afterthought, or epistemic effect) aren’t as dead as they might have seemed. However, the view that says classical physics emerges as some kind of limiting case of quantum effects seems better developed, at least mathematically, than the reverse. As for his claim that we “understand” the classical picture “physically”, whereas it’s not so for the quantum picture - I personally can only agree that’s true for me, but I don’t entirely see what you can conclude from that.

The bottom line seems to be that there are still problems in epistemology. I suspected as much already - though I’m not sure if I “knew” it, whatever that means.