A “mother of all groups”? What kind of structures have "mother of all"s?

The surreal numbers exhibit much stronger universal properties than you have mentioned, for they also exhibit very strong homogeneity and saturation properties. For example, every automorphism of a set-sized elementary substructure of the surreals extends to an automorphism of the entire surreal numbers, and every set-sized type over the surreals that is consistent with the theory of the surreal numbers is realized in the surreal numbers. That is, any first-order property that could be true about an object as it relates to some surreal numbers, which is consistent with the theory of the surreal numbers, is already true about some surreal number.

For any complete first-order theory $T$, one can consider the concept of a monster model of the theory. This is a model $\mathcal{M}$ of $T$, such that first, every other set-sized model of $T$ embeds as an elementary substructure of $\mathcal{M}$--not merely as a substructure, but as a substructure in which the truth of any first-order statement has the same truth value in the substructure as in big model--and second, such that every automorphism of a set-sized substructure of $\mathcal{M}$ extends to an automorphism of $\mathcal{M}$.

One may also use approximations to these proper class monster models by considering extremely large set-sized models, of some size $\kappa$, such that the embedding and homogeneity properties hold with respect to substructures of size less than $\kappa$.

Every consistent complete first order theory $T$ has such monster models, and they are used pervasively in model theory. Model theorists find it convenient, when considering types over and extensions of a fixed model, to work inside a fixed monster model, considering only extensions that arise as submodels of the fixed monster model.

Your topic also has a strong affinity with the concept of the Fraïssé limit of a collection of finitely-generated (or $\kappa$-generated) structures, which one aims to be the age of the limit structure, the collection of finitely-generated ($\kappa$-generated) substructures of the limit structure. Fraïssé limits are often built to exhibit the same saturation and homogeneity properties of the surreal numbers. As a linear order, the surreal numbers are the Fraïssé limit of the collection of all set-sized linear orders. And I believe that one can also put the field structure in here.


Update. When one has the global axiom of choice, the set-homogeneous property of the generalized Fraïssé limit allows one to establish universality for proper class structures. Basically, using global AC one realizes a given proper class structure as a union of a tower of set structures, and gradually maps them into the homogeneous structure. The homogeneity property is exactly what you need to keep extending the embedding, and so one gets the whole proper class structure mapping in. This kind of argument, I believe, shows that what you want to consider is homogeneity rather than merely the universal property itself. (This argument is an analogue of the idea that when one has homogeneity for countable substructures, then one gets universality for structures of size $\aleph_1$.)

Regarding your specific construction, here is a simpler way to undertake the same idea, which avoids the need for the equivalence relation: Let $G$ be the proper class of all fixed-point-free permutations of a set. This class supports a natural group operation, which is to compose them, regarding elements outside the domain as fixed by the permutation, and then cast out any newly-created fixed points. The identity element of $G$ is the empty function, which is really a stand-in for the identity function on the universal class.

It is clear that every group finds an isomorphic copy inside $G$, without using the axiom of choice, since every group is naturally isomorphic to a group of permutations, and these are naturally embedded into $G$, simply by casting out fixed-points. This does not use the axiom of choice.

The class group $G$ is a natural presentation of the set-support symmetric group $\text{Sym}_{\text{set}}(V)$ of the set-theoretic universe $V$, the class of all permutations of $V$ having set support. Any such permutation of $V$ is represented in $G$ by restricting to the non-fixed-points.

Note finally that in the case of the surreal numbers, one doesn't need the axiom of choice in order to construct the surreal numbers, and one can get many universal properties for well-orderable set structures and in general from the axiom of choice for all set-sized structures. But to get the universal property that you mention for class-sized structures, mere AC is not enough, for one needs the global axiom of choice, which is the assertion that there is a proper class well-ordering of the universe. This does not follow from AC, for there are models of Gödel-Bernays set theory GB that have AC, but not global choice. But global AC is sufficient to carry out the embedding of any class linear order into No. A similar phenomenon arises with many other class-sized class-homogeneous set-saturated models, which require global AC to get the universality for class structures.


Let $\mathcal{G}$ denote the class of all groups. Define

$$\mathcal{M} = \left\{ f : \mathcal{G} \dashrightarrow \bigcup\mathcal{G} : \left(\forall G \in \mathrm{dom}(f)\right)\left(f(G) \in G\setminus\{\mathrm{id}_G\}\right)\right\}$$

In other words, $\mathcal{M}$ consists of all set-sized partial choice functions on the class of all groups, excluding choice functions that ever pick the identity. The empty function is the identity of this class group, the inverse of a function in $\mathcal{M}$ is its point-wise inverse, and multiplication is defined as follows:

  • $\mathrm{dom}(fg) = \mathrm{dom}(f) \cup \mathrm{dom}(g) \setminus \{G \in \mathcal{G} : f(G)g(G) = \mathrm{id}_G\}$
  • $(fg)(G) = f(G)g(G)$ if $G\in\mathrm{dom}(f)\cap\mathrm{dom}(g)$ and $f(G)g(G)\neq\mathrm{id}_G$
  • $(fg)(G) = f(G)$ if $G \in \mathrm{dom}(f)\setminus\mathrm{dom}(g)$
  • $(fg)(G) = g(G)$ if $G \in \mathrm{dom}(g)\setminus\mathrm{dom}(f)$

The intuitive idea here is you want to take the Cartesian product of all the groups. In other words, your mother group would consist of all the class-sized choice functions on $\mathcal{G}$. Of course, a class can't have class-sized elements, so we try to settle for taking the class of all set-sized partial choice functions on $\mathcal{G}$. The problem with that boils down to what to take as the identity element? Or in other words, how do you differentiate a partial choice function from one that is similar, but additionally picks out some identity elements from groups in its domain? The solution is to "disallow" choosing the identity. This leads us to make the identity of this mother group the empty choice function, and to define multiplication essentially like point-wise multiplication, except that we remove any identities in the result.


This is a footnote to Joel’s nice answer.

Motivated by the ordered field No of surreal numbers, in 1989 I extended the theory of saturated models to class structures in “Absolutely Saturated Models” (Fundamenta Mathematica 133, pp. 39-46). Working in NBG (with global choice)—where truth is not in general definable in class structures and induction cannot in general be applied freely in such structures to assertions involving global truth—it is shown (among other things) that: if the language L of a theory T is a set and T is a complete, model complete theory with an infinite model, then T has (up to isomorphism) a unique absolutely saturated (i.e. On-saturated) model which coincides (up to isomorphism) with the homogeneous-universal model of T. Additional On-saturated models can be constructed in NBG, but there are limitations due to the considerations having to do with truth in class structures.

If one moves to Kelly-Morse set theory, which unlike NBG is not a conservative extension of ZFC, the entire classical theory of saturated models can be extended to class structures.