Informal lecture notes January 28 Intro: basic definitions and questions.

Illustrated by Fano plane $\Pi_2$ of 7 “points” and 7 “lines” (diagram again; get properties from class, including):

– Each line has 3 points
– Any 2 points on a unique line

That makes it a 2-(7,3,1) design. In general:

Definition 1.1 (p.1) A $t$-design with parameters $(v,k,\lambda)$, a.k.a. a $t$-$(v,k,\lambda)$ design, is a pair ${\mathfrak D} = (X,{\mathfrak B})$ where

@ $X$ is a set of $v$ “points”
@ $\mathfrak B$ is a collection of $k$-element “blocks”, each a subset of $X$
@ any $t$ points contained in exactly $\lambda$ blocks.

E.g. $\Pi_2$ is also a 1-(7,3,3) design, and Petersen is 1-(10,2,3).
Trivial example: any collection of $b$ blocks is a 0-$(v,k,b)$ design. Is $\Pi_2$ a 3-design?

Even more trivial: If $v \lt k$ or $k \lt t$ then $\lambda=0$ (and if $v \lt t$ then $\lambda$ is undefined); we don’t care about such examples, nor indeed the example where $\lambda=0$ because $\mathfrak B$ is empty.

We do care very much about the special case

$\lambda = 1$ (as with $\Pi_2$);

that’s a Steiner system (a.k.a. $S(t,k,v)$).

[Further examples: the $S(2,3,81)$ of the game “Set”; after Prop. 1.4 and its Corollaries: the S(3,4,8) obtained by adding a point to $\Pi_2$ etc. – more on this later in the term]

Aim: describe all designs, up to equivalence.

That’s asking too much!

But it can be done, with interesting results, for many particular cases, some of which will appear shortly. E.g. 2-(7,3,1) is unique.

Constructing $t$-designs gets harder as $t$ increases, with one exception …

Only slightly less trivial: the set of all $v \choose k$ subsets is a trivial design: the symmetric group $S_v$ acts transitively on $t$-element subsets, so each is contained in the same number (namely $v-t \choose k-t$). That’s a complete block design; we’re interested in incomplete ones. The $t$-design condition says that the set of blocks is “balanced” among all elements, pairs, …, $t$-subsets. This is sometimes called a

BIBD: balanced incomplete block design.

(At least when $t \geq 2$, since $t=1$ is still too easy to satisfy without additional conditions, e.g. $k=2$ just gives regular graphs; more on that soon.) Jan.30 the incidence matrix of a design; Fisher’s theorem and duality

Reminder: first HW due the midnight before next class

Def. 1.10 (p.4) Incidence matrix of design (or even t-structure):
columns ↔ points
rows ↔ blocks
entries = 1 if point in block, 0 else.

Example (the RED BUOY / YOUR BED description of $\Pi_2$):

       B D E O R U Y
  BUD  1 1 0 0 0 1 0
  BYE  1 0 1 0 0 0 1
  DOE  0 1 1 1 0 0 0
  DRY  0 1 0 0 1 0 1
  ORB  1 0 0 1 1 0 0
  RUE  0 0 1 0 1 1 0
  YOU  0 0 0 1 0 1 1

Warning: depends on ordering of $X$ and $\mathfrak B$; also, some authors use a definition with rows and columns switched, resulting in transpose of our matrix. To do it more canonically, we may think of the associated linear transformation between $F^X$ and $F^{\mathfrak B}$ where $F$ is the ground field (and $F^X,F^{\mathfrak B}$ are the vectors spaces of maps from $X,\mathfrak B$ respectively to $F$); for instance a map $f: X \to F$ goes to the map ${\mathfrak B} \to F$ taking any block $B$ to $\sum_{x \in B} f(x)$.

1.11 (p.4), generalized slightly:

If it’s a $t$-$(v,k,\lambda)$ design then:

@ each block has $k$ points ⇔ each row dots with all-1’s column to $k$ ⇔ $M$ * (all-1’s column) = $k$*(all-1’s column)

@ if $t \geq 1$: each point is in $r=\lambda_1$ blocks ⇔ each column dots with all-1’s row to r ⇔ (all-1’s row) * M = r*(all-1’s row)

The fact that rows and columns are interchangeable so far suggests defining the dual ${\mathfrak D}^{\sf T}$ of a design ${\mathfrak D} = (X,{\mathfrak B})$ [see p.5, I’m changing the text’s order of exposition] so that the incidence matrix of ${\mathfrak D}^{\sf T}$ is is the transpose of the incidence matrix of $\mathfrak D$ (as the notation suggests). Morally speaking ${\mathfrak D}^{\sf T} = ({\mathfrak B},X)$. Since $X$ does not consist of subsets of $\mathfrak B$, we interpret this as $$ {\mathfrak D}^{\sf T} = (X^{\sf T}, {\mathfrak B}^{\sf T}) = ({\mathfrak B}, \{\beta_x : x \in X\}) $$ where $$ \beta_x := \{B \in {\mathfrak B}: x \in B\} $$ NB in general this might be only a “structure”, not a design. A structure $\mathfrak D$ is a design if the adjacency matrix has no repeated rows, and ${\mathfrak D}^{\sf T}$ is a design if there are no repeated columns. So, for instance, consider

1100
1100
0011
0011

($\mathfrak D$ and ${\mathfrak D}^{\sf T}$ are 1-(4,2,2) structures, both isomorphic with the regular graph-with-multiplicity @=@ @=@), and

110000
001100
000011

($\mathfrak D$ is the 1-(6,2,1) design @-@ @-@ @-@, and ${\mathfrak D}^{\sf T}$ is a 1-(3,1,2) structure; that’s a degenerate example because it“s twice the complete design — can you find a nondegenerate example?) In any case the dual of a 1-structure is a 1-structure, and — as ought to be the case with a notion of duality —

$({\mathfrak D}^{\sf T})^{\sf T}$ is canonically isomorphic with $\mathfrak D$.

OK: what if we actually have a 2-design, such as $\Pi_2$? Check that for $\Pi_2$ the dual ${\mathfrak D}^{\sf T}$ is also a 2-design, indeed with the same parameters. This isn’ always true, but it does suggest something that is. Let’s see:

@ if also $t=2$: each pair of points in $\lambda$ blocks ⇔ any two different columns have dot product $\lambda$. The dot product of a column with itself is again $r4$, so ...

1.11 iii) $M^{\sf T} M = (r-\lambda) I + \lambda J$ ($v \times v$ matrices)

where $J$ = all-1’s matrix. (Likewise if $t \geq 2$, replacing $\lambda$ by $\lambda_2$.) Note that this excludes repeated columns unless $r = \lambda$, which is (not surprisingly) impossible except in trivial cases as we soon see.

The first two properties in (1.11) can also be expressed in terms of $J$:

1.11 ii) $M J = k J$ ($v \times v$ matrices)
1.11 i) $J M = r J$ ($b \times b$ matrices) That might seem like just bookkeeping, but (iii) lets us actually do nontrivial things using linear algebra (and then (i) and (ii) will help too). We need:

Lemma 1.12 (p.4): for the identity and all-1 matrices $I,J$ of order $n$ we have $$ det(xI+yJ) = (x+ny) x^{n-1}. $$ In particular, $xI+yJ$ has rank n iff $x$ and $x+ny$ are nonzero.

[NB it's a priori a homogeneous polynomial of degree $n$; also ${\rm det}(I+yM)$ is a polynomial in $y$ of degree rank($M$), and since $J$ has rank 1 it had to be linear in $y$.]

Pf.: $xI+yJ$ symmetric ⇔ spectral theorem applies (orthonormal eigenvectors, det = product of eigenvalues). All-1’s has eigenvalue x+ny and its orth. complement has dimension $n-1$ and eigenvalue $x$, etc.

[Yes, Lemma 1.12 can also be obtained directly by row operations; but we'll soon make similar applications of the spectral theorem in settings where a more direct approach is not available.]

In our case x+ny is certainly nonzero: it’s $r+(v-1)\lambda$ and both terms are positive. What about $x=r-\lambda$? Well we saw that

(1.9) $r (k-1) = (v-1) \lambda$

so, $$ (r-\lambda) (k-1) = (v-1) \lambda - (k-1) \lambda = (v-k) \lambda $$ so as long as $k\gt 1$ (remember we must have $k\geq t$ for nontriviality) and $k \lt v$ (it’s not just the complete design $(X,\{X\})$) we have $r-\lambda \gt 0$. Hence:

Theorem 1.14 (p.5) [Fisher’s inequality] In a nonempty 2-design with $k \lt v$ we have $b \geq v$.

Proof: We have written a nondegenerate $v \times v$ matrix as a product of two matrices each of rank at most $b$. QED

What if equality holds? First of all $M$ is a square matrix, and (1.8) $bk=vr$ means that $b=v$ iff $k=r$, so (1.11) means $MJ=JM$. But then $M$ commutes with $M^{\sf T} M$ since that’s a linear combination of $I$ and $J$, so $M M^{\sf T} M = M^{\sf T} M M$, and since $M$ is invertible we conclude $M M^{\sf T} = M^{\sf T} M$, which is already known (1.11 again) to be $(r-\lambda) I + \lambda J$. But this means that every two blocks have $\lambda$ points in common! That in turn means that ${\mathfrak D}^{\sf T}$ is also a 2-design. To summarize:

Theorem 1.15 (p.5) In a 2-design with $k \leq v$, TFAE:
(a) $b=v$
(b) $r=k$
(c) any two blocks have $\lambda$ points in common
(d) any two blocks have a constant number of points in common.
[(e) ${\mathfrak D}^{\sf T}$ is a 2-design.]

Proof: a ⇔ b follows from $bk=rv$ ($b/v=r/k$). We’ve just shown that these imply c, and c ⇒ d ⇔ e are obvious. Finally for e ⇒ a, if ${\mathfrak D}^{\sf T}$ is a 2-design then it satisfies the hypotheses of Fisher’s inequality (notably the condition $k \lt; v$ becomes $r \lt; b$, which is equivalent because bk=rv), so we may apply Fisher to both $\mathfrak D$ and ${\mathfrak D}^{\sf T}$ to get $b \geq v \geq b$, whence $b=v$. QED

A 2-design satisfying these conditions is said to be square (because the incidence matrix is square; again the book warns that the literature also contains other terminology for this). You might find problem 2 on the homework easier now.

It“s harder to construct $t$-designs as $t$ increases because

Cor. 1.6 (p.3): a $t$-($v,k,\lambda$) design is automatically also an $s$-($v,k,\lambda_s$) design for each $s \leq t$, with $\lambda_s$ given by:

Prop. 1.4: If $S$ is a subset of $X$ with $s = |S| \in [0,t]$ then the number of blocks containing $S$ is $$ \lambda_s := \left( {v-s \choose t-s} \left/ {k-s \choose t-s}\right. \right) \lambda. $$ Proof: double count pairs $(B,T)$ where $B$ is a block and $S \subset T \subset X$ with $|T|=t$.

Cor. 1.7: If a $t$-($v,k,\lambda$) design exists then $$ {k-s \choose t-s} \left| {v-s \choose t-s} \lambda \right. $$ for each $s=0,1,\ldots,t-1.$

Example:

$b := \lambda_0 =$ # blocks and
$r := \lambda_1 =$ # blocks containing a given point [if $t \gt 0$...]

satisfy $$ b k = v r. $$ Likewise if $t=2$ then we have $$ r = \left({v-1 \choose 1} \left/ {k-1 \choose 1}\right.\right) \lambda = \frac{v-1}{k-1} \lambda $$ so $(v-1)\lambda = r(k-1)$ [p4, (1.9)].

Even for $\lambda=1$, infinitely many examples known with $t=2,3$ but only finitely many for $t=4,5$ (we’ll see all the \ldquo;classical\rdquo; ones) and none known with 6 or more! [Update: though thanks to Keevash, as of 2014 we know they exist.]

What does “up to equivalence” mean?

p.3 isomorphism from $(X,{\mathfrak B})$ to $(X',{\mathfrak B}')$: bijection $f: X \to X'$ that takes $\mathfrak B$ to ${\mathfrak B}'$.

Automorphism of $(X,{\mathfrak B})$ = isomorphism $(X,{\mathfrak B}) \to (X,{\mathfrak B})$; these form a group.

If $G$ is $t$-transitive, or even transitive on $t$-subsets, then $(X,{\mathfrak B})$ is automatically a $t$-design. Many $t$-designs explained this way, including $\Pi_2$ (and Petersen).

Finally, as promised: why don’t we work with “designs with multiplicity”, a.k.a. “$t$-structures” (see p.1)? Even the bridges-of-Königsberg graph has multiplicity… But again it&rsquos too easy to construct $t$-structures:

Prop. 1.2 (p.2): If $t \lt k \lt v-t$ then there is a $t$-($v,k,\lambda$) structure for some $\lambda$, in which not every $k$-set is incident with a block.

Proof: More variables than $\bf Q$-linear equations; clear denominators and add the smallest multiple of the all-1’s vector.

[Another way to say this: the variables are the multiplicities $m_B$ of the $v \choose k$ $k$-subsets, plus $\lambda$ itself; the equations are the $v \choose t$ conditions that each $t$-subset be contained in $\lambda$ blocks counted with multiplicity; so there’s a $\gt 1$ dimensional space of solutions, which includes the vector $x_1$ where all the $m_B$ equal 1. Let $x_2$ be an independent vector; multiply by a common denominator to get all the coordinates of $x_2$ to be integers; and subtract $m x_1$, where $m$ is the minimal coordinate $m_B$ of $x_2$, to get a nonzero solution where all the $m_B$ are nonnegative and at least one of them vanishes.] [The vector space of solutions with $\lambda=0$ may still be of interest for the construction of irreducible representations of the symmetric group of permutations of $v$ objects, but that’s a topic for a different kind of combinatorics class.] Feb.4 More about square designs: BRC theorem; Fisher re-proved; duality and polarity.

Recall: Fisher’s inequality (1.14) says a 2-$(v,k,\lambda)$ design with $k \lt v$ has $b \geq v$; proved using

1.11 iii) $M^{\sf T} M = (r-\lambda) I + \lambda J$ ($v\times v$ matrices)

and the fact that the RHS is invertible. We showed that equality holds if the dual design ${\mathfrak D}^{\sf T}$ is also a 2-design, and noted that in that case $M$ is a square matrix. A further necessary condition (part of the project of deciding which parameters $v,k,\lambda$ are realized by a $t$-design):

Theorem 1.21 (Bruck-Ryser-Chowla; p.7) Suppose there exists a square 2-$(v,k,\lambda)$ design. Then:

i) $v$ even ⇒ $k-\lambda$ is a square [NB not “$k$ is square”; that’s a typo for “$n$ is square” where CvL define $n=k-\lambda$]
ii) $v$ odd ⇒ there exist nonzero integers $x,y,z$ such that $z^2 = (k-\lambda) x^2 + (-1)^{(v-1)/2} \lambda y^2.$

As you can imagine from the statement, (i) is much easier. We’ll show only that case; you might base your final project on one or more of the known proofs of (ii), about which we’ll make some remarks to give a bit of context to that strange-looking condition (but you still needn’t memorize it for an exam).

Proof of (i): Recall that $\det(xI+yJ) = (x+yn) x^{n-1}$ [Lemma 1.12]. If $xI+yJ = M^{\sf T} M$ for a square matrix $M$ then $$ \det(xI+yJ) = \det M^{\sf T} \det M = \det M \det M = (\det M)^2 $$ and since $M$ is an integer matrix $\det(M)$ is an integer. So, a necessary condition is that $\det(xI+yJ)$ be a square. Now in our case $n = v$, and $x + yn = r - \lambda + v\lambda = r + (v-1) \lambda$, and we already saw (1.9) that $(v-1) \lambda = r(k-1)$ so it’s $rk$ which for a square design is $k^2$ (recall 1.15b). So $\det(M) = k (k-\lambda)^{(v-1)/2}$, which is an integer iff v is odd or $k-\lambda$ is a square. So if $v$ is even then $k-\lambda$ must be a square, as claimed.

As for (ii): the rows are vectors in ${\bf Z}^v$ whose Gram matrix of inner products is the invertible matrix $(r-\lambda) I + \lambda J$. So a necessary condition is that such vectors exist in ${\bf Q}^v$, which is to say that the quadratic form with Gram matrix $(r-\lambda) I + \lambda J$ be equivalent to the standard form with Gram matrix $I$. A necessary condition is square determinant, and the arithmetic theory of quadratic forms gives the additional condition (ii). The names BRC are not in alphabetical order because BR proved it in 1949 for the special case $\lambda=1$ of square Steiner systems (= projective planes of order $q=k-1$), and Chowla generalized in 1950 to arbitrary $\lambda$. That condition can in turn be checked “locally” modulo powers of primes dividing $k-\lambda$ and $\lambda$; for instance if $\lambda=1$, so $(v,k) = (q^2+q+1, q+1)$, we find that $(-1)^{(v-1)/2} = (-1)^{(q^2+q)/2}$ is 1 if $q$ is 0 or $3 \bmod 4$, and $-1$ if q is 1 or $2 \bmod 4$; in the former case the condition is automatically satisfied (let $x=0, y=z$) and in the latter we ask that $q = (y/z)^2 + (x/z)^2$, which by Fermat is equivalent to $q$ = sum of two integer squares and can be checked from the prime factorization of $q$. This condition is necessary but not sufficient ($q=10$, which may be still the only known case where BRC allows parameters that have been proved impossible).


We can re-prove Fisher without linear algebra as a special case of:

Thm. 1.20 (p.6): let $B$ be a block of a 2-($v,k,\lambda)$ design. Then there are at least $k(r-1)^2 / ((k-1)(\lambda-1) + (r-1))$ blocks intersecting $B$ nontrivially [other than $B$ itself]; equality ⇔ every block other than $B$ meets $B$ in a constant number of points, in which case that constant is $1 + (k-1)(\lambda-1)/(r-1)$.

[Note that for a square design, $k=r$ simplifies that constant to $\lambda$ as expected. To recover Fisher, show that the lower bound of Thm. 1.20 is at least $b-1$ using (1.8) $bk=vr$ and (1.9) $r(k-1)=(v-1)\lambda$, see top of page 7.]

Happily it’s not the specific formula that we’ll use (we won’t cover chapter 7) — you certainly need not memorize that result for the exam! — but you should know the technique (called the “variance trick” by the textbook’s authors): obtain the first few “moments” $\sum 1, \sum i, \sum i^2$ (usually by some kind of double counting) to get at the mean and variance, which must be positive (in other words, apply Cauchy-Schwarz: the average of $i^2$ is at least the square of the average of $i$); since we have only a 2-design we can get only the zeroth, first, and second moment, but in other contexts one may be able to calculate an exploit higher moments too.

Proof of Thm. 1.20: For a block $B'$ other than $B$, let $i(B') = \#(B \cap B')$. Let $d \leq b-1$ be the number of $B'$ for which $i(B') \gt 0$, so the intersection is nonempty. We’ll find the sums of $1, i, i^2$ over those $B'$ to get the mean and variance. (The book lets $n_i$ be the number of solutions of $i(B)=i$, and finds the sum over $i$ of $n_i$, $i n_i$, and $(i^2-i)n_i$; that’s the same as summing $1, i, i^2-i$, and thus yields the sum of $1, i, i^2$ respectively.)

Thus the sum of $i^2$ is the sum of $i^2-i$ plus the sum of $i$, which is $$ (k^2-k)(\lambda-1) + k(r-1) = k ((k-1)(\lambda-1) + (r-1)). $$ Now Cauchy-Schwarz says $$ \left(\sum 1\right) \left(\sum i^2\right) \geq \left(\sum i\right)^2, $$ with equality iff all i are equal. Therefore $d = sum(1) \geq \left(\sum i\right)^2 / \left(\sum i^2\right)$, which is exactly what we claimed; in the case of equality, the constant value of $i$ is $$ \frac{\sum i^2}{\sum i} = k \frac{(k-1)(\lambda-1) + (r-1)}{k (r-1)}, $$ which is exactly $1 + \frac{(k-1)(\lambda-1)}{r-1}$ as claimed. QED


Duality and polarity of a square design (1.18, 1.19 on page 6): If $\mathfrak D$ is a square design that is isomorphic with the dual design $\mathfrak D^{\sf T}$, then we can call $\mathfrak D$ self-dual, and any such isomorphism is called a duality of $\mathfrak D$; explicitly, it consists of bijections $\sigma: X \to {\mathfrak B}, \tau: {\mathfrak B} \to X$ such that $x \in B$ iff $\tau(B) \in \sigma(x)$. [NB the book uses exponential notation $B^\tau, x^\sigma$, which switches the order of composition of bijections; also there’s a typo in the first display of p.6: each $\mathfrak B$ (the collection of blocks) should be plain $B$ (a single block of $\mathfrak B$).] In terms of the incidence matrix $M$, the maps $\sigma,\tau$ switch rows and columns; alternatively, they are a permutation of the rows and columns that takes $M$ to $M^{\sf T}$. Composing two dualities $(\sigma,\tau)$ and $(\sigma',\tau')$ yields an automorphism $(\tau' \sigma, \sigma' \tau)$ of ${\mathfrak D} = (X,{\mathfrak B})$; also, a duality can be composed from either side with an automorphism of $\mathfrak D$ to yield another duality. So the dualities of a self-dual design $\mathfrak D$ extend ${\rm Aut}({\mathfrak D})$ to a bigger group containing ${\rm Aut}({\mathfrak D})$ with index 2.

A duality that’s an involution in this group, i.e. for which $\tau\sigma$ is the identity permutation of $X$ (and thus $\sigma\tau$ is the identity permutation of $\mathfrak B$), is called a polarity. In terms of $M$, there’s a polarity iff we can permute the rows (and columns) to get a symmetric matrix — this makes the $i$-th block the image under $\sigma$ of the $i$-th point for each $i$. For example, $\Pi_2$ is polar:

   1 1 0 1 0 0 0
   1 0 1 0 0 0 1
   0 1 0 0 0 1 1
   1 0 0 0 1 1 0
   0 0 0 1 1 0 1
   0 0 1 1 0 1 0
   0 1 1 0 1 0 0

Important examples of designs I: projective planes and higher-dimensional projective spaces

Usually when we introduce a new kind of mathematical structure we give a selection of important examples, and transformations that construct new examples from known ones. But for $t$-designs, so far we’ve given several necessary conditions on the parameters but precious few examples with $t \gt 1$ — only $\Pi_2$, a couple of similar designs relegated to the first problem set, and the 3-(8,4,1) design constructed from $\Pi_2$ — and no transformations except for the facts that a $t$-design is also an $s$-design for each $s\lt t$, and the dual of a square 2-design is a square 2-design, which have yet to give us any new examples. Today we remedy this by introducing the first of several batches of important examples of 2-designs: algebraic projective spaces, including notably projective planes; and square designs coming from Hadamard matrices.

While there’s (still?) only one example is known of parameters for a square 2-design that pass the BRC condition (1.21, p.7) but do not arise for any design, there’s also (still!?) no value of $\lambda$ known to arise infinitely often — except for $\lambda;=1$. A square 2-design with $\lambda=1$ (i.e. a square 2-design that’s also a Steiner 2-system)projective plane, and its blocks are called lines, because (as with the prototypical example of $\Pi_2$) any two of its points determine a unique line, and any two lines meet in a unique point. In a square 2-$(t,k,1)$ design let $k=n+1$, so $t=n^2+n+1$, either using $r(k-1)=(v-1)\lambda$ or directly because a point determines $n+1$ lines each of which contains $n$ further points and no two of these $n(n+1)$ points coincide [that’;s actually a special case of our argument for $r(k-1)=(v-1)\lambda$]. We then say that the projective plane has order n; e.g. $\Pi_2$ has order 2. (Not order $n+1$ as might be expected; we already saw that $n=r-\lambda$ is an important parameter in Fisher’s inequality (where it was the coefficient of $I$ in $M^{\sf T} M = (r-\lambda) I + \lambda J)$ and in the BRC theorem, and this is even more decisively the case for projective planes.

Now $n=1$ is possible but yields the complete 2-(3,2,1) design. For each of $n=2,3,4,5$ we shall show that there is a unique projective plane of order $n$ and determine its automorphism group. These will all be special cases of the following construction, and applicable whenever $n$ is a prime power, producing a projective plane with more than $n^8$ automorphisms. Warning: for some such n it is known that there are non-isomorphic projective planes of order $n$. (It seems that the first example is for $n=9$, when there are four such planes [Lam, Kolesova, and Theil 1988, computer-assisted].) It is a famous open question whether there is a finite projective plane whose order is not a prime power; the first open case is $n=12$ (BRC disposes of $6$ but not $10$).

Construction of $\Pi_2$ that exhibits all its automorphisms. We claimed that there are 168 automorphisms, forming a group isomorphic with ${\rm GL}_3({\bf Z}/2{\bf Z})$. [Recall the formula $\prod_{i=0}^{n-1} (q^n-q^i)$ for the size of ${\rm GL}_n(k)$ for a finite field $k$ of $q$ elements; here that gives $(8-1)(8-2)(8-4)$ which indeed is $7 \times 6 \times 4 = 168$.] So, let $k$ be the 2-element field ${\bf Z}/2{\bf Z}$, and let $V$ be a 3-dimensional space over $k$. Then $V$ has $2^3 = 8$ vectors, including zero. So, the set $X$ of points of&nbps;$\Pi_2$ will be the 7 nonzero vectors, and the lines will be triples $\{x,y,z\}$ of vectors with $x+y+z = 0$. Check by explicit identification that this is equivalent with our initial picture of $\Pi_2$. Because the definition uses only the vector space structure, any automorphism of $V$ yields the same structure, and the group of these automorphisms is ${\rm GL}_3({\bf Z}/2{\bf Z})$.

Proof of uniqueness of the 2-(7,3,1) design: as is often the case with highly symmetric combinatorial structures $S$, we can identify any other structure $S'$ having the same parameters with $S$ even if we specify some parts $p,p'$ of $S,S'$ respectively and require that the identification take $p'$ to $p$. Taking $S'=S$, this then shows that the automorphism group ${\rm Aut}(S)$ “acts transitively on the $p$’s”, and lets us count the automorphisms. In particular, if $p$ pins down $S$ completely then the number of $p$’s equals the number of automorphisms, and then ${\rm Aut}(S)$ “acts simply transitively” on thef $p$’s. If we already have a group $G$ acting on $S$, we can then check whether $G={\rm Aut}(S)$ by comparing cardinalities.

Here we argue as follows. Let $\mathfrak D$ be any 2-(7,3,1) design. We’ll show $\mathfrak D$ is isomorphic to $\Pi_2$. Fix a point $P$ of $\mathfrak D$, an ordering $(l_1,l_2,l_3)$ of the three lines through $P$, and one of the other lines $l'$ of $\mathfrak D$. This can be done in $7 \times 3! \times 4 = 168$ ways. Map them arbitrarily to corresponding elements of $\Pi_2$. This tells us the images of all seven points: there’s $P$ itself; the points $P_1,P_2,P_3$ where $l'$ meets $l_1,l_2,l_3$ respectively; and the points $Q_1,Q_2,Q_3$ of $l_1,l_2,l_3$ which are neither $p$ nor on $l'$. We’ve also found four of the lines, and readily check that the others are $\{P_1,Q_2,Q_3\}, \{Q_1,P_2,Q_3\}, \{Q_1,Q_2,P_3\}.$ This gives an isomorphism of $\mathfrak D$ with $\Pi_2$ (and a construction of $\Pi_2$ if we don’t know one already) and shows that $\#({\rm Aut}(\Pi_2)) = 168$. Moreover we already know that $\Pi_2$ has automorphisms by ${\rm GL}_3({\bf Z}/2{\bf Z})$ [the group of invertible $3 \times 3$ matrices mod 2 — “GL” stands for “general linear”]. Thus these are all the automorphisms, and we’re done.


Feb. 25 – intro to strongly regular graphs (chapter 2)

Def. 2.1 (p.29): Graph: $G = (V,E)$ where $V$ is a finite set of “vertices” (singular = “vertex”) and $E$ is a collection of two-element subsets of $V$ (“edges”). Equivalently, a finite set $V$ with a relation that is symmetric and never reflexive. So we’re dealing only with undirected graphs – edge $\{v,v'\}$ is the same as $\{v',v\}$ – without loops (there’s no “$\{v,v\}$”) or multiple edges (each $\{v,v'\}$ can appear at most once). We say $v$ is adjacent to $v'$, or a neighbor of $v'$, in the graph iff $\{v,v'\}$ is an edge; the latter term suggests: the neighborhood of $v$ is the set of all vertices $v'$ adjacent to $v'$. We denote the neighborhood of $v$ by $G(v)$. Also we can say $v$ is “incident” with an edge $e$ iff $v$ one of its two vertices.

[Isomorphism and automorphism of graphs are defined as expected: isomorphism $G=(V,E) \to G'=(V',E')$ is a bijection $V \to V'$ that induces a bijection $E \to E'$; if $G=G'$ this is an automorphism, and the automorphisms of $G$ form a group.]

So a graph is at least a 0-$(n,2,\#E)$ design, and never a 2-$(n,2,\lambda)$ design except in the trivial cases of the null graph ($E = \emptyset$, $\lambda=0$) or the complete graph (in which $E$ consists of all 2-element subsets and $\lambda=1$). Graphs that are 1-$(n,2,d)$ designs are called regular of degree $d$. (In general the degree of a vertex $v$ of $G$ is the number of edges containing it; the text calls this the “valency” (p.30), which makes some sense if you think about chemistry and forget double (and triple etc.) bonds. So a graph is regular iff all vertices have the same degree.) But we already know that we don’t expect to do much with 1-designs. How to impose more structure than $t=1$ (too flabby, except for degree 1 or 2 and their complements) but less than $t=2$ (only trivial examples)?

As suggested by the Petersen graph:

Def. 2.4 (p.32) a graph $G$ that is neither null nor complete is strongly regular iff the number of common neighbors of any two vertices $v,v'$ depends only on whether $v$ and $v'$ are equal, adjacent or non-adjacent.

In the first case that number is just $\deg(v)$ so the graph must be regular. So we have two parameters: $n=\#V$ and $k={\rm degree}$. The other two cases give us two more parameters. We say $G = (V,E)$ is strongly regular with parameters $(n,k,\lambda,\mu)$ if:

There are $n$ vertices (i.e. $\#V = n$);
$G$ is regular of degree $k$;
any two adjacent vertices have $\lambda$ common neighbors; and
any two non-adjacent vertices have $\mu$ common neighbors.

[NB: i) excluding null or complete graphs relieves us from the conundrum of an ill-defined $\lambda$ or $\mu$ respectively.
ii) The book uses $\Gamma$, and you’re welcome to do so too, but $G$ is easier to type or TeX (though no easier to handwrite).
iii) I use American spellings but you won’t be penalized for using the text’s British spellings such as "neighbours" ;-) ]

Examples:

The complement $\bar G$ of a graph $G = (V,E)$ is $(V,\bar{E})$ where $\bar E$ is the set of all two-element subsets of $V$ not in $E$. (Same as the complement of a $t$-design.) $G$ is regular of degree $k$ iff $\bar G$ is regular of degree $\#V-k-1$. $G$ is strongly regular with parameters $(n,k,\lambda,\mu)$ iff $\bar G$ is strongly regular with parameters $(n, n-k-1, \bar\lambda, \bar\mu)$ with $$ \bar\lambda = n-2 + \mu - 2k, \quad \bar\mu = n + \lambda - 2k $$ (Prop. 2.7 (p.33), using Inclusion-Exclusion; sanity check: $\bar{\bar\lambda} = \lambda$ and $\bar{\bar\mu} = \mu$ — be sure to change $k$ to $n-k-1$ when doing this. As the text notes, since $\bar\lambda$ and $\bar\mu$ are nonnegative we must have $n \geq 2k-\lambda, n \geq 2k - \mu + 2$.)

(2.8) i) The “triangular graph” $T(m)$ has ${m \choose 2} = (m^2-m)/2$ vertices, indexed by 2-element subsets of an $m$-element set $S$; two are adjacent iff they have an element in common. (So the complement has two pairs adjacent if disjoint.) Parameters: $((m^2-m)/2, 2m-4, m-2, 4)$ [why?]. We must take $m \geq 4$, else the graph is complete (and also null if $m \lt 3$…).
$T(4)$ is the octahedron, complement of $3.K_2$: 12–34 13–24 14–23.
$T(5)$ is the complement of the Petersen graph, as you can check from its labeling in the introductory handout.
These examples may be misleading: for $m$ large enough, $T(m)$ has smaller degree than its complement (how large is “large enough”?). See the second picture on p.35 (Fig. 2.3) for $T(9)$, and note the convention (p.34) for drawing this graph so the picture doesn’t become an ugly clutter.
Automorphism group contains the symmetric group $S_m$, and usually equals $S_m$, though with a couple of exceptions – trivial for $m=2$, a bit more interesting for $m=4$. We’ll use this to study the structure of $S_m$ later in the course.

ii) “Square lattice graph” $L_2(m)$: vertices = $S \times S$ with $\#S = m$; two vertices are adjacent iff they have a common coordinate; parameters $(m^2, 2m-2, m-2, 2)$. Why doesn’t $S \times T$ work if $\#S \neq \#T$? Must have $m \gt 1$; $L_2(2)$ is a 4-cycle and the complement of $2.K_2$ (the only cycles that are strongly regular with our definitions are the 4- and 5-cycle); for $L_2(3)$ see Figure 2.2 (p.34); this graph is its own complement (Problem 5ii = page 45). Automorphism group? It contains (and we’ll soon easily show it equals) the wreath product $S_m\ {\rm wr}\ S_2$, with $2 m!^2$ elements.

iii) (2.9, p.34-35) for $r \gt 1$ and $m \gt 1$ the disjoint union $r.K_m$ of $r$ complete graphs of order $m$ is strongly regular with parameters $(rm, m-1, m-2, 0)$ — the only strongly regular graphs with $\mu = 0$. See page 34-35 for some further quaint terminology involving some of these graphs and their complements.

iv) (2.10, p.35): what happens to Paley’s construction of Hadamard matrices when $q \equiv 1 \bmod 4$?) We get the Paley graph $P(q)$ with vertices $V = k = {\bf F}_q$ and $x,y$ adjacent iff $x-y$ is a nonzero square. Here $-1$ is a square, so this relation is indeed symmetric, because $y-x = -(x-y)$. It is strongly regular with parameters $(q, (q-1)/2, (q-5)/4, (q-1)/4)$. Proof is analogous to what we did for Paley-Hadamard. First few examples:

(*) That is, every four-element subset of $V$ includes two adjacent points and also two disjoint points; see PS5 problem 5. According to the Wikipedia entry, $P(101)$ is the largest known graph with the corresponding property for six-element subsets.

As with 2-designs, we aim to describe, at least in some cases, which parameters $(n,k,\lambda,\mu)$ are feasible, and to describe the graphs and automorphisms for the feasible cases. We start with an easy algebraic condition:

Proposition 2.6 (p.33): If a strongly regular graph has parameters $(n,k,\lambda,\mu)$ then $$ k (k-\lambda-1) = (n-k-1) \mu. $$ Proof: Naturally we double count. Fix $x \in V$ and count edges $\{y,z\}$ where $y$ is a neighbor of $x$ but $z$ isn’t (nor does $z=x$). There are $k$ choices for $y$; each of them has $k$ neighbors, of which one is $x$ and $\lambda$ are the common neighbors of $x$ and $y$, so there are $k-\lambda-1$ choices for $z$. On the other hand there are $n-k-1$ choices for $z$, and for each of them there are $\mu$ choices for $y$, QED.

You should verify that the parameters of $\bar G$ satisfy this necessary condition iff those of $G$ do, and that the parameters of the graphs we’ve seen so far (triangular, square lattice, $r.K_m$, and Paley) all satisfy $k (k-\lambda-1) = (n-k-1) \mu$ as well.

Feb. 27 – the adjacency matrix and the integrality condition (p.36-38)

Let $G=(V,E)$ be a graph with n vertices. Choose a labeling $v_1,...,v_n$ of the vertices. A basic construction for much of graph theory: The adjacency matrix $A=A(G)$ is the matrix whose $(i,j)$ entry $a_{i,j}$ is the number of edges from $v_i$ to $v_j$. As with the incidence matrix of a design, this will turn out to be much more than mere bookkeeping.

Strictly speaking $A$ depends not just on $G$ but on the labeling, but different labelings yield similar matrices $P^{-1} \! A P$ for some permutation matrix $P$ (more on this below). Also:

Since we use $\{1,2,...,n\}$ to index both vertices and rows/columns, it’s useful to think of the $(i,j)$ entry $a_{i,j}$ as being really the $(x,y)$ entry where $x = v_i$, $y = v_j$. This makes it canonical: $A$ is really a linear operator (self-transformation) on a vector space ${\bf R}^V$ with an orthogonal basis $\{e_x : x \in V\}$, and it takes $e_x$ to $\sum_{y \in G(x)} e_y$, that is, to the sum of $e_y$ over $y$ in the neighborhood of $x$ — morally speaking it takes $x$ to $G(y)$. For example, the degree of $x$ is the inner product of $Ae_x$ with the all-1’s vector ${\bf j}$, which (since $A$ is symmetric) is also the inner product $\langle e_x, A{\bf j}\rangle$. A graph is regular of degree $k$ iff $A{\bf j} = k {\bf j}$, i.e. iff $\bf j$ is a $k$-eigenvector of $A$. The adjacency matrix of the complementary graph $\bar G$ is $J-I-A$ (where $J$ is the all-1\rsquo;s square matrix of order $n$, as in Chapter 1).

So what does $A^2$ do? Well, $A e_x$ to the sum of $e_y$ over neighbors $y$ of $x$, so $A^2 e_x$ is the sum of $A e_y$ over those $y$. Writing $A e_y = \sum_{z \in G(y)} e_z$, we see that the $(x,z)$ entry of $A^2$ is the number of paths $(x,y,z)$ of length $2$ (note that $x=z$ is allowed). Likewise the $(x,x')$ entry of $A^3$ is the number of length-3 paths $(x,y,z,x')$ (again with repeats and backtracks allowed), etc. For that matter the $(x,y)$ entry of $A=A^1$ is the number of length-1 paths, i.e. the number of edges $\{x,y\}$; and the $(x,y)$ entry of $A^0$ is 1 if $x=y$ and 0 else, so it does behave like the number of “length-0 paths”… For an undirected graph, the length-$m$ paths from $x$ to $x'$ are in bijection with the length-$m$ paths from $x'$ to $x$ (just retrace the path backwards), so $A^m$ is symmetric, as it should be since $A$ is.

Now we know from linear algebra that we can analyze $A^k$ via the “spectrum” (i.e. eigenvalues) and eigenspaces of $A$. This is one reason why the adjacency matrix and its spectrum constitute such an important graph invariant. [Note that the spectrum really is a graph invariant: reindexing the vertices changes $A$ to a similar matrix but does not change the spectrum; equivalently, it’s just a different indexing of the coordinates for the linear operator on ${\bf R}^V$.] A further example: the diagonal entries of $A^m$ are the number of closed paths $(x_1,x_2,...,x_m)$ of length $m$ going through a given vertex $x_1$; thus the sum is the number of closed paths with no restriction on the $x_m$ (and with a choice of initial point $x_1$ and direction, e.g. each pentagon counts $10$ times). But the sum of the diagonal entries of a matrix is its trace, and the trace of $A^m$ is the sum of the $m$-th powers of its eigenvalues counted with multiplicity.

This and the integrality condition below (Theorem 2.16) illustrate spectral graph theory, an approach often used in both theoretical and applied graph theory. There’s also a remarkably fruitful analogy with the spectrum of the Laplacian in analysis (the motivation is that the Laplacian can be visualized as taking $x$ to $a(x)-x$, where $a(x)$ is the average of an infinitesimal neighborhood of $x$); this connection has been productive in both directions, with ideas and results in graph theory suggesting analogues in analysis — some proved, some open — and vice versa.

===

Back to the adjacency matrix of a graph and its spectrum. It’s particularly nice when $G$ is a strongly regular graph. We already saw that $G$ is regular of degree $k$ iff $A {\bf j} = k {\bf j}$; as happened in Chapter 1, this is tantamount to $AJ=kJ$, and since here $A$ and $J$ are symmetric we deduce

(p.37, 2.14) $A J = J A = k J$.

If $G$ is strongly regular then we also get to describe $A^2$: its $(x,z)$ entry is the number of paths $(x,y,z)$, which is just the number of common neighbors $y$ of $x$ and $z$. nd we already know what that is:

This means
(p.37, 2.13) $A^2 = k I + \lambda A + \mu (J-I-A)$.

Conversely a graph is strongly regular iff it satisfies (2.13) and (2.14) (and is neither complete nor null). Applying (2.13) to $\bf j$ yields

$k^2 = k + \lambda k + \mu (n-k-1)$, i.e. $k (k-\lambda-1) = \mu (n-k-1)$

as seen already (2.6). Since $\bf j$ is an eigenvector and $A$ is symmetric, $A$ acts on the orthogonal complement of $\bf j$ (recall the usual argument: if $\langle u,{\bf j}\rangle = 0$ then $\langle Au,{\bf j}\rangle = \langle u,A{\bf j}\rangle = 0$ because $A{\bf j}$ is a multiple of $\bf j$). This orthogonal complement is sometimes denoted ${\bf R}^V_0$: it’s the space of vectors in ${\bf R}^V$ whose coordinates sum to zero.

What could be the eigenvalues of this operator on ${\bf R}^V_0$? Since the letter $\lambda$ is already in use, let $\rho$ be an eigenvalue, and $u$ a (nonzero) eigenvector, so $Au = \rho u$. Since ${\bf R}^V_0 = \ker J$ we have $Ju=0$, so (2.13) yields $$ \rho^2 u = (k + \lambda\rho - \mu(\rho+1)) \, u $$ and since $u$ is nonzero this is equivalent to the quadratic equation $$ \rho^2 - (\lambda-\mu) \rho + (\mu-k) = 0. $$ let $r$ and $s$ be the roots. Note that they are always real — as they must be for a self-adjoint transformation — because $r s = \mu - k \lt 0$. Moreover they’re distinct, else $\lambda = \mu = 0$ and we have a null graph. So we may, and always do without loss of generality, order them so $r \gt s$: $$ r = \frac12(\lambda - \mu + \sqrt{D}), \quad s = \frac12(\lambda - \mu - \sqrt{D}) $$ where $D = (\lambda - \mu)^2 + 4 (k - \mu)$, the discriminant of the quadratic equation above.

Let $f$ and $g$ be the multiplicities. We compute them by solving linear equations: since ${\bf R}^V_0$ is the direct sum of the $r$- and $s$-eigenspaces, we compare dimensions to get $f + g = n - 1$; and since $A$ has trace zero, $fr + gs + k = 0$ (remember that $A$ also has the 1-dimensional $k$-eigenspace spanned by $\bf j$). These equations are independent because $r$ and $s$ are distinct, and we find an elementary if somewhat offputting formula (bottom of page 37): $$ f, g = \frac12 \Bigl( n - 1 \pm \frac{(n-1)(\mu-\lambda) - 2k}{\sqrt{D}} \!\Bigr) $$ with $D$ as above. This we have:

Theorem (p.37, 2.16 = integrality condition) $f$ and $g$ are nonnegative integers.

[Check that this is the case in each of our arsenal of examples so far. How are the eigenvalues $r,s$ and multiplicities $f,g$ of the complementary graph related with those of $G$?]

It almost follows that $D$ is a square; that is already a strong condition, and already implies that $f$ and $g$ are rational — which is why Theorem 2.16 is sometimes called the rationality condition. Why “almost”? Because it could be that the numerator vanishes, i.e. that $(n-1)(\mu-\lambda) = 2k$ ! This can indeed happen, and already is the case for the pentagon with $(n,k,\lambda,\mu)=(5,2,0,1)$ (and indeed for all Paley graphs, of which the pentagon is the first example). In that case we say $G$ is of Type I. We then have $$ n = 1 + \frac{2k}{\mu-\lambda}, $$ but also (since $G$ is not complete) $$ n \gt 1 + k, $$ so the denominator $\mu - \lambda$ is positive but less than 2. Hence $\lambda = \mu - 1$ and $n = 2k+1$; the condition $k (k-\lambda-1) = \mu (n-k-1)$ then yields $k = 2\mu$ so

Type I ⇔ $(n,k,l,m) = (4\mu + 1, 2\mu, \mu - 1, \mu)$

(and $D = n = 4\mu+1$,   $f=g=2\mu$), as with Paley. Similar to BRC (Bruck-Ryser-Chowla, Thm. 1.21 on p.7), we then have (but don’t prove) a further condition, here due to Van Lint and Seidel 1966, that forces $n=4\mu+1$ to be a sum of two squares — so for $n \lt 45$ we get $5, 9, 13, 17, 25, 29, 37, 41$, all prime powers so there are Paley graphs, while 21 and 33 are excluded; but 45 is allowed, and — unlike the situation with finite projective planes — is known to be possible (Mathon 1975).

If not Type I then D is a square; such graphs are said to be of Type II. These are not mutually exclusive: already $L_2(3) = P(9)$ has both properties (in general Type I is also Type II iff $n$ is a square).

March 4 – Moore graphs; two more conditions on $(n,k,\lambda,\mu)$: “absolute”and Krein (pages 41-42)

The textbook motivates strongly regular graphs with a classification theorem (2.3, p.30ff.) that leads to strongly regular graphs with parameters $(6u-3,2u,1,u)$ (Prop. 25 on page 33). We highlight a different special case, suggested by the Petersen graph already introduced in the first lecture. In effect we work out Exercise 10 (page 46).

Moore graphs: Suppose $G$ is a regular graph of degree $k$ on $n$ vertices. Then:

(“girth” = minimal cycle length; a graph with no cycles [a.k.a. a “forest” = disjoint union of trees] has infinite girth.)

In the case of equality we say $G$ is a Moore graph of girth 5. Equivalently, a Moore graph of degree 5 s a strongly regular graph with parameters $(k^2+1, k, 0, 1)$..

For higher odd girths $g$, see Exercise 9 on page 45; but it turns out that past $g=5$ the only such graph is a $g$-cycle, as can be shown by generalizing our techniques to study the spectrum of the adjacency matrix of such a graph In PS4 you will find a similar analysis of minimal graphs of degree $d$ and girth 6. For other values of the girth, see e.g. Bollobás’s book Extremal Graph Theory.

We now apply the integrality condition. Type I iff $k=2$; otherwise $D = (0-1)^2 + 4(k-1) = 4k-3$ is a square, necessarily odd so $4k-3=(2m+1)^2$ and $k=m^2+m+1$ for some $m \gt 1$. $m=1$ and $m=2$ yield $k=3$ (Petersen) and $k=7$ (Hoffman-Singleton) respectively.

Any $m$ is allowed by rationality, but integrality limits us to $m=1,2,7$ so $k=3, 7$, or $57$. Here the numerator $n - 1 + ((n-1)(\mu-\lambda) - 2k)$ is $k^2 - 2k = (m^2+m+1)(m^2+m-1)$, while $\sqrt{D} = 2m+1$, so we need $$ 2m + 1 \, \big| \, (m^2+m+1)(m^2+m-1). $$ We’ve already seen the technique: write the RHS as a multiple of the LHS plus a remainder, which is a constant of which $2m+1$ must be a factor. Trouble is the remainder here is $-15/16$ (the value at the root $m=-1/2$. OK, but if we multiply the RHS by 16 then it’s still a multiple, and then $$ 16(m^2+m+1)(m^2+m-1) = (8m^3+12m^2+2m-1) (2m+1) - 15 $$
so $15 \mid 2m+1$, whence $2m+1 \in \{1, 3, 5, 15\}$, so $m$ = [0,] 1, 2, or 7 as claimed. We can check that in each case the division by 16, and later by 2, introduces no denominators — indeed the division by 16 had to work because $2m+1$ is odd — so the integrality criterion allows any of 2, 3, 7, 57; as noted already the first three are known to arise, each uniquely up to automorphisms, and the last is a famous open problem.


Two more conditions on $(n,k,\lambda,\mu)$. The first arises from the observation that we can project the $n$ standard unit vectors to the $r$- and $s$-eigenspaces, obtaining a configuration of vectors of the same length with only two different inner products (other than the squared length), depending on whether the corresponding vertices are adjacent or disjoint. (The text outlines a different construction.) Indeed if we write each $e_x$ as $(1/n) {\bf j} + u_x + v_x$ with $u_x,v_x$ in the $r$- and $s$-eigenspace respectively then we have $$ 1 = \langle e_x,e_x \rangle = 1/n + \langle u_x,u_x \rangle + \langle v_x,v_x \rangle, $$ $$ 0 = \langle e_x,Ae_x \rangle = k/n + r\langle u_x,u_x \rangle + s\langle v_x,v_x \rangle $$
and these simultaneous linear equations in $\langle u_x,u_x \rangle$ and $\langle v_x,v_x \rangle$ have unique solutions; likewise for distinct $x,y$ $$ 0 = \langle e_x,e_y\rangle = 1/n + \langle u_x,u_y\rangle + \langle v_x,v_y\rangle, $$ $$ a_{x,y} = \langle e_x,Ae_y\rangle = k/n + r\langle u_x,u_y\rangle + s\langle v_x,v_y\rangle $$
where the entry $a_{x,y}$ of $A$ is 1 or 0 according as $x,y$ are adjacent or disjoint), and this determines $\langle u_x,u_y \rangle$ and $\langle v_x,v_y \rangle$ in terms of $a_{x,y}$.

Theorem (2.23+, p.14) [Delsarte-Goethals-Seidel 1977]: if there is a set $S$ of unit vectors in some inner product space ${\bf R}^f$ for which $\langle v,v' \rangle$ ($v,v'$ distinct) takes on only two distinct values then $\#S \leq f(f+3)/2.$

Proof: Let $\beta,\gamma$ the two values of $\langle v,v'\rangle$; necessarily $\beta,\gamma\lt 1$. Let $f_v$ be the quadratic polynomial functions on ${\bf R}^f$: $$ f_v(x) = (\langle v,x\rangle - \beta) (\langle v,x\rangle - \gamma) + \beta \gamma (\langle x,x\rangle - 1). $$
Thanks to the last term they live in a space of dimension $f(f+3)/2$ (quadratic polynomials in $f$ variables form a vector space of dimension $1 + f + (f^2+f)/2 = (f+1)(f+2)/2$ but the constant term vanishes). But evaluation on $v' \in S$ shows that they are linearly dependent. Hence their number $\#S$ is at most $f(f+3)/2$ as claimed. QED

Similarly, for m distinct inner products there’s an upper bound on $\#S$ that grows as $f^m/m!$. The best bound of this form requires some finesse with adjustments analogous to “$\! {} + \beta \gamma (\langle x,x \rangle - 1)$” but more involved; we shall say no more about it here.

Yet another variant: suppose we have a set $S$ of $n$ pairs $\{v,-v\}$ of unit vectors in ${\bf R}^f$, with only one pair $\{c,-c\}$ of inner products other than 1 and $-1$. Then $f_v(x) = \langle v,x \rangle^2 - c^2 + c^2 (\langle x,x \rangle - 1)$ is an even quadratic polynomial without constant term (and thus homogeneous) that vanishes on all of $S$ except $\{v,-v\$}, so the number of such pairs is at most the dimension $(f^2+f)/2$ of the space of homogeneous quadratic polynomials in $f$ variables. Here there are more examples of configurations attaining the bound, starting with the $3$ opposite pairs of vertices of a regular hexagon $(f=2)$ or the 6 of a regular icosahedron (with $f=3$). Back to strongly regular graphs…

Corollary (“absolute bound”): in a strongly regular graph, $n \leq \min(f(f+3)/2, g(g+3)/2)$.

Proof: apply 2.23 to the normalized vectors $u_x / |u_x|$ in ${\bf R}^f$ and $v_x / |v_x|$ in ${\bf R}^g$.

Example: the pentagon gives an example of equality with $f=2$ (it really is a regular pentagon or pentagram in ${\bf R}^2$!). The text gives another example where otherwise plausible parameters are excluded. In general equality is rare because $f$ and $g$ are typically both near their average $(n-1)/2$. I think only two further examples are known: $(f,n)=(6,9)$ [the Schläfli graph or its complement, related with the exceptional $E_6$ root system — coming attraction] and $(22,275)$ [the McLaughlin graph or its complement, related with a sporadic simple group in the Conway/Leech family; that’s too exotic for us, but could make for a good senior thesis topic]. The Krein condition or Krein bound (Thm. 2.26, p.42) exploits:

  1. a formula for the projection $E_r: x \mapsto u_x$ as a linear combination of $\{J,I,A\}$, and
  2. a trick involving Kronecker products.

These are individually of interest but I have no special wisdom to impart on the final result, except that it lets us exclude a few more possible parameters as you’re doing for Exercise 3 on page 45 (= problem 2). For the record the bound says $(r+1) (k+r+2rs) \leq (k+r) (s+1)^2$ and the same with $r$ and $s$ switched.

  1. since we have
    • $Je_x = {\bf j}$,
    • $Ie_x = (1/n) {\bf j} + u_x + v_x$,
    • $Ae_x = (k/n) {\bf j} + r u_x + s v_x$,
    and $r,s$ are distinct, we can recoert the map $E_r: x \mapsto u_x$ as an explicit linear combination of $\{J, I, A\}$, namely $$ E_r = \frac1{s-r} \left( \frac{k-s}{n} J + sI - A\right). $$ In particular the $(v,v')$ entry again depends only on whether $v,v'$ are the same, adjacent, or disjoint. But this matrix is positive semidefinite: $\langle v, E_r v \rangle \geq 0$ for all $v$ (with equality iff $v$ in the span of $\bf j$ and the $s$-eigenspace). Now use:

  2. Lemma 2.25 (p.42): if a matrix $M=(m_{ij})$ is positive semidefinite then so is $M \!\!\phantom{.}_{\phantom.^\bigcirc} M = (m_{ij}^2)$.

    Proof: $M \!\!\phantom{.}_{\phantom.^\bigcirc} M$ is symmetric, and the self-adjoint linear transformation associated to it is the restriction to a coordinate subspace of the tensor square $M \!\!\phantom{.}_{\phantom.^\bigcirc} M$ of the transformation $M$, QED.

Now write $M \!\!\phantom{.}_{\phantom.^\bigcirc} M$ as a linear combination of $\{J, I, A\}$ and compute its eigenvalues (it acts on each eigenspace of $A$ by scalar multiplication), obtaining the Krein bound from the condition that these eigenvalues be nonnegative.

Hint / Word to the wise: the decomposition $x = (1/n) {\bf j} + u_x + v_x$ and the associated inner-product computations have other uses; e.g. try the sum of $u_x$ or $v_x$ over $x$ in a (co-)clique.



March 6: Groups plan; uniqueness and automorphism group of $\Pi_2$ (again) and $\Pi_3$

The second part of this course focuses on (usually finite) groups that can be studied using some of the designs or strongly regular graphs we’ve encountered and/or can shed light on them. The groups we will focus on are:

In particular we’ll prove the simplicity of $A_n$ ($n\gt4$) and ${\rm PSL}_n(k)$ ($n\gt2$, or $n=2$ and $\#k\gt3)$, explaining how simplicity fails in small cases like $A_4$ and ${\rm PSL_2(F_3)}$, and also prove the simplicity of each $M_n$ (which will lead us to the corresponding Steiner systems); and use some of our designs and graphs to explain some other remarkable behavior like coincidences among some of these groups (listed below) and the automorphism group of $S_6$ (which, uniquely among all $S_n$, is larger than its group of inner automorphisms). We’ll temporarily leave our textbook, relying on my text and TeX notes and some other sources, except for using some of Chapter 6 (on ${\rm Aut}(S_6)$).

The isomorphisms we’ll address are as follows (listed by increasing group size):
12 $A_4 \cong {\rm PSL_2}({\bf F}_3)$
' 24 $S_4 \cong {\rm PGL_2}({\bf F}_3)$
60 $A_5 \cong {\rm (P)SL_2}({\bf F}_4) \cong {\rm PSL_2}({\bf F}_5)$
' 120 $S_5 \cong {\rm \Sigma L_2}({\bf F}_4) \cong {\rm PGL_2}({\bf F}_5)$
168 ${\rm (P)SL}_3({\bf F}_2) \cong {\rm PSL_2}({\bf F}_7)$
' 336 ${\rm Aut}\bigl({\rm (P)SL}_3({\bf F}_2)\bigr) \cong {\rm PGL_2}({\bf F}_7)$
360 $A_6 \cong {\rm PSL_2}({\bf F}_9)$
20160 $A_8 \cong {\rm (P)SL_4}({\bf F}_2)$

Each ' marks a group $G'$ that is not simple but is ${\rm Aut(G)}$ where $G$ is the simple group in the preceding line, with $[G':G] = 2$. For $G={\rm (P)SL}_3({\bf F}_2)$, an outer automorphism is inverse transpose, corresponding to the action of $G$ on the dual 3-dimensional vector space over ${\bf F}_2$. As we’ll see, ${\rm Aut}(A_6)$ is more complicated, containing $A_6$ with index $4$ rather than $2$; in the ${\bf F}_9$ picture it’s “${\rm P}\Gamma{\rm L}_2({\bf F}_9)$”. The notation “${\rm (P)SL}_n(k)$” means the group is both ${\rm PSL}_n(k)$ and ${\rm SL}_n(k)$, which as we’ll see happens whenever the group of $n$th roots of unity in $k$ (that is, the $n$-torsion group of $k^*$) is trivial, which in turn means $n$ is relatively prime to $\#(k^*) = \#(k)-1$. The full list of exceptional isomorphisms is about thrice as long, but includes orthogonal, unitary, and symplectic groups, which we won’t get to in Math 165. The ATLAS lists them all (though not, I think, in a single place). Wikipedia has an “exceptional isomorphism” entry that does include all of them, and also exceptioal isomorphisms between other mathematical structures.

We’ll introduce our approach by proving the uniqueness up to automorphisms of the projective planes of order at most $5$ and identifying their automorphism groups. In each case we do this by showing that all (hyper)ovals are equivalent, i.e. by showing that a combinatorial projective plane $\Pi$ must have a (hyper)oval $O$ and that $\Pi$ can be reconstructed from $O$.

Today we review $\Pi_2$ and prove the result for $\Pi_3$.

For $\Pi_2$ we have:

Theorem. Let $\Pi$ be a projective plane of order $2$. Then $\Pi$ has $7$ hyperovals, namely the line complements. If $O$ is a hyperoval of $\Pi$, and $O'$ is a hyperoval of some other projective plane $\Pi'$ of order $2$, then any bijection $f: O \to O'$ extends uniquely to an isomorphism $\Pi \to \Pi'$.

Proof: The complement of a line $l$ is a hyperoval because it has the correct size ($7-3 = 4 = 2+2$) and meets every line $l'$ in either 0 or 2 points (the former iff $l'=l$). Conversely any hyperoval $O$ has ${4\choose 2} = 6$ secants, so $7-6=1$ passant line $l$, and since the complement of $l$ has size $4 = |O|$ it must be the same as $O$.

Now given a bijection $f: O \to O'$ we know the images in $\Pi'$ of the four points of $O$ and the six secants; thus the remaining line of $\Pi$, the unique passant of $O$, must go to the unique passant of $O'$. Since we know the images of all lines, the images of all points are determined (since each point is the unique intersection of the 3 lines containing it, or indeed of any 2 of them). This shows that the extension of $f$ to an isomorphism $\Pi\to\Pi'$, if it exists, is unique.

If $O = \{p_1,p_2,p_3,p_4\}$ we then have the following roster of points and lines in $\Pi$: the lines are

and the points are

Moreover, we know the 3 points on each line: $s_{ij}$ contains $s_i$, $s_j$, and an “off $O$” point where $s_{ij}$ intersects another secant; and $l$ contains the three “off $O$” points. These satisfy the axioms of a finite projective plane of order 2, as can be checked either directly or by observing that we already have one example. So we can reconstruct $O'$ from $f(P_1)$, $f(P_2)$, $f(P_3)$, and $f(P_4)$ to complete the isomorphism $f$, QED.

Corollary: i) $\Pi_2$ is the unique projective plane up to isomorphism.
ii) ${\rm Aut}(\Pi_2) = {\rm (P)GL}_3({\bf F}_2) = {\rm (P)SL}_3({\bf F}_2)$, and this group acts transitively on hyperovals of $\Pi_2$.

Proof: (i) is clear: fix a hyperoval $O$ in $\Pi_2$; for any projective plane $\Pi'$ of order $2$ we can choose any of its 7 hyperovals $O'$, and any bijection $f : O \to O'$, and extend $f$ to an isomorphism $\Pi \to \Pi'$.

(ii) Take $\Pi' = \Pi_2$. Then there are 7 choices of $O$ and 4! bijections, whence $\#({\rm Aut}(\Pi_2)) = 4! \cdot 7 = 168$. Since ${\rm Aut}(\Pi_2)$ contains ${\rm (P)GL}_3({\bf F}_2)$, a group of order 168, this must be the full group of automorphisms. By our theorem these include automorphisms taking any oval $O$ to any other oval $O'$. QED

Moreover: Fix a line $l$, let $O$ be its complement, and consider the stabilizer $H$, which is the subgroup of ${\rm Aut}(\Pi_2)$ consisting of automorphisms $g$ such that $g(O)=O$ (equivalently: such that $g(l)=l$). Our Theorem gives an isomorphism from $H$ to the group $S_4$ of permutations of $O$. On the other hand, $H$ acts transitively on $l$; this recovers the group homomorphism $S_4 \to S_3$. Explicitly: if we label the points of $O$ as $p_1, p_2, p_3, p_4$ then each point $p$ of $l$ is on two secants, and thus determines a partition of $O$ into two pairs — and this partition uniquely determines $p$ as the intersection of the corresponding secants. But there are only 3 partitions, so each one arises — and this is tantamount to the usual construction of the homomorphism $S_4 \to S_3$ by the action of $S_4$ on such partitions.

So what happens for $\Pi_3$?

Theorem: Let $\Pi$ be a projective plane of order $3$. Then $\Pi$ has $234$ ovals. If $O$ is a oval of $\Pi$, and $O'$ is a oval of some other projective plane $\Pi'$ of order $3$, then any bijection $f: O \to O'$ extends uniquely to an isomorphism $\Pi \to \Pi'$.

Proof: We count ordered ovals $(p_1,p_2,p_3,p_4)$. There are:

for a total of $13 \cdot 12 \cdot 9 \cdot 4 = 5616$. Since that’s 4! times the number of unordered ovals, the count is $5616 \, / \, 4! = 234$ as claimed.

Now fix an ordered oval $O = {p_1,p_2,p_3,p_4}$. It has four tangents, call them $t_1,t_2,t_3,t_4$, and six secants, call them $s_{i,j}$ for distinct $i,j \in \{1,2,3,4\}$. Given the bijection $f: O \to O'$ we know their images in $\Pi$, call them $$ p_1', p_2', p_3', p_4'; \ t_1', t_2', t_3', t_4'; \ {\rm and} \ s'_{i,j}. $$

There are six pairs of tangents, each determining an intersection point. We claim that these are all distinct. Indeed each point of $\Pi - O$ lies on either $0$ or 2 tangents — not $1$ or $3$ by parity, nor 4 because (aha) $\Pi$ has no hyperovals. So once two tangents meet at a point there are no others. So we’ve located six more points of $\Pi$ (the intersections of tangent pairs), and thus their images in $\Pi'$. The remaining $13-4-6 = 3$ points of $\Pi$ each lie on two secants, so determine a partition of $O$ into two pairs — and as in our analysis of $\Pi_2$ it follows that each of the three partitions arises once. So we’ve determined the images of all 13 points, and this fixes all the line images as well. This proves that $f$, if it exists, is unique.

The remaining 3 lines can be identified as follows. Consider the intersection of $t_1$ and $t_2$. The four lines through it are $t_1$, $t_2$, $s_{34}$, and a passant line $l$. What other points are on $l$? Not the intersection of $t_1$ with $t_3$ or $t_4$ (because we already know where $l$ meets $t_1$), nor the intersection of $t_2$ with $t_3$ or $t_4$ (for the same reason). But there are only three $l$’s to be shared among six such points, so $l$ must also contain the the point $t_3 \cap t_4. Each of the other two points of $l$ lies on two secants. There are 3 such, and we cannot use the intersection of $s_{34}$ and $s_{12}$, because we already know where each of these secants meets $l$. So it must be the other two.

This completes the description of $\Pi_3$ starting from a labeled oval. We could now check directly that it satisfies the axioms of a projective plane, but it is simpler to note that we already have ${\bf P}^2({\bf F}_3)$ so we must have reconstructed it. So we can complete the reconstruction of $f: \Pi \to \Pi'$ knowing $O'$ and the images on $O'$ of $p_1, p_2, p_3, p_4$, QED.

Corollary: i) $\Pi_3$ is the unique projective plane of order 3 up to isomorphism.
ii) ${\rm Aut}(\Pi_3) = {\rm PGL}_3({\bf F}_3) = {\rm PSL}_3({\bf F}_3)$, and this group acts transitively on ovals of $\Pi_3$.

Proof: (i) is clear: fix a hyperoval $O$ in $\Pi_3$; for any projective plane $\Pi'$ of order $3$ we can choose any of its $234$ hyperovals $O'$, and any bijection $f : O \to O'$, and extend $f$ to an isomorphism $\Pi \to \Pi'$.
(ii) Take $\Pi' = \Pi_3$. Then there are $234$ choices of $O$, and $4!$ bijections, whence $\#({\rm Aut}(\Pi_3)) = 13 \cdot 234 = 5616$. But ${\rm Aut}(\Pi_3)$ contains ${\rm PGL}_3({\bf F}_3)$, a group of order $26 \cdot 24 \cdot 18 \, / \, 2 = 5616$. So this must be the full group of automorphisms. By our theorem these include automorphisms taking any oval $O$ to any other oval $O'$. QED

Remark: The fact that every point of $\Pi - O$ is on 0 or 2 tangents (while of course every point of $O$ is on exactly 1 tangent) means that the tangents constitute an oval $O^*$ in the dual projective plane $\Pi'$. The tangents of $O^*$ in turn are the points of $O$, so $O^{**}=O$, as should be the case for a duality. This works for a conic over any field not of characteristic 2 (while we already saw that in characteristic 2 the tangents to an oval all meet at a point).


March 11: Uniqueness and automorphism group of $\Pi_4$, part 1

We just finished showing: for $q=2$ and $q=3$, a projective plane $\Pi$ of order $q$ has respectively $7$ hyperovals or $234$ ovals $O$, so $4! \cdot 7 = 168$ ordered hyperovals or $4! \cdot 234 = 5616$ ordered ovals, and ${\rm Aut}(\Pi)$ acts simply transitively on them; moreover $\Pi$ is unique up to isomorphism: for any other $\Pi'$ and $O'$, any bijection $O \to O'$ extends uniquely to an isomorphism $\Pi \to \Pi'$.

Can we expect this to hold in general? Not if the automorphism group is no larger than ${\rm PGL}_3({\bf F}_q)$: the size of this group is $(q^3-1) (q^3-q) (q^3-q^2) \, / \, (q-1)$ or about $q^8$, which for large $q$ is much smaller than the number $(q+1)!$ or $(q+2)!$ of permutations of an oval, let alone that times the number of ovals. Nevertheless it still works for $q=4$ (with 168 hyperovals), though in a special way. (For $q=5$, you’ll see that it doesn’t quite work, but as many as $120$ of the $6!=720$ possible bijections lift.)

For starters, how many ordered ovals are there? The number of ordered $n$-arcs in a projective plane of order $q$ is:

and for $n=7$ and higher we can’t tell because we don’t know how many coincident triples of secants there are. So this technique cannot work (at least not in this form) past $q=5$.

Also, just how transitively do we expect ${\rm Aut}(\Pi)$ to act? Suppose $\Pi = {\bf P}^2({\bf F}_q)$ and consider ${\rm PGL}_3({\bf F}_q)$. This group acts 2-transitively; not 3-transitively because of collinearity, but that’s the only obstruction even up to $4$ points. Indeed for any $P_1,P_2,P_3,P_4$ no three of which are collinear, and any $Q_1,Q_2,Q_3,Q_4$ no three of which are collinear, there is an automorphism in ${\rm PGL_3}({\bf F}_q)$ taking $P_1$ to $Q_1$, $P_2$ to $Q_2$, $P_3$ to $Q_3$, and $P_4$ to $Q_4$ — and this automorphism is unique! More generally we have (whether or not $k$ is finite)::

Proposition: For any field $k$, the group ${\rm PGL}_n(k)$ acts simply transitively on ordered $(n+1)$-tuples of points in ${\bf P}^{n-1}(k)$ in general linear position.

“Simply transitively” means that for any two such $(n+1)$-tuples $(P_0,P_1,\ldots,P_n)$ and $(Q_0,Q_1,\ldots,Q_n)$ there is a unique group element taking one to the other. “General linear position” means no $3$ points on a line, no $4$ on a plane, …, no $n$ on a hyperplane. This last condition is actually sufficient: if we have $k$ points on a projective subspace of dimension $k-2$ then any $n$ points containing them are on a subspace of dimension $n-2$. Equivalently, general linear position that if we lift $P_0,P_1,\ldots,P_n$ to nonzero vectors $v_0,v_1,\ldots,v_n \in k^n$ then any $n$ of these $n+1$ vectors form a basis for $k^n$.

Proof: It is enough to prove this for one choice of $(P_0,P_1,\ldots,P_n)$. Let $P_0$ be $(1:1:1:\ldots:1)$, and for each $i=1,2,\ldots,n$ let $P_i$ be the (1-dimensional subspace of $k^n$ generated by) the $i$-th unit vector. Let $v_0,\ldots,v_n$ be nonzero vectors in $k^n$ corresponding to $Q_0,\ldots,Q_n$. General linear position $\Leftrightarrow$ any $n$ of these are linearly independent $\Leftrightarrow$ any $n$ of these form a basis. So there’s a linear map $T$ taking each $e_i$ to the corresponding $v_i$. This map is unique, so how do we get $P_0$ to go to $Q_0$? Projectively, $P_1,\ldots,P_n$ go to $Q_1,\ldots,Q_n$ iff each $e_i$ goes to $c_i v_i$ for some nonzero scalar $c_i$, and any choice of $(c_1,\ldots,c_n)$ gives a unique $T$; conversely $T$ determines $(c_1:\ldots:c_n)$ [note the colons: these are projective coordinates]. Now $v_1,\ldots,v_n$ is a basis, so $v_0 = a_1 v_1 + \ldots + a_n v_n$ for some scalars a_1,...,a_n; moreover none of them are zero because then there’d be a linear dependence among $v_0$ and the other $n-1$ vectors $v_i$. So choose $(c_1:\ldots:c_n) = (a_1:\ldots:a_n)$ and we’re done. QED

Note that it should follow that $\#({\rm PGL}_n({\bf F}_q))$ is the number of ordered $(n+1)$-tuples of points in general linear position in ${\bf P}^{n-1}({\bf F}_q)$; this can be checked directly (as we in effect did for $q=n=3$).

OK, so we expect transitivity on $n$-arcs for $n=4$ but not beyond. Yet a hyperoval in $\Pi_4$ has $6$  points. How can this be?

Let’s count: taking q=4 in the above n-arc counts, we find that the number of ordered hyperovals is $$ 21 \cdot 20 \cdot 16 \cdot 9 \cdot 2 \cdot 1 = 6! \cdot 168. $$

in particular, having chosen the first 4 points we have no choice about the last 2 except for their order. We can give explicit coordinates starting from those of our proof above for the first 4 points: each of the remaining two must have nonzero coordinates, all distinct. Scaling the first to 1, there are only two choices: $(1:a:b)$ and $(1:b:a)$, where $a,b$ are the elements of ${\bf F}_4$ other than $0$ and $1$, satisfying $$ ab = a+b = a^3 = b^3 = 1. $$

So we have the hyperoval consisting of the points

$(1 : 0 : 0)$,
$(0 : 1 : 0)$,
$(0 : 0 : 1)$,
$(1 : 1 : 1)$,
$(1 : a : b)$,
$(1 : b : a)$

and now it’s clear that the remaining two points are switched by the automorphism of ${\bf P}^2({\bf F}_4)$ that takes each coordinate to its Galois conjugate: $0,1,a,b$ go to $0,1,b,a$ respectively.

So in fact it is reasonable to expect even for $n=4$ that every bijection of hyperovals lifts to a unique isomorphism of projective planes (provided it’s true that $\Pi_4$ is the unique order-4 projective plane up to automorphism). That is what we shall prove.

Before starting this, recall yet another way that n=4 is special. We know one way of getting hyperovals in an algebraic projective plane ${\bf P}^2(k)$ when $\#k$ is even: use a conic plus its center. If all ordered hyperovals are equivalent then any point can be the center. How can this be? Two conics can meet in at most 4 points (Bézout). But that’s just enough for center-switching to work for $\#(k)=4$. (It also works for $\#(k)=2$, but that’s not as surprising…)

OK: start with any order-4 projective plane $\Pi$ and one of its $6! \cdot 168$ ordered ovals $O = \{p_1,p_2,p_3,p_4,p_5,p_6\}$. This lets us identify $15$ of the lines: the ${6 \choose 2} = 15$ secants $s_{i,j}$. That leaves $15$ points and $6$ passant lines to account for. Each of the $15$ points is on $3$ secants and $2$ passants. This immediately tells us (check this!) that the passant lines form a hyperoval $O^*$ in the dual projective plane $\Pi^*$, whose secants are the $15$ points off $O$. How to label $O^*$? Since we’ll show that any permutation $g$ of $O$ lifts uniquely to ${\rm Aut}(\Pi)$, we have a homomorphism from the permutations of $O$ to those of $O^*$ — and that turns out to be an outer automorphism from $S_6$ to $S_6$ ! We’ll see how this happens next.


Recall: we began with any order-4 projective plane $\Pi$ and one of its $6! \cdot 168$ ordered ovals $O = \{p_1,p_2,p_3,p_4,p_5,p_6\}$. This let us identify 15 of the lines: the 15 secants $s_{i,j}$. That leaves 15 points and 6 passant lines to account for. Each of the 15 points is on 3 secants and 2 passants. This told us that the passant lines form a hyperoval $O^*$ in the dual projective plane $\Pi^*$, whose secants are the 15 points off $O$.

Consider one of these points $q$. It is contained in $6/2=3$ secants of $O$, and thus $5-3=2$ passant lines. Now the secants determine a partition of $O$ into three pairs. A partition of a six-element set into three pairs is called a syntheme in this context. For any $n$, the number of partitions of a $2n$-element set into $n$ pairs is $$ 1 \cdot 3 \cdot 5 \cdot 7 \cdots (2n-1) = \frac{(2n)!}{2^n n!}. $$

[Proofs: i) fix an element: it has $2n-1$ possible partners; for each choice we have to count pairings of the remaining $2(n-1)$ elements, etc. by induction;
ii) standard double-counting: there are ${2n \choose 2}{2n-2 \choose 2}{2n-4 \choose 2} \cdots {2 \choose 2}$ ordered $n$-tuples of disjoint pairs, and this gives each partition $n!$ times; or
iii) all such partitions are equivalent under $S_{2n}$, and the stabilizer has order $2^n n!$ — yes, this is more-or-less equivalent to (ii).]
This count is $1, 3, 15, 105, 945, \ldots$ for $n=1,2,3,4,5\ldots$. [This has the curious notation $(2n-1)!!$, the “
double factorialof $2n-1$, and is the $n$-th term of OEIS sequence #1147.] For $n=2$ this was already seen to give the exceptional homomorphism $S_4 \to S_3$. For $n=3$ it will lead us to the exceptional (outer) homomorphism $S_6 \to S_6$: the count is still small enough to coincide with the number $2n\choose 2$ of pairs.

March 25: Uniqueness and automorphism group of $\Pi_4$ (day 2)

We begin by observing that 15 is just big enough to account for all the points not on O (clearly the syntheme determines the point). So for any syntheme, the corresponding three secants are coincident. This is the last time that can happen: already $105 = 1 \cdot 3 \cdot 5 \cdot 7$ is too large for us to hope that each 1-factor of an oval in some $\Pi_7$ yields four secants that meet at a point. (As you’ll see in the present homework, even for $\Pi_5$ it doesn’t quite work even though there are still only 6 points on the oval.) Getting back to $\Pi_4$: we’ve now accounted for all 21 points, namely the 6 points of the oval and one point for each of the 15 synthemes; we’ve also accounted for 15 of the 21 lines, namely the secants (one of each of the 15 pairs).

So we need only figure out how to group the 15 syntheme points into 6 passant lines, each of which will contain exactly two of them. We’ve already noted that the passant line forms an oval $O^*$ in $\Pi^*$; its secants are the points where two passants intersect — necessarily syntheme points — so we’re in effect seeking to identify synthemes in $O$ with pairs in $O^*$.

Now the key point is that two syntheme points $q,q'$ are on a passant line $\Leftrightarrow$ the unique line through $q$ and $q'$ is not a secant $\Leftrightarrow$ the synthemes associated to $q$ and $q'$ have no pair in common. Well a passant line $l$ contains 5 points that determine 5 synthemes, each of which consists of 3 pairs, and all $3\cdot5=15$ of these pairs are distinct. But there are only ${6 \choose 2} = 15$ pairs to go around. So each of the six passant lines $l$ determines a partition of the pairs into 5 synthemes — what we’ll call a total, short for “synthematic total” (and the text calls a “factorization”, short for “1-factorization”).

Lemma: there are six totals.

Proof: count triples $(t,s,s')$ where $t$ is a total and $s,s'$ two of its synthemes. Given $t$ there are $5 \cdot 4=20$ choices for $(s,s')$. We can check that any two disjoint synthemes lie in a unique total. (There are 4 other synthemes disjoint from both $s$ and $s'$; one has a pair in common with each of the other three, but the other three are pairwise disjoint.) On the other hand, given s there are $4 \cdot 2 = 8$ choices for $s'$, so the number of $(s,s')$ pairs is $15 \cdot 8 = 120$. Hence there are $120/20 = 6$ totals, QED

So we have a unique choice for completing our (re)construction of $\Pi$ from $O$: there’s one passant line for each of the 6 totals. To check that this works, we can either verify the axioms directly, or note that the algebraic $\Pi_4$ is already known to satisfy them and we just showed that starting from any of its 168 hyperovals we get our labeling of all $21+21$ vertices and edges.

We can thus formulate a theorem for projective planes of order 4 analogous to what we’ve done for order 3 and 2: let $\Pi, \Pi'$ be projective planes of order 4, and $O, O'$ hyperovals on them; then every bijection $O \to O'$ extends uniquely to an isomorphism $\Pi \to \Pi'$. Taking $\Pi'=\Pi$ we also find that ${\rm Aut}(\Pi)$ has size $6! \cdot 168 = 120960$. Identifying $\Pi$ with ${\bf P}^2({\bf F}_4)$ — which we can do now that we've shown all these projective planes are isomorphic — we also deduce that every automorphism is either a projective linear transformation (i.e. an element of ${\rm PGL}_3({\bf F}_4))$ or such a transformation composed with the nontrivial automorphism of ${\bf F}_4$.

Now what about $\Pi' = \Pi^*$ and $O' = O^*$? his gives an isomorphism between the symmetric groups of permutations of $O$ and $O^*$, both $S_6$. But it’s not an inner automorphism! For any group $G$ the group of inner automorphisms is $G/Z(G)$ [where $Z(G) = {\rm center\ of\ } G$ (because German Zentrum)]; for $G=S_n$ with $n \gt 2$, the center is trivial and conjugation by $g$ is just relabeling the objects that $S_n$ permutes. In particular an inner automorphism of $S_n$ preserves the cycle structure of each element. But here we can see explicitly that this is not the case. Recall that we have projective coordinates $$ (1 : 0 : 0),\ (0 : 1 : 0),\ (0 : 0 : 1),\ (1 : 1 : 1),\ (1 : a : b),\ (1 : b : a) $$

for the points in $O$. Using these coordinates, $O^*$ consists of the six lines $(y:x:x),\, (x:y:x),\, (x:x:y)$ where $x,y$ are distinct nonzero elements of ${\bf F}_4$. Now a cyclic permutation of the three coordinates has cycle structure 3,1,1,1 on $O$ — to see that it fixes $(1:a:b)$, note that $b=a^2$ and $a^3=1$, so $$ (1:a:b) = (1:a:a^2) = (a:a^2:a^3) = (a:a^2:1) = (a:b:1), $$

and likewise for $(1:b:a)$; but there are no fixed points in $O^*$ so the cycle structure of the action on $O^*$ is 3,3.

We shall see that this is essentially the unique way to get an outer automorphism of any $S_n$, in that $n$ must equal 6 and the outer automorphism must be the one relating $O$ and $O^*$ up to conjugation (= relabeling of $O$, or equivalently of $O^*$). And we shall exploit the strongly regular graph $T(n)$ — recall that this “triangle graph” has vertices = pairs taken from an $n$-element set, and edges = pairs that intersect in one element. We shall prove that ${\rm Aut}(T(n)) = S_n$ for $n \gt 4$, and that for $n \neq 6$ any automorphism of $S_n$ induces an automorphism on $T(n)$ via the action on involutions.

We just constructed a total by starting from two disjoint synthemes, finding the four disjoint from both of them, and showing that one is a dead end while the other three are pairwise disjoint and thus complete the total. Does that ring a bell? We have 15 synthemes, each disjoint from 8 others; form the relevant regular graph with $(n,k) = (15,8)$, and then in effect we’s looking to extend an edge to form a maximal clique of 5, succeeding provided we avoid a wrong turn at the beginning. That’s just what happened with cliques of maximal size $m-1$ on the triangular graphs $T(m)$ for $m \gt 4$; here $m=6$, and indeed $(15,8)$ are the correct parameters. Check: the syntheme graph is indeed strongly regular with the same parameters $(15,8,4,4)$. In fact it’s isomorphic with $T(6)$, and we can use this isomorphism to explain or construct an outer automorphism of $S_6$ if we identify pairs (the vertices of $T(6)$) with simple transpositions and synthemes (the vertices of the syntheme graph) with triple transpositions. Indeed it’s somewhat surprising that our text doesn’t make this connection (see Chapter 6 for its development of ${\rm Aut}(S_n)$ and the hyperoval picture). We give this approach next — it’s basically a standard approach, but “$T(m)$” and “cliques” provide a convenient link with something we’ve thought about already.

What lets us connect these graphs with the structure of $G$ is that the 15 simple transpositions constitute a conjugacy class in $S_6$, and the triple transpositions constitute another one. Recall: Elements $x,y$ of a group $G$ are said to be conjugate iff there exists $g \in G$ such that $y = g^{-1} x g$; this is an equivalence relation (why?), and the equivalence classes are called conjugacy classes. Any automorphism of a group must permute the conjugacy classes — and if a class is finite (as it must be in a finite group) then its image must be a conjugacy class of the same size.

Now in a symmetric group $S_n$ we have

conjugacy class $\longleftrightarrow$ cycle structure $\longleftrightarrow$ partition of $n$.

For example, 6 has 11 partitions; here they are, each together with the parity and size of its conjugacy class:

+   1   111111
-  15   21111
+  45   2211
-  15   222
+  40   3111
- 120   321
+  40   33
-  90   411
+  90   42
+ 144   51
- 120   6

the first column gives the sign (+/- for even/odd) of the class; the second, its size; and the last, the partition. For example, the number of 6-cycles $g$ is $5!$: they are in bijection with permutations $( g(1), g^2(1), ..., g^5(1) )$ of $\{2,3,4,5,6\}$. We shall soon explain how to get such counts for $S_n$ in general; for now we check our arithmetic by verifying that the counts add up to $\#(S_6) = 6! = 720$, with the even and odd classes each adding up to half the total, e.g. $15+15+120+90+120 = (30+90)+120+120 = 360$ as it should be. Last time we saw that our outer automorphism takes cycle structure 3111 to 33; it is comforting that these classes have the same size, 40. We note three further such coincidences: we’ve already seen the count 15 for both 21111 and 222, and there’s also 120 for both 321 and 6 and 90 for both 411 and 42. These all look like candidates for switching: they have not just the same size but also the same exponent, respectively 2, 6, and 4. We next show that the outer automorphism switches 21111 ↔ 222 and 321 ↔ 6 but leaves 411 and 42 alone.

The first of these we already expect: our isomorphism $\Pi \to \Pi^*$ extending a bijection $O \to O^*$ takes secants ($\leftrightarrow$ pairs in $O$, also lines outside $O^*$) to syntheme points (points outside $O$). To prove it, note that for any 3-cycle $c$ there’s a simple transposition commuting with it (switch any two of the three fixed points); so an outer automorphism takes a simple transposition to an involution that commutes with a double 3-cycle, which a simple transposition can’t do (check this). So it must go to a triple transposition as claimed. Alternatively, using our explicit coordinates for $O$ we see an automorphism that fixes the first four points and switches the last two, namely the coordinatewise automorphism of ${\bf F}_4$; we then see that the same automorphism acts on $O^*$ by the triple transposition $$ \bigl( (a:1:1) \, (b:1:1) \bigr) \, \bigl( (1:a:1) \, (1:b:1) \bigr) \, \bigl( (1:1:a) \, (1:1:b) \bigr). $$

(Can you find a double 3-cycle in $S_6$ that commutes with a triple transposition?)

Similarly $c$ commutes with a permutation $g$ with cycle structure 321 (namely the product of $c$ with the simple transposition from the previous paragraph), but a double 3-cycle $c'$ cannot (e.g. if $p$ is the unique fixed point of $g$ then $c'$ moves $p$, so the conjugate of $g$ by $c'$ doesn’t fix the same $p$). So the conjugacy class $[g]$ must go to the 6-cycles, as claimed.

Finally, since a 411 permutation is odd, it is the product of an odd number of simple transpositions (three suffice), so its image under an outer automorphism is an odd product of triple transpositions. But that’s still an odd permutation, so it cannot have cycle structure 42, QED.

So now what about $S_n$ in general? For a partition (or cycle structure) with $a_1$ 1’s, $a_2$ 2’s, $a_3$ 3’s, etc., the conjugacy class has size $$ (\#)\quad\quad \frac{n!}{a_1! \cdot a_2! 2^{a_2} \cdot a_3! 3^{a_3} \cdots} $$

(of course the product is finite because for $k \gt n$ the $k$-th factor is $0! k^0 = 1$). This can be proved by generalizing any of our arguments for the special case that $a2 = n/2$ (and all other $a$’s vanish). Fortunately we don&Rsquot have to prove that for $n \gt 6$ there are no coincidences: it’s enough to work with the simple transpositions. [In fact there are coincidences for every $n \gt 4$, e.g. between cycle structures $n-2,2$ and $n,1,1$.] Indeed we know that these generate $S_n$ (any permutation can be obtained — or if you prefer undone — by a sequence of pairwise switches), so an automorphism of $S_n$ is determined by its action on simple transpositions. For future reference we observe that $S_n$ is even generated by the $n-1$ simple transpositions $(12),\, (13),\, (14),\, \ldots, \, (1n)$: reduce to the previous claim by writing an arbitrary simple transposition $(ij)$ as $(1i)(1j)(1i)$.

In turn we have a bijection from the simple transpositions to the vertices of $T(n)$. I claim that any automorphism that takes this conjugacy class to itself yields an automorphism of $T(n)$, i.e. it takes edges to edges. This follows from the observation that two simple transpositions do not commute if and only if they overlap on exactly 1 letter — that is, iff the corresponding vertices of T(n) are adjacent.

Digression:

i) in general the product of two simple transpositions has order 1 if they coincide, 2 if they commute, and 3 if not; in group-theory lingo this is sometimes expressed by saying that the simple transpositions form a “conjugacy class of 3-transpositions” — quite a few of the groups in the ATLAS have such a description.

ii) the syntheme graph has a similar description, because two triple transpositions don’t commute iff they have no pair in common. (If there’s a common pair we’re basically in the 4-element normal subgroup of $S_4$ consisting of the identity and the three double transpositions.) Here too we see that these 15  involutions as 3-transpositions — which is what we expect since this conjugacy class is equivalent with the simple transpositions under ${\rm Aut}(S_6)$.

Back to $T(n)$: an automorphism of $T(n)$ must permute its maximal cliques. But for $n \gt 4$ there are exactly $n$ of them, and each vertex is contained in exactly 2. This gives us a homomorphism from ${\rm Aut}(T(n))$ to $S_n$ and shows that the kernel is trivial (if we know where each maximal clique goes then we can identify each vertex by intersecting the two cliques it is in). Since the map is clearly surjective, we deduce that ${\rm Aut}(T(n)) = S_n$ for $n \gt 4$. We already saw that for n=4 there are twice as many maximal cliques, and indeed ${\rm Aut}(T(4))$ is twice as large as $S_4$; for $T(3)$ it is easy to see ${\rm Aut}(T(3))=S_3$, while ${\rm Aut}(T(2))$ and ${\rm Aut}(T(1))$ are clearly trivial.

So far we have:

Proposition: an automorphism of $S_n$ takes the conjugacy class of simple transpositions to itself if and only if it is inner.

Proof: given the automorphism $f$, it is enough to find an inner automorphism that agrees with $f$ on the simple transpositions. If $n \neq 4$, use the image in $S_n$ of the automorphism of $T(n)$. If $n=4$, the construction still works because $f$ cannot induce an automorphism of $T(4)$ that switches the two kinds of maximal clique: one gives three elements of $S_4$ that generate all of $S_4$, the other generate only a subgroup $S_3$. This completes the proof.

In particular for $S_6$ every automorphism is either inner or our one outer automorphism followed by an inner one; so ${\rm Aut}(S_6)$ contains $S_6$ as a subgroup of index 2.

Theorem: For $n \neq 6$, every automorphism of $S_n$ is inner.

Proof sketch: For $n=1$ and $n=2$ the automorphism group is trivial. For $n=3,4,5$ there is no other conjugacy class whose size is the same as that of the simple transpositions. For $n \gt 6$ the same is true because the class of simple transpositions is the smallest non-identity class, as one can check from the formula (#) for the size of an arbitrary conjugacy class in $S_n$: it’s almost clear that for large $n$ the size of any other class grows at least as $n^3$, which does it for sufficiently large $n$, and a bit of routine but unedifying work turns “sufficiently large” into “at least 7”.


March 21: the (5,6,12) Steiner system via ${\rm Aut}(S_6)$ (cf. Chapter 6 of the textbook; also p.27, Exercise 13)

Recall that we found in $\Pi_4$ a hyperoval $O$ and a dual oval $O*$ permuted by $S_6$ in different ways, and showed that the resulting outer automorphism of $S_6$ is essentially unique (i.e., unique up to inner automorphism). Such 6-element sets $O,O^*$ with non-conjugate actions of $S_6$ are said to be “dual” to each other. This duality gives:

In the last of these, an “even split” of a six-element set is one of its $\frac12{6 \choose 3} = 10$ partitions into two triples. The bijection is constructed as follows. Fix an even split of $O$, and let $c,c'$ be 3-cycles permuting the two subsets. Then $c$ and $c'$ commute and generate a subgroup $A \equiv ({\bf Z}/3{\bf Z})^2$ of $S_6$ (it’s a 3-Sylow because $3^2 \| 6!$). We saw already that each of $c$ and $c'$ maps to a double 3-cycle of $O^*$. Since these commute but are neither equal nor inverses of each other, those double 3-cycles must come from the same even split of $O^*$, say $c \mapsto c^* {c'}^*$ and $c' \mapsto c^* ({c'}^*)^{-1}$. So we get a well-defined bijection as claimed. Moreover $cc'$ maps to $(c^*)^{-1}$ and $c^{-1}c'$ maps to ${c'}^*$, so if we use the same rule starting from even splits of $O^*$ we get the same bijection (and $A$ goes to a subgroup conjugate to $A$, as it must by Sylow’s theorem).

What’s more, these bijections are related as follows:

$\bullet$ if a pair $p \subset O$ is contained in a syntheme $s$, then the dual syntheme $p^*$ contains the dual pair $s^*$. Proof: Identify $p,s$ with simple and triple transpositions in $S_6$. Then $s$ contains $p$ if and only if the corresponding group elements commute. [NB The three pairs in a given syntheme constitute a maximal coclique in the triangular graph $T(6)$; on the syntheme graph, the corresponding coclique is the three synthemes containing a given pair.]

$\bullet$ an even split $\{c,c'\}$ splits each pair in a syntheme $s$ iff its dual $\{c^*,{c'}^*\}$ does not split the pair dual to $s$. Proof: that’s the only way a triple or simple transposition can commute with a nontrivial element of the 3-Sylow group $A$.

We can use all this to obtain:

Theorem (6.7, page 87-88, extended). There exists a unique $5$-$(12,6,1)$ design $\mathfrak D$ up to isomorphism. Its automorphism group $M_{12}$ is ’simply 5-transitive” (a.k.a. ’sharply 5-transitive”): for every pairwise distinct $x_i$ $(i=1,\ldots,5)$ and pairwise distinct $y_i$ $(i=1,2,3,4,5)$ there exists a unique automorphism of $\mathfrak D$ taking each $x_i$ to the corresponding $y_i$. In particular $\#M_{12} = 12 \cdot 11 \cdot 10 \cdot 9 \cdot 8 = 95040$.

Proof: Such a design has It has ${12 \choose 5} \big/ {6 \choose 5} = 132$ blocks, and the intersection triangle for the six points of a block is

                  132
                66    66
             30    36    30
          12    18    18    12
        4     8    10     8     4
     1     3     5     5     3     1
  1     0     3     2     3     0     1

(this is page 87, Table 6.1 but with a typo corrected: the text has $2,3,2$ instead of $3,2,3$ in the bottom row). In particular the complement of a block is again a block. This suggests the following construction. Fix a block $B$ with complement $\bar B$. Choose an isomorphism between the symmetric groups of permutations of $B$ and $\bar B$ that realizes the outer automorphism (i.e. choose a bijection between $\bar B$ and the totals of $B$). The design then consists of the following blocks $B'$, ordered according to the size of their intersection with $B$:

(6) $B' = B$ itself.
(4) For any four-element subset $S \subset B$, the union of $S$ and any of the 3 pairs in the syntheme of $\bar B$ associated to the complement of $S$.
(3) For any three-element subset $S \subset B$, the union of $S$ with either of the two parts of the even split of $\bar B$ corresponding the even split of B that contains $S$.
(2) For any two-element subset $S \subset B$, the union of $S$ with the complement in $\bar B$ of one of the three pairs of the associated syntheme.
(0) $B' = \bar B$.

This gives $1 + 3 \cdot 15 + 2 \cdot 20 + 3 \cdot 15 + 1 = 132$ blocks $B'$ as needed (and of course consistent with the bottom row of the intersection triangle). The fact that this is actually a $(5,6,12)$ Steiner system — that is, that any 5-element set $P$ is contained in a unique block $B'$ — can be recovered from our above description of the duality of pairs, synthemes, and “even splits” the only tricky case is when the intersection of $P$ with $B$ has size 2 or 3, and we can cut down on the case analysis a bit by proving that every P is contained in at least one $B'$, and then double-count pairs $(P,B')$: the count is at least the number $12 \choose 5$ of $P$’s, but also equal to ${6 \choose 5}$ times the number 132 of $B'$’s — but that equals $12 \choose 5$, so each $P$ must be in a unique $B'$. (Likewise it is enough to prove each $P$ is contained in at most one $B'$.)

Now suppose we have any (5,6,12) Steiner system ${\mathfrak D}_1 = (X_1, {\mathfrak B}_1)$. Fix a block $B_1 \in {\mathfrak B}_1$ and a bijection of that block with $B$. This can be done in $132 \cdot 6! = 12 \cdot 11 \cdot 10 \cdot 9 \cdot 8$ ways. We claim that this extends to a unique isomorphism of Steiner systems. We shall repeatedly use the following fact: any 4-element subset $T$ of the points of a $(5,6,12)$ Steiner system is contained in four blocks, each the union of $T$ with a pair, and these pairs partition the $12-4=8$ points in $\bar T$ (because for each point outside $T$, that point together with $T$ makes 5, which are contained in a unique block.)

First let $T$ be one of the ${6 \choose 4} = 15$ four-element subsets of $B_1$, i.e. the complement of a pair $P$. The blocks containing $T$ are $B_1$ itself and three others which partition $\bar{B}_1$. This gives a map

$f$: {pairs in $B_1$} $\to$ {synthemes in $\bar{B}_1$}
which takes $P$ to the syntheme determined by the complement of $P$ in $B_1$.

Taking complements, it follows that for each pair $P$ the three blocks that intersect $S_1$ in $P$ are obtained from $P \cup \bar{B}_1$ by removing one of the pairs in $f(P)$.

Now we know that either pairs or synthemes can be identified with the vertices of the strongly regular graph $T(6)$. We next show that the bijection preserves the edges.

If pairs $P,P'$ overlap in a point then $f(P)$ and $f(P')$ are disjoint: if $f(P)$ and $f(P')$ share a pair then its complement in $\bar{B}_1$, together with the three points of $B$ not in $P \cup P'$, are five points contained in two blocks.

So, edges in the pair graph go to edges in the syntheme graph. Since the two graphs have the same degree, it follows that non-edges go to non-edges. But we’ve shown that for any $n \gt 4$ every automorphism of $T(n)$ comes from the action of $S_n$. So we’ve identified $\bar B$ with the totals of $B$, and $\bar{B}_1$ with the totals of $B_1$, and thus $\bar{B}_1$ with $\bar B$.

We proceed to collect enough structure about the Steiner systems ${\mathfrak D}, {\mathfrak D}_1$ to show that any bijection from $B_1$ to $B$ and $\bar{B}_1$ to $\bar B$ yields an isomorphism from ${\mathfrak D}_1$ to ${\mathfrak D}$.

Let the four-element set $T \subset X_1$ consist of three points in $B_1$ and one in $\bar B_1$. The resulting partition of $\bar T$ must pair each of the remaining points of $B_1$ with a point in $\bar B_1$: if two points in $B_1$ were paired with each other then they together with $T$ would be a block that meets $B_1$ in exactly 5 points.

Now the intersection triangle tells us that for any 3-element subset $S_1 \subset B_1$ there are exactly 2 blocks that intersect $B_1$ in $S_1$. I claim that they do not intersect in $\bar B_1$. Indeed if they did then they would meet in one point, say $x$, and letting $T = S_1 \cup \{x\}$ we get a contradiction. So, $S_1$ determines an even split of $\bar B_1$. Take the complements of those two blocks (which are also in ${\mathfrak B}_1$), we see that the complement of $S_1$ in $B_1$ determines the same even split. So we get a well-defined bijection

$g$: even splits in $B_1$ $\leftrightarrow$ even splits in $\bar B_1$.

I claim that if $E$ is the even split containing $S_1$, and $S_1$ contains pair $P$, then each pair in the syntheme $f(P)$ is split by $g(E)$. Proof: else two of the pairs in $f(P)$ together contain one of the parts $E'$ of the even split $g(E)$, and their union with $P$ is a block that meets the block $S_1 \cup E'$ in exactly five points.

We next deduce that $f$ is a bijection. Say that an even split $E$ is a “neighbor” of a pair $P$ iff $P$ is contained in one of the parts of $E$, and is a “neighbor” of a syntheme $s$ iff it contains none of pairs that comprise $s$. Then $E$ is a neighbor of 4 pairs and 4 synthemes. Exercise: these $15+15 = 30$ neighborhoods constitute a $(3,4,10)$ Steiner system. In particular (and this is much easier, and all we’ll need): different pairs have different neighborhoods. But we saw in effect that $g$ maps the neighborhood of $P$ to a neighborhood of $f(P)$. Since $g$ is a bijection, it follows that $g$ is a bijection as well. (Whew.)

Thus $f$ is an isomorphism from the triangular graph $T(6)$ of pairs in $B_1$ to the syntheme graph of $\bar B_1$. We saw already that such an isomorphism is tantamount to an identification of $\bar B_1$ with the totals of $B_1$. (A point $p \in B_1$ goes to the total consisting of all synthemes $f(P)$ for pairs $P \ni p$.) That identification identifies $\bar B_1$ with $\bar B$, the complement of $B$ in the $(5,6,12)$ Steiner system we designed previously. Using the properties of $f$ and $g$ that we’ve developed, we now quickly confirm that this bijection from $\bar B_1$ to $\bar B$, together with the given bijection from $B_1$ to $B$, yield an isomorphism from ${\mathfrak D}_1$ to our known $\mathfrak D$, and that this isomorphism is unique.

As noted already, this means there are $132 \cdot 6!$ isomorphisms. Moreover, any bijection from a 5-point set $F_1 \subset X_1$ to a 5-point set $F \subset X$ 8xtends uniquely to a bijection from the unique block containing $F_1$ to the unique block containing $F$, and thus to a unique isomorphism from ${\mathfrak D}_1$ to $\mathfrak D$. In particular, taking ${\mathfrak D}_1 = \mathfrak D$ we see that ${\rm Aut}({\mathfrak D})$ is simply 5-transitive, QED.

It also follows that, given $\mathfrak D$ and $B$, we can recover ${\rm Aut}(S_6)$ as the index-66 subgroup of ${\rm Aut}({\mathfrak D})$ consisting of automorphisms that fix the partition $X = B \cup \bar B$ (possibly switching $B$ with $\bar B$). This may seem circular, but there are other constructions of $\mathfrak D$ (e.g. from the affine plane over the 3-element field, or a Hadamard matrix of order 12, or the projective line over the 11-element field), and each of those can give us an alternative route to ${\rm Aut}(S_6)$.

The derived design ${\mathfrak D}_p$ of $\mathfrak D$ is a Steiner (4,5,11) system. (We can say “the” derived design because ${\rm Aut}({\mathfrak D})$ acts transitively on the points $p \in X$.) Conversely, any Steiner (4,5,11) system ${\mathfrak D}' = (X',{\mathfrak B}')$ is extendable, and thus isomorphic with ${\mathfrak D}_p$. Indeed if we set $X = X' \cup \{p\}$ then the blocks in $X$ must be $B' \cap \{p\}$ and the complement $\bar {B'}$ of each of the 66 blocks $B' \in {\mathfrak B}')$; and we show that these indeed form a (5,6,12) Steiner system using the last problem on the third problem set (= CvL p.25 \#5). Thus there is a unique (4,5,11) Steiner system, and its automorphism group is the point stabilizer in $M_{12} = {\rm Aut}({\mathfrak D})$, and acts simply 4-transitively on the 11 points in $X'$. This point stabilizer is called $M_{11}$ and is the smallest of Mathieu’s five sporadic simple groups. (We have not yet proven that either $M_{11}$ or $M_{12}$ is in fact simple; we shall do this soon.)


April 10: extending $\Pi_4$ leads us to simplicity of $A_n$

[See page 22 of the textbook; a reference for practically all the group theory we shall need, and much more, is Joseph J. Rotman’s An Introduction to the Theory of Groups (Springer 1995), which you can view (and download for personal use only) from hollis.harvard.edu on a Harvard computer.]

We saw(*) that if a projective plane $\Pi$ of order $n$ is extendable then either $n=2$ or $n=4$, and in both cases $\Pi$ is algebraic. For $n=2$, the extension is the Hadamard 3-(8,4,1) design. For $n=4$ it must be a 3-(22,6,1) Steiner system. We investigate this case next; it also leads us to the 4-(23,7,1) and 5-(24,8,1) systems and the Mathieu groups, via a construction that the text attributes to Lüneburg (page 22).
(*) assuming the nonexistence of a projective plane of order 10, or at any rate of an extendable one.

A 3-(22,6,1) design $\mathfrak D = {\mathfrak D}_{22}$ has ${22 \choose 3} \big/ {6 \choose 3} = 77$ blocks. Choosing a point p and identifying the derived design ${\mathfrak D}_p$ with $\Pi_4$, we see that the blocks are:
21: $p$ + line
56: hyperoval
[BTW the CvL text uses $\infty_1$ for $p$, and likewise $\infty_2,\infty_3$ for the points that extend $\mathfrak D$ further to 4-(23,7,1) and 5-(24,8,1); other sources use I,II,III.] The intersection triangle is as follows (see Table 1.1 on page 21, and ignore for now the leftward diagonals that start 759 and 253):

	    77 
	  56  21
	40  16   5
      28  12   4   1
    20   8   4   0   1
  16   4   4   0   0   1
16   0   4   0   0   0   1

In particular any two of the 56 hyperovals intersect in 0 or 2 points. It turns out that this is an equivalence relation on the 168 hyperovals (that is: if we say $O \sim O'$ if $O,O'$ are hyperovals that are equal or intersect in 0 or 2 points, then $\sim$ is an equivalence relation), with three equivalence classes. We could check this combinatorially, but what does it “mean”?

Recall that all hyperovals are equivalent under ${\rm Aut}(\Pi_4)$, and every element of ${\rm Aut}(\Pi_4)$ is either in ${\rm PGL}_3({\bf F}_4)$ or is the Galois conjugate of an element of ${\rm PGL}_3({\bf F}_4)$. So, say we have two hyperovals $O,O'$. We can get from $O$ to $O'$ by some $g_0 \in {\rm Aut}(\Pi_4)$. Then $g \in {\rm Aut}(\Pi_4)$ takes $O$ to $O'$ iff $g = g_0 h$ with $h(O)=O$, i.e. with $h$ in the stabilizer ${\rm Stab}(O)$. We have already identified ${\rm Stab}(O)$ with the group $S_6$ of permutations of $O$; and we’ve also seen that a permutation $\pi$ of $O$ comes from ${\rm PGL}_3({\bf F}_4)$ [without a Galois conjugation] iff $\pi$ is even. So, composing with an odd permutation if necessary, we can assume that $g_0 \in {\rm PGL}_3({\bf F}_4)$, and then $g \in {\rm PGL}_3({\bf F}_4)$ takes $O$ to $O'$ iff $g = g_0 h$ for some $h$ in the subgroup $A_6$ of ${\rm Stab}(O)$.

Now I claim:

   1) There exists a well-defined surjective homomorphism $\det: {\rm PGL_3}({\bf F}_4) \to {\bf F}_4^*$;

   2) $A_6$ is in the kernel ${\rm PSL_3}({\bf F}_4)$ of $\det$;

whence it follows that

   3) All $g$ taking $O$ to $O'$ have the same determinant.

Thus we can define an equivalence relation on the 168 hyperovals by defining

   $O \sim O'$ iff there exists $g \in {\rm PSL_3}({\bf F}_4)$ taking $O$ to $O'$.

This equivalence relation partitions the hyperovals into 3 equivalence classes, each of size $168/3=56$.

It turns out that this is the equivalence relation we need to extend $\Pi_4$.

Proof of (1): we know from linear algebra that there is a surjective homomorphism $\det: {\rm GL}_3({\bf F}_4) \to {\bf F}_4^*$. The point is that this homomorphism takes every scalar $3 \times 3$ matrix to 1, and thus “factors through the quotient”: for any nonzero scalar $c$ and any $3 \cdot 3$ matrix $g$ we have $\det(cg) = c^3 \det(g) = \det g$, so the determinant can be defined even when we identify $cg$ with $g$.

Proof of (2) (and thus of (3)): The kernel of $\det: A_6 \to {\bf F}_4^*$ is normal and of index 1 or 3. But $A_6$ is simple, and of order $\gt 3$ Therefore $\ker(\det)$ is all of $A_6$, as claimed.

Having partitioned the 168 orbits into three equivalence classes each of size 56, we next show that any of these equivalence classes yields a 3-(22,6,1) design. Fix any three points. We claim they’re in a unique block. If one of the three is $p$, the other two lie on a unique line and we’re done. If not, but they’re collinear, we’re still done (they can’t be on a hyperoval because no hyperoval meets a line in as many as three points). Finally, if three non-collinear points, put them at the three unit vectors, and then exactly one of the three hyperovals through them works. Indeed if $O$ is any oval then ${\rm diag}(1,1,a)$ and ${\rm diag}(1,1,b)$ takes $O$ to some hyperovals $O', O''$ in the other two equivalence classes, so exactly one of $O, O', O''$ is in the class we chose. So there exists a 3-(22,6,1) design ${\mathfrak D}_{22},$ as desired.

We next show that the three equivalence classes are the only sets $\mathfrak S$ of 56 hyperovals such that $\#(O\cap O')$ is even for all $O,O' \in {\mathfrak S}$. [This $\mathfrak S$ is a fraktur S.] To this end we prove that our two equivalence relations on hyperovals are the same: hyperovals $O,O'$ are in the same ${\rm PSL_3}({\bf F}_4)$ orbit iff $\#(O\cap O')$ is even. The forward direction follows from the fact that these hyperovals are blocks of a 3-(22,6,1) design and we already used the intersection triangle to show that the intersection always has size 0, 2, or 6. Thus the reverse direction is equivalent to the claim that given a hyperoval $O$ there are $112$ hyperovals $O'$ such that $\#(O\cap O')$ is odd. (Note that $112 = 168 - 56 = (2/3) \cdot 168$.) We prove this using another intersection triangle. While the hyperovals do not constitute a 3-(21,6,\lambda) design, they do satisfy the hypothesis that for $S \subseteq O$ the number of $O'$ such that $S \subseteq O \cap O'$ depends only on $s := \#S$. For $s \leq 4$ this follows from transitivity of the action of ${\rm PGL_3}({\bf F}_4)$ on $s$-tuples of points on general linear position; for $s \geq 4$ it follows from the fact that $O$ is the only hyperoval containing $S$. The counts $\nu_{s,s}$ are: $$ \nu_{0,0} = 168, \ \nu_{1,1} = \frac6{21} 168 = 48, \ \nu_{2,2} = \frac5{20} 48 = 12, \ \nu_{3,3} = \frac4{16} 12 = 3, \ \nu_{4,4} = \nu_{5,5} = \nu_{6,6} = 1. $$

The resulting intersection triangle is

	    168
	  120 48
	84  36  12
      57  27   9   3
    37  20   7   2   1
  22  15   5   2   0   1
10  12   3   2   0   0   1

As a “sanity check”, the total number of hyperovals should be $\sum_{j=0}^6 {6 \choose j} \nu_{6,j}$ which indeed comes to $$ 10 + 6\cdot 12 + 15 \cdot 3 + 20\cdot 2 + 1 = 10 + 72 + 45 + 40 + 1 = 168. $$

The even $j$ contribute $10 + 45 + 1 = 56$ to the sum, and the odd $j$ contribute $72 + 40 = 112$ as claimed.

This means that $\mathfrak D$ is the unique 3-(22,6,1) design up to isomorphism, and indeed for any 3-(22,6,1) design ${\mathfrak D}' = (X',{\mathfrak B}')$ and any point $p' \in X'$ we may choose an isomorphism ${\mathfrak D}' \to {\mathfrak D}_{22}$ taking $p'$ to $p$. Taking ${\mathfrak D}' = {\mathfrak D}_{22}$ we conclude that ${\rm Aut}({\mathfrak D}_{22})$ is 3-transitive on points (and thus transitive on blocks).