Projective {groups, spaces, geometry}
We introduce this with the following observation.
Let $f,F$ be “fractional linear transformations”,
that is, functions of the form
$f(x) = (ax+b)/(cx+d)$, $F(x) = (Ax+B)/(Cx+D)$
(with coefficients $a,b,c,d$ and $A,B,C,D$ in some given
field $K$). Then the composition $F \circ f$
is of the same form. Explicitly,
$$
F(f(x)) = \frac{A f(x) + B}{C f(x) + D}
= \frac{A \frac{ax+b}{cx+d} + B}{C \frac{ax+b}{cx+d} + D}
= \frac{(Aa + Bc)\,x + (Ab + Bd)}{(Ca + Dc)\,x + (Cb + Dd)}.
$$
The pattern of the coefficients
$({Aa + Bc \; Ab + Bd \atop Aa + Dc \; Cb + Dd})$
should look familiar: it exactly matches the entries of the
product of $2 \times 2$ matrices
$({A \; B \atop C \; D}) \, ({a \; b \atop c \; d}).$
😮
What’s going on here, and why?
We shall first give a more precise account of “what”,
using some of the group-theoretic terminology that we have developed
during the past two months, and then — after a scenic detour to
construct an outer automorphism of $S_6$ and note a Rubik’s Cube
curiosity — explain “why”.
We already have some terminology to describe the situation.
For starters, we claim that the nonconstant fractional linear transformations
form a group, call it $\Gamma.$
(That’s a capital Greek letter Gamma; LaTeX: \Gamma.)
As we know, this means not only that $\Gamma$ is closed under composition
but also that it has an identity element and inverses. Well,
the identity map ${\rm id}(x) = x$ is in $\Gamma$ because
$x = (1 \cdot x + 0) \, / \, (0\cdot x + 1).$ As for inverses,
if $f(x) = (ax+b) \, / \, (cx+d)$ then $f^{-1}(x)$ is the solution of
$f(y)=x;$ solving for $y$ yields $y = (dx-b) / (-ax+c)$ which is again
a fractional linear transformation, so indeed $f^{-1} \in \Gamma.$
To go further it will be convenient to introduce the following notation:
to any $2 \times 2$ matrix
$M = ({a_{11} \; a_{12} \atop a_{21} \; a_{22}})$
associate the fractional linear transformation $\Phi_M$
(we’ve already seen $\Phi$ = Phi = \Phi),
defined by
$$
\Phi_M(x) = F(x) = \frac{a_{11} x + a_{12}}{a_{21} x + a_{22}}.
$$
We have observed that $\Phi_M(\Phi_{M'}(x)) = \Phi_{MM'}(x),$
i.e. $\Phi_M \Phi_{M'} = \Phi_{MM'}.$ This suggests that the map
taking $M$ to $\Phi_M$ is a homomorphism to $\Gamma.$
But we need to be careful here:
what are the domain and codomain of this homomorphism?
Recall that $M_2(K)$ does not quite form a group under matrix multiplication:
we must remove the singular matrices (matrices
$M = ({a_{11} \; a_{12} \atop a_{21} \; a_{22}})$
with $a_{11} a_{22} = a_{12} a_{21})$ to get the group ${\rm GL}_2(K).$
Thankfully the nonsingularity condition $a_{11} a_{22} \neq a_{12} a_{21}$
is exactly what we need to assure that the transformation $\Phi_M$
does not simplify to a constant (as in $M = ({2 \; 4 \atop 3 \; 6}),$
$\Phi_M(x) = (2x+4)/(3x+6) = 2/3$)
or worse (try $M=0$, i.e. $M=({0 \; 0 \atop 0 \; 0})$ ).
Removing singular matrices from $M_2(K),$
and removing constant transformations from $\Gamma$,
we get a surjective homomorphism ${\rm GL}_2(K) \to \Gamma.$
Moreover, the kernel of this homomorphism consists exactly of
the scalar matrices $k I_2 = ({k \; 0 \atop 0 \; k})$ for $k \in K^\times$:
if $M$ is such a matrix then $\Phi_M(x) = kx/k = x$ so
$\Phi_M$ is the identity transformation,
and we readily check that there are no other $M$ in the kernel.
So (by our “first isomorphism theorem”) we have identified
$\Gamma$ with the projective linear group ${\rm PGL}_2(K)$.
Indeed if we multiply a matrix $({a\;b\atop c\;d}) \in {\rm GL}_2(K)$
by $k I_2$ then $\Phi_M$ is replaced by the transformation $\Phi_{kM}$
that takes $x$ to $(kax+kb) \, / \, (kcx + kd),$
which simplifies to $(ax+b) \, / \, (cx + d) = \Phi_M(x)$ as expected.
Now since $\Gamma$ is a group of functions under composition,
the map $(g,x) \mapsto g(x)$ must be a group action.
Again some care is needed: $g$ is in $\Gamma$, and $x$ should be
in the set $\Omega$ that $\Gamma$ acts on; but what exactly is
this $\Omega$? It cannot be simply $K$ because
the denominator in $cx+d$ might be zero.
For once the answer suggested by elementary calculus is correct:
if $g\in\Gamma$ acts by $g(x) = (ax+b)/(cx+d)$ and $cx+d=0$
then we say $g(x) = \infty.$ That is, $\Omega$ consists of
all $x \in K$ together with an extra element called $\infty$.
For that to work, we need to also define $g(\infty)$ and check that
the group-action axiom $g(h(x)) = (gh)(x)$ still holds if
one or more of $x,h(x),g(h(x))$ is $\infty$.
Again you won't be surprised that we define $g(\infty) = a/c$
which (at least for $K=\bf R$) is the limit of $g(x)$
as $x \to \infty.$ If $c=0$ then naturally we interpret $a/c$ as $\infty$
so $g(\infty) = \infty.$ You might worry about what happens if also $a=0$
— but we cannot have both $a=0$ and $c=0$ because then
$g(x)$ is the constant $b/d$ (and indeed the determinant $ad-bc$ is zero).
It is then routine, albeit tedious, to check that $g(h(x)) = (gh)(x)$
still works in all cases; we’ll be able to give a better
(or at least more structural) explanation
once we have reached the “why” section below.
For now, we remark on the transitivity of the action of
$\Gamma$ on $\Omega$. Certainly all points of $\Omega$
are in the same orbit: it is easy to get from any $x \in K$ to any other,
or to $\infty$. The stabilizer $\Gamma_\infty$ of $\infty$
is just the group of linear substitutions $f(x) = ax+b,$
corresponding to scalar multiples of the matrix
$({a\;b\atop 0\;1})$ (any $a \in K^\times$ and $b \in K$),
which we have already encountered as a subgroup of ${\rm GL}_2(K)$.
[What is $\Gamma_0?$ Can you check that it is the conjugate of
$\Gamma_\infty$ by a group element such as $g(x) = 1/x$
which switches $0$ and $\infty$?]
The action of that group $\Gamma_\infty$ on $K = \Omega - \{\infty\}$
is in turn transitive, and indeed “2-transitive”
(a.k.a. “doubly transitive”):
for any $x,y,x',y' \in K$ with $x\neq y$ and $x'\neq y'$ we can find
$a\in K^\times$ and $b\in K$ such that $x'=ax+b$ and $y'=ay+b.$
In fact the choice is unique
(“two points determine a unique line”),
so this action of $\Gamma_\infty$ on $K$ is
“sharply 2-transitive”.
This makes the action of $\Gamma$ on $\Omega$
sharply 3-transitive: for all pairwise distinct $x,y,z$
and pairwise distinct $x',y',z'$ in $\Omega$,
there exists a unique $f \in \Gamma$ such that
$f(x)=x',$ $f(y)=y',$ and $f(z)=z'.$
In particular if $K$ is a finite field of $q$ elements
then $|\Gamma| = (q+1) q (q-1) = q^3-q.$ Happily this agrees with
our formulas $|{\rm PGL}_2(K)| = (q^2-1)(q^2-q)$ and
$|{\rm GL}_2(K) : {\rm PGL}_2(K)| = |K^\times| = q-1$.
For example, if $q=2$ or $q=3$ then $|\Gamma| = (q+1)!$
so the homomorphism $\Omega \to S_\Omega$ is an isomorphism
(it’s easy to see that even for finite $K$
the kernel is trivial — you might remember that
this is equivalent to the action being “faithful”).
For $q = 4$ we have $|\Gamma| = 60 = \frac12 5!$ so
the image is an index-2 subgroup of $S_5;$
as we know, such a subgroup must be normal and thus equal to $A_5.$
We next consider the most interesting case, $q=5.$
(There is no field of $6$ elements because $6$ is not a prime power;
for $q=7$ and larger, $|\Gamma|$ soon becomes very small compared with
$|S_{q+1}|.$)
When $K$ is the field ${\bf Z} / 5 {\bf Z}$ of $5$ elements,
the action $\Gamma \to S_\Omega$ makes ${\rm PGL}_2(K)$
a subgroup of $S_6$ of index $6! / 120 = 6.$
So $\Gamma$ has 6 cosets, which $S_6$ permutes as usual,
giving a homomorphism $h: S_6 \to S_{|S_6:\Gamma|} = S_6.$
We readily see that $h$ is injective (since we already now that
$\ker h$ is a normal subgroup, and that the only normal subgroups of $S_6$
are $\{1\},$ $A_6,$ and $S_6$ itself).
This makes $h$ an automorphism of $S_6$.
But it cannot be an inner automorphism: the stabilizer of $\Gamma$
(considered as a coset of itself) is $\Gamma,$ which we’ve seen
is transitive (indeed triply transitive); while if $h$ were
an inner automorphism the stabilizer would be the point-stabilizing
$S_5 \lt S_6.$ So we have constructed an outer automorphism
of $S_6$ — and also found an isomorphism
between $\Gamma = {\rm PGL}_2 ({\bf Z} / 5{\bf Z})$ and $S_5$.
This 3-transitive copy of $S_5$ in $S_6$ also happens to arise
in Rubik’s Cube, as the corner permutations accessible by
rotations of two adjacent faces. Here is the relevant part of
the diagram from Problem 5 in the
Eighth problem set,
with the six corners now labeled by
$\Omega = ({\bf Z} / 5{\bf Z}) \cup \{\infty\}$:
$$
\begin{array}{|cc||cc|}
\hline
{\bf \bar 3} & {\bf \bar 1} & \bar 1 & \bar 0
\cr
{\bf \bar 4} & {\bf \bar 2} & \bar 2 & \!\!\infty\!\!
\cr \hline \end{array}
$$
With this labeling, the clockwise rotation $s$ of the left face
is readily seen to act by $s(x) = 2x$ (NB this also takes
$\bar 0$ to $\bar 0$ and $\infty$ to $\infty$).
The clockwise rotation $t$ of the right face turns out to be
$t(x) = (2x-2)/x$ (again also fixing the two points off that face,
here $\bar 3$ and $\bar 4$). Thus the subgroup
$\langle s,t \rangle$ of $S_6$ accessible by just rotating
those two faces is contained in ${\rm PGL}_2({\bf Z} / 5{\bf Z}).$
With a bit more work we can show that in fact $\langle s,t \rangle$
is all of ${\rm PGL}_2({\bf Z} / 5{\bf Z}).$
For example, find elements of order $3$ and $5$
to deduce (as on that midterm problem …) that
$\left| \langle s,t \rangle \right|$ is a multiple of
$3 \cdot 4 \cdot 5 = 60$ so is either all of
${\rm PGL}_2({\bf Z} / 5{\bf Z})$ or a normal subgroup of index 2;
but since ${\rm PGL}_2({\bf Z} / 5{\bf Z}) \cong S_5$
an index-2 subgroup would have to be $\cong A_5$
which is impossible because $A_5$ contains no elements of order 4.
(Using the fact that $S_5$ is sharply 3-transitive
we then deduce that we cannot use just $s$ and $t$ to switch
two corner cubes, with the others ending up where they began.)
Finally, to explain why ${\rm PGL}_2(K)$ has the same structure as
fractional linear transformations. In general, we know that
${\rm GL}_n(K)$ acts on the vector space $K^n,$ and the action is
almost transitive: the orbits are $\{\vec 0\}$ and everything else.
The quotient group ${\rm PGL}_n(K) = {\rm GL}_n(K) / (K^\times I_n)$
does not act on $K^n,$ but it does act on the orbits
of $K^\times I_n$. We remove the uninformative $\vec 0;$
the remaining orbits are the nonzero vectors on lines through the origin
(a.k.a. one-dimensional subspaces of $K^n$).
The set of such lines is called projective space
of dimension $n-1$ over $K$, and denoted by
${\bf P}^{n-1}(K).$ NB the dimension is $n-1$ rather than $n$
because we lose one dimension by identifying proportional vectors.
In fact if a vector $\vec x = (x_1,\ldots,x_n)^* \in K^n$
[using the star to remind that we’re working with column vectors]
has $x_n \neq 0$ then the line through $\vec 0$ and $\vec x$ contains
a unique vector whose last coordinate is $1$
(namely $x_n^{-1} \vec x$);
this identifies “most” of ${\bf P}^{n-1}(K)$ with $K^{n-1},$
leaving only a “hyperplane at infinity” where $x_n = 0,$
which is a projective space of dimension one less.
Multiplication of $n \times n$ matrices by column vectors
of length $n$ yields a well-defined action of
${\rm PGL}_n(K)$ on ${\bf P}^{n-1}(K).$
In particular, if $n=2$ then we get a projective line that
consists of $K$ together with a “point at infinity” $\infty,$
with the line through the nonzero vector $(x_1,x_2)^*$ identified with
$x = x_1/x_2$ if $x_2 \neq 0,$ and with $\infty$ if $x_2 = 0.$
Any $({a\;b\atop c\;d}) \in {\rm GL}_2(K)$
takes $(x_1,x_2)^*$ to $(ax_1+bx_2, cx_1+dx_2)^*.$ The ratio
$(ax_1+bx_2) / (cx_1+dx_2)$ must depend only on $x=x_1/x_2,$
and indeed we compute that if $x_2 \neq 0$ then
$$
\frac{ax_1+bx_2}{cx_1+dx_2}
= \frac{(ax_1+bx_2) / x_2}{(cx_1+dx_2) / x_2}
= \frac{a \frac{x_1}{x_2} + b}{c \frac{x_1}{x_2} + d}
= \frac{ax + b}{cx + d}
$$
which explains the isomorphism beteween $\Gamma$ and ${\rm PGL}_2(K)$:
the action of $\Gamma$ on $\Omega$ is just the quotient by $K^\times$
of the action of ${\rm GL}_2(K)$ on $K^2 - \{0\}.$
(We still need to check that our identification also works in
the special case $x_2 = 0$ and $x = \infty,$ but that is routine.)
Once $n \gt 2$ the action of ${\rm PGL}_n(K)$ on ${\bf P}^{n-1}(K)$
is still doubly transitive, but not triply transitive.
That is because the action must preserve collinearity:
three points in ${\bf P}^{n-1}(K)$ are collinear if
they come from vectors in $K^n$ that are linearly dependent
(so span a vector subspace of dimension 2 rather than 3).
Collinearity in ${\bf P}^{n-1}(K)$ (and coplanarity for $n \gt 3$ etc.)
gives projective space a geometric structure that was not so easy to see
for $n=2.$ For example, the projective plane (i.e. ${\bf P}^{n-1}(K)$
with $n-1=2$, so $n=3)$ has the nice property that any two lines
meet in a unique point. Indeed the projective plane is a copy of
the ordinary plane $K^2$ together with a projective
“line at infinity” $\ell_\infty;$ lines in ${\bf P}^2(K)$
other than $\ell_\infty$ are ordinary lines in $K^2$ together with
a “point at infinity“ on $\ell_\infty,$
and if they are parallel in $K^2$ then they have the same point at infinity
so they meet there.
Historically ${\bf P}^2(K)$ was first discovered (in the case
$K = {\bf R}$) in the Renaissance visual arts,
where it provides the mathematical setting for the geometry of perspective.
A painting or drawing can reproduce your view of a three-dimensional scene
by projecting it from your eye to the plane $\Pi$ of the canvas or paper
— hence the term “projective“ for ${\rm PGL}_n$ and
${\bf P}^{n-1}.$ Changing $\Pi$ to some other $\Pi'$ requires
a projection from $\Pi$ to $\Pi'$ which (unless $\Pi,\Pi'$ are parallel)
misbehaves by sending a line on $\Pi$ to infinity and missing a line
of $\Pi'$. Completing $\Pi$ and $\Pi'$ to projective planes
cures this misbehavior. You are familiar with this line at infinity as
the “horizon” where parallel train tracks seem to meet.