Math 263x is a new “topics class” concentrating on some of the computational tools and techniques that can complement theoretical research in number theory, algebraic geometry, and related fields. We meet Mondays and Wednesdays from 1:00 PM to 2:00 PM in Room 222 of the Science Center.
If you find a mistake, omission, etc., please let me know by e-mail.
September 5:
Introduction
September 10:
Belyi maps and some of their uses
September 12:
Start on computation of Belyi functions
September 17:
More on Belyi polynomials etc.
September 19:
Resultants
September 24:
A cube minus a square
September 28 [sic]:
Interlude on sorting and searching
October 1:
Using multivariate (and usually p-adic) Newton's method
October 3:
Wednesday, Oct. 3: Multivariate p-adic Newton, cont'd
[October 8: No class: University holiday (Columbus Day)]
October 10:
positive- [usually 1-]dimensional families
October 15:
Curves of genus 0 through 5
October 17:
Hyperelliptic curves; curves of genus 1;
a Weil-Belyi function on an elliptic curve
(and parametrizing
October 22:
Overview of complex reflection groups and their invariant rings
(which give rise to highly symmetric curves)
October 24:
The Hilbert-Molien series of a group representation;
Oct. 29: NO CLASS:
Harvard cancelled all classes due to
anticipated severe weather
October 31:
Covariants of subgroups of
November 5:
Covariants of subgroups of
November 7:
Hurwitz quaternions, the W(D4) lattice,
and the W(F4) invariants;
Belyi functions parametrizing trinomials with interesting
Galois groups
Nov. 12 and 14: NO CLASS: I'll be in
Lausanne, Switzerland
November 19:
Introduction to (classical (elliptic)) modular curves
[November 21 is the start of Thanksgiving break]
November 26:
Fricke and Atkin-Lehner involutions of
November 28:
Non-Atkin-Lehner elements of the normalizer of
After outlining the general purpose and spirit of the class, we give an example that illustrates some of our concerns in a context that does not require most of the background that will be freely assumed later in the semester. The example is Fermat's celebrated two-squares theorem: a prime p can be written as a sum of two distinct squares if and only if p≡1 mod 4. The representation is unique up to switching the two summands. So take say p = 31415926535897932384626433832795028841. Fermat promises an essentially unique solution to the Diophantine equation p = x2 + y2.
HOW TO ACTUALLY FIND THIS SOLUTION?
Trying all x < p1/2 works in finite time, but not “finite enough” even with the computer (and if/when the computers catch up I can double the number of digits in p…). One proof of the theorem almost yields an efficient algorithm, using an idea attributed to Cornacchia (1908): x/y is a square root of −1 mod p, and conversely given such a root we recover (x,y) in time ≪logcp by lattice reduction (which in two dimensions is basically the Euclidean algorithm). All the ingredients we used are already implemented in packages such as gp, so the resulting algorithm can be expressed by a one-liner such as
[Victor Miller 1992, transcribed some time later into the new gp syntax]. So for instance
fermat(p) = qflll([lift(sqrt(Mod(-1,p))),p;1,0])[1,]
#
fermat(31415926535897932384626433832795028841)
returns [4223562448517994405, -3684758713859920604] in about 0 ms. (and this would even be feasible, if arduous, to do by hand).
Why did we write that this analysis “almost yields an efficient algorithm”? Well, how do we find the square root mod p? An embarrassment: it's easy to evaluate the Legendre symbol, but if it's +1 we generally don't know how to get a square root in deterministic polynomial time unless we assume the extended Riemann hypothesis for the Legendre character mod p — though we can do it in “random polynomial time”. However, modular square roots of small numbers can be evaluated in polynomial (albeit not practical) time by using the arithmetic of elliptic curves mod p ! That was the application Schoof gave for his algorithm [ = René Schoof: Elliptic Curves over Finite Fields and the Computation of Square Roots mod p, Math. of Computation 44, pages 483–494 (1985)] for counting rational points on an elliptic curve mod p. In our case we count points on the curve Y2 = X3 − X, which is relevant because it has complex multiplication by a square root of −1: the count is p+1±2x or p+1±2y, from which we recover the two-square representation in determinstic polynomial time.
See Serre's
Topics in Galois Theory (Boston: Jones & Bartlett 1992)
for the application to the inverse Galois problem (perhaps the
best-known arithmetic application) and other results concerning
Belyi functions. In algebraic geometry, such functions might be
most famous for the equality case in the
Hurwitz bound of
Such functions appear surprisingly often in other contexts; one of these years I might write an article on the ubiquity of Belyi functions. For now, I give references and/or links to some of the places where I've run across Belyi functions over the years:
• ABC implies Mordell,
International Math. Research Notices 1991 #7, 99–109
[bound with Duke Math. J. 64 (1991)].
• The Klein Quartic in Number Theory (1998, in
the MSRI volume The Eightfold Way on Klein's quartic curve
• “slides” from a 1999 talk at MSRI on
“Other Arithmetic Manifestations of Branched Covers”
• Shimura curve computations (1998)
[especially the curves associated to groups commensurate with
arithmetic triangle groups]
• Rational points near curves
and small nonzero |x3−y2|
via lattice reduction (2000) [see the start of Section 4,
pages 22–25; some of the other material here will figure
later in the course]
• Trinomials
ax7+bx+c and
ax8+bx+c
with Galois Groups of Order 168 and
• My HCMR
article on “The ABC’s of Number Theory” (2007)
A bit more detail on the topology:
a Belyi map
Start on computing explicit examples. We'll start with some
of the simpler cases, where C is rational
Next we might make g0 the product of three cycles,
of lengths a1, a2, and
a3, so we have
In general if g0 is the product of m cycles
and g1 is an
POSTSCRIPT to the three-cycle analysis last time:
one could also see directly that such a polynomial cannot exist
over R, because its logarithmic derivative
Before proceeding to the case that g1
is a double transposition, consider the generalization where
Suppose first that the ai are distinct.
Then the roots of P are in the field of coefficients
of P. As before, once
Recall that we postponed till later the case that
g0 is the product of only three cycles, so
In this setting the critical points
Consider for example the case that n=6 and
Indeed we find that
Exercises:
POSTSCRIPT on some Belyi polynomials of degree at most 7
for which
Motivation, definition, and properties of
resultants of univariate polynomials,
which we'll use to eliminate one of two variables when we've
brought one of these calculations down to solving two
simultaneous nonlinear equations.
The Sylvester matrix of P and Q
has corank equal the degree of
Exercise: Use this to find the Belyi polynomials
of degree 11 with cycle structures
POSTSCRIPT re degree-11 Belyi polynomials: for
We begin our consideration of the important special case where
(*) Recall that any cubic polynomial
[There was no class Wednesday the 26th due to the Yom Kippur fast,
so I offered to lecture Friday instead; lacking a quorum to proceed with
Newton's method, I substituted a tangential lecture on some applications
of sort-and-search in computational number theory]
A basic problem: given a list
xi (1<i<N),
find all coincidences
Besides its application to this kind of wordplay,
fast sorting finds use in some computational problems in number theory.
A typical example is the search for elliptic curves
For a paradigm we shall use Hall's identity, already exhibited
at the end of last Monday's notes:
For any m, we can normalize
P and Q to be monic, and require
the Laurent expansion of
We encountered such a problem before (see the end of the Sep.12 notes),
where the number of solutions was given by Bézout's
product formula. But here such a formula would grossly overestimate
the number of solutions because there's an
We'll exploit our foreknowledge that the solutions
must be rational numbers. (We shall soon see how to adapt
this method to solutions that are not necessarily rational
but still algebraic of small degree.) We do not know
in advance how complicated these rational numbers must be
(in the present case we at least pretend not to know…),
but we hope that they're simple enough that we can approximate
them closely enough to guess the numerator and denominator,
and then confirm the guess by checking that we have an
exact solution of our system of equations. The complexity
of a rational number c is measured by its height
If we have an approximation that's close but not good enough
for this purpose, we can repeatedly improve it using
Newton's method. Suppose we want to find a zero
In practice we won't actually compute
But we still don't know how to find a close enough initial approximation
Warning: even though P had quite small coefficients
(absolute value at most 12), the weighted projective coordinates
As promised, here are the polynomials for
Q: what if the coefficients we solve for weren't rational?
Remember we've seen a few examples over fields like
A0: If we know the quadratic imaginary field in advance, just
work over C rather than R,
and use continued fractions to approximate the real and imaginary parts
separately. If the field is unknown, but still quadratic imaginary,
we can recover the real part and the square of the imaginary
part from a close enough approximation, and then we're done.
BUT
that doesn't help us for real quadratic or more complicated
irrationalities…
(Added Oct.10)
A1: OK, so find all the algebraic conjugates —
which in effect is what we already did in the imaginary quadratic case
(when an approximation to c automatically gives us
an approximation to its complex conjugate). Then their
elementary symmetric functions are rational numbers, and
we've reduced to a previous problem. This is useful in other contexts
too, as for computing “CM points on modular curves”,
such as
BUT we're actually going to work
We need a more powerful technique.
Overview of
lattice basis reduction,
accomplished in polynomial time by the
Lenstra-Lenstra-Lovász algorithm,
which for us is a tool for
finding integer relations
and similar Diophantine-approximation tasks. For us, we need it
to recognize an algebraic number c of degree
at most d from a good approximation,
which is tantamount to finding an integer relation in
Lattice reduction in dimension >2 can give rise to
useful practical improvements even for problems that we already
know how to solve using
Finally (and you may have been wondering about this for a while now),
how do we know that our solution will have even one conjugate
defined
So far we've developed some techniques for finding (maps described by)
isolated identities. But there's also considerable interest in
identities that vary in positive-dimensional families. Our techniques
adapt at least to low- (preferably 1-)dimensional families;
some of the techniques we'll develop later this term also
lend themselves to families of higher dimension.
Two examples of problems that naturally lead to a one-dimensional family
of identities: covers of
Now when we have an isolated identity we can try to find it by
using Newton to bootstrap from an approximate solution. That
doesn't work in a positive-dimensional family: it might actually
be easier to find an initial approximation, but with fewer equations
than unknowns it doesn't make sense to speak of to define the
exact solution that it approximates. Even if we do know a special
exact solution (e.g. the
[...]
So far we've made sure that all our Belyi covers are rational curves;
but that's not always the case, nor the only interesting case.
Before giving some examples of Belyi maps on non-rational curves,
we need to give (or recall) enough of a description of such curves
to understand the form of explicit equations defining the curves and
rational functions on them. We'll stop at genus 5,
which is the last case that a generic curve is a complete intersection
in projective space, namely the zero-locus of a three-dimensional
space of quadrics (homogeneous polynomials of degree 2)
genus 0: over C it's just the
projective line (a.k.a. Riemann sphere), but over a field that's
not algebraically closed (nor finite) even genus-zero curves
needn't be trivial. Such a curve is always a smooth conic in
genus 1: again, over an algebraically closed field such as
C we have a familiar picture, this time an
elliptic curve C, since there must be a rational point
But what if there is no divisor of degree 1?
Any
genus 2: Once g≥1, the curve C
is of general type (the canonical divisor is positive).
The space of holomorphic differentials gives a map,
the canonical map,
Again we give an example of a modular curve, this time
genus 3 and higher: Here if C is not hyperelliptic
then the holomorphic differentials (= sections of the canonical divisor)
embed C as a curve of degree
A curve C of genus g>1 is hyperelliptic
if and only if the canonical map
Given just C and the holomorphic differentials,
we can recognize the hyperelliptic curves as those for which
the differentials satisfy too many quadratic relations,
For a general hyperelliptic curve C of genus 3,
the genus-zero quotient curve
[elliptic normal curves and their Jacobians]
Here's an example of some of the new considerations that arise
when we deal with Belyi functions on curves of positive genus.
We'll find the unique such function
Curiously we can also predict the simple preimage P
of the third branch point 1: it must be
The next step is to parametrize pairs
Now it's easy to describe, for small
Next step is to find a Weil function w. Since w is
a section of
We are finally ready to find the value of a,
and thus the curve E, for which
The standard model of E has coordinates
[motivation: Klein, Fermat etc., and Bring curves]
Recall that a complex reflection is a linear transformation
g of a finite-dimensional projective space V
such that g is of finite order and fixes
a subspace of dimension
A subgroup G of
The overlap between the two lists of examples is no coincidence:
Theorem (Shephard-Todd 1954): A finite subgroup
G of
Part of the original proof (G.C. Shephard and J.A. Todd,
“Finite unitary reflection groups”,
There are three infinite families of irreducible complex reflection groups,
plus 34 exceptional cases, in dimensions ranging from 2 to 8.
Most (19) of the 34 exceptional complex reflection groups
are in dimension 2; each is a variation of one of the
three special groups
We have encountered already most of the infinite families.
The simplest one (though it happens to be listed third in the
Shephard-Todd list) consists of the cyclic groups
The next series (and the first in the Shephard-Todd table)
is the symmetric group
The final infinite series generalizes the group that arises in the
symmetries of Fermat curves. For any
The real (a.k.a. Euclidean) reflection groups in these
families are:
It is not quite correct to say the three infinite families
plus the 34 exceptional cases fully account for the
irreducible complex reflection groups.
There's also the missing group
[In positive characteristic, it is known that a representation of a
finite group that has polynomial invariant ring must still be
generated by g such that
Recall that if U is a graded vector space with degrees
We next work out in some detail the identities related with exceptional
finite subgroups of
For any finite subgroup
A nonzero polynomial P is covariant
We next give explicit formulas in each case, using
N=3:
We put the four vertices of the tetrahedron at
G0 clearly contains the 3-cycle
Besides the A4 cover
The
N=4:
We have two natural choices here. One is to note that
the edge-centers of a regular tetrahedron are the vertices of
a regular octahedron, while the vertices of the tetrahedron
and its dual constitute the eight vertices of the
octahedron's dual cube. We may thus use the above C and
It is often more convenient to start from
Here the symmetry group is generated by
N=5:
Review of the complex-analytic picture of the modular curves Y(1) and X(1)
(affine and projective
The covers X(N) → X(1) are Galois.
For small N these curves are well known: rational for
We're not up to computing with
As usual there's a PGL2 choice for the rational coordinate
h on a rational curve, which we exploit (and reduce) by putting
a convenient point at infinity. Here we require that the pole
of h be at the cusp
For N=2 we easily construct such h as
For prime N>2, we can likewise construct the rational function
is a modular cusp form of
We can already see some patterns and arithmetical artifacts here,
some of which we'll soon be able to explain. For starters, note that
the pair of simple roots of j
(when N is
When N−1 is a factor of 24 but N is not prime,
we can still use the formula
If you carried out this calculation you may have noticed that
many of the coefficients of the
A key feature of the curves
Recall that to any isogeny
By the Riemann-Hurwitz formula, the genus
For the seven values of N>1 with
[Added after class: each of the pairs
The Fricke involution wN
Monday, Sep. 17: More on Belyi polynomials etc.
i) Since we just got a sextic cover with Galois group
S5, there must also be a Belyi map of degree 5
giving the same Galois closure. The cycle structures are
ii) What happens for the Belyi polynomials of degree 7 for which
Wednesday, Sep. 19: Resultants
As for degrees 6 and 7…
• In degree 6, besides the cases with cycle structures
• 7 / 331 / 22111:
here G is necessarily the 168-element group;
the polynomial is defined
over
• 7 / 421 / 22111: here
END POSTSCRIPT
Monday, Sep. 24: A cube minus a square
END POSTSCRIPT
Friday, Sep. 28: Interlude on sorting and searching
Monday, Oct. 1: Using multivariate (and usually p-adic)
Newton's method
•
For
•
For
•
Finally, for
Wednesday, Oct. 3: Multivariate p-adic Newton, cont'd
Wednesday, Oct. 10:
positive- [usually 1-]dimensional families
Monday, Oct. 15: Curves of genus 0 through 5
]
ModularForms(11,prec=14).echelon_basis()
to get the
[
1 + 12*q^2 + 12*q^3 + 12*q^4 + 12*q^5 + 24*q^6 + 24*q^7 + 36*q^8 + 36*q^9 + 48*q^10 + 72*q^12 + 24*q^13 + O(q^14),
q - 2*q^2 - q^3 + 2*q^4 + q^5 + 2*q^6 - 2*q^7 - 2*q^9 - 2*q^10 + q^11 - 2*q^12 + 4*q^13 + O(q^14)
]
in which the second generator, call it
Z=subst(z^2,q,serreverse(1/x)); subst(truncate(Z),q,1/X)
in gp) then gives us the equation
CuspForms(Gamma1(13),prec=14).echelon_basis()
to get
[
q - 4*q^3 - q^4 + 3*q^5 + 6*q^6 - 3*q^8 + q^9 - 6*q^10 - 2*q^12 + 2*q^13 + O(q^14),
q^2 - 2*q^3 - q^4 + 2*q^5 + 2*q^6 - 2*q^8 + q^9 - 3*q^10 + 3*q^13 + O(q^14)
]
We call these φ1 and
φ2, and set
Wednesday, Oct. 17:
Hyperelliptic curves; curves of genus 1;
a Weil-Belyi function on an elliptic curve
(and parametrizing
[
q + q^4 - q^5 - 2*q^6 + 2*q^7 - 2*q^8 - 3*q^10 - 2*q^12 + 2*q^14 + 2*q^15 + 3*q^16 - 2*q^17 + 3*q^18 + 2*q^19 + O(q^20),
q^2 - 2*q^4 - q^5 + 3*q^8 + q^9 + q^10 - 2*q^11 - 2*q^12 + 2*q^13 + 2*q^14 - 4*q^16 - 2*q^18 + 2*q^19 + O(q^20),
q^3 - 2*q^4 + q^6 - q^7 + 2*q^8 + 2*q^10 - 3*q^11 - q^12 + 2*q^13 - q^14 - 2*q^15 - 2*q^18 + 3*q^19 + O(q^20)
]
(you may have surmised by now that
As before, this octic polynomial has distinct roots modulo
all primes other than 2 and factors of the level (here 41:
the discriminant is
Let T be any point other than the group-law
origin O, and translate x and y
to put T
Monday, Oct. 22:
Overview of complex reflection groups and their invariant rings
(which give rise to highly symmetric curves)
Wednesday, Oct. 24:
The Hilbert-Molien series of a group representation;
Now the Euler relation
N
G0
polyhedron
|G0|
V
F
E
3
A4
tetrahedron
12
4
4
6
4
S4
octahedron
24
6
8
12
5
A5
icosahedron
60
12
20
30
Monday, Oct. 31:
Covariants of subgroups of
Monday, Nov. 5:
Covariants of subgroups of
Wednesday, Nov. 7:
Hurwitz quaternions, the W(D4) lattice,
and the W(F4) invariants;
Belyi functions parametrizing trinomials with interesting
Galois groups
Monday, Nov. 19:
Introduction to (classical (elliptic)) modular curves
• N=3:
• N=4:
j
=
(h2 + 28h + 212)3
/ ((h + 16) h4)
= 1728 +
(h + 32)2
(h2 − 29h − 213)2
/ ((h + 16) h4);
Monday, Nov. 26:
Fricke and Atkin-Lehner involutions of
Wednesday, Nov. 28:
Non-Atkin-Lehner elements of the normalizer of