A Blueprint for Fermat’s Last Theorem

3 Elliptic curves, and the Frey Curve

3.1 Overview

In the last chapter we explained how, given a counterexample to Fermat’s Last Theorem, we could construct a Frey package and thus a Frey curve, which is an elliptic curve with some interesting properties. In this chapter we start with an overview of parts of the theory of the arithmetic of elliptic curves. Following this we sketch proofs of the two main results of this chapter: firstly that the \(p\)-torsion \(\rho \) in the Frey curve is “hardly ramified”, and secondly that Mazur’s result on the possible torsion of elliptic curves implies that \(\rho \) must be irreducible. Everything here follows from standard results about elliptic curves, however almost none of these results are in mathlib as I am writing this, so there is plenty to be done here.

3.2 The arithmetic of elliptic curves

We give an overview of the results we need, citing the literature for proofs. Everything here is standard, and most of it dates back to the 1970s or before.

Theorem 3.1
#

Let \(n\) be a positive integer, let \(F\) be a separably closed field with \(n\) nonzero in \(F\), and let \(E\) be an elliptic curve over \(F\). Then the \(n\)-torsion \(E(K)[n]\) in the \(F\)-points of \(E\) is a finite group of size \(n^2\).

Proof

There are several proofs in the textbooks. The proof being worked on uses the theory of division polynomials; the formalisation is ongoing work of David Angdinata, and it will be part of his PhD thesis.

This theorem actually tells us the structure of the \(n\)-torsion, because of the following purely group-theoretic result:

Lemma 3.2
#

Say \(n\) is a positive integer, \(r\) is a natural, and \(A\) is an abelian group. Assume that for all \(d\mid n\), the \(d\)-torsion \(A[d]\) of \(A\) has size \(d^r\). Then \(A[n]\cong (\mathbb {Z}/n\mathbb {Z})^r\).

Proof

The result is obvious if \(n=1\), so we may assume \(n{\gt}1\). One proof would be to write \(A\) as \(\prod _{i=1}^t(\mathbb {Z}/a_i\mathbb {Z})\) with \(a_i\mid a_{i+1}\) (this is possible by the structure theorem for finite abelian groups), and then to apply our hypothesis firstly with \(d=a_1\) to deduce \(t=r\) and then with \(d=a_t\) to deduce \(a_1=a_t\).

Corollary 3.3
#

Let \(n\) be a positive integer, let \(F\) be a separably closed field with \(n\) nonzero in \(F\), and let \(E\) be an elliptic curve over \(F\). Then the \(n\)-torsion \(E(F)[n]\) in the \(F\)-points of \(E\) is a finite group isomorphic to \((\mathbb {Z}/n\mathbb {Z})^2\).

Proof

This follows from the previous group-theoretic lemma 3.2 and theorem 3.1.

We saw in section 2.4 that if \(E\) is an elliptic curve over a field \(k\) and if \(k^{\operatorname{sep}}\) is a separable closure of \(k\), then the group \(\operatorname{Gal}(k^{\operatorname{sep}}/k)\) acts on \(E(k^{\operatorname{sep}})[n]\). Now let \(n\) be a positive integer which is nonzero in \(k\). We have just seen that \(E(k^{\operatorname{sep}})[n]\) is isomorphic to \((\mathbb {Z}/n\mathbb {Z})^2\), and it inherits an action of \(\operatorname{Gal}(k^{\operatorname{sep}}/k)\). If we fix an isomorphism \(E(k^{\operatorname{sep}})[n]\cong (\mathbb {Z}/n\mathbb {Z})^2\) then we get a representation \(\operatorname{Gal}(k^{\operatorname{sep}}/k)\to \operatorname{GL}_2(\mathbb {Z}/n\mathbb {Z})\). A fundamental fact about this Galois representation is that its determinant is the cyclotomic character.

Theorem 3.4

If \(E\) is an elliptic curve over a field \(k\), and \(n\) is a positive integer which is nonzero in \(k\), then the determinant of the 2-dimensional representation of \(\operatorname{Gal}(k^{\operatorname{sep}}/k)\) on \(E(k^{\operatorname{sep}})[n]\) is the mod \(n\) cyclotomic character.

Proof

This presumably should be done via the Weil pairing. I have not yet put any thought into a feasible way to formalise this.

3.3 Good reduction

We give a brief overview of the theory of good and multiplicative reduction of elliptic curves. For more details one can consult the standard sources such as  [ 14 ] . We stick with the low-level approach, thinking of elliptic curves as plane cubics; whilst we cannot do this forever, it will suffice for these initial results.

Definition 3.5
#

Let \(E\) be an elliptic curve over the field of fractions \(K\) of a valuation ring \(R\) with maximal ideal \(\mathfrak {m}\). We say \(E\) has good reduction over \(R\) if \(E\) has a model with coefficients in \(R\) and the reduction mod \(\mathfrak {m}\) is still non-singular. If \(E\) is an elliptic curve over a number field \(N\) and \(P\) is a maximal ideal of its integer ring \(\mathcal{O}_N\), then one says that \(E\) has good reduction at \(P\) if \(E\) has good reduction over the \(\mathcal{O}_{N,P}\), the localisation of \(\mathcal{O}_N\) at \(P\).

Remark 3.6
#

From this point on, our Frey curves and Frey packages will use notation \((a,b,c,\ell )\), with \(\ell \geq 5\) a prime number, rather than \(p\). This frees up \(p\) for use as another prime.

If \(E\) is the Frey curve \(Y^2=X(X-a^\ell )(X+b^\ell )\) associated to a Frey package \((a,b,c,\ell )\), and if \(p\) is a prime not dividing \(abc\) (and in particular if \(p{\gt}2\)), then \(E\) has good reduction at \(p\).

Proof

The reduction mod \(p\) of the equation defining the Frey curve is still a smooth plane cubic, because the three roots \(0\), \(a^\ell \) and \(-b^\ell \) are distinct modulo \(p\) (note that the difference between \(a^\ell \) and \(-b^\ell \) is \(c^\ell \)).

If \(E\) is an elliptic curve over a number field \(N\) and if \(\rho \) is the representation of \(\operatorname{Gal}(\overline{N}/N)\) on the \(n\)-torsion of \(E\) then \(\rho \) is continuous and its image is finite, so by the fundamental theorem of (infinite) Galois theory the representation factors through an injection \(\operatorname{Gal}(L/N)\to \operatorname{GL}_2(\mathbb {Z}/n\mathbb {Z})\) where \(L/N\) is a finite Galois extension of number fields. One says that \(\rho \) is unramified at a maximal ideal \(P\) of \(\mathcal{O}_N\) if the extension \(L/N\) is unramified at \(P\) (or in other words, if the factorization of \(P\mathcal{O}_L\) into prime ideals is squarefree).

At some point we will need a theory of finite flat group schemes over an affine base. Here is a working definition.

Definition 3.8
#

If \(R\) is a commutative ring, then a finite flat group scheme over \(R\) is the spectrum of a commutative Hopf algebra \(H/R\) which is finite and flat as an \(R\)-module.

(Probably this is not the correct definition in the non-Noetherian case; one should instead ask for locally free, which is equivalent in the Noetherian case and enables you to reduce to the Noetherian case in general)

Some facts we will need are:

Theorem 3.9

If \(E\) is an elliptic curve over a number field \(N\) and \(E\) has good reduction at a maximal ideal \(P\) of \(\mathcal{O}_N\), and if furthermore \(n\not\in P\), then the Galois representation on the \(n\)-torsion of \(E\) is unramified.

Proof

One approach would be by showing that the \(n\)-torsion in the integral model of \(E\) over \(\mathcal{O}_{N,P}\) is an etale finite flat group scheme. There might be simpler approaches however. It’s worth looking to see what Silverman does.

Theorem 3.10

If \(E\) is an elliptic curve over a number field \(N\) and \(E\) has good reduction at a maximal ideal \(P\) of \(\mathcal{O}_N\) containing the prime number \(p\), then the Galois representation on the \(p\)-torsion of \(E\) comes from a finite flat group scheme over the localisation \(\mathcal{O}_{N,P}\).

Proof

Indeed, the kernel of the \(p\)-torsion on a good integral model is finite and flat. Checking this claim formally will probably involve a fair amount of work.

3.4 Multiplicative reduction

Definition 3.11
#

Let \(E\) be an elliptic curve over the field of fractions \(K\) of a valuation ring \(R\) with maximal ideal \(\mathfrak {m}\). We say \(E\) has multiplicative reduction over \(R\) if \(E\) has a model with coefficients in \(R\) and which reduces mod \(R/\mathfrak {m}\) to a plane cubic with one singularity, which is an ordinary double point. We say that the reduction is split if the two tangent lines at the ordinary double point are both defined over \(R/\mathfrak {m}\), and non-split otherwise.

If \(E\) is an elliptic curve over a number field \(N\) and \(P\) is a maximal ideal of its integer ring \(\mathcal{O}_N\), then one says that \(E\) has multiplicative reduction at \(P\) if \(E\) has multiplicative reduction over the \(\mathcal{O}_{N,P}\), the localisation of \(\mathcal{O}_N\) at \(P\).

Lemma 3.12

If \(E\) is the Frey curve \(Y^2=X(X-a^\ell )(X+b^\ell )\) associated to a Frey package \((a,b,c,\ell )\), and if \(p\) is an odd prime which divides \(abc\), then \(E\) has multiplicative reduction at \(p\).

Proof

The hypothesis \(p\mid abc\) implies that precisely two of the three roots \(0\), \(a^\ell \) and \(-b^\ell \) of the cubic are equal mod \(p\). Call \(x\in \mathbb {Z}/p\mathbb {Z}\) this common value. Then the reduction mod \(p\) of the curve is smooth away from the point \((x,0)\), and has an ordinary double point at \((x,0)\). Hence the Frey curve has multiplicative reduction at \(p\).

Remark 3.13
#

If the third root reduces mod \(p\) to \(y\not=x\), then the reduction is split multiplicative iff \(x-y\) is a square mod \(p\). We shall not need this fact.

Lemma 3.14

If \(E\) is the Frey curve \(Y^2=X(X-a^\ell )(X+b^\ell )\) associated to a Frey package \((a,b,c,\ell )\) then \(E\) has multiplicative reduction at 2.

Proof

Indeed, the change of variables \(X=4X'\) and \(Y=8Y'+4X'\) transforms the equation to \(64Y'^2+64X'Y'=64X'^3+16X'^2(b^\ell -a^\ell -1)-4X'a^\ell b^\ell \) and, because \(\ell \geq 5\), \(b\) is even and \(a=3\) mod 4, we see that the 64s cancel, giving an equation over \(\mathbb {Z}\) which reduces mod 2 to \(Y'^2+X'Y'=X'^3+cX'^2\) for some \(c\in \{ 0,1\} \). This cubic is smooth away from an ordinary double point at \((0,0)\). Hence the Frey curve has multiplicative reduction at 2.

Remark 3.15
#

Note that \(E\) has split multiplicative reduction iff \(c=0\), which happens iff \(a^\ell =7\) mod \(8\). We shall not need this fact.

In particular, the Frey curve associated to a Frey package is semistable – it has good or multiplicative reduction at all primes.

The main thing we need about elliptic curves with multiplicative reduction over nonarchimedean local fields is the uniformisation theorem, originally due to Tate.

Theorem 3.16

If \(E\) is an elliptic curve over a field complete with respect to a nontrivial nonarchimedean (real-valued) norm \(K\) and if \(E\) has split multiplicative reduction, then there is a Galois-equivariant injection \((K^{\operatorname{sep}})^\times /q^{\mathbb {Z}}\to E(K^{\operatorname{sep}})\), where \(q\in K^\times \) satisfies \(|q|=|j(E)|^{-1}\).

Proof

See  [ 13 ] , Theorems V.3.1, Remark V.3.1.2 (we don’t need surjectivity), and Theorem V.5.3. This is a lot of work and is a good target for breaking down into many smaller lemmas.

Corollary 3.17

If \(E\) is an elliptic curve over a field \(K\) complete with respect to a nontrivial nonarchimedean (real-valued) norm and with perfect residue field, and if \(E\) has multiplicative reduction, then there’s an unramified character \(\chi \) of \(\operatorname{Gal}(K^{\operatorname{sep}}/K)\) whose square is 1, such that for all positive integers \(n\) with \(n\not=0\) in \(K\), the \(n\)-torsion \(E(K^{\operatorname{sep}})[n]\) is an extension of \(\chi \) by \(\epsilon \chi \), where \(\epsilon \) is the cyclotomic character. Furthermore, the element of \(K^\times /(K^\times )^\ell \) corresponding to this extension is given by the \(q\)-invariant of the curve.

Proof

After a quadratic twist we may assume that \(E\) has split multiplicative reduction. The result then follows from the uniformisation theorem and an explicit computation. Note that even if we do not prove surjectivity of Tate’s uniformisation, we still know that it’s surjective on the \(n\)-torsion, because we know that there are \(n^2\) points in the \(n\)-torsion of \(E\) over \(K^{\operatorname{sep}}\), and they are all accounted for by the \(n\)-torsion in \((K^{\operatorname{sep}})^\times /q^{\mathbb {Z}}\).

3.5 Hardly ramified representations

We make the following definition; this is not in the literature but it is a useful concept for us.

Definition 3.18
#

Let \(\ell \geq 5\) be a prime and let \(V\) be a 2-dimensional vector space over \(\mathbb {Z}/\ell \mathbb {Z}\). A representation \(\rho : \operatorname{Gal}(\overline{\mathbb {Q}}/\mathbb {Q})\to \operatorname{GL}(V)\) is said to be hardly ramified if it satisfies the following four axioms:

  1. \(\det (\rho )\) is the mod \(\ell \) cyclotomic character;

  2. \(\rho \) is unramified outside \(2\ell \);

  3. The semisimplification of the restriction of \(\rho \) to \(\operatorname{Gal}(\overline{\mathbb {Q}}_2/\mathbb {Q}_2)\) is unramified;

  4. The restriction of \(\rho \) to \(\operatorname{Gal}(\overline{\mathbb {Q}}_\ell /\mathbb {Q}_\ell )\) comes from a finite flat group scheme.

We are interested in hardly ramified representations for several reasons. One is that by using some deep theorems, we will be able to prove that all hardly ramified representations are potentially automorphic, which will give us our first foothold into the world of modular forms. We shall come back to these ideas later. In the next section we shall be concerned with the following rather simpler result, namely that the \(\ell \)-torsion in the Frey curve associated to a Frey package \((a,b,c,\ell )\) is hardly ramified. The proof is standard; for another reference, see Theorem 2.15 of  [ 5 ] .

3.6 The \(\ell \)-torsion in the Frey curve is hardly ramified.

Let \((a,b,c,\ell )\) be a Frey package, with associated Frey curve \(E\) and mod \(\ell \) Galois representation \(\rho =E[\ell ]\). We now work through a proof that \(\rho \) is hardly ramified.

Theorem 3.19

If \(p\not=\ell \) is a prime not dividing \(abc\) then \(\rho \) is unramified at \(p\).

Proof

Indeed, \(E\) has good reduction at \(p\), and hence \(\rho \) is unramified at \(p\) by 3.9.

If however \(p\) divides \(abc\) then \(E\) has multiplicative reduction at \(p\), and we can use the theory of the Tate curve to analyse \(\rho \) at \(p\).

Theorem 3.20
#

If \((a,b,c,\ell )\) is a Frey package then the \(j\)-invariant of the corresponding Frey curve is \(2^8(C^2-AB)^3/A^2B^2C^2\), where \(A=a^\ell \), \(B=b^\ell \) and \(C=c^\ell \).

Proof

Apply the explicit formula (presumably already in mathlib)

Corollary 3.21

If \((a,b,c,\ell )\) is a Frey package and the \(j\)-invariant of the corresponding Frey curve is \(j\), and if \(2{\lt}p\mid abc\), then the \(p\)-adic valuation \(v_p(j)\) of \(j\) is a multiple of \(\ell \).

Proof

Indeed \(p\) does not divide \(2^8\) as \(p{\gt}2\), and (using the notation of the previous theorem) \(p\) does not divide \(C^2-AB\) either, because it divides precisely one of \(A\), \(B\) and \(C\). Hence \(v_p(j)=-2v_p(a^\ell b^\ell c^\ell )=-2\ell v_p(abc)\) is a multiple of \(\ell \).

Corollary 3.22

If \((a,b,c,\ell )\) is a Frey package, if \(2{\lt}p\mid abc\) is a prime with \(p\not=\ell \), then the \(\ell \)-torsion in the Frey curve is unramified at \(p\).

Proof

After an unramified quadratic twist we may assume the curve is split at \(p\). The theory of the Tate curve tells us that the extension of \(\mathbb {Q}_p\) cut out by the \(\ell \)-torsion of the Frey curve is \(\mathbb {Q}_p(\mu _\ell ,\sqrt[\ell ]{q})\), with \(\mu _\ell \) the set of \(\ell \)th roots of unity in \(\overline{\mathbb {Q}}_p\). Because \(\ell \not=p\) the extension \(\mathbb {Q}_p(\mu _\ell )\) is unramified at \(p\). And because \(p\not=2\) divides \(abc\), theorem 3.21 shows us that the \(j\)-invariant of the Frey curve has \(p\)-adic valuation a multiple of \(\ell \). Thus the extension can be written \(\mathbb {Q}_p(\mu _\ell ,\sqrt[\ell ]{u})\), where \(u\in \mathbb {Q}_p^\times \) is a unit. The extension is hence unramified (because, for example, Hensel’s Lemma shows that the \(\ell \)th root of \(u\) is in the maximal unramified extension of \(\mathbb {Q}_p\)).

Corollary 3.23

If \((a,b,c,\ell )\) is a Frey package, then the \(\ell \)-torsion in the Frey curve is unramified at all primes \(p\not=2,\ell \).

Proof

Follows from 3.19 and 3.22.

This analysis needs to be slightly modified if \(p=2\), because the \(j\)-invariant of the Frey curve may not have 2-adic valuation a multiple of \(\ell \). We obtain the following weaker result.

Corollary 3.24

If \((a,b,c,\ell )\) is a Frey package, then the semisimplification of the restriction of the \(\ell \)-torsion \(\rho \) in the associated Frey curve to \(\operatorname{Gal}(\overline{\mathbb {Q}}_2/\mathbb {Q}_2)\) is unramified.

Proof

After a quadratic twist to make the curve have split multiplicative reduction, the theory of the Tate curve shows us that \(\rho \) is an extension of the trivial character by the cyclotomic character. Hence the semisimplification of this representation is the direct sum of two unramified characters and is hence unramified.

Let \(\rho \) be the \(\ell \)-torsion in the Frey curve associated to a Frey package \((a,b,c,\ell )\). Then the restriction of \(\rho \) to \(\operatorname{Gal}(\overline{\mathbb {Q}}_\ell /\mathbb {Q}_\ell )\) comes from a finite flat group scheme.

Proof

The Frey curve either has good reduction at \(\ell \) (case 1 of FLT) or multiplicative reduction at \(\ell \) (case 2 of FLT). In the first case the \(\ell \)-torsion is finite and flat at \(\ell \) by theorem 3.10. In the second case the theory of the Tate curve shows that the \(\ell \)-torsion is (up to quadratic twist) an extension of the trivial character by the cyclotomic character corresponding (via Hilbert 90) to the \(\ell \)th power of an \(\ell \)-adic unit. This extension is known to be finite and flat; see for example Proposition 8.2 of  [ 6 ] . Note that the proof in  [ 6 ] uses fppf cohomology, although one can write down a much more elementary proof of this using arguments in  [ 8 ] .

We have now proved the first main result of this chapter.

Let \(\rho \) be the Galois representation on the \(\ell \)-torsion of the Frey curve coming from a Frey package \((a,b,c,\ell )\). Then \(\rho \) is hardly ramified.

Proof

This follows from the results above. The fact that \(\ell \geq 5\) follows from the definition of a Frey package. The first condition is theorem 3.4, and the second is theorem 3.23. The third condition is theorem 3.24, and the fourth is theorem 3.25.

3.7 The \(\ell \)-torsion in the Frey curve is irreducible.

We finish this chapter by showing that Mazur’s theorem implies that the \(\ell \)-torsion in the Frey curve is irreducible. We start by stating Mazur’s theorem.

Theorem 3.27

Let \(E\) be an elliptic curve over \(\mathbb {Q}\). Then the torsion subgroup of \(E\) has size at most 16.

Proof

This is the main theorem of  [ 9 ] . Formalising this result will be a highly non-trivial project; note that this theorem is used in all known proofs of FLT, so there seems to be no way around it.

Let \((a,b,c,\ell )\) be a Frey package, with associated Frey curve \(E\) and mod \(\ell \) Galois representation \(\rho =E[\ell ]\). We know that \(\rho \) is 2-dimensional; let’s suppose for a contradiction that that it is reducible, so in particular its semisimplification is the direct sum of two characters \(\alpha \) and \(\beta \).

The next two results are Lemme 6 on p307 of  [ 11 ] .

Theorem 3.28

With notation as above, the characters \(\alpha \) and \(\beta \) are unramified at \(p\) for all primes \(p\not=\ell \).

Proof

We have seen in theorem 3.23 that \(\rho \) is unramified at all primes \(p\not=2,\ell \), so the characters \(\alpha \) and \(\beta \) are unramified at all such primes. If \(p=2\) then the semisimplification of the restriction of \(\rho \) to \(\operatorname{Gal}(\overline{\mathbb {Q}}_2/\mathbb {Q}_2)\) is unramified by corollary 3.24, so \(\alpha \) and \(\beta \) are unramified at 2.

Remark 3.29
#

Does this innocuous-looking proof above use some form of the Brauer-Nesbitt theorem?

Theorem 3.30

One of \(\alpha \) and \(\beta \) is unramified at \(\ell \).

Proof

In the multiplicative case this follows immediately from the theory of the Tate curve. In the good reduction case, the \(\ell \)-torsion is finite and flat at \(\ell \) by theorem 3.25, so we now need to understand what such representations look like. If the reduction is supersingular, then \(\rho \) is necessarily irreducible, contradicting our assumption. If however the reduction is ordinary, then the theory of the canonical subgroup shows that the \(\ell \)-torsion is an extension of an unramified character by an unramified twist of the cyclotomic character (see Proposition 11 on p273 of  [ 11 ] ).

Corollary 3.31

One of \(\alpha \) and \(\beta \) is trivial.

Proof

The previous two theorems show that one of \(\alpha \) and \(\beta \) is a character unramified at all primes, and hence cuts out an extension unramified at all primes, so by Minkowski’s theorem this character is trivial.

To summarise, we have shown the following.

Theorem 3.32

If \(\rho \) is reducible, then either \(\rho \) has a trivial 1-dimensional submodule or a trivial 1-dimensional quotient (here “trivial” means that the Galois group \(\operatorname{Gal}(\overline{\mathbb {Q}}/\mathbb {Q})\) acts trivially).

Proof

Follows from the above.

We now split into two cases, depending on whether \(\rho \) has a trivial submodule or a trivial quotient.

Lemma 3.33

If \(\rho \) has a trivial 1-dimensional submodule then the Frey curve has a non-trivial point of order \(\ell \).

Proof

Indeed, the trivial 1-dimensional submodule is a Galois-invariant subgroup of \(E[\ell ]\), so it corresponds to a Galois-stable point of order \(\ell \).

Corollary 3.34

\(\rho \) cannot have a trivial 1-dimensional submodule.

Proof

We have just seen that in this case, the Frey curve has a point of order \(\ell \). It also has three points of order 2, meaning that its torsion subgroup has order at least \(4\ell \geq 20\), contradicting Mazur’s theorem 3.27.

It remains to rule out the case where \(\rho \) is reducible and has a trivial quotient. To do this, we need to quotient out \(\rho \) by its 1-dimensional Galois-stable submodule.

Theorem 3.35

If \(p\) is a prime and if \(E\) is an elliptic curve over a field \(K\) of characteristic not equal to \(p\), and if \(C\subseteq E(K^{\operatorname{sep}})[p]\) is a Galois-stable subgroup of order \(p\), then there’s an elliptic curve \(E':=\)“\(E/C\)” over \(K\) and an isogeny of elliptic curves \(E\to E'\) over \(K\) inducing a Galois-equivariant surjection \(E(K^{\operatorname{sep}})\to E'(K^{\operatorname{sep}})\) with kernel precisely \(C\).

Proof

Brian Conrad suggested the following approach, applicable as well for abelian schemes \(A\to S\) over a base. Let \(G\) be a finite locally free \(S\)-subgroup of \(A\), say \(G\) with constant rank \(n {\gt} 0\) by working locally on the base, so \(G\) is contained in \(A[n]\). Then \(n: A \to A\) is the fppf quotient of the source by \(A[n]\), so it expresses \(A\) as an \(A[n]\)-torsor over itself. The problem of building \(A/G\) as an abelian scheme is then seen to be the “same” as that of constructing the quotient of this \(A[n]\)-torsor by the G-action.

In other words, the problem them becomes one having nothing specific to do with abelian schemes, at the cost of working over a base (such as the original target \(A\)) even when \(S\) was the spectrum of a field in the application. The question is now: for a finite locally free commutative \(S\)-group \(H\) and a closed locally free \(S\)-subgroup \(G\), build a reasonable quotient \(H/G\). One approach is to look at the Cartier dual \(H^\vee \to G^\vee \), show that it’s faithfully flat, and then deduce that the Cartier dual of the kernel of this map does the job. Note that one input for this proof is the claim that inclusions of Hopf algebras over fields are flat, proved nicely in Waterhouse’s book.

I suspect that the proof above is a no-go right now; there will presumably be a much easier proof of this result in Silverman though. Note also that this approach will not give us a plane cubic, but rather a smooth proper group scheme so we would need Riemann-Roch to turn it into a plane cubic, although it’s unlikely that one will be able to prove Mazur’s theorem without developing all of this machinery and much more.

Corollary 3.36

\(\rho \) cannot have a trivial 1-dimensional quotient.

Proof

\(\rho \) has a Galois-stable submodule \(C\). The quotient curve \(E/C\) now has a trivial 1-dimensional submodule, and also three points of order 2 (the images of the three 2-torsion points in \(E\)). Hence the torsion subgroup of \(E/C\) has order at least \(4\ell \geq 20\), again contradicting Mazur’s theorem.

Theorem 3.37

The \(\ell \)-torsion in the Frey curve associated to a Frey package \((a,b,c,\ell )\) is irreducible.

Proof

Follows from theorem 3.32, corollary 3.34 and corollary 3.36.