Skip to content

R = T – The Heart of the Proof

Summary

The core of Wiles' proof is the equation \(R = T\): the universal deformation ring \(R\) is isomorphic to the Hecke algebra \(T\). This means that every admissible deformation of the residual representation is modular. The proof uses a numerical criterion of Wiles and Lenstra that reduces the question to a comparison of Selmer groups and congruence ideals.

Prerequisites

  • Deformation Theory – universal deformation ring \(R\), Hecke algebra \(T\), surjection \(R \twoheadrightarrow T\)

1. The Setup

Recap: The surjection

From the previous article we know: there is a natural surjective map

\[ \varphi: R \twoheadrightarrow T \]

from the universal deformation ring \(R\) (all admissible deformations) to the Hecke algebra \(T\) (modular deformations). Our goal is:

\[ \ker(\varphi) = 0, \qquad \text{i.e.} \quad R \xrightarrow{\sim} T. \]

Complete intersection rings

Wiles shows not only \(R = T\), but also that \(T\) (and hence \(R\)) is a complete intersection:

\[ T \cong \mathbb{Z}_p[[x_1, \ldots, x_r]] / (f_1, \ldots, f_r). \]

Note: the number of generators \(r\) equals the number of relations – the ring is as "small as possible" for its dimension. This property is crucial for the numerical criterion.


2. The Numerical Criterion

The idea

Wiles developed (together with Hendrik Lenstra, who simplified the proof) a purely algebraic criterion that turns a surjection \(R \twoheadrightarrow T\) into an isomorphism:

Numerical Criterion (Wiles–Lenstra)

Let \(\varphi: R \twoheadrightarrow T\) be a surjection of complete local Noetherian \(\mathbb{Z}_p\)-algebras with \(T\) finite and free over \(\mathbb{Z}_p\). Let \(\pi: T \to \mathbb{Z}_p\) be an augmentation homomorphism. Then: $$ \varphi \text{ is an isomorphism and } T \text{ is a complete intersection} $$ if and only if $$ |\Phi_R / \Phi_R^2| \leq |\eta_T / \mathbb{Z}_p|, $$ where: - \(\Phi_R = \ker(R \xrightarrow{\pi \circ \varphi} \mathbb{Z}_p) / \ker(R \xrightarrow{\pi \circ \varphi} \mathbb{Z}_p)^2\) is the cotangent space of \(R\), - \(\eta_T = \pi(\text{Ann}_T(\ker \pi))\) is the congruence ideal of \(T\).

What the criterion says

In words: \(R = T\) holds if the deformation space is no larger than what the Hecke algebra allows in congruences. The left-hand side measures "how many" deformations there are (upper bound); the right-hand side measures "how many" congruences between modular forms exist (lower bound).

Why it works

The intuition is: if there were "too many" deformations (more than modular ones), then \(\ker(\varphi) \neq 0\), and the cotangent space of \(R\) would be strictly larger than that of \(T\). The criterion formalises this intuition precisely.


3. Selmer Groups and the Cotangent Space

The tangent space of \(R\)

The cotangent space \(\Phi_R / \Phi_R^2\) has a cohomological interpretation. Infinitesimal deformations of \(\bar{\rho}\) – that is, lifts to \(\mathbb{F}_p[\varepsilon]/(\varepsilon^2)\) – are classified by Galois cohomology:

\[ \text{Def}(\bar{\rho}, \mathbb{F}_p[\varepsilon]) \cong H^1(G_{\mathbb{Q}}, \text{Ad}^0(\bar{\rho})), \]

where \(\text{Ad}^0(\bar{\rho})\) denotes the trace-zero endomorphisms of the representation (the adjoint representation with trace 0).

The Selmer group

The local deformation conditions (flat, ordinary, Steinberg, etc.) cut out a Selmer group from the global cohomology:

\[ H^1_{\mathcal{D}}(G_{\mathbb{Q}}, \text{Ad}^0(\bar{\rho})) \subset H^1(G_{\mathbb{Q}}, \text{Ad}^0(\bar{\rho})). \]

This Selmer group is precisely the cotangent space of \(R_{\mathcal{D}}\):

\[ \Phi_R / \Phi_R^2 \cong H^1_{\mathcal{D}}(G_{\mathbb{Q}}, \text{Ad}^0(\bar{\rho}))^\vee. \]

The size of the Selmer group thus determines how many deformation parameters \(R\) has – the smaller the Selmer group, the more "rigid" the deformation space.

The dual Selmer group

By Poitou–Tate duality, there is an exact sequence connecting the Selmer group \(H^1_{\mathcal{D}}\) with a dual Selmer group \(H^1_{\mathcal{D}^\perp}\):

\[ 0 \to H^1_{\mathcal{D}} \to H^1 \to \bigoplus_v H^1 / H^1_v \to (H^1_{\mathcal{D}^\perp})^\vee \to H^2 \to \cdots \]

Controlling both Selmer groups is essential for the upper bound.


4. Upper Bounds for Selmer Groups

The goal

For the numerical criterion we need:

\[ |H^1_{\mathcal{D}}(G_{\mathbb{Q}}, \text{Ad}^0(\bar{\rho}))| \leq |\eta_T|. \]

That is: the Selmer group must not be "too large".

Wiles' original approach (1993): Euler systems

In his first version of the proof (presented at Cambridge, June 1993), Wiles tried to obtain the upper bound using an Euler system – a family of cohomology classes that, via Kolyvagin's methods, yield upper bounds for Selmer groups.

This approach failed: the construction of the required Euler system could not be completed.

The final approach (1994): Patching

The successful proof of the upper bound uses the Taylor–Wiles trick (patching argument), which is presented in detail in the next article. The basic idea: instead of estimating the Selmer group directly, one constructs a family of auxiliary situations in which the bound holds "in the limit" – and transfers it to the original case.


5. Lower Bounds: Congruence Ideals

What are congruences between modular forms?

Two newforms \(f\) and \(g\) of weight 2 and level \(N\) are called congruent modulo \(p\) if their Fourier coefficients agree modulo \(p\):

\[ b_n(f) \equiv b_n(g) \pmod{p} \quad \text{for all } n. \]

The congruence ideal

The congruence ideal \(\eta_T\) measures how many such congruences exist. More precisely: let \(\pi: T \to \mathbb{Z}_p\) be the homomorphism corresponding to the newform \(f_0\) (associated to our elliptic curve). Then

\[ \eta_T = \pi(\text{Ann}_T(\ker \pi)) \subset \mathbb{Z}_p \]

is an ideal in \(\mathbb{Z}_p\), hence of the form \(\eta_T = (p^m)\) for some \(m \geq 0\).

  • \(m = 0\) (i.e., \(\eta_T = \mathbb{Z}_p\)): there are no non-trivial congruences with \(f_0\) – the "isolated" case.
  • \(m > 0\): there are congruences, and \(p^m\) measures their "depth".

The connection to \(L\)-values

A deep formula connects the congruence ideal with special values of the \(L\)-series. For the minimal case (after Hida and others):

\[ |\eta_T| = |L_{\text{alg}}(f_0, 1)|_p^{-1}, \]

where \(L_{\text{alg}}(f_0, 1)\) is the "algebraic part" of the special \(L\)-value at \(s = 1\).


6. The Proof: Minimal Case

What "minimal" means

The representation \(\bar{\rho}\) is called minimal if at every prime \(q \neq p\) it is ramified as little as possible – more precisely: if the local deformation conditions are chosen as restrictively as possible (no additional ramification allowed).

Why the minimal case is easier

In the minimal case, the Selmer groups are as small as possible, and the congruence ideal can be best controlled. The two sides of the numerical criterion become comparable.

The proof

In the minimal case, Wiles shows (using Taylor–Wiles patching):

  1. Upper bound: \(|H^1_{\mathcal{D}}| \leq |\mathbb{Z}_p / \eta_T|\) (the Selmer group is small enough).
  2. Numerical criterion: The inequality \(|\Phi_R / \Phi_R^2| \leq |\eta_T / \mathbb{Z}_p|\) is satisfied.
  3. Conclusion: \(R = T\) and \(T\) is a complete intersection.

The minimal case is thus settled – and it forms the foundation for the general case.


7. From the Minimal to the General Case

The problem

Not every semistable elliptic curve yields a minimal representation. In the general case, \(\bar{\rho}\) may be "additionally ramified" at some primes \(q\) – the local conditions are less restrictive, and the Selmer groups are larger.

The reduction

Wiles shows that the general case can be reduced to the minimal case. The idea:

  1. Choose the minimal deformation conditions \(\mathcal{D}_{\min}\).
  2. For the actual (more general) conditions \(\mathcal{D}\), we have \(R_{\mathcal{D}_{\min}} \twoheadrightarrow R_{\mathcal{D}}\).
  3. Since \(R_{\mathcal{D}_{\min}} = T_{\mathcal{D}_{\min}}\) (minimal case), the isomorphism transfers to the general case through careful analysis of the additional ramification.

The result

Theorem (Wiles, 1995)

Let \(E/\mathbb{Q}\) be a semistable elliptic curve and \(p \in \{3, 5\}\). Then \(R = T\) for the associated residual representation \(\bar{\rho}_{E,p}\) (under suitable deformation conditions).

This proves the modularity of semistable elliptic curves – and Fermat's Last Theorem follows.

The complete proof chain

\[ \bar{\rho}_{E,p} \text{ modular} \xrightarrow[\text{Wiles–Lenstra}]{\text{Num. crit. + TW trick}} R = T \implies \rho_{E,p} \text{ modular} \implies E \text{ modular} \implies \text{FLT} \]

Outlook

The proof of \(R = T\) in the minimal case uses the Taylor–Wiles trick – a revolutionary patching argument that closed the gap in Wiles' first proof attempt:

Article Topic
06 – The Taylor–Wiles Trick The patching argument and the story of the gap
07 – The 3-5 Switch How Langlands–Tunnell provides the starting point

Sources

  • Andrew Wiles: Modular elliptic curves and Fermat's Last Theorem, Annals of Mathematics 141 (1995), §2–3
  • Hendrik Lenstra: Appendix to Wiles' paper – simplification of the numerical criterion
  • Nigel Boston: The Proof of Fermat's Last Theorem (2003), Chapter 12 – \(R = T\) and the numerical criterion
  • Gary Cornell, Joseph Silverman, Glenn Stevens (eds.): Modular Forms and Fermat's Last Theorem, Springer (1997) – Comprehensive exposition of all proof steps