Lectures on the Hardy-Littlewood Singular Serieswright/HLnotes.pdf · Lectures on the...

Lectures on the Hardy-Littlewood Singular Series

Takashi Ono

September 3, 2008

0 Introduction

The story begins almost ninety years ago with Hardy and Little-wood, who examined the integer solutions for the equation

xδ1 + xδ2 + ...+ xδn = ν, ν ∈ Z

for an integer δ ≥ 2. Now, instead of finding solutions for x1, x2, ...xnin Zn, it is an equally valid question to ask whether there are solu-tions in Onk for a global field k, or an A-field in the sense of Weil’sBasic Number Theory (1967). Here, by Ok we mean the ring ofintegers of k.

To put this another way, if f is a polynomial function such thatf : kn → k where ν ∈ k, we wish to study the set f−1(ν) (i.e. thefiber of f which maps down to ν). Since we are interested in thenumber of solutions for the Hardy-Littlewood equation, we will de-fine

Nt(ν) = #(f−1(ν)⋂Kt) = #x ∈ Kt : f(x) = ν

such that⋃tKt = knA,

where the Kt is a family of compact sets in knA. The parame-ter t itself will be later explained, but for now it suffices to say thatwe will be interested in the growth of Nt as t likewise grows. Morespecifically, we will consider

1

limt→∞Nt(ν)t∗

,

where the denominator denotes that t is raised to some power. Themeaning of ”t→∞” here is ill-defined but will be properly definedin 2.4.

This type of quantity is known as the singular series, denotedS(ν). Hardy and Littlewood made extensive use of the singularseries and a method which they called the circle method in provingtheir results about the Hardy-Littlewood equation.

Upcoming sections will use these ideas to develop an adelizedversion of the Hardy-Littlewood Theorem.

0.1 Nt(ν)

Now, we will define our terms more formally. Let k be an A-field,f : kn → km a polynomial map, where ν ∈ Omk . We will define

f−1(ν) = γ ∈ kn : f(γ) = ν

to be the fiber of f which maps down to ν. Then

k ⊂ kA := ring of adeles of K ⊃ Kt.

Note that k is discrete in kA, kA is locally compact, and Kt is com-pact. For a given set Kt, we have

Nt(ν) = #(Kt

⋂f−1(ν)).

0.2 Integral Representation of Nt(ν)

In this section, we wish to find an integral which will accuratelydescribe Nt(ν). Let ψt be the characteristic function of Kt. It isobvious, then, that

Nt(ν) =∑

γ∈kn,f(γ)=ν ψt(γ).

We also define the inner product in the usual way:

2

<,>: knA × knA → kA,< x, y >=

∑xiyi.

We let χ be a basic character of k+A which is trivial on k but non-

trivial on k+A . This character is unique up to k×, i.e. if χ′ is another

such character then

χ′(x) = χ(αx), α ∈ k×.

We can use this to define a function Γfψt which maps kmA to C:

(Γfψt)(ξ) =∑

γ∈km ψt(γ)χ(< f(γ), ξ >).

This is a function on kmA /km.

For ν ∈ km, the ν-th Fourier coefficient of Γfψt is∫kmA /k

m Γfψt(ξ)χ(< ξ, ν >)dξ

=∫kmA /k

m Γfψt(ξ)χ(< ξ,−ν >)dξ

=∑

γ ψt(γ)∫kmA /k

m χ(< f(γ)− ν, ξ >)dξ.

Note that < f(γ)− ν, ξ > is a character on ξ. In fact,

< f(γ)− ν, ξ >=

1 if f(γ) = ν,

0 otherwise.

So∑γ ψt(γ)

∫kmA /k

m χ(< f(γ)− ν, ξ >)dξ

=∑

γ∈kn,f(γ)=ν ψt(γ)

= Nt(ν).

Thus, we have reduced the initial question to one about (Γfψt)(ξ).

0.3 Singular Series

Instead of Γ, we will move to G:

(Gfψt)(ξ) =∫knAψt(x)χ(< f(x), ξ >)dx.

3

We assume Gfψt ∈ L1(kmA ). We let

St(ν) = Gfψt(−ν) =∫kmAGfψt(ξ)χ(< ξ, ν >)dξ.

In upcoming sections, we will compare St(ν) and Nt(ν).

0.4 Pair of Representations of an Algebraic Group

We start with a group G and a pair of representations ρ1 and ρ2

Gρ2

##GGGGGGGGρ1 // GLn

GLm

which are commutative and equivariant. In fact, for any s ∈ G,we get the commutative diagram

knf−−−→ kmyρ1(s)

yρ2(s)

knf−−−→ km

,

where the f is the one described in our generalization of the Hardy-Littlewood Theorem such that

f(ρ1(s)x) = ρ2(s)f(x) ∀ x ∈ kn, ∀ s ∈ G.

Next, we also need to define our Ktt∈GA , where each Kt ⊂ knAis a compact set. So, for a place v which determines a valuation | · |v,

Bv = x ∈ kv : |xi|v ≤ 1, 1 ≤ i ≤ n.

If v = ∞ then Bv is an n-dimensional box. If v 6= ∞ then Bv =Onv ⊂ knv , where Onv is the standard lattice points. We relate theseB’s to the language of adeles by taking

K1 = ΠvBv

and taking t = 1 (the 1 adele) to be the origin. This is a standard

4

compact set by Tychonof’s Theorem. Now, we can define the Kt’s as

Kt = ρ1(t)Kt ∀ t ∈ GA.

Recalling our definition of ψt as a characteristic function on Kt,we note that

ψt(x) = ψ1(ρ1(t)−1x).

If we define the adelic valuation

|x|A = Πv|xv|v

then, recalling from the previous section our definition for Gfψt andSt(ν), we can rewrite the definition of Gfψt as

Gfψt(ξ) = |detρ1(t)|Agfψ1((ρ2(t))T ξ),

where |detρ1(t)|A ∈ R×. Similarly, for St(ν),

St(ν) = |detρ1(t)|A|detρ2(t)|A

S1(ρ2(t)−1ν).

Now, we can more formally define ”t→∞” to mean |detρ1(t)|A →∞.

Armed with these definitions, we hope, as Hardy and Littlewooddid previously, that

Nt(ν)|detρ1(t)|A|detρ2(t)|A

= S1(ρ2(t)−1ν) +O(|detρ1(t)|−εA ).

Hardy and Littlewood then went on to find more explicit resultsfor ε.

0.5 Hardy-Littlewood Theorem

Having defined the previous notation, we can describe in this no-tation Hardy and Littlewood’s work from their 1920 paper, A NewSolution To Waring’s Problem

f : Qn → Q,

5

f : x 7→ xδ1 + xδ2 + ...+ xδn for 1 ≤ δ ∈ Z,

and

k = Q,G = Q× = GL1,ρ1(t) = tIn ∈ GLn,ρ2(t) = tδ ∈ GL1.

Plugging these in as our definitions gives us the following infor-mation:

det(ρ1(t)) = tn, det(p2(t)) = tδ,f(ρ1(t)x) = ρ2(t)f(x).

Moreover, in defining our Kt’s, we find in this case that

K1 =

v =∞ B∞ = x ∈ Rn : |xi| ≤ 1, 1 ≤ i ≤ n,v = p Bp = Zn

p ,

K = B∞ × ΠpZnp ,

where, as always, Zp is defined as

Zp = x ∈ Qp : |x|p ≤ 1.

We can even describe t in terms of an element in the group GA =Q×A = A× of ideles in the ring of adeles A by letting t = (τ, 1, 1, 1, ...) ∈GA, where τ ∈ Q∞ = R, τ > 0. Then

Kt = ρ1(t)K1 = B∞(τ)× ΠpZnp .

Additionally, for ν ∈ Z, our choice of t means that

Nt(ν) = #γ ∈ Qn : f(γ) = ν, γ ∈ Kt= #γ ∈ Zn : |γi| ≤ τ, 1 ≤ i ≤ n such that f(γ) = ν,

and St(ν) can be written as

6

St(ν) = |t|n−δA J(τ−δν)S(ν),

where |t|n−δA = τn−δ, J(τ−δν) is a Gamma factor, and S(ν) is thesingular series.

Hardy and Littlewood’s theorem can then be written as

Theorem (Hardy-Littlewood): ∃ ε such that

Nt(ν)τn−δ

= J(τ−δν)S(ν) +O(τ−ε).

We give an outline of the proof:

“Proof”: Since m = 1 (i.e. ν ∈ Q1), we found in 2.2 that

Nt(ν) =∫

QA/Q(Γfψt)(ξ)χ(ξν)dξ.

We can express Γfψt(ξ) as

Γfψt(ξ) =∑

γ∈Qn ψt(γ)χ(f(γ)ξ)

=∑

γ∈Zn,|γi|≤τ χ((γδ1 + γδ2 + ...+ γδn))

= (∑

γ∈Z,|γ|≤τ χ(γδξ))n.

So

Nt(ν) =∫

QA/Q[∑

γ∈Z,|γ|≤τ χ(γδξ)]nχ(ξν)dξ.

Let us now define our character χ on QA. Let x = (xv) ∈ QA.For v = p, xp ∈ Qp implies that ∃ l > 0 such that plxp ≡ z (mod pl)for some z ∈ Z. So let

χv(xv) =

e−2πix∞ if v =∞,e

2πi

plz

if v = p.

Then a basic character χ : QA/Q→ T is given by

χ(x) = Πvχv(xv).

In fact, χ∞ yields the exact sequence

7

0→ Z→ R χ∞→ T→ 0.

Analogously, χp yields the sequence

0→ Zp → Qpχp→ T,

where Zp is the kernel of χp.Next, since Q is discrete in QA, we can discuss the fundamental

domain F of QA/Q

F = [0, 1)× ΠpZp.

Of course, the volume of this fundamental domain is 1.Now, we can rewrite Nt as

Nt(ν) =∫F(∑

γ∈Z,|γ|≤τ χ(γδξ))nχ(ξν)dξ.

We note that for any non-archimedean place, ξ ∈ F is integral,and hence γδξ is also integral. So

χ(γδξ) = Πvχv(γδξ) = χ∞(γδξ∞)Πpχp(γ

δξp) = χ∞(γδξ∞),

and similarly

χ(ξν) = χ∞(ξ∞ν).

Then our expression for Nt becomes

Nt(ν) =∫ 1

0(∑

γ∈Z,|γ|≤τ χ∞(γδξ∞))nχ∞(ξ∞ν)dξ∞.

Put α = ξ∞ and

Tτ (α) =∑

γ∈Z,|γ|≤τ χ∞(γδα).

Then

Nt(ν) =∫ 1

0(Tτ (α))nχ∞(αν)dα.

The finite sum Tτ is counted using the circle method. Let x ∈

8

[0, 1)⋂

Q, x = am

, (a,m) = 1. Moreover, let ε and τ > 0. For ourpurposes, ε will be small, while τ will be larger. Define

S(ε, τ) = x ∈ [0, 1)⋃

Q : m ≤ τ ε,Mx = α ∈ [0, 1) : |α− x| < 1

τδ−ε.

Then for x, x′ ∈ S(ε, τ), x 6= x′ implies that Mx

⋃Mx′ = Ø.

Finally, we split the integral into two parts and evaluate eachone. Let

M =⋃x∈S(ε,τ) Mx,

M∗ = [0, 1)−M.

M and M∗ are referred to as the major and minor arcs, respec-tively. We split the interval [0, 1) into the two arcs:∫ 1

0=∫M+

∫M∗ .

For the minor arc, if n ≥ 2δ + 1 then∫M∗ |Tτ (α)|ndα = O(τn−δ−ε

′),

where ε′ > 0 is a function of our chosen ε. For the major arc,the integral was computed explicitly to be∫

M |Tτ (α)|ndα = Nt(ν)tn−δ

= J(τ−δν)S(ν) +O(τ−ε).

1 Finite Fields1

1.1 Gauss Sums

Let Fq be the extension of Fp such that q = pf (i.e. [Fq : Fp] = f)and let T to be the trace in Fq/Fp. Then for x ∈ Fq, we can definethe function

1For details, see A. Weil, The Number of Solutions of Equations in Finite Fields, Bull.A.M.S. 55 (1949)

9

ε(x) = e2πipT (x).

Clearly, ε ∈ F+q . So if ϕ ∈ C(Fq) is a complex valued function

then

ϕ(ξ) =∑

x∈Fq ϕ(x)ε(xξ)

is the Fourier transform of ϕ. Of course, the field Fq has two op-erations, so we must also consider multiplicative characters. For

χ ∈ F×q , we extend it to get χ ∈ C(Fq) by

χ(0) =

1 if χ = χ0 (i.e. χ is trivial),

0 otherwise.

If χ 6= χ0, we set

χ(ξ) =∑

x∈F×q χ(x)ε(xξ) = Gξ(χ).

This Gξ(χ) is called a Gauss sum. The Gauss sum has three basicproperties:

(1) If ξ 6= 0 then χ(ξ) = χ(ξ)−1χ(1).(2)∑

ξ 6=0 χ(ξ) = 0.

(3) If ξ 6= 0 then |χ(ξ)| = √q.

The first two properties are to be expected, but the third is rathersurprising.

1.2 N (q)(ν)

Let f be a function

f : Fnq → Fq,

where ν ∈ Fq. Define

N (q)(ν) = #f−1(ν).

Put

10

Γf (ξ) =∑

x∈Fnq ε(f(x)ξ).

So Γf (ξ) ∈ C(Fq). Then, as before, we can relate N to the ν-thFourier coefficient of Γf :∑

ξ∈Fq Γf (ξ)ε(ξν) =∑

x

∑ξ ε((f(x)−ν)ξ) =

∑f(x)=ν q = qN (q)(ν).

This means that

N (q)(ν) = qn−1 + 1q

∑ξ∈F×q ε(ξν)

∑x∈Fnq ε(f(x)ξ).

This process can be repeated for the more general set of functionsf : Fnq → Fmq by simply replacing the products with inner products(e.g. f(x)ξ would be replaced by < f(x), ξ >).

Let us take the particular case of

f(x) = a1xδ11 + ...+ anx

δnn .

Note that

ε(f(x)ξ) = Πni=1ε(aix

δii ξ).

So

N (q)(ν) = qn−1 + 1q

∑ξ∈F×q ε(ξν)Πn

i=1(∑

xi∈Fq ε(aixδii ξ)).

As in 1.1, we define Gξ(χ) as

Gξ(χ) = χ(ξ) =∑

F×q χ(x)ε(xξ).

This is a generalization of the classical Gauss sum, which wouldbe the case where δ’s are all 2.

Next, if d = (δ, q − 1) then χα ∈ F×p can be defined as

χα(y) = e2πidL(y)α for y ∈ F×q .

Here, L(y) is defined as follows. Choose a generator ω ∈ F×q . Then

y = ωl(y) for some l(y) in Z/(q − 1)Z. Let j be the reduction map

11

from Z/(q − 1)Z to Z/dZ. This gives us the maps

F×ql

→ Z/(q − 1)Z j→ Z/dZ,

where l is an isomorphism and j is surjective. Then L = j l.Using our generalized version of the Gauss sum, we wish to show

the following:

Claim:∑

x∈Fq ε(cxδ) =

∑α∈Z/dZ,α 6=0Gc(χα).

Let us begin with the left side, which we can rewrite as∑x∈Fq ε(cx

δ) =∑

y∈Fq µ(y)ε(cy) = 1 +∑

y∈F×q µ(y)ε(cy),

where

µ(y) = #x ∈ Fq : y = xδ,

meaning that µ(0) = 1. Now, by basic modular arithmetic, weknow that if ax ≡ b (mod m) has a solution then the number ofsolutions is equal to d = (m, a). So

µ(y) =

d if d|l(y)⇔ L(y) = 0,

0 if d - l(y)⇔ L(y) 6= 0.

So

1 +∑

y∈F×q µ(y)ε(cy) = 1 + d∑

L(y)=0 ε(cy).

Additionally, let ed(∗) = e2πid

(∗). So

∑α∈Z/dZ χα(y) =

∑α ed(L(y)α) =

0 if L(y) 6= 0,

d if L(y) = 0.

So

1 + d∑

L(y)=0 ε(cy) = 1 +∑

y∈F×q∑

α∈Z/dZ χα(y)ε(cy)

= 1 +∑

α∈Z/dZ∑

y∈F×q χα(y)ε(cy)

= 1 +∑

y∈F×q ,α=0 ε(cy) +∑

α∈Z/dZ,α 6=0

∑y∈F×q χα(y)ε(cy)

12

= 1 + 0 +∑

α∈Z/dZ,α 6=0

∑y∈F×q χα(y)ε(cy)

=∑

α∈Z/dZ χα(c)

=∑

α∈Z/dZGc(χα).

This proves the claim.Now, we can use this to give a bound for |N (q) − qn−1|. As an

aside, the n = 2 case is essentially the Riemann Hypothesis for func-tion fields, as proven by Deligne in 1974. For a δi, let di = (δi, q−1).By the claim, it follows that∑

xi∈Fq ε(aixiξ) =∑

αi∈Z/diZ,α 6=0 χαi(aiξ).

We can plug this into our expression for N (q) to find

N (q)(ν) = qn−1+1q

∑ξ∈F×q ε(ξν)Πn

i=1(∑

xi∈Fq∑

αi∈Z/diZ,α 6=0 χαi(aiξ)).

Note that |ε(ξν)| = 1 and |χαi(aiξ))| =√q by property (3) of Gauss

sums. So, the above implies that

|N (q)(ν)− qn−1| ≤ 1q(q − 1)Πn

i=1((di − 1)√q)

= qn2−1(q − 1)Πn

i=1(di − 1) < Mqn2

with M = Πni=1(di − 1) independent of q.

2 Weyl Sums

2.1 Weyl Sums over Finite Abelian Groups

Let A be a finite abelian group of order N , and let G be a locally

compact abelian group. Let χ ∈ G = Hom(G,T), the dual of G,and let ϕ : A→ G be any map. We define the Weyl sum

W (ϕ) =∑

x∈A χ(ϕ(x)).

Additionally, let x, h, h1, h2 ∈ A. Then we define

(∆hϕ)(x) = ϕ(x+ h)− ϕ(x),(∆h1h2ϕ) = (∆h2(∆h1ϕ)).

13

More generally, we will write

∆h1h2..hlϕ = ∆hl(∆h1...hl−1ϕ).

We claim that this ∆ action commutes

Lemma 2.1.1: ∆h1h2 = ∆h2h1 .

Proof : Let us denote ψ = ∆h1ϕ. Then

∆h1h2(x) = ∆h2(∆h1ϕ)(x) = ∆h2(ψ(x)) = ψ(x+ h2)− ψ(x)= (∆h1ϕ)(x+ h2)− (∆h1ϕ)(x)= ϕ(x+ h2 + h1)− ϕ(x+ h2)− ϕ(x+ h1) + ϕ(x).

Note that this would still be the case if h1 and h2 were switched. Sothe lemma holds.

Proposition 2.1.1: |W (ϕ)|2 =∑

h∈AW (∆hϕ).

Proof : Since W (ϕ) ∈ C,

|W (ϕ)|2 = W (ϕ)W (ϕ)

=∑

y χ(ϕ(y))∑

x χ(ϕ(x))

=∑

x,y χ(ϕ(y)− ϕ(x)).

Let y = x+ h. Then∑x,y χ(ϕ(y)− ϕ(x)) =

∑x,h χ(ϕ(x+ h)− ϕ(x))

=∑

h

∑x χ(∆hϕ(x))

=∑

hW (∆hϕ).

Next, we know that for x, y ∈ Cn,

| < x, y > | ≤ |x||y|

by the Cauchy-Schwartz inequality, where x = (x1, ..., xn), y =(y1, ..., yn), < x, y >=

∑i xiyi is the usual Hermitian, and |x| =√

|xi|2 is the usual norm. Then

14

|∑

i xiyi|2 ≤ (∑

i |xi|2)(∑

i |yi|2).

If x = (1, ..., 1) ∈ CN then the above yields

(1) |∑

i yi|2 ≤ N∑

i |yi|2.

We can apply this to our ∆ by the following. Let

yhl−1=∑

h1,h2,...,hl−2W (∆h1h2,...hl−2hl−1

ϕ).

Then

|∑

h1,...,hl−1W (∆h1...hl−1

ϕ)|2 = |∑

hl−1

∑h1,h2,...,hl−2

W (∆h1...,hl−2hl−1ϕ)|2

≤ N∑

hl−1|∑

h1,h2,...,hl−2W (∆h1...,hl−2hl−1

ϕ)|2

by (1).

Theorem 2.1.1: If l ≥ 1 then

|W (ϕ)|2l ≤ N2l−l−1∑

h1...hlW (∆h1...hlϕ).

Proof : If l = 1 then this is Proposition 2.1.1 (with equality). Theproof is then an induction on l, and is left as an exercise to thereader.

Now, we amend the previous setup such that R is a locally com-

pact ring, χ ∈ R+, A ⊂ R+ is a finite subgroup of order N , andϕ : A→ R is any mapping from A to R. In particular, we will takeϕ(x) = xδξ, ξ ∈ R, δ > 0. So

W (ϕ) =∑

x∈A χ(xδξ).

Note that if δ = 1 then

W (ϕ) =∑

x∈A χ(xξ) =

N if χ(xξ) = 1 ∀x ∈ A,0 otherwise.

So we will assume that δ > 1. This means that

15

∆hϕ(x) = ϕ(x+ h)− ϕ(x) = (x+ h)δξ − xδξ= δhxδ−1ξ+(lower order terms in x).

So

∆h1...hδ−1ϕ(x) = δ!ξh1...hδ−1x+ c

for some c ∈ R. If we put l = δ − 1 then, by Theorem 2.1.1,

• |W (ϕ)|2δ−1 ≤ N2δ−1−δ∑h1...hδ−1

|∑

x∈A χ(δ!ξh1...hδ−1x)|,

since |χ(c)| = 1.

2.2 Riemann-Roch Theorem

Let k be a function field defined over Fq, and let kA be its ring ofadeles. We let χ : kA → T be a basic character of k.

Now, we get the following diagram.kA

γ

000000000000000 0

k k#

0 kA,

where

k# = χ ∈ kA : χ(k) = 1

is the annihilator of k. We know that an isomorphism γ existsbecause of the self-duality of kA. More specifically, γ must be of theform

γ(x)(a) = χ(ax)

for a ∈ k×A . For another character χ′ ∈ kA,

16

χ′(x) = χ(ax)

for some a ∈ k×. The character χ can be expressed as

χ(x) = Πvχv(xv),

where x = (xv) ∈ kA. We define ordv χv ∈ Z by

P−ord χvv = x ∈ kv : χv(xy) = 1 ∀ y ∈ Ov,

where Pv = (πv) is the prime ideal of Ov. Let

π(χ) = (π−ord χvv ) ∈ k×A .

Additionally, for t ∈ k×A , define

t∗ = π(χ)t−1.

It is obvious that t∗∗ = t. Next, let

P (t) = Πv(tvOv) ⊂ kA,L(t) = k

⋂P (t) ⊂ kA.

Note that P (t) is compact, while k is discrete in kA. So L(t) isa finite abelian group. Note that P (t) is defined equivalently by

P (t) = x ∈ kA : |x|v ≤ |t|v,∀ v.

P (t) contains Fq, which can be defined by

Fq = x ∈ k : |x|v ≤ 1.

Finally, since L(t) can be viewed as a vector space over Fq, let

l(t) = dimFqL(t).

Hence |L(t)| = ql(t).Now, we come to an important theorem in algebraic geometry

and complex analysis.

17

Riemann-Roch Theorem: Let t ∈ k×A and let g be the genus ofk. Then

|t|A = ql(t)−l(t∗)+(g−1).

Proof : To start, we consider the following diagrams:

kA

γ

""EEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEE 0

k + P (t)

IIIIIIIII

xxxxxxxxxx(k + P (t))#

MMMMMMMMMM

ssssssssss

k

FFFFFFFFFF P (t)

uuuuuuuuuk#

KKKKKKKKKK P (t)#

qqqqqqqqqq

L(t) k# + P (t)#

0 kA.

We wish to prove four properties about the elements of the abovediagrams:

(1) (k + P (t))# = k#⋂P (t)#.

(2) (P (t))# = γ(P (t∗)).(3) (k + P (t))# = γ(L(t∗)).(4) (L(t))# = γ(k + P (t∗)) = k# + P (t)#.

The proofs are as follows:

(2) We know that γ : kA → kA is an isomorphism. So

γ(x) ∈ P (t)# ⇔ γ(x)(P (t)) = 1⇔ χ(xP (t)) = 1⇔ χ(x · ty) = 1 ∀ y ∈ P (1) = ΠvOv⇔ ∀ v, χv(xvtvyv) = 1 ∀ yv ∈ Ov⇔ ∀ v, xvtv ∈ P−ord χvv

18

⇔ ∀ v, xv ∈ π−ord χvv t−1v Ov

⇔ x ∈ P (t∗).

(3) This is proven by the following string of equalities

(k + P (t))# = k#⋂P (t)#

= γ(k)⋂γ(P (t∗))

= γ(k⋂P (t∗))

= γ(L(t∗)).

(4) First, we note that

γ(k + P (t∗)) = γ(k) + γ(P (t∗)) = k# + P (t)#.

This shows the latter of the two equalities. Now, kA/k is compact,which means that kA/(k + P (t)) is compact (and, in fact, finite).Moreover,

kA/(k + P (t)) ≈ (kA/(k + P (t))) = (k + P (t))# = γ(L(t∗)),

where we define the notation A to be the same as A. Now weprove the former of the two equalities by showing inclusion in eachdirection:

(⊃) L(t)# = (k⋂P (t))# ⊃ k# + P (t)# = γ(k + P (t∗)).

(⊂) Consider the exact sequence

(#) kA/γ(k + P (t∗))→ kA/L(t)# → 0.

Note that

L(t) ≈ L(t) ≈ kA/L(t)#,

where the former is because L is a finite group, and the latter isby Pontrjagin duality. So

[kA/L(t)#] = ql(t).

19

On the other hand,

[kA : γ(k + P (t∗))] = [kA : k + P (t∗)] = ql(t∗∗) = ql(t).

Since l(t∗∗) = l(t), (#) is an isomorphism. So L(t)# = γ(k+P (t∗)).

Now, let µ be the Haar measure of the additive group kA such thatµ(πvOv) = µ(P (1)) = 1. So

µ(kA/k) = ql(t∗)µ(k + P (t)/k)

= ql(t∗)µ(P (t)/L(t))

= ql(t∗)−l(t)µ(P (t)),

kA

ql(t∗)

k + P (t)

JJJJJJJJJ

wwwwwwwwww

k

GGGGGGGGGG P (t)

ttttttttt

L(t)

ql(t)

0.

But µ(P (t)) = |t|A, and, since L(1) = Fq,

µ(kA/k) = qgµ(k + P (1)/k) = qgµ(P (1)/L(1)) = qg−1.

We combine these two expressions for µ(kA/k) to get

|t|A = ql(t)−l(t∗)+(g−1).

Q.E.D.

Corollary 2.2.1: |π(χ)|A = q2g−2, l(π(χ)) = g.

Proof : From the Riemann-Roch Theorem,

20

|t∗|A = ql(t∗)−l(t)+(g−1).

So

|tt∗|A = q2(g−1).

The corollary then follows from the definition

t∗ = π(χ)t−1.

Now, put t = 1. We know l(1) = 1 since L(1) = Fq. Then wehave the diagram

kA

Fgq

k + P (1)

PPPPPPPPPPPP

vvvvvvvvvv

k

HHHHHHHHHH P (1) = ΠvOv

nnnnnnnnnnnn

L(1) = Fq

0.

Then

1 = q1−l(1∗)+g−1.

Since 1∗ = π(χ), this means that

q0 = q−l(π(χ))+g,

i.e. g = l(π(χ)).

Corollary 2.2.2: If |t|A < 1 then l(t) = 0.

21

Proof : Recall that l(t) = dimFqL(t) by definition. If L(t) 3 x 6= 0then |x|v ≤ |tv|v ∀ v, and hence, by the product formula in k,

1 = Πv|x|v ≤ Πv|tv|v < 1,

a contradiction.

Corollary 2.2.3: Let |t|A > g2g−2. Then l(t∗) = 0. Moreover,|t|A = ql(t)+(g−1) and kA = k + P (t).

Proof : First, if |t|A > g2g−2 then

|t∗|A = |π(χ)t−1|A < 1.

So l(t∗) = 0 by Corollary 2.2.2. By the Riemann-Roch Theorem,this means that |t|A = ql(t)+(g−1). Additionally, the space kA/k+P (t)has dimension l(t∗), so kA = k + P (t).

Finally, we will consider the set of valuations v. First, we notethat in the completion of k by v, Fq extends to Fqv = Ov/Pv, wherePv is the maximal prime ideal. We can define the degree of v by

qdeg v = qv.

Moreover, let us define a divisor A as

A =∑

v a(v)v,

where a(v) ∈ Z and all but finitely many a(v) = 0. Similarly,we define

deg A =∑

v a(v)deg v.

The group of divisors will be denoted D(k), and the group of degreezero divisors will be denoted D0(k). So we have the following exactsequence

0→ D0(k)→ D(k)deg→ Z→ 0.

22

To t = (tv) ∈ k×A , we associate a divisor

At =∑

v ordv(t−1v )v.

This yields another exact sequence

0→ ΠvO×v → k×A → D(k)→ 0.

Additionally, it can be shown that |t|A = qdegAt (an exercise tothe reader).

Let

t = π(χ) = (π−ord χvv ).

So

ord χv = ord(t−1v ).

and hence

At = Aπ(x) =∑

v(ord χv)v.

In general, Aπ(x) will be called the canonical divisor and denotedC.

Proposition 2.2.1: At∗ = C− At.

Proof : Follows from t∗ = π(χ)t−1.

Proposition 2.2.2 (The Classical Form of the Riemann-RochTheorem): For a divisor A =

∑v a(v)v, let

Λ(A) = k⋂

ΠvP−a(v)v ⊂ k

and

λ(A) = dimFqΛ(A).

Then

23

degA = λ(A)− λ(C− A) + (g − 1).

Proof : Note first that

Λ(At) = L(t),λ(At) = l(t).

Then by the Riemann-Roch Theorem

qAt = |t|A = ql(t)−l(t∗)+(g−1) = qλ(At)−λ(At∗ )+(g−1).

But any divisor can be written as At for some t. So the Propo-sition follows from Proposition 2.2.1.

2.3 Weyl Sums Over L(t)

Let k be a function field and χ be a basic additive (nontrivial) char-acter on kA which is trivial on k. Again, we will define f : kA → k by

f(x) = a1xδ11 + ...+ anx

δnn .

Let δ = l.c.m.(δi). We let

ρ1(t) =

tδδ1 .... 0...

. . ....

0 ... tδδn

, ρ2(t) = tδ, t ∈ k×A .

In this case,

K1 = ΠvOnv ,

Kt = ρ1(t)K1 = Πni=1P (t

δδi ) ∈ knA,

Kt

⋂kn = Πn

i=1L(tδδi ) ⊂ kn.

In this case, using the notation in 2.1

ΓfΨt(ξ) =∑

α∈Kt⋂kn χ(f(α)ξ)

24

= Πni=1(∑

αi∈L(tδ/δi ) χ(aiαδii ξ)), ξ ∈ kA.

We define

W(a,δ)t (ξ) =

∑α∈L(t) χ(aαδξ).

This is a Weyl sum with R = kA and A = L(t). Recalling theintegral representation in 2.2 for Nt(ν) for ν ∈ k, we can write

Nt(ν) =∫kA/k

(Πni=1W

(ai,δi)

tδ/δi(ξ))χ(ξν)dξ.

Now, if |t|A > q2g−2, then |t|A = ql(t)+(g−1), which means thatl(t) > (g − 1). This gives us the following diagrams

kA = k + P (t)

NNNNNNNNNNN

rrrrrrrrrrrrkA

ql(t)

k

LLLLLLLLLLLL P (t)

ppppppppppppk + P (t∗)

LLLLLLLLLL

vvvvvvvvvv

L(t)

ql(t)

k

HHHHHHHHHH P (t∗)

rrrrrrrrrr

0 L(t∗) = 0.

Note that k + P (t∗)/k ∼= P (t∗)/L(t∗) since L(t∗) = 0. So∫kA/k

= q−l(t)∫P (t)

.

Plugging this into our definition for Nt(ν) gives

Nt(ν) = q−l(t)∫P (t)

(Πni=1W

(ai,δi)

tδ/δi(ξ))χ(ξν)dξ.

By the inequality (•) at the end of 2.1,

|W (a,r)(ξ)t |2r−1 = (ql(t))2r−1−r∑

h1,..,hr∈L(t) |∑

χ∈L(t) χ(r!aξh1...hr−1)|=∑

x Ψh1...hr−1(ξ)(x)

=

ql(t) if Ψh1...hr−1(ξ) = 1,

0 otherwise.

25

From now on, we assume that p = char(k) > r. So r! 6= 0 ink. Note that

Ψh1...hr−1(ξ) = 1 ⇔ γ(r!aξh1...hr−1)(x) = 1 ∀ x ∈ L(t)⇔ γ(r!aξh1...hr−1)(x) ∈ L(t)# = γ(k + P (t∗)⇔ r!aξh1...hr−1 ∈ k + P (t∗)⇔ aξh1...hr−1 ∈ k + P (t∗).

By the Riemann-Roch Theorem, ql(t) = |t|Aq1−g. So

∑x Ψh1...hr−1(ξ)(x) =

|t|Aq1−g if Ψh1...hr−1(ξ) = 1,

0 otherwise.

So

♥ |W (a,r)t(ξ)|2r−1 ≤ (ql(t))2r−1−r+1#(h1, ..., hr−1) ∈ L(t)r−1 :ξah1...hr−1 ∈ k + P (t∗).

Note that

ξah1...hr−1 ∈ k + P (t∗) ⇔ ξh1...hr−1 ∈ k + a−1P (t∗)

and

k + a−1P (t∗) = k + P (a−1t∗) = k + P ((at)∗).

This additional a can be manipulated via the following:

L(at) = aL(t),l(at) = l(t),l((at)∗) = l(a−1t∗) = l(t∗) = 0.

Now, if we define

N = #(h1, ..., hr−1) ∈ L(t)r−1 : ξh1...hr−1 ∈ k + P ((at)∗),N0 = #(h1, ..., hr−1) ∈ L(t)r−1 : ∃ i such that hi = 0

and note that L(t)× = L(t)⋂k×, then ♥ can be rewritten as

26

|W (a,r)(ξ)t |2r−1 ≤ (ql(t))2r−1−r+1N

and

N0 = #(Lr−1)−#(L(t)×)r−1

= (ql(t))r−1 − (ql(t) − 1)r−1

= ((ql(t)) − (ql(t) − 1))((ql(t))r−2 − (ql(t))r−3(ql(t) − 1) + ... +(ql(t) − 1)r−2)

= (1)((ql(t))r−2 − (ql(t))r−3(ql(t) − 1) + ...+ (ql(t) − 1)r−2)≤ (r − 1)ql(t)(r−2).

So

N0 ≤ (r − 1)ql(t)(r−2).

Next, let us define

N1 = #(h1, ..., hr−1) ∈ (L(t)×)r−1 : ξh1...hr−1 ∈ k + P ((at)∗).

Then

N ≤ (r − 1)ql(t)(r−2) +N1.

Putting all this together with ♥ gives

♣ |W (a,r)(ξ)t |2r−1 ≤ (ql(t))2r−1−r+1N ≤ (r−1)(ql(t))2r−1+N1(ql(t))2r−1−r+1.

Now we define Φ : L(t)r−1 → L(tr−1) by

Φ(h1, ..., hr−1) = h1...hr−1.

Then we can express N1 in terms of Φ:

N1 =∑

x∈L(tr−1)×, ξx∈k+P ((at)∗) #Φ−1(x).

Given this expression for N1, we have two goals

Problem I: For x ∈ L(tr−1)×, estimate #Φ−1(x).

27

Problem II: Estimate #x ∈ L(tr−1)× : ξk ∈ k + P ((at)∗).

To begin with, we state a lemma which is an analogue to Diophan-tine approximation in kA:

Lemma 2.3.1: Let ξ ∈ kA, t ∈ k×A , a ∈ k×, |t|A > q2g, and r ≥ 3.Then ∃ x, y ∈ k, x 6= 0 such that x ∈ L(tr−1) and ξx−y ∈ P ((at)∗).

Proof : We start again with a diagram

kA

ql(at)=ql(t)

k + P ((at)∗)

OOOOOOOOOOO

ttttttttttt

k

KKKKKKKKKKKKK P ((at)∗)

nnnnnnnnnnnnn

0.

Now, we know that #L(tr−1) = ql(tr−1). Moreover,

|t|A = ql(t)+(g−1) > q2g

⇒ l(t) > g + 1 ≥ 1⇒ l(t) ≥ 2.

Additionally,

|tr−1|A = ql(tr−1)+(g−1) = ql(t)(r−1)+(g−1)(r−1),

which means that

l(tr−1) = l(t)(r − 1) + (g − 1)(r − 2) > l(t).

Now, denote the elements of L(tr−1) by xi, indexed for 1 ≤ i ≤ql(t

r−1) and consider ξx1, ξx2, ..., ξxql(tr−1) . We know that

ql(tr−1) > ql(t) = [kA : k + P ((at)∗)].

28

So by pigeonhole principle, ∃ i 6= j such that

ξxi − ξxj ∈ k + P ((at)∗)

or, equivalently,

ξ(xi − xj) ∈ k + P ((at)∗).

Since xi 6= xj, we can take x = xi − xj. So ∃ y ∈ k such that

ξx− y ∈ P ((at)∗).

Q.E.D.

Next, let

x = Φ(h1, h2, ..., hr−1) = h1h2...hr−1.

We fix a valuation v. Then

|h1|v...|hr−1|v = |x|v,|hi|v ≤ |t|v, 1 ≤ i ≤ r − 1

since hi ∈ L(t). Then

|x|v ≤ |h1|v|t|r−2v ,

which means that

|x|v|t−1|r−2v ≤ |h1|v ≤ |t|v.

By the definition of | · |v, we know that |h1|v = q−ordvh1v , qv = qdeg(v).

So there exists an e such that

(∗) |x|v|t−1|r−2v ≤ q−ev ≤ |t|v.

We define

29

Tv(x) = #q−ev such that (∗) holds.

We let α : k× → ΠvR×>0 be defined by

α(h) = (|h|v).

Clearly, ker α = F×q . Then

#Ψ−1(x) ≤ (q − 1)r−1(ΠvTv(x))r−1.

So we must study Tv more carefully. Let us write (∗) in termsof qv.

q−ordvxv q(r−2)ordvtv ≤ q−ev ≤ q−ordvtv ,

which means that

ordvx− (r − 2)ordvt ≥ e ≥ ordvt.

The number of possible e’s that satisfy this are

#e’s= ordvx− (r − 1)ordvt+ 1 ≥ 1.

If we let

λv = ordvx− (r − 1)ordvt

then

#e’s= λv + 1 = Tv(x).

Now, we can compare ΠvTv(x) to |t|εA for ε > 0. Recall that, forx ∈ k×, |x|A = 1. So

|tr−1|A = |tr−1|A|x−1|A= Πvq

ordvx−(r−1)ordvt = Πvqλv .

Then

30

ΠvTv(x)

|t|ε(r−1)A

= Πvλv+1

qλvεv.

We determine what this means for two cases:

(i) qεv > 2,(ii) qεv ≤ 2.

Note that the number of v which satisfy case (ii) is finite.Case (i): First,

qλvεv ≥ 2λv ≥ λv + 1

which means that

λv+1

qελvv≤ 1.

So

Πvλv+1

qελvv≤ Πv

λv+12ελv≤ C(ε)

for some constant C(ε) which does not depend upon λ, since, forany λ, ∃ a constant B(ε) such that

λ+12ελ≤ B(ε).

So

ΠvTv(x) ≤ C(ε)|t|ε(r−1)A .

Using this in our bound for #Φ−1(x) gives

#Φ−1(x) ≤ (q − 1)r−1C(ε)r−1|t|ε(r−1)2

A .

Let us write ε for ε(r − 1)2. Then

#Φ−1(x) ≤ (q − 1)r−1C(ε)r−1|t|εA ∀ ε > 0.

Then

31

N1 = #(h1, ..., hr−1) ∈ (L(t)×)r−1 : ξh1...hr−1 ∈ k + P ((at)∗)=∑

x∈L(tr−1)×,ξx∈k+P ((at)∗) #Φ−1(x)≤ C(ε)|t|εA#x ∈ L(tr−1)× : ξx ∈ k + P ((at)∗).

Finally, define

∆(a,r)t (ξ) = #x ∈ L(tr−1) : ξx ∈ k + P ((at)∗).

Using ♣ and the fact that ql(t) = |t|Aq1−g, we have the followingresult for Problem 1:

Problem 1: Let char(k) > r ≥ 3, a ∈ k×, ξ ∈ P (t), and assumethat |t|A > q2g. Then

|W (a,r)t (ξ)|2r−1 ≤ |t|2r−1−r

A (c1|t|r−1A + c2|t|1+ε

A ∆(a,r)t (ξ)),

where c1 and c2 are independent of t.

Having dealt with Problem 1, we turn our attention to Problem 2:

Problem 2: Let ξ ∈ P (t), a ∈ k×, r ≥ 3, |t|A > 22g, and let

∆(a,r)t (ξ) ⊂ k ⊂ kA as above. Define ϕξ : L(tr−1) → kA/(k +

P ((at)∗)) by

ϕξ(x) = ξx (mod k + P ((at)∗)).

Note that ϕ = ϕξ is an additive homomorphism. Then

∆(a,r)t (ξ) = #Ker ϕ− 0.

Recall that under the above hypotheses, we have l(tr−1) > l(t)(Lemma 2.3.1). Note that

L(tr−1)/Ker ϕ → kA/(k + P ((at)∗)).

Since the cardinality of the former group is ql(tr−1)/#Ker ϕ and the

cardinality of the latter is ql(t), the first divides the second and hence

ql(tr−1)−l(t)|#Ker ϕ.

32

Let A = L(tr−1) and B = kA/(k + P ((at)∗)). So #A = ql(tr−1)

and #B = ql(t). This yields the exact sequence

0→ Ker ϕ→ A→ B → Cok ϕ→ 0,

which implies that

#Ker ϕ ·#B = #A ·#Cok ϕ.

So

#Ker ϕ = #Cok ϕ · (ql(tr−1)−l(t)).

Now, recall from page 28 that

l(tr−1) = l(t)(r − 1) + (g − 1)(r − 2).

So

ql(tr−1)−l(t) = q(l(t)+(g−1))(r−2) = |t|r−2

A ,

where the last step is by Riemann-Roch. Then

#Ker ϕξ = |t|r−2A ·#Cok ϕξ.

Let 0 < η < 1 such that #Cok ϕξ ≤ |t|1−ηA . So

#Ker ϕξ ≤ |t|r−1A ,

which means that

∆(a,r)t (ξ) = #Ker ϕξ − 1 ≤ |t|r−1−η

A .

This gives us an upper bound for Problem 1. Let us substitutethis upper bound into the equation from Problem 1:

|W (a,r)t (ξ)|2r−1 ≤ |t|2r−1−r

A (c1|t|r−1A + c2|t|1+ε

A ∆(a,r)t (ξ))

≤ c3|t|2r−1−(η−ε)

A ,

33

where 0 < η − ε < η < 1. If we redesignate ε as ε2r−1 , we can

then rewrite the above as

|W (a,r)t (ξ)| ≤ c4|t|

1− η

2r−1 +ε

A .

As before, we let

f(x) = a1xδ11 + ...+ anx

δnn .

We relate this to the findings above by letting r = δi, t = tδδi , and

|tδδi |A > 22g. Then we define ϕ

(i)ξ : L(t

δ− δδi ) → kA/(k + P ((ait

δδi )∗))

in the obvious way. Moreover, we choose 0 < ηi < 1 in such a way

that #Cok ϕ(i)ξ ≤ |t|

1−ηiA . So

|W (a,δi)

tδδi

(ξ)| ≤ c(i)4 |t

δδi |

1− ηi

2δi−1 +ε

A .

Then

Πni=1|W

(a,δi)

tδδi

(ξ)| ≤ c5Πni=1|t

δδi |

1− ηi

2δi−1 +ε

A .

Examining the exponent in the above expression,∑ni=1

δδi

(1− ηi2δi−1 + ε) =

∑ni=1

δδi− δ

∑ni=1( ηi

δi2δi−1 ) + ε∑n

i=1δδi

Let us define ε0 = ε∑n

i=1δδi

. Then the above is rewritten as∑ni=1

δδi− δ

∑ni=1( ηi

δi2δi−1 ) + ε0

=∑n

i=1δδi− δ + δ − δ

∑ni=1( ηi

δi2δi−1 ) + ε0= δ((

∑ni=1

1δi

)− 1)− (δ(∑n

i=1ηi

δi2δi−1 − 1)− ε0).

Define the latter expression by ε′, i.e. let

ε′ = δ(∑n

i=1ηi

δi2δi−1 − 1)− ε0).

Then

34

Πni=1|W

(a,δi)

tδδi

(ξ)| ≤ c5|t|δ((

∑ni=1

1δi

)−1)−ε′

A .

We will choose ηi in a slightly different manner. Let 0 < ηi < 1be such that∑n

i=1ηi

δi2δi−1 > 1.

Then we define

Et(ηi) = ξ mod k ∈ kA/k : #Cok ϕ(i)ξ ≤ |t|

1−ηiA .

So ∫Etηi |Π

ni=1W

(ai,δi)

tδδi

(ξ)|dξ ≤ c|t|δ((

∑ni=1

1δi

)−1)−ε′

A .

Using this, we have the following proposition

Proposition 2.3.1:∫Etηi |Π

ni=1W

(ai,δi)

tδδi

(ξ)|dξ ≤ c |det ρ1(t)|A|det ρ2(t)|A

|t|−ε′A ,

where

ρ1(t) =

tδδ1 0 ... 0

0 tδδ2 ... 0

0 0 ... tδδn

,

ρ2(t) = tδ,

and hence

det ρ1(t) = tδ

∑ 1δi ,

det ρ2(t) = tδ.

2.4 k = Fq(T )

We now examine the special case where k = Fq(t) is a rational func-tion field. We specify that

f(x) = xδ1 + ...+ xδn,

35

where δ ≥ 2. So

G = k×,

ρ1(t) =

tδδ1 0 ... 0

0 tδδ2 ... 0

0 0 ... tδδn

,

ρ2(t) = tδ,ν ∈ k,t = (τ, 1, 1, ...) ∈ k×A , τ ∈ k×∞,

As before, we let

Nt(ν) =∫kA/k

(∑

α∈L(t) χ(αδξ))nχ(ξν)dξ.

We wish to consider the various valuations on k. Note first that

O = Fq[T ] ⊂ k.

Now, the completion of k at the valuation v associated to a primepolynomial π ∈ O is

kπ = x =∑

ν≥ν0 cνπν , cν ∈ O, deg cν < deg π,

where cν0 6= 0. So we can say that

|x|π = q−ν0π , qπ = qdeg π.

For the place v =∞,

k∞ = x =∑

ν≥ν0 cνT−ν , cν ∈ Fq ⊃ O = Fq[T ],

where the inclusion is analogous to the case where R ⊃ Z. So

|x|∞ = q−ν0 ,

and

|x|∞ = qdeg x.

36

when x ∈ O. For any valuation v, we get the inclusion

kv ⊃ Ov ⊃ Pv = x ∈ Ov : |x|v < 1,

where

Ov = x ∈ kv : |x|v ≤ 1.

Note that for v =∞,

O∞ = x ∈ k∞ :∑

ν≥0 cνT−ν,

P∞ = x ∈ k∞ :∑

ν≥1 cνT−ν.

We also have the relation that⋂v(k ∩ Ov) = Fq,⋂v 6=∞(k ∩ Ov) = O.

Now, in the case of k = Fq(t), we have the diagram

kA

QQQQQQQQQQQQQQ

k

>>>>>>>> k∞ × ΠπOπ = kA(∞)

nnnnnnnnnnnnnn

O,where the top right and the bottom left are isomorphic as com-pact abelian groups, and the notation kA(S) means that for a finiteset of primes S,

kA(S) = Πv∈SkvΠv 6∈SOv.

For x ∈ k∞ with the expansion

x =∑

ν≥ν0 cνt−ν ,

37

we put

c1 = Res x,

the residue of x. Note that an element x ∈ kA(∞) is of the formx = (xv), where x∞ ∈ k∞ and xπ ∈ Oπ.

Now, we will discuss characters on kA, beginning with the char-acter χ1:

χ1(x) = ep(trFq/Fp(Res x∞)),

where

ep(z) = e−2πizp .

If x ∈ O then Res x∞ = 0 and hence χ1(x) = 1. Hence, χ1 be-comes a character of kA(∞)/O ∼= kA/k.

Denote by χ the basic character of kA thus obtained. The char-acter can also be restricted to v; in our case of x ∈ k(∞), χv = χ|kvis defined by

χ∞(x∞) = ep(trFq/Fp(Res x∞)),χπ(Oπ) = 1,

where the latter equality comes from the fact that χπ(O) = 1 andχπ is continuous.

Next, let

k(π) = aπn, n ≥ 0, a ∈ O.

Then we have the following diagram

38

kπ

AAAAAAAA

k(π)

CCCCCCCC Oπ

||||||||

O

0.

Oπ is the integral part of kπ and k(π) is the fractional part. Weextend the character χπ to k(π) via the following claim:

Claim: Let ξ = aπn∈ k(π). Then

χπ(ξ) = ep(tr Res (ξ)).

Proof : Note that ξ ∈ Oπ′ ∀ π′ 6= π. Since ξ ∈ k, χ(ξ) = 1, andχ(x) = Πvχv(x). So

1 = χ(ξ) = χ∞(ξ)Ππ′ 6=π(χπ′(ξ))χπ(ξ)= ep(trFq/Fp(Res ξ))(Ππ′ 6=π1)χπ(ξ)

and the claim follows.

Given a character χ, we can now determine ord χv, where

P−ord χvv = x ∈ kv : χv(xy) = 1 ∀ y ∈ Ov.

We prove the following

Lemma 2.4.1: ord χπ = 0 and ord χ∞ = −2.

Proof : We show first that ord χπ = 0. Note that χπ(Oπ) = 1. So

Oπ ⊂ P−ord χππ .

For x 6∈ Oπ, ∃ l > 0 such that πlx ∈ O×π . Choose a constantc ∈ Fq such that tr c 6= 0 and let δ =deg π. Then we define

y = cTlδ−1

πlx.

39

Note that

χπ(xy) = ep(tr c) 6= 1.

Thus, x 6∈ Oπ ⇒ x 6∈ P−ord χππ .Next, we show that χ∞ = −2, or, equivalently, that

P2∞ = x ∈ k∞ : χ∞(xy) = 1 ∀ y ∈ O∞.

We show containment in both directions:

(⊂) We know that x ∈ P2∞ implies

x = c2T 2 + c3

T 3 + ....

If y ∈ O∞ then

y = b0 + b1T

+ b2T 2 + ....

Multiplying these expressions together, it is clear that Res xy = 0and hence

χ∞(xy) = 1.

(⊃) Assume that χ∞(xy) = 1 ∀ y ∈ O∞. Let

x = c−ν0Tν0 + ...+ c−1T + c0 + c1

1T

+ ....

We claim that

c−ν0 = ... = c−1 = c0 = c1 = 0.

Then x ∈ P2∞ as required. In fact, let

y = λT ν0+1 ∈ O∞,

where λ ∈ Fq. Then

40

xy =λc−ν0T

+ λc−ν0+1 + ...

By assumption on x,

1 = χ∞(xy) = ep(tr (λc−ν0)).

So

tr (λc−ν0)) = 0 ∀ λ ∈ Fq,

and hence c−ν0 = 0. Similarly, we get

c−ν0+1 = ... = c−1 = c0 = c1 = 0.

Q.E.D.

Recall that we defined

π(χ) = (π−ord χvv ) ∈ kA,

where

πv =

π if v = π,

T−1 if v =∞.

Then

q2g−2 = |π(χ)|A = Πv|πv|−ord χvv = |T−1|−ord χ∞∞= |T |−ord χ∞∞ = (qdeg T )−2 = q−2.

Thus, g = 0, which corresponds with the standard definition ofgenus of a rational function field.

Now, let

t0 = (T−1, 1, 1, ....) ∈ k×A .

Then

|t0|A = |T−1|∞ = q−1,

41

P (t0) = P∞ × ΠπOπ ⊂ kA,L(t0) = k

⋂P (t0) = 0.

Note that

t∗0 = π(χ)t−10 = (T−2, 1, 1, ...)(T, 1, 1, ...) = (T−1, 1, 1, ...) = t0,

l(t∗0) = l(t0) = 0.

Because this last line is true, our diagram from p.20 is shrunk asshown:

kA

LLLLLLLLLLL

uuuuuuuuuuu

k

HHHHHHHHHH P (t0)

rrrrrrrrrr

L(t0) = 0.

Assume that∫kA/k

dx = 1.

For each v, we denote dxv the measure given by∫Ov dxv = 1.

For a fixed character χ, we denote by dχvxv the Haar measure on kvwhich is consistent with the self-duality kv ∼= kv. Then we have

dχvxv = q−ord χv

2 dxv,dx = Πvd

χvxv = qΠvdxv,

where the last equality comes from Lemma 2.4.1. (For more de-tails, see Weil, Basic Number Theory p.108).

We can use these for the case of

t = (T d, 1, 1, ....),

where d ≥ 1. Accordingly, we have

42

|t|A = qd > q2g = 1,P (t) = x ∈ kA : |x|v ≤ |t|v ∀ v

= P−d∞ × ΠπOπ,

L(t) = k ∩ P (t) = x ∈ O = Fq[T ], |x|∞ ≤ qd= x ∈ O = Fq[T ], deg x ≤ d.

Note that the last expression indicates that L(t) is simply the poly-nomials of degree less than d. So we can adjust the notation toreflect this fact. Define

M(d) = x ∈ O = Fq[T ], deg x ≤ d = L(t).

Then

l(t) = dimFqM(d) = d+ 1.

If we let

ξ ∈ P (t0) = P∞ × ΠπOπ

then

Wt(ξ) =∑

x∈L(t) χ(xδξ).

Of course, L(t) = M(d) ⊂ O. Note that for any non-infinite place,

πδξπ ∈ Oπ, χπ(xδξπ) = 1.

So

Wt(ξ) =∑

x∈M(d) χ∞(xδξ∞).

Let us denote

Wd(ξ∞) =∑

x∈M(d) χ∞(xδξ∞).

Recalling the definition for Nt(ν),

43

Nt(ν) =∫kA/k

(∑

α∈L(t) χ(αδξ))nχ(ξν)dξ

= q∫kA/k

(Wd(ξ∞))nχ∞(ξ∞ν)dξ∞,

where∫O∞ dξ∞ = 1 and

∫P∞ qdξ∞ = 1.

From the previous section, we saw that

|Wt(ξ)|2δ−1 ≤ c1|t|2

δ−1−1A + c2|t|2

δ−1−δ+1+εA ∆t(ξ),

where

∆t = #Ker ϕξ − 1

and ϕξ is described in the previous section. We let

Ed(η) = ξ ∈ P (t0) : #Cok ϕξ ≤ |t|1−ηA .

This is essentially our definition of Et(η) from before. We will take

1 > η > δ2δ−1

n. Then∫

Ed(η)Wd(ξ)

nχ(νξ)dξ = O(|t|n−δ−ε′A ),

where ε′ = ε′(ε) > 0. This integral is analogous to the minor arcestimation in the initial work of Hardy and Littlewood.

Since we have t = (T d, 1, 1, ...)

t∗ = π(χ)t−1 = (T−d−2, 1, 1, 1, ...),

which gives us the diagram

kA

IIIIIIIII

rrrrrrrrrrr

k + P (t∗)

KKKKKKKKKK

vvvvvvvvvvP (t0),

vvvvvvvvv

k

IIIIIIIIIII P (t∗)

rrrrrrrrrrr

0

44

where

P (t0) = P∞ × ΠπOπ,P (t∗) = Pd+2

∞ × ΠπOπ.

Then

P (t0)/P (t∗) = P∞/Pd+2∞ ,

with

|P∞/Pd+2∞ | = qd+1.

Note that

kA/(k + P (t∗)) ∼= P (t0)/P (t∗).

We can describe α ∈ kA in terms of an integral part [α] ∈ k and afractional part α ∈ P (t). Moreover, if α∞ ∈ P∞ is defined inthe obvious manner then we can project from α ∈ kA to P∞/Pd+2

∞ by

α 7→ α∞ (mod Pd+2∞ ).

With this mapping, we can create a commutative diagram. Letξ ∈ P (t0). Then

L(tδ−1) = M(d(δ − 1))ϕξ //

++VVVVVVVVVVVVVVVVVVVVkA/(k + P (t∗))∫

P∞/Pd+2

∞

.

It is a trivial exercise to show that

ξx∞ = ξ∞x,

which shows that the diagram commutes.Next, let

Ed(η)∞ = α ∈ P∞ : #Cok ϕα ≤ qd(1−η).

45

Then

Ed(η) = Ed(η)∞ × ΠπOπ.

So in our integration over Ed(η), we can ignore every place exceptthe infinite one. Then

(♠)∫Ed(η)∞

Wd(ξ)nχ(νξ)dξ = O(|t|n−δ−ε′A ).

Now, for x ∈ k∞, we note that

k∞ = O + P∞

and hence we can again break x into an integral part [x] ∈ O and afractional part x.

For α ∈ P∞, we can define ϕα : M(d(δ − 1))→ P∞/Pd+2∞ by

ϕα(x) = αx (mod Pd+2∞ ).

This is the simply the previous definition for ϕξ combined with theisomorphism from kA/(k + P (t∗)) to P∞/Pd+2

∞ . Clearly,

x ∈ ker ϕα ⇔ |αx|∞ ≤ 1qd+2 .

This gives us the following approximation theorem:

Lemma 2.4.2: ∀ λ > 0 and ∀ α ∈ k∞, ∃ (x, y) ∈ O2 where x 6= 0such that |x|∞ ≤ qλ (i.e. deg x ≤ λ) and |αx− y|∞ < 1

qλ.

Proof : Let N = [λ]. So N ≤ λ ≤ N + 1. Then we defineψ : M(N)→ P∞/PN+1

∞ by

ψ(x) = x∞ (mod PN+1∞ ).

Clearly,

M(N)/Ker ψ → P∞/PN+1∞ .

46

We know that #M(N) = qN+1 and #(P∞/PN+1∞ ) = qN . So Ker

ψ 6= 0. Let x ∈Ker ψ such that x 6= 0. Then

αx− [αx] = αx ∈ PN+1∞ .

Let y = [αx] ∈ O. Then

|x|∞ ≤ qN ≤ qλ

by assumption and

|αx− y|∞ = |αx− [αx]|∞ ≤ 1qN+1 <

1qλ

.

Corollary 2.4.1: Let α ∈ P∞ and λ > 0. Then ∃ (m, a) ∈ O2 withm 6= 0, (m, a) = 1, and deg a <deg m such that |m|∞ ≤ qλ and|mα− a| < 1

qλ.

Proof : Take x, y as in Lemma 2.4.2. Then we can find a,m suchthat (a,m) = 1 by letting a

mbe the reduced fraction for y

x.

Now, we can finally ascertain bounds for the kernel and cokernel ofϕα. Recall that

#Ker ϕα = |t|δ−2A ·#Cok ϕα = qd(δ−2)#Cok ϕα.

Let µ = deg m ≤ δ. Then

Proposition 2.4.1: #Ker ϕα ≤

qµ−(d+1) if µ− 1 ≥ d(δ − 1),

qdδ−2d if d(δ − 1) > µ− 1 > d,

qdδ−d−µ+1 if µ− 1 ≤ d.

Proposition 2.4.2: #Cok ϕα ≤

qµ−1−d(δ−1) if µ− 1 ≥ d(δ − 1).

1 if d(δ − 1) > µ− 1 > d.

qd−(µ−1) if µ− 1 ≤ d.

Proof : We only prove the first case of Proposition 2.4.1; the otherproofs are similar. So let µ−1 ≥ d(δ−1). SoM(d(δ−1)) < M(µ−1).

47

Then

#Ker ϕα ≤ #x ∈M(µ− 1) : αx ∈ Pd+2∞ .

Using Corollary 2.4.1, we note that x ∈M(µ− 1) means that

αx ≡ amx (mod Pd+2

∞ ).

So the above bound can be rewritten as

#Ker ϕα ≤ #x ∈M(µ− 1) : amx ∈ P d+2

∞ .

The above is a set of representatives for O/mO (i.e. ax ≡ u (modm)). So

#Ker ϕα ≤ #u ∈M(µ− 1) : um ∈ P d+2

∞ .

But

um ∈ P d+2

∞ ⇒ | um|∞ ≤ 1

qd+2

⇒ qdeg u

qµ≤ 1

qd+2

⇒ deg u ≤ µ− (d+ 2)⇒ # Ker ϕα ≤ qµ−(d+1).

Q.E.D.To clarify the separation into major and minor arcs, we define

0 < θ ≤ 12,

d ≥ 2θ.

Later, it will suffice to consider θ = 12. Define Ed(η) as before,

and let a,m be such that am∈ k

⋂P∞, where

a,m ∈ O = Fq[t],deg a <deg m,(a,m) = 1,µ =deg m ≤ dθ.

48

With these restrictions, we can define

M am

= M am

(d, θ) = α ∈ P∞ : |mα− a|∞ < 1qd(δ−θ)

.

We require the following proposition:

Proposition 2.4.3: If am6= a′

m′then M a

m

⋂M a′

m′= ∅.

Proof : Let α ∈M am

⋂M a′

m′. Since | · |∞ is non-archimedean,

| am− a′

m′|∞ ≤ max(| a

m− α|∞, |α− a′

m′|∞) < 1

qd(δ−θ).

Note that

| am− a′

m′|∞ = |am′−a′m

mm′|∞

= qdeg (am′−a′m)

qdeg m+deg m′

≥ 1q2dθ

.

because deg m ≤ dθ. So

d(δ − θ) < 2dθ⇒ δ − θ < 2θ⇒ δ < 3θ ≤ 3

2.

But δ ≥ 2, which is a contradiction.

Let

M(d, θ) =⋃

am

M am

(d, θ).

This is the major arc on P∞. The minor arc is then

M∗(d, θ) = P∞ −M(d, θ).

Proposition 2.4.4: If n > δ2δ

θthen M∗(d, θ) ⊂ Ed(

θ2)∞.

As an immediate corollary to the Proposition, we get

49

Corollary 2.4.2: Let θ = 12, n > δ2δ+1. Then, for ν ∈ O,∫

M∗(d,θ)Wd(α)nχ∞(να)dα = O(|t|n−δ−εA ).

This corollary follows from Proposition 2.4.4 and the equation de-noted (♠).

Proof of Proposition 2.4.4: Let α ∈M∗(d, θ). Putλ = d(δ− θ) > 0. Then ∃ a

m∈ k

⋂P∞ such that µ =deg m < λ and

|mα− a|∞ < 1qd(δ−θ)

.

Then µ > dθ (since µ ≤ dθ implies α ∈M am

). So dθ < µ ≤ d(δ−θ).Now, recall that we had three cases for the cokernel of ϕα:

#Cok ϕα ≤

qµ−1−d(δ−1) if µ− 1 ≥ d(δ − 1),

1 if d(δ − 1) > µ− 1 > d,

qd−(µ−1) µ− 1 ≤ d.

Recall that n > δ2δ

θ, and let η = θ

2. Then 1 > η > δ2δ−1

n. In

this case, we have

Ed(θ2)∞ = α ∈ P∞ : #Cok ϕα ≤ qd(1−η).

We will prove the first case; the others are similar and are left as anexercise to the reader.

Case 1 : #Cok ϕα ≤ qµ−1−d(δ−1).

But

µ− 1− d(δ − 1) = µ− 1− dδ + d≤ d(δ − θ)− 1− dδ + d= d(1− θ − 1

d)

≤ d(1− θ)≤ d(1− θ

2)

= d(1− η).

Now, to garner a result which parallels that of Hardy and Little-

50

wood, we define analogues for some of the concepts used in theoriginal proof. Let

S am

=∑

x∈M(µ−1) χ∞( amxδ).

S am

serves as an analogue to the Gauss sum. Additionally, let

β = α− am

,

and let

I(β) =∫|ξ|∞≤qd χ∞(βξδ)dξ.

I(β) will help us to define our Gamma factor. First, however, weshow how it relates to Weyl sums:

Proposition 2.4.5: Let S am

and I(β) be as above. Then

Wd(α) =∑

x∈M(d) χ∞(αxδ) = 1qµ−1S a

mI(β).

Proof : First, we know that

µ ≤ dθ < d2< d.

So we can divide polynomials in M(d) by m to get a quotient yand remainder z, i.e.

M(d) = m(M(d− µ)) +M(µ− 1)

such that, for x ∈M(d), ∃ y ∈M(d− µ), z ∈M(µ− 1) where

x = my + z.

We use the above, along with the definition of β, to find that

αxδ = (β + am

)(my + z)δ

≡ β(my + z)δ + amzδ (mod O).

So

51

Wd(α) =∑

x∈M(d) χ∞(αxδ)

=∑

y∈M(d−µ)

∑z∈M(µ−1) χ∞(β(my + z)δ)χ∞( a

mzδ)

=∑

z∈M(µ−1) χ∞( amzδ)∑

y∈M(d−µ) χ∞(β(my + z)δ).

Let us fix z. Then we define

J(β) =∫|η|∞≤qd−µ χ∞(β(mη + z)δ)dη.

Note that

η ∈ k∞ : |η|∞ ≤ qd−µ = M(d− µ) + P∞,

where the sum on the right is actually a direct sum. This meansthat, for η in the set above, ∃! y ∈ M(d − µ), u ∈ P∞ such thatη = y + u. Then

J(β) =∑

y∈M(d−µ)

∫y+P∞ χ∞(β(my +mu+ z)δ)du.

We claim that

χ∞(β(my + z)δ−ν(mu)ν) = 1

for ν ≥ 1. In view of ord(χ∞) = −2, this is equivalent to show-ing that

|β(my + z)δ−ν(mu)ν |∞ ≤ 1q2

,

or, equivalently,

|β(my + z)δ−ν(mu)ν |∞ < 1q.

Since | · |∞ is non-archimedean,

|my + z|∞ ≤ maxqµ+d−µ, qµ−1 = qd.

Moreover, u ∈ P∞ implies that |mu|∞ ≤ qµ−1. So

|β(my + z)δ−ν(mu)ν |∞ ≤ ( 1qd(δ−θ)

)(qd(δ−ν))(qν(µ−1))

52

= q−dδ+dθ+d(δ−ν)+ν(µ−1)

= qdθ−dν+ν(µ−1)

= qdθ−ν(d−(µ−1))

≤ qdθ−(d−(µ−1))

≤ qd2−d+µ−1

< q−1,

which proves the claim. This claim tells us that

J(β) =∑

y∈M(d−µ) χ∞(β(my + z)δ)∫y+P∞ du

= q−1∑

y∈M(d−µ) χ∞(β(my + z)δ).

Then

Wd(α) =∑

z∈M(µ−1) χ∞( amzδ)∑

y∈M(d−µ) χ∞(β(my + z)δ)

= q∑

z∈M(µ−1) χ∞( amzδ)J(β).

Now, in the original expression for J(β), we can change variablesfrom η to ξ by letting

ξ = mη + z.

Then

|η|∞ ≤ qd−η ⇔ |ξ|∞ ≤ qd.

Note that, for the measures defined by dξ and dη, we have therelationship

dξ = |m|∞dη = qµdη.

So

J(β) =∫|η|≤qd−µ χ∞(β(mη + z)δ)dη

= |m|−1∞∫|ξ|≤qd χ∞(β(ξ)δ)dξ

= |m|−1∞ I(β).

Q.E.D.

53

Finally, we can state our analogue to the Hardy-Littlewood resultfor Nt(ν). Define

Sd,θ =∑

m∈M([dθ])

∑a∈M(µ−1),(a,m)=1(|m|−1

∞ S am

)nχ∞(ν am

)

and

Γd,θ(νT−dδ) =

∫|γ|∞<qdθ(

∫|ζ|∞≤1

χ∞(γζδ)dζ)nχ∞(νT−dδγ)dγ

Applying proposition 2.4.5 to the expression for Nt(ν) on page 44:

Nt(ν) = q∑

m∈M([dθ])

∑a∈M(µ−1),(a,m)=1

∫M a

m (d,θ)Wd(α)nχ∞(να)dα

+O(qd(n−δ−ε′))= qqn(1−µ)

∑m∈M([dθ])

∑a∈M(µ−1),(a,m)=1

∫M a

m(d,θ)

SnamI(β)nχ∞(να)dα

+O(qd(n−δ−ε′)).

If we let

βT dδ = γξT−d = ζ

then the associated changes in measure will be

qdδdβ = dγq−ddξ = dζ.

Let us define (∗) to be

(∗) =∫|β|∞< 1

qd(δ−θ)(I(β))nχ∞(νβ)dβ

= q−dδ∫|γ|∞<qdθ(I(T−dδγ))nχ∞(νT−dδγ)dγ

= qd(n−δ) ∫|γ|∞<qdθ(

∫|ζ|∞≤1

χ∞(γζδ)dζ)nχ∞(νT−dδγ)dγ.

From this equality, we see that

Nt(ν) = qn+1qd(n−δ)Sd,θ(ν)Γd,θ(νT−dδ) +O(qd(n−δ−ε′)).

54

If we let

Γd,θ =∫|γ|∞<qdθ(

∫|ζ|∞≤1

χ∞(γζδ)dζ)ndγ

then, using the above expression for Nt(ν), we obtain the follow-ing theorem:

Theorem 2.4.1: Let 0 < θ ≤ 12, ch(k) > δ, d > deg ν

δ−1, d ≥ 2

θ, and

n > δ2δ

θ. Then

Nt(ν)

|t|n−δA= qn+1Sd,θ(ν)Γd,θ +O(q−dε(θ)).

We wish to define the limits

S(ν) = limd→∞Sd,θ(ν),Γ = limd→∞ Γd,θ

where the latter will be postponed until the discussion of Kelvin’sprinciple in the next section. We are aided by the fact that, for ν,we can find d such that νT−dδγ ∈ P2

∞ ∀ γ which satisfy |γ|∞ < qdθ;this means that χ∞(νT−dδγ) = 1. So we can define Γ as∫

k∞(∫O∞ χ∞(γξδ)dξ)ndγ.

We require a proposition:

Proposition 2.4.6: ∃ a c independent of m such that

|S am| ≤ c|m|

1− 1

2δ−1 +ε∞ ∀ ε > 0.

Proof : First, we will let

t = (T µ−1, 1, 1, 1, ...),ξ = ( a

m, 1, 1, 1, ...).

Note that, by our definitions,

S am

= Wµ−1( am

).

55

So

|S( am

)| = |Wµ−1( am

)| = |Wt(ξ)| ≤ c|t|1− η

2δ−1 +ε

A

for some 0 < η < 1, and η > ε is such that

#Cok ϕξ ≤ |t|1−ηA ,

or, equivalently,

#Cok ϕ am≤ (qµ−1)1−η.

We recall from Proposition 2.4.2 that

#Cok ϕα ≤

qµ−1−d(δ−1) if µ− 1 ≥ d(δ − 1),

1 if d(δ − 1) > µ− 1 > d,

qd−(µ−1) µ− 1 ≤ d.

Since d = µ− 1, we use the third case. So

#Cok ϕξ ≤ qd−(µ−1) = 1,

which means that 1− η = ε′ > 0 can be arbitrarily small. Then

|S am| = |Wt(ξ)| ≤ c|t|

1− 1−ε′

2δ−1 +ε

A .

Let

ε′′ = ε+ ε′

2δ−1 .

Then

|S am| = |Wt(ξ)| ≤ c|t|

1− 1

2δ−1 +ε′′

A

= c(qµ−1)1− 1

2δ−1 +ε′′

< cq−1+ 1

2δ−1 |m|1− 1

2δ−1 +ε′′

∞

= c′|m|1− 1

2δ−1 +ε′′

∞

56

where c′ = cq−1+ 1

2δ−1 .

Q.E.D.

Let

S(ν) =∑

m∈Fq [t], monic∑

a∈M(µ−1),(a,m)=1(|m|−1∞ S a

m)nχ∞(ν a

m).

We wish to show two things: that S(ν) converges, and that it islimd→∞Sd,θ(ν).

Claim: S(ν) converges.

Proof : From Proposition 2.4.6,

(|m|−1∞ |S a

m|)n ≤ c|m|−

n

2δ−1 +ε

for some ε > 0. So∑a∈M(µ−1),(a,m)=1(|m|−1

∞ S am

)n ≤ c1|m|1− n

2δ−1 +ε

A

since the number of possible a’s is bounded by qµ−1 < |m|∞. Werecall that

n > δ2δ

θ> δ2δ ≥ 2δ+1.

So

− n2δ−1 < −2− 1

2δ−1 .

Then∑a∈M(µ−1),(a,m)=1(|m|−1

∞ S am

)n ≤ c1|m|−1− 1

2δ−1 +ε∞

= c1

|m|1+ε′∞,

and∑m

1

|m|1+ε′∞< +∞,

57

thus proving the claim.

Claim: limd→∞Sd,θ(ν) = S(ν).

Proof : By definition,

|S(ν)−Sd,θ(ν)| ≤∑

deg m>dθ

∑a∈M(µ−1),(a,m)=1(|m|−1

∞ |S am|)n

≤ c∑

deg m>dθqµ

qµ(1+ε′)

= c∑

deg m>dθ1

qµ(ε′)

≤ c2q−ε′′d.

Thus, S(ν) is as required.

2.5 Kelvin’s Principle in P-adic fields

For this section, we let K be a P-adic field, O be the unique max-imal compact subring of K, and P the unique maximal ideal in O,where O/P = Fq. Also, define

dx = the basic Haar measure on k, where∫O dx = 1,

P = (π),|π|K = q−1,χ = a basic character of K+,S(Kn) = ϕ : Kn → C : ϕ is locally constant and compactly

supported.

S(Kn) is known as the Schwartz space, and ϕ ∈ S(Kn) is trueif and only if ∃ lattices L,M ⊂ Kn such that L ⊃ M , supp ϕ ⊂ L,and ϕ is constant on each coset in L/M .

For ϕ ∈ S(Kn), define Gfϕ by

Gfϕ(ξ) =∫Kn ϕ(x)χ(< f(x), ξ >)dχx,

where

dχx = q−n2ord χdx,

dx = dx1dx2...dxn.

58

For simplicity, we write dx for dχx most of the time. For a finitefield, this is a kind of Gauss sum, so we can say that this is theGauss sum generalized to the adeles. We will let f : Kn → Km bea polynomial map composed of a series of fi : Kn → K such that

f(x) = (f1(x), f2(x), ..., fn(x)).

We will let dfx denote the Jacobian matrix for f at x. Additionally,we let

Cf = x ∈ Kn : rank dfx < m,

the critical set for f . A natural question arises as to when Gfϕ ∈S(Km). To answer this question, we use the following version ofKelvin’s Principle:

Theorem 2.5.1 (Kelvin’s Principle): If Cf⋂supp ϕ = ∅ then

Gfϕ ∈ S(Km).

Proof : First, if n < m then Cf = Kn. So supp ϕ = ∅, which meansthat ϕ = 0, and hence Gfϕ = 0. Hence, it suffices to consider thecase n ≥ m.

Now, we can assume that ϕ is a characteristic function of a com-pact open set C such that

(i) Cf⋂C = ∅.

(ii) Let Mλ be an m ×m minor of dfx determined by λ, whereDλ is its determinant. Then |Dλ(x)|K = b 6= 0 on C for some λ.

(iii) Let h : Kn → Kn be defined by

h(x) = (f1(x), ...fm(x), xm+1, ..., xn).

Then h induces an analytic homeomorphism between C and N =a+ παOn for some a ∈ Kn.

Let

59

(παOn)′ = ξ ∈ Kn : χ(< t, ξ >) = 1 ∀ t ∈ παOn.

Then it is enough to show that ξ 6∈ (παOm)′ implies Gfϕ(ξ) = 0. Solet ξ 6∈ (παOm)′, and let

t = (t1, ..., tm),u = (um+1, ..., un).

So

Gfϕ(ξ) =∫cχ(< f(x), ξ >)dx

=∫Nχ(< t, ξ >)|Dλ(x)|−1

K dtdu= b−1

∫a′′+παOn−m

∫a′+παOm χ(< t, ξ >)dt,

where b is from (ii) and a = (a′, a′′) ∈ Kn. Then∫a′+παOm χ(< t, ξ >)dt = χ < a′, ξ >

∫παOm χ(< t, ξ >)dt = 0

since ξ 6∈ (παOm)′. Q.E.D.

We can use this in the proof of the following theorem

Theorem 2.5.2: Let f : Kn → Km be be a homogeneous polyno-mial map of degree δ such that Cf = 0, and let |ξ| = max1≤i≤m|ξi|K .Then

Gfϕ(ξ) = O(|ξ|−nδ ),

for all ϕ ∈ S(Kn).

Remark: This theorem can be applied when m = 1, f(x) =xδ1 + ... + xδn, and p - δ, δ ≥ 2, where p is the characteristic ofK. In this case,

Cf = x : ∂f∂xi

(x) = 0 ∀ i = x : δxδ−1i = 0 ∀ i = 0.

If we let ϕ0 be the characteristic function of On ⊂ Kn then

< f(x), ξ >= (xδ1 + ...+ xδn)ξ,

60

which means that

Gfϕ0(ξ) =∫On χ(ξ(xδ1 + ...+ xδn))dx

= (∫O χ(ξxδ)dx)n

= O(|ξ|−nδ ).

Proof of Theorem 2.5.2: Let

ψ(x) = ϕ(x)− ϕ(0)ϕ0(x),

where ϕ0 is the characteristic function of On. Then

ψ(0) = ϕ(0)− ϕ(0)ϕ0(0) = 0.

Now, since ψ ∈ S(Km) by Theorem 2.5.1, it is enough to showthat

Gfϕ0(ξ) =∫On χ(< f(x), ξ >)dx = O(|ξ|−nδ ).

Let ψ0 be the characteristic function of On−πOn. Then ψ0(0) = 0.So Gfψ0 ∈ S(Km) again by Theorem 2.5.1. Hence it has a compactsupport. So

0 = Gfψ0(ξ) =∫On−πOn χ(< f(x), ξ >)dx

for |ξ| = ql ≥ ql0 for some l0. So∫On =

∫πOn . Inductively, we

can make this smaller and smaller if l is sufficiently greater than l0.To be more precise, choose N so that

l −Nδ ≥ l0 ≥ l − (N + 1)δ,

and let

x = πy,

which means that

61

Exercise: Let m = 1. Then

|Gfϕ(ξ)| ≤ c 1

|ξ|nδ

for |ξ| > qν0 for some ν0 ≥ 0. Hence, if we define

Jν =∫|ξ|>qν

dξ

|ξ|nδ

then

Jν = c1(q1−nδ )ν

2.6 k = Fq[T ] (continued)

Let us return to the function field k = Fq(T ), and let us put K = k∞as in section 2.5. Then we have the following:

Theorem 2.6.1: Let 0 < θ ≤ 12, ch(k) > δ, d > deg ν

δ−1, d ≥ 2

θ, and

n > δ2δ

θ. Then

Nt(ν)

|t|n−δA= qn+1S(ν)Γ +O(q−dε

′).

For this theorem, we have finally eliminated the dependence of Sand Γ on d and θ.

Proof : We have already shown that S is the limit of Sd,θ as d→∞.Now, we must do the same for Γ. We can see that

Γ =∫k∞

∫On∞

χ∞(γ(ζδ1 + ...+ ζδn))dζdγ

=∫k∞

(∫O∞ χ∞(γ(ζδ))dζ)ndγ

=∫k∞Gfϕ0(γ)dγ.

where ϕ0 is the characteristic function on On∞. Similarly, by ourdefinition of Γd,θ,


∫O∞ χ∞(γζδ)dζ)ndγ

=∫|γ|∞<qdθ Gfϕ0(γ)dγ,

From this and from the exercise above, it is obvious that limd→∞ Γd,θ =

63

Γ.

Q.E.D.

3 Singular Series

3.1 N(ν) and S(ν)

Let k be an A-field, and let kA be the ring of adeles over k. We fix abasic character χ of k. Let f : kn → km be a polynomial map overk. We use the same notation f for maps obtained from f over kv,kA in the natural way.

Next, let

ϕ : knA → C

be a function in S(knA), the Schwartz space (this space will be dis-cussed in more detail later). Then we define

(∗) N(ν) =∑

γ∈kn,f(γ)=ν ϕ(γ).

This sum can be thought of as Nf,ϕ(ν); this sum converges as aresult of the fact that ϕ is a Schwartz function.

Now, we will limit our study by introducing the following axioms:

(A.1)∑

γ∈kn |ϕ(γ)| < +∞, i.e. ϕ|kn ∈ L1(kn).

(A.1)A ϕ ∈ L1(knA).

Again, we let

(∗∗) (Γϕ)(ξ) =∑

γ∈kn ϕ(γ)χ(< f(γ), ξ >).

This is clearly integrable, since |χ(< f(γ), ξ >)| = 1. Moreover,because f(γ) ∈ km, it is clear that Γϕ is a function modulo km, i.e.

(Γϕ)(ξ + a) = (Γϕ)(ξ), ∀ a ∈ km

Recall also that, for ν ∈ km, the ”ν-th Fourier coefficient of (Γϕ)(ξ)”

64

is ∫kmA /k

m(Γϕ)(ξ)χ(< ξ, ν >)dξ

=∑

γ∈kn ϕ(γ)∫kmA /k

m χ(< f(γ)− ν, ξ >)dξ

Let

ψ(ξ) = χ(< f(γ)− ν, ξ >).

Note that ψ ∈ kmA /km. Hence we have

∫kmA /k

m ψ(ξ)dξ =

1 if ψ = 1,

0 otherwise.

So this integral is 1 if and only if f(γ) = ν. Thus,∑γ∈kn ϕ(γ)

∫kmA /k

m χ(< f(γ)− ν, ξ >)dξ

=∑

γ∈kn,f(γ)=ν ϕ(γ)

= N(ν).

This means that (∗) is the ”ν-th Fourier coefficient of (∗∗)”.Now, if we move from the sum over kn to the integral over knA,

we require the adelized axiom (A.1)A. For ξ ∈ kmA , we let

Gϕ(ξ) =∫knAϕ(x)χ(< f(x), ξ >)dx.

By (A.1)A, this integral converges. We let

S(ν) =∫kmAGϕ(ξ)χ(< ξ, ν >)dξ.

Clearly, S(ν) = Gϕ(ν). In order for this to be useful, we requireanother axiom:

(A.2) Gϕ ∈ L1(kmA ).

This allows the integral S(ν) to converge.

65

3.2 Nt(ν) and St(ν)

Let G be an algebraic group over k; i.e, let G be a group which is analgebraic variety over k such that there exists a multiplication mapG×G→ G and an inverse map G→ G which are both morphismsover k. We take a universal field Ω that contains k, and we let

GLn = GLn(Ω) = s = (sij), sij ∈ Ω : det(s) 6= 0.

Define ρ to be a representation over k of G:

ρ : G→ GLnt 7→ ρ(t).

Hence ρ(st) = ρ(s)ρ(t). We let Gk be the k-rational points of G. Asusual, one can define

ρk : Gk → GLn(k)ρA : GA → GLn(A).

When the context is clear, we may use ρ interchangeably with ρk orρA. If we define a representation of G in GLm by

ω : G→ GLm,

then it is clear that G acts both on Ωn and Ωm, since, for x ∈ Ωn

and y ∈ Ωm, we can define the action on each by

tx = ρ(t)xty = ω(t)y.

As before, f : kn → km is a polynomial map. We will assumethat

f(tx) = tf(x)

or, equivalently,

f(ρ(t)x) = ω(t)f(x).

66

Next, for a function ϕ : knA → C, we define

ϕt(x) = ϕ(t−1x).

Note that

ϕst(x) = (ϕt)s(x) = ϕt(s−1x) = ϕ(t−1s−1x) = ϕ((st)−1x).

Hence the left action of G on ϕ is well-defined.We state three propositions and leave their proofs as exercises:

Proposition 3.2.1: If a ∈ Gk and t ∈ GA then Nat = Nt(a−1ν)

and Sat(ν) = St(a−1ν).

For the second proposition, let

∆ρ(t) = |det ρ(t)|A∆ω(t) = |det ω(t)|A

Proposition 3.2.2: St(ν) = ∆ρ(t)∆ω(t)−1S(ω(t)−1ν).

Proposition 3.2.3: Let GA(ϕ) = u ∈ GA : ϕu = ϕ. Then, fort ∈ GA and u ∈ GA(ϕ)

Ntu(ν) = Nt(ν)Stu(ν) = St(ν).

Using the definition from Proposition 3.2.2, we can define our thirdand final axiom:

(A.3) ∆ρ(t) : t ∈ GA ⊂ R×>0 is unbounded.

Now, we concatenate these five pieces of information (the group andthe various functions and representations) into a single symbol A:

A = (G, ρ, ω, f, ϕ).

Definition: We say that A is admissible if (A.1), (A.1)A, (A.2),

67

and (A.3) hold and

f(ρ(t)x) = ω(t)f(x).

Additionally, we say that A enjoys an asymptotic formula if ∃ r > 0such that

Nt(ν)∆ρ(t)∆ω(t)−1 = S(ω(t)−1ν) +O(∆ρ(t)

−r).

3.3 Poisson Summation Formula

For ϕ ∈ S = S(knA), its Fourier transform is

ϕ(ξ) =∫knAϕ(x)χ(< x, ξ >)dx ∈ S

and then we get

ϕ(x) =∫knAϕ(ξ)χ(< x, ξ >)dξ ∈ S,

i.e. ϕ(x) = ˆϕ(−x), the inversion formula.

Here, we state two versions of the Poisson summation formula:

Theorem 3.3.0: Let ϕ ∈ S. Then∑γ∈kn ϕ(γ) =

∑γ∈kn ϕ(γ).

Note that

ϕ(0) =∫knAϕ(x)dx.

A generalization of this is the following:

Theorem 3.3.1: Let ϕ ∈ S. Then∑γ∈kn ϕ(x+ γ) =

∑γ∈kn ϕ(γ)χ(< x, γ >).

Remark: Using this, we can actually compute the difference be-tween St(ν) and Nt(ν):

68

St(ν)−Nt(ν) =∫kmA /k

m Ψt(ξ)χ(< ξ, ν >)dξ

where

Ψt(ξ) =∑

γ∈km Gϕt(ξ + γ)−∑

γ∈km ϕt(γ)χ(< f(γ), ξ >).

Proof of Theorem 3.3.1: First, let us write

α(x)(ξ) = χ(< x, ξ >)

and

Φ(x) =∑

γ∈kn ϕ(x+ γ).

Φ is a function on knA/kn, since adding an element in kn to x leaves

Φ unchanged. We can consider the ”γ-th” Fourier coefficient Φ(γ)to be

Φ(γ) =∫knA/k

n Φ(x)α(x)(ξ)dx

=∫knA/k

n(∑

δ∈kn ϕ(x+ δ))α(x)(ξ)dx

=∫knA/k

n(∑

δ∈kn ϕ(x+ δ))α(x+ δ)(ξ)dx

=∫knAϕ(x)α(x)(ξ)dx

= ϕ(γ)

where the third line comes from the fact that, for δ, γ ∈ kn,

α(x+ δ)(ξ) = α(x)(ξ)α(a)(ξ) = α(x)(ξ),

and the fourth line is Fubini’s theorem applied to the exact sequence

0→ kn → knA → knA/k → 0.

Note that ϕ(γ) is then the ”γ-th Fourier coefficient”. Using thisto construct the Fourier series for Φ,

Φ(x) =∑

γ∈kn ϕ(γ)χ(< x, γ >).

69

But

Φ(x) =∑

γ∈kn ϕ(γ + α)

from which the theorem follows.

Corollary 3.3.1:∑

γ∈kn ϕ(x+ γ) =∑

γ∈kn ϕ(γ)χ(< x, γ >).

Proof : Follows from taking the Fourier transform of both sidesin Theorem 3.3.1.

3.4 f = x21 + ...+ x2

n over R

We elucidate the process of how one calculates S(ν) with an exam-ple. Let us take R as our field k and let

f = x21 + ...+ x2

n

be our polynomial over R. Moreover, let

χ(x) = e−2πix = e(x).

Additionally, the Schwartz space S(R) is defined by

S(R) = ϕ : R→ C : ϕ is infinitely differentiable,supx∈R|xβ( d

dx)αϕ| <∞ ∀ α, β ∈ Z+.

In the simplest version, we see what happens when n = 1; hence,f(x) = x2. In this example, we will let

ϕ(x) = e−πx2.

It is clear that ϕ ∈ S(R). One knows that

ϕ(x) = ϕ(x) = e−πx2.

Now, we write explicitly Gϕ:

70

Gϕ(ξ) =∫

R ϕ(x)χ(f(x)ξ)dx

=∫∞−∞ e

−πx2(1+2ξi)dx

= 2∫∞

0e−πx

2(1+2ξi)dx.

Let x′ = x2. Then

Gϕ(ξ) =∫∞

0e−πx

′(1+2ξi)(x′)12−1dx′.

Note the similarity of the above to the Gamma function Γ(s). Infact, let us express the integral above as∫∞

0e−(p+qi)xxs−1dx.

Then

p = π,q = 2πξ,s = 1

2.

To compute this integral, then, we integrate over the complex plane.For ease of notation, let

r =√p2 + qi

and let

g(z) = e−rzzs−1.

We will define our contour of integration using

tanϕ = qp

where 0 ≤ ϕ ≤ π2

(i.e. ϕ is the principal branch of tan−1( qp)).

Then, integrating over the contour in Figure 1, if we let C denotethe path which goes through ABB′A′A then, by Cauchy’s IntegralFormula,

0 =∫Cg(z)dz = e−iϕπΓ(s)

rs.

71

Figure 1: Contour of integration

72

where the latter step follows from much non-trivial computation.Thus,

Gϕ(ξ) = e−πx2(1+2ξi)dx =

e−iϕ2 Γ( 1

2)

√π(1+4ξ2)

14

= e−iϕ2

(1+4ξ2)14

.

Recall that assumption (A.2) required that Gϕ ∈ L1(kmA ). Clearly,in order for this to be true, we must have that Gϕ ∈ L1(R) (wherem = 1). But, for sufficiently large ξ,

|Gϕ(ξ)| = 1

(1+4ξ2)14> 1

1+ξ

which means that∫∞c|Gϕ(ξ)|dξ >

∫∞cdξ 1

1+ξ= log |1 + ξ|]∞c .

So the integral diverges.However, let us define

f(x) = x21 + ...+ x2

n.

We can use the steps above to determine how large n must be inorder for Gϕ(ξ) ∈ L1(R). First, we must define the Schwartz spacefor Rn:

f ∈ S(Rn) = f ∈ C∞(R) : supx∈Rn|xβDα(f(x))| < ∞ ∀α ∈ Zn

+, β ∈ Z+

where

α = (α1, ..., αn)

and

Dαf = ∂α1+...+αn

∂xα11 ...∂xαnn

.

As before, we let

ϕ(x) = e−π|x|2.

73

Note that, in this case,

e−π|x|2

= Πnj=1e

−πx2j = e−πf(x).

Clearly, ϕ(x) ∈ S(Rn).

So

Gϕ(ξ) =∫

Rn ϕ(x)χ(f(x)ξ)dx

=∫

Rn e−πf(x)e(f(x)ξ)dx

= (∫

R e−πx2−2πix2ξdx)n

= e−ni2

tan−1(2ξ) 1

(1+4ξ2)n4

.

where the last step comes from our earlier computation that∫R e−πx2−2πix2ξdx

= e−iϕ2

(1+4ξ2)14

= e−i tan−1(2ξ)

(1+4ξ2)14

.

It is easy to see from this computation that

Gϕ(ξ) ∈ L1(R) ⇔ n ≥ 3.

Thus, for the remainder of the computation, we will assume thatn ≥ 3. This will allow us to make frequent use Fubini’s theorem.Recall that

S(ν) =∫

R Gϕ(ξ)χ(ξν)dξ.

Plugging in the definitions chosen for our example,

S(ν) =∫

R

∫Rn e

−π|x|2−2πi|x|2ξe2πiξνdxdξ

=∫

Rn e−π|x|2 ∫

R e2πiξν−2πi|x|2ξdξdx.

Let us define

Jε =∫

R e2πiξν−2πi|x|2ξ−π(εξν)2dξdx,

74

Iε =∫

Rn e−π|x|2 · Jεdx.

Then

S(ν) = limε→0 Iε.

We examine Jε. Set

u = εξνdu = ενdξ.

Then

Jε = 1εν

∫R e

πu2+ 2πiuεν

(|x|2−ν)du.

Now, by our choice of ϕ, we recall that ϕ = ϕ, i.e.∫R e−πX2−2πiXY = e−πY

2.

In our case, we let

X = uY = 1

εν(|x|2 − ν).

Then

Jε = 1ενe−πY

2= 1

ενe−π (|x|2−ν)2

(εν)2 .

So

Iε = 1εν

∫Rn e

−π|x|2e−π (|x|2−ν)2

(εν)2 dx.

This can be more easily evaluated by changing to hypersphericalcoordinates. Set

x1 = t cos(φ1),x2 = t sin(φ1) cos(φ2),x3 = t sin(φ1) sin(φ2) cos(φ3),

75

· · ·xn−1 = t sin(φ1) · · · sin(φn−2) cos(φn−1),xn = t sin(φ1) · · · sin(φn−2) sin(φn−1),

where the φ’s range over the (n− 1)-dimensional unit sphere, and tranges from 0 to ∞. Then

dx = tn−1 sinn−2(φ1) sinn−3(φ2) · · · sin(φn−2), dt, dφ1, dφ2 · · · dφn−1

= tn−1dtdω,

where∫Sn−1 dω = π

n2

Γ(n2

+1),

the volume of the n− 1-dimensional sphere. So

Iε = 1εν

∫Sn−1

∫∞0e−πt

2e−π (t2−ν)2

(εν)2 tn−1dtdw.

We change variables once more, letting

y = t2−νεν

,ενdy = 2tdt.

SoIε =

∫Sn−1 dw

∫∞− 1εe−π(yε+ν)e−πy

2 tn−2

2dy

= 12

∫Sn−1 dw

∫∞− 1εe−π(yε+ν)−πy2(yε+ ν)

n−22 dy

= πn2

2Γ(n2

+1)

∫∞1εe−π(yε+ν)−πy2(yε+ ν)

n−22 dy.

If we take the limit as ε→ 0, we get

S(ν) = limε→0 Iε = πn2

2Γ(n2

+1)

∫∞−∞ e

−π(ν)−πy2(ν)n−2

2 dy

= πn2

2Γ(n2

+1)e−πν(ν)

n−22

∫∞−∞ e

−πy2dy.

But∫∞−∞ e

−πy2dy = 1

76

is the Gauss distribution integrated over all R. Thus,

S(ν) = πn2

2Γ(n2

+1)e−πνν

n−22 .

3.5 f(x) = x21 + ...+ x2

n over Qp

Next, we study the same computation over the p-adic fields. In thiscase, we will define ϕ(x) to be the characteristic function of Zn

p . Asbefore, this allows ϕ to be in S(Qn

p ), the Schwartz space over Qnp .

Additionally, a character of Qp is defined as follows:

χp(x) = e2πix

where x is the fractional part of the p-adic expansion of x ∈ Q+p .

Notationally, this means that ∃ l, r such that l ≥ 0, 0 ≤ r ≤ pl − 1,and

x = rpl

.

Throughout this section, we will use χp and χ interchangeably ifthe context is clear.

This definition of character allows us to define our various terms.We begin with Gϕ;

Gϕ(ξ) =∫

Qnpϕ(x)χ(f(x)ξ)dx

=∫

Znpϕ(x)χ(f(x)ξ)dx, ξ ∈ Qp.

This means that the p-adic singular series, denoted Sp(ν), is de-fined as

Sp(ν) =∫

Qp Gϕ(ξ)χ(νξ)dξ.

Now, we first determine for which n we have Gϕ(ξ) ∈ L1(Qp). Todo this, let us define

Il = 1pl

Zp,

Jl = plZp.

There are three important properties about these sets. First, we

77

note that the inclusion of the sets is given by

... ⊂ Jl ⊂ ...J1 ⊂ J0 = Zp = I0 ⊂ I1 ⊂ ... ⊂ Qp.

Second,

Il/Zp∼= Zp/Jl = Zp/p

lZp = Z/plZ,

Finally, we can write Il as

Il =⋃pl−1r=0 (Zp + r

pl)

where the above union is disjoint. From the first and third proper-ties, we can write:∫

Qp Gϕ(ξ)dξ = liml→∞∫IlGϕ(ξ)dξ

= liml→∞∑pl−1

r=0

∫Zp+ r

plGϕ(ξ)dξ

= liml→∞∑pl−1

r=0

∫Zp+ r

plGϕ( r

pl)dξ.

From this, it follows that∫Qp |Gϕ(ξ)|dξ = liml→∞

∑pl−1r=0

∫Zp+ r

pl|Gϕ( r

pl)|dξ.

Therefore, it remains to evaluate Gϕ( rpl

). From the second property

above, we can write the representatives of the coset group Znp/J

nl as

a = (a1, ..., an), 0 ≤ ai ≤ pl − 1,

which means that

Znp =

⋃a(J

ml + a).

Then

Gϕ( rpl

) =∫

Znpχ(f(x) r

pl)dx

=∑

a∈Znp/Jnl

∫a+Jnl

χ(f(x) rpl

)dx.

Now, since elements of Jnl can be written as ply for some y ∈ Znp , it

78

follows that, for x ∈ a+ Jnl ,

f(x) = f(a+ ply) = f(a) + plz ≡ f(a) (mod Jl).

for some z ∈ Znp . So∑

a∈Znp/Jnl

∫a+Jnl

χ(f(x) rpl

)dx =∑

a∈Znp/Jnl

∫a+Jnl

χ(f(a) rpl

)χ(rz)dx

=∑

a∈Znp/Jnlχ(f(a) r

pl)∫Jnldx

= p−nl∑

a∈Znp/Jnlχ(f(a) r

pl)

= p−nl∑

a∈Zn/Jnle

2πif(a) rpl .

Now, let us denote

cl =∫IpGϕ(ξ)dξ =

∑pl−1r=0 Gϕ(ξ)

= p−nl∑pl−1

r=0

∑a∈Znp/Jnl

χ(2πif(a) rpl

)

= p−nl∑

a∈Znp/Jnl

∑pl−1r=0 χ(2πif(a) r

pl).

If we define

ψa(r) = χ(f(a) rpl

)

then

∑pl−1r=0 ψa(r) =

pl if ψa = 1,

0 otherwise.

So

p−nl∑

a∈Znp/Jnl

∑pl−1r=0 χ(2πif(a) r

pl)

= p−nl∑

a∈Znp/Jnl ,f(a)≡0 (mod p) pl

= p−ln+l∑

a∈Znp/Jnl ,f(a)≡0 (mod pl) 1.

Let us put

Apl(f) = #a ∈ (Zp/plZp)

n : f(a) ≡ 0 (mod pl).

Then

79

cl =Apl

(f)

p(n−1)l .

Now, we note that

Gϕ( rpl

) = p−ln∑

a e2πi

plf(a)r

= (p−l∑

a e2πi

pla2r

)n.

Let us define

(3.5.1) G( rpl

) = p−l∑

a e2πi

pla2r

= p−l∑pl−1

a=0 e2πi

pla2r

.

Note that G is the classical Gauss sum. So

Gϕ( rpl

) = G( rpl

)n.

Now, we evaluate G( rpl

):

|G( rpl

)|2 = G( rpl

)G( rpl

)

= p−2l∑pl−1

a=0

∑pl−1b=0 e

2πia2−b2

plr

= p−2l∑pl−1

a=0

∑pl−1b=0 e

2πi(a−b)(a+b)

plr.

If we substitute c = a− b then the above is equal to

p−2l∑pl−1

c=0

∑pl−1b=0 e

2πic(c+2b)

plr

= p−2l∑pl−1

c=0 e2πi c

2

plr∑pl−1

b=0 e2πi

c(2b)

plr.

Let

ψ(b) = e2πi

c(2b)

plr.

Since ψ(b) is a character mod pl, it follows that

∑pl−1b=0 ψ(b) =

pl if ψ(b) = 1 ∀ b,0 otherwise.

Now,

80

ψ(b) = 1 ∀ b ⇔ 2crb ≡ 0 (mod pl) ∀ b⇔ 2cr ≡ 0 (mod pl).

If p - r and p 6= 2 then this is equivalent to

c ≡ 0 (mod pl),

which means that

|G( rpl

)|2 = p−2lpl = p−l

and hence

(3.5.2) |G( rpl

)| = p−l2 .

If p = 2 and 2 - r then

c ≡ 0 (mod 2l) or c ≡ 0 (mod 2l−1).

So

|G( r2l

)|2 = 2−l∑

c≡0 (mod 2l−1)e2πi c

2

2lr

= 2−l(1 + e2πi 22l−2

2lr)

= 2−l(1 + e2πi(2l−2)r).

Note that

e2πi(2l−2)r =

1 if l ≥ 2,

−1 if l = 1.

So

(3.5.3) |G( r2l

)| =

2−

l−12 if l ≥ 2,

0 if l = 1.

Thus, we arrive at the final expression for |G( rpl

)|. For 1 ≤ r ≤ pl, let

v(r) = maxk ∈ Z : pk|r.

81

Then

|G( rpl

)| =

p−

l−v(r)2 if p 6= 2,

2−l−v(r)−1

2 if l − v(r) ≥ 2, p = 2

0 if l − v(r) = 1, p = 2,

1 if l = v(r), p = 2.

Now, we recall that |Gϕ( rpl

)| = |G( rpl

)|n and

Gϕ ∈ L1(Qp) ⇔ liml→∞∑pl

r=1 |Gϕ( rpl

)| <∞.

Case 1: Assume p 6= 2. Then let

sl =∑pl

r=1 |Gϕ( rpl

)| =∑pl

r=1 p−n(

l−v(r)2

) = p−nl2

∑lµ=0 aµp

nµ2 ,

where

aµ = #r, 1 ≤ r ≤ pl : v(r) = µ.

Clearly,aµ = φ(pl−µ) = pl−µ−1(p− 1),

if µ < l, and a = 1 when µ = l; here, φ is the Euler phi func-tion. So

sl = p−nl2

∑lµ=0 aµp

nµ2 = 1 + p−

nl2

+l−1(p− 1)∑l−1

µ=0 p(n2−1)µ

= 1 + (1− 1p)(1−p−(n2−1)l

pn2−1 ).

From this, we conclude that

liml→∞ sl =

∞ if n = 1, 2,pn2 −1

pn2 −p

if n ≥ 3.

Case 2: Let p = 2. The work, which is analogous to the previouscase, is left as an exercise. Instead, we merely state the results

82

liml→∞ sl =

∞ if n = 1, 2,

2n2 −1

2n2−1−1

if n ≥ 3.

Thus, from the work above, we see that∫Qp Gϕ(ξ)dξ = liml→∞ sl,

which implies that L1(Qp) for n ≥ 3.

3.6 Determination of G( rpl

)

The determination of G( rpl

) is a calculation which has a long mathe-

matical history; as was noted in the previous chapter, its beginningscan be traced back to Gauss. However, we will use the Siegel nota-tion, where, for a, b ∈ N, we define

G(a, b) = 1√b

∑bh=1 e

πiabh2+πiah

(cf. p. 50 C.L Siegel, Analytische Zahlentheorie II, Gottingen1963/64). Recall that, in our case, we had defined

G( rpl

) = p−l∑pl

x=1 e2πi

plx2r

.

This is nothing more than the special case that a = 2r, b = pl,i.e.

G(2r, pl) = pl2G( r

pl).

More generally, though, we will use only the assumption that, fora, b ∈ N, λ ∈ Q, we have the relation that

ab+ 2aλ ≡ 0 (mod 2).

Next, we let z ∈ C and define

f(z) = eπiab(z+λ)2

e2πiz−1.

Note that the numerator is never zero, and the denominator hasa pole at every integer h (and at no other points). We show that

83

the poles are all of order 1:

Resz=hf(z) = limz→h(z − h)f(z)

= eπiab

(h+λ)2 limz→h(z−h)e2πiz−1

= eπiab

(h+λ)2 limz→h1

2πie2πiz

= eπiab

(h+λ)2 12πi

where the penultimate step is by L’Hospital’s rule. Thus, if wehave a contour C which contains the integers 1, 2, ..., b (and no oth-ers) then∫

Cf(z)dz = 2πi

∑bh=1Resz=hf(z) = 2πi

∑bh=1Resz=he

πiab

(z+λ)2 .

In particular, we will take C to be the contour in Figure 2, whereL is the line which goes through the real axis at a 45 angle. Notethat as the horizontal lines go toward infinity, the integrals overthese lines go to zero, since, if we let z = x+ iy, the real part of theexpression above is determined by i(iy2) = −y2, which goes to zeroas y →∞. So∫

Cf(z)dz =

∫L(f(z + b)− f(z))dz

=∫Leπi

ab(z+λ)2

e2πiz−1[eπi

ab

(2b(z+λ)+b2) − 1]dz.

Now,

ab(2b(z + λ) + b2) = 2aλ+ ab+ 2az ≡ 2az (mod 2)

by our assumption earlier, which means that

eπiab

(2b(z+λ)+b2) = e2πiaz.

Then∫Cf(z)dz =

∫Leπi

ab

(z+λ)2 [ eπi ab(2b(z+λ)+b2)−1e2πiz−1

]dz

=∫Leπi

ab

(z+λ)2 [∑a−1

h=0 e2πihz]dz

=∑a−1

h=0

∫Leπi

ab

(z+λ)2+2πihzdz

=∑a−1

h=0 eπi bah2−2πihλ

∫Leπi

ab

(z+λ+ bah)2dz.

84

Figure 2: Contour of integration

85

Here, the first step comes from the expansion

xa−1x−1

=∑a−1

h=0 xh.

Let us define

Z = z + λ+ bah.

Then the line L instead shifts to another parallel line which wecan denote L0, i.e.∑a−1

h=0 eπi bah2−2πihλ

∫L0eπi

abZ2dZ.

Let L1 be the line parallel to L0 which goes through the origin.Since the integrand above has no poles, it follows from Cauchy’sTheorem that we can shift the contour of integration from L0 to L1.The integral in the above expression is then∫

L1eπi

abZ2dZ.

We make the change of variables

Z =√

ibat

dZ = −√

ibadt.

Note that the integral over L1, like the integrals over L and L0,goes downward along the line. So the change of measure allows usto change the orientation of the integral, i.e.∫

L1eπi

abZ2dZ =

√iba

∫∞−∞ e

−πt2dZ =√

iba

.

Thus, in summary,∫Cf(z)dz =

∑bh=1 e

πiab

(h+λ)2

=√

iba

∑a−1h=0 e

−πi bah2−2πihλ

=√

iba

∑ah=1 e

−πi bah2−2πihλ

86

where the shift in parameter in the last step is allowed because

ab+ 2aλ ≡ 0 (mod 2).

We arrive, then, at the theorem

Theorem 3.6.1:√

1b

∑bx=1 e

πiab

(x+λ)2 =√

ia

∑ax=1 e

πi bax2−2πixλ.

As a corollary, we can prove the special case of the sum which Gaussconsidered:

Corollary 3.6.1: Let a = 2, λ = 0, b = n ∈ N. Then

1√n

∑nx=1 e

2πix2 1n =

1 + i if n ≡ 0 (mod 4),

1 if n ≡ 1 (mod 4),

0 if n ≡ 2 (mod 4),

i if n ≡ 3 (mod 4).

Proof : We know that

√i = e

iπ4 = 1+i√

2.

Then

1√n

∑nx=1 e

2πix2 1n = 1√

2

(1+i)√2

(1 + e−πin2 ).

Naturally, e−πin2 depends on n. In particular,

1 + e−πin2 =

2 if n ≡ 0 (mod 4),

1− i if n ≡ 1 (mod 4),

0 if n ≡ 2 (mod 4),

1 + i if n ≡ 3 (mod 4).

from which the corollary follows.

Now, we can derive a useful theorem about Legendre characters.

For an odd prime p, let us choose a ζ ∈ F+p such that

87

F+p =< ζ >.

We define

S(ζ) =∑

x∈Fp ζx2

= 1 +∑

x∈F×p ζx2

= 1 +∑

x<0 ζx2

+∑

x>0 ζx2

,

where an element in F×p can be described as being > 0 if it is be-

tween 0 and p−12

or < 0 otherwise. Then the above is

= 1 + 2∑

x>0 ζx2

= 1 + 2∑

y∈(F×p )2 ζy.

If we define

0 = T (ζ) =∑

y∈Fp ζy

= 1 +∑

y∈F×p ζy

= 1 +∑

y∈(F×p )2 ζy +

∑y∈F×p ,y 6∈(F×p )2 ζ

y

then we can express S(ζ) as

S(ζ) = S(ζ)− T (ζ) =∑

y∈(F×p )2 ζy −

∑y∈F×p ,y 6∈(F×p )2 ζ

y

=∑

y∈Fp(yp)ζy,

where (yp) is the Legendre symbol with (y

p) = 0 if p|y. Thus, we

arrive at the following:

Theorem 3.6.2:∑

y∈Fp(yp)ζy =

∑p−1x=0 ζ

x2.

Now, if p - r then we can replace ζ by ζr. Then, by the theo-rem,∑p−1

x=0 ζrx2

=∑

x∈Fp(xp)ζrx

= ( rp)∑

x∈Fp(rxp

)ζrx

= ( rp)∑p−1

x=0 ζx2

.

88

If we let

ζ = e2πip

then the above string of equalities tells us that

G( rp) = ( r

p)G(1

p)

where

G(1p) =

√p p ≡ 1 (mod 4),

i√p p ≡ 3 (mod 4).

3.7 Gauss Sums (Classical)

Gauss sums have been studied by many mathematicians throughouthistory. Here, we give a brief history of the notation used by thevarious mathematicians who studied Gauss sums. We begin withDirichlet, who defined the following sum:

φ(h, n) =∑n

x=1 e2πix2 h

n ,

where h ∈ Z and n ∈ N. His student, Paul Bachmann, deviseda notation which could facetiously be referred to as a generalizationof his teacher’s:

ϕ(m,n) =∑n

x=1 eπix2m

n

where m ∈ Z and n ∈ N. Note, then, that

ϕ(2h, n) = φ(h, n).

A variant of this notation was also used by Hardy; he let

Sh,n = φ(h, n).

On the other hand, Gauss (and later Siegel) used the notation

89

G(a, b) = 1√b

∑bx=1 e

πiabx2

.

It is worth noting that Siegel, not Gauss, chose the letter G, asGauss did not name the sums after himself. So

G(2r, n) = 1√nφ(r, n) = 1√

nϕ(2r, n).

In particular,

G(2r, pl) = p−l2φ(r, pl) = p

l2G( r

pl) = p−l

∑pl

x=1 e2πix2r

pl .

where 1 ≤ r ≤ pl.Let us stay with Bachmann’s notation for ϕ. Then we state the

following fifteen properties, most of whose proofs are left as an ex-ercise to the reader.

(1) If m ≡ n ≡ 1 (mod 2) then ϕ(m,n) = 1.

For the remaining properties, we will assume that mn ≡ 0 (mod2).

(2) ϕ(m, 1) = 1 ∀ m ∈ Z.

(3) ϕ(m, 2) = 1 + im ∀ m ∈ Z.

(4) ϕ(m′, n) = ϕ(m,n) if m′ ≡ m (mod 2n).

(5) Let m = 2µ and (k, n) = 1. Then ϕ(k2m,n) = ϕ(m,n).

(6) Let m = 2µ and (n′, n) = 1. Then ϕ(mn′, n)ϕ(mn, n′) =ϕ(m,nn′).

(7) Let h ∈ N. Then ϕ(hm, hn) = hϕ(m,n).

The next several properties are in the specific case where n is apower of a prime, often 2.

(8) Assume µ is odd and α ≥ 2. Then ϕ(2µ, 2α) = 2∑2α−1−1

x=0 eπix22 µ

2α .

90

(9) With notations as above and α ≥ 4, ϕ(2µ, 2α) = 2ϕ(2µ, 2α−2).

(10) If α = 2 then ϕ(2µ, 22) = 2(1 + iµ).

(11) If α = 3 then ϕ(2µ, 23) = 4πi4 µ.

(12) For given n and p, let αp be such that pαp ||n and P = npαp

.

Then ϕ(2µ, n) = Πp|nϕ(2µP, pαp).

(13) If p 6= 2 then

ϕ(2µ, 2α) =

pα2 if α ≡ 0 (mod 2),

pα−1

2 ϕ(2µ, p) if α ≡ 1 (mod 2).

(14) If µ is odd then

ϕ(2µ, 2α) =

2α2 (1 + iµ) if α ≡ 0 (mod 2),

2α+1

2 iµ2 if α ≡ 1 (mod 2).

Now, we apply this to our specific case of

Sp(ν) =∫

Qp Gϕ(ξ)χp(ξν)dξ

= liml→∞∫IlGϕ(ξ)χp(ξν)dξ

= liml→∞∑pl

r=1G( rpl

)ne−2πi r

plν.

We can use Dirichlet notation to write

G( rpl

) = p−lϕ(2r, pl)

We can let r = ptr′, p - r′. Then, if we use (7), letting m = 2r′,n = pl′ where l′ = l − t, and h = pt, we find

ϕ(2r, pl) = ptϕ(2r′, pl′)

and thus

G( rpl

) = G( r′

pl′).

Thus, we can assume that p - r.

91

Now, we apply (13) to Gϕ. In our case, we have µ = r and α = l.For p 6= 2,

G( rpl

) = p−l ·

pl2 if l ≡ 0 (mod 2),

pl−12 ϕ(2r, p) if l ≡ 1 (mod 2).

Of course, we know that

ϕ(2r, p) =

( rp)p

12 if p ≡ 1 (mod 4),

i( rp)p

12 if p ≡ 3 (mod 4).

Combining this information,

G( rpl

) =

p−

l2 if l ≡ 0 (mod 2),

( rp)p−

l2 if l ≡ 1 (mod 2), p ≡ 1 (mod 4),

i( rp)p−

l2 if l ≡ 1 (mod 2), p ≡ 3 (mod 4).

We note that this is easily evaluated in the case of 4|n:

(15) (p 6= 2) If n ≡ 0 (mod 4) then G( rpl

)n = p−ln2 .

Next, we consider when p = 2. So we will apply (14). Since

G( r2l

) = 2−lϕ(2r, 2l)

and

ϕ(2r, 2l) =

2l2 (1 + ir) if l ≡ 0 (mod 2),

2l+12 i

r2 if l ≡ 1 (mod 2),

it follows that

G( r2l

) =

2−

l2 (1 + ir) if l ≡ 0 (mod 2)

2−l−12 i

r2 if l ≡ 1 (mod 2)

=

2−

l−12

1+ir√2

if l ≡ 0 (mod 2),

2−l−12 i

r2 if l ≡ 1 (mod 2).

This gives us a proposition similar to (15) above:

(15) (p = 2) If n ≡ 0 (mod 8) then G( r2l

)n = 2−( l−12

)n.

92

Now, recall that

Sp(ν) = liml→∞∑pl

r=1 G( rpl

)ne−2πi r

plν.

We can expand

r = atpt + ...+ al−1p

l−1 = ptu,

with 0 ≤ ai ≤ p− 1, at 6= 0, and

u = at + ...+ al−1pl−1−t.

and thus p - u. So we can rewrite Sp as

Sp(ν) = liml→∞∑l

t=0

∑0≤u<pl−t,p-uG(p

tupl

)ne−2πi u

pl−tν.

If, as before, we let Sl be the double sum above (where Sp =liml→∞ Sl) then, by (15), we have

Sl =

∑lt=0 p

− l−t2n∑

0≤u<pl−t,p-u e−2πi u

pl−tν

if p 6= 2,∑lt=0 2−

l−t−12

n∑

0≤u<2l−t,2-u e−2πi u

2l−tν if p = 2.

To better understand Sl, we review the theory of Ramanujan sums.Let us define the ring

< = f : N→ C

with operations (+, ∗), where ∗ is Dirichlet convolution, which isdefined by

f ∗ g(n) =∑

d|n f(d)g(nd).

< is then a commutative ring with identity e = 1< defined by

e(n) =

1 if n = 1,

0 if n ≥ 2.

So

93

f ∗ e = e ∗ f = f .

Moreover, we can define

f ′(n) = f(n) log n

which gives us

(f ∗ g)′ = (f ′ ∗ g) + (f ∗ g′).

Note also that this is a unique factorization domain. Additionally,we define in the natural way

<× = f : ∃ g such that f ∗ g = e.

We claim that

<× = f : f(1) 6= 0.

To prove this claim, we note that if f ∈ <×, ∃ g such that

f ∗ g(n) =∑

d|n f(d)g(nd) =

1 if n = 1,

0 if n ≥ 2.

If n = 1, f(1)g(1) = 1. If n > 1,

0 =∑

d|n f(nd)g(d) = f(1)g(n) +

∑d|n,d<n f(n

d)g(d).

So

g(n) = − 1f(1)

∑d|n,d<n f(n

d)g(d).

Thus, f(1) 6= 0 if and only if such a g exists.Next, let us define the functions η, ε ∈ < be such that

η(n) = 1 ∀ n ∈ N.ε(n) = n.

94

Additionally, we define ϕ and µ to be the classical Euler and Mobiusfunctions, respectively:

ϕ(n) = #(Z/nZ)×,

µ(n) =

1 if n = 1,

0 if ∃ p such thatp2|n,(−1)k if n = p1p2...pk is square-free.

With these definitions, we introduce the following properties:

(1)∑

d|n µ(d) = e.

To prove this, note that∑d|n µ(d)

= µ(1) + µ(p1) + ...µ(pk) + µ(p1p2) + ...+ µ(pk−1pk)+...+ µ(p1....pk)

=∑k

i=1(−1)i(ki

)= (1− 1)k

= 0.

(2) µ ∗ η = e.

This is true because µ ∗ η(n) =∑

d|n µ(d).

(3) ϕ ∗ η = ε.

This follows from the fact that∑d|n ϕ(d) = n = ε(n).

Note that (2) and (3) imply that µ−1 = η and

ϕ = ε ∗ η−1 = ε ∗ µ.

(4) ∀ f, g ∈ <, f = g ∗ µ−1 ⇔ g = f ∗ µ.

In addition to the above properties, it is clear that ε, η, ϕ, µ ∈ <×. So

95

< ε, η, ϕ, µ >⊂ <×.

By the relations from (2) and (3), we note that

< ε, η, ϕ, µ >=< ϕ, µ >.

The above information can be related to our singular series usingthe aforementioned Ramanujan sum ck, which is defined as

ck(n) =∑

m (mod k),(m,k)=1 e2πimn/k.

We will prove the following theorem about these sums

Theorem 3.7.1: ck(n) =∑

d|(n,k) dµ(kd).

As an obvious consequence of the theorem, we have the followingcorollaries:

Corollary 3.7.1: ck(1) = µ(k).

Corollary 3.7.2: If k|n then

ck(n) =∑

d|k dµ(kd) = ε ∗ µ(k) = ϕ(k).

We note that ck, k ≥ 1, k square-free generates a subgroup of<×. Additionally, any ck has period k, i.e. ck(k + n) = ck(n) ∀ k.In fact, we have the following theorem about periodic functions:

Theorem 3.7.2: If f ∈ < has period k then ∃ g with period k suchthat

f(m) =∑k−1

n=0 g(n)e(mnk

),

where

g(n) = 1k

∑k−1m=0 f(m)e(mn

k).

This theorem can be easily verified and lends itself to a more im-

96

portant theorem:

Theorem 3.7.3: Let f, g ∈ < and

Sk(n) =∑

d|(n,k) f(d)g(kd),

where Sk is periodic with period k. Then

Sk(n) =∑

m (mod k)ak(m)(mn

k),

with

ak(m) = 1k

∑n (mod k)

Sk(n)e(mnk

)

= 1k

∑kn=1

∑d|n,d|k f(d)g(k

d)e(mn

k).

Note that, for given d, we can write n = cd, where 1 ≤ c ≤ kd.

Additionally, we can make the substitution d→ kd. So∑k

n=1

∑d|n,d|k f(d)g(k

d)e(mn

k) =

∑d|k f(k

d)g(d)

∑dc=1 e( cm

d)

=∑

d|k,d|m f(kd)g(d)d.

where the last line comes from the fact that

∑dc=1 e( cm

d) =

d if d|m,0 otherwise.

Thus, to prove Theorem 3.7.1, we take the special case of f(k) = k,g(k) = µ(k). So

Sk(n) =∑

d|(n,k) f(d)g(kd)

=∑

m (mod k ak(m)e(mnk

).

where

ak(m) = 1k

∑d|(m,k)

kdµ(d)d

=∑

d|(m,k) µ

=

1 if (m, k) = 1,

0 otherwise..

97

Now, recalling that

Sl =

∑lt=0 p

− l−t2n∑

0≤u<pl−t,p-u e−2πi u

pl−tν

if p 6= 2,∑lt=0 2−

l−t−12

n∑

0≤u<2l−t,2-u e−2πi u

2l−tν if p = 2,

we can use the fact that

ck(n) =∑

m (mod k),(m,k)=1e2πimn/k =

∑d|(n,k) dµ(k

d).

to rewrite Sl as

Sl =

1 +

∑lλ=0 p

−λn2

∑d|(ν,pl) dµ(p

λ

d) if p 6= 2,∑l

λ=0 2−(λ−1)n

2

∑d|(ν,2λ) dµ(2λ

d) if p = 2.

Now, we can examine the specific case of ν = 1. In this case, d = 1.So

∑d|(ν,pλ) dµ(2λ

d) =

−1 if λ = 1,

0 if λ ≥ 2,

and hence

Sl =

1− p−n2 if p 6= 2,

2−n2 − 1 if p = 2.

Note that Sl is independent of l. So

Sp(1) = liml→∞ Sl = Sl.

Thus, we have

Πp6=∞Sp(1) = 2n2

ζ(n2

),

S∞(1) = πn2

Γ(n2

+1)· 1

2e−π.

Thus, for 8|n, we have

S(1) = ΠvSv(1) = 2n2−1π

n2 e−π

Γ(n2

+1)ζ(n2

).

Recalling that

98

ζ(2k) = 22k−1Bk(2k)!

π2k

where Bk is the k-th Bernoulli number, the above is then

S(1) = e−πn!Γ(n

2+1)Bn

4

.

In the case where n = 8,

S(1) = e−π8!Γ(5)B2

= e−π 8!·304!

= 50400e−π.

3.8 Estimate of Gauss Sums (Local Fields)

The goal of this section is to show that the definition from this chap-ter for S(t) gives the same estimate for N(t) as does the definitionfrom the previous chapter. As such, we return to the original nota-tion; let K be a local field, where

K ⊃ O ⊃ P = (π) = Oπ

are as before. Additionally, let

κ = O/P ∼= Fq

for

q = pe.

We require a measure dx such that∫O dx = 1.

Let χ be a basic character of K. Recalling the definition of ordχ, given by

P−ord χ = x ∈ K : χ(xy) = 1 ∀ y ∈ O,

we know from before that ord χ = 0.

99

Now, for a ∈ O, define

a ≡ a (mod P).

Then a ∈ κ. Let

χ(a)def= χ( a

π).

We check that χ is well-defined and non-trivial. First, to provethat this is well defined, we show that a = b implies χ(a) = χ(b).But

a = b⇔ a ≡ b (mod P)⇔ a− b = πc for some c ∈ O⇔ a

π− b

π∈ O

⇔ χ( aπ− b

π) = 1

⇔ χ( aπ) = χ(− b

π)−1 = χ( b

π).

where the penultimate step is because ord χ = 0.To show that χ is non-trivial, note that χ(a) = 1 ∀ a means that

χ( aπ) = 1 ∀ a ∈ O. Since ord χ = 0, this implies that 1

π∈ O, which

is a contradiction.Next, define

Gxδ(β) = q−1∑

α∈κ χ(αδβ)

where p - δ. If we let Gxδ denote Gfϕ where f(x) = xδ and ϕ is thecharacteristic function on O then we have the following relationship:

Proposition 3.8.1: Gxδ( ξπ ) = Gxδ(ξ).

Proof : To begin, let us denote A = a ∼= O/P to be a set ofrepresentatives of the cosets of O/P . Then

Gxδ( ξπ ) =∫O χ(xδ ξ

π)dx

=∑

a∈A∫a+P χ(xδ ξ

π)dx

=∑

a∈A χ(aδ ξπ)∫a+P dx

100

= q−1∑

a∈A χ(aδ ξ)

= q−1∑

α∈κ χ(αδ ξ)

= Gxδ(ξ).

Proposition 3.8.2: If β 6= 0 then

|Gxδ(β)| ≤√

2δq−12 =

√2qδ.

Proof : Put

σ =∑

η∈κ |Gxδ(η)|2

=∑

η∈κGxδ(η)Gxδ(η)

= q−2∑

α∈κ∑

γ∈κ∑

η∈κ χ((α2 − γ2)η)

= q−1#α, γ ∈ κ× κ : αδ ≡ γδ≤ q−1(q)δ= δ,

where the antepenultimate line comes from the fact that the sumover η is zero if the character is non-trivial and q otherwise. Soσ ≤ δ.

Next, for λ ∈ κ×, we define the action of λ on κ+ by

λ η = λδη.

We show that this action does not change G:

Lemma 3.8.1: G(λ η) = G(η).

Proof : G(λ η) = q−1∑

α∈κ χ(αδλδη)= q−1

∑α∈κ χ((αλ)δη)

= q−1∑

α∈κ χ(αδη)= G(η)

where the second-to-last step is because λ ∈ κ×. So let us denotean orbit of β under this action by λ β. Then

δ ≥ σ ≥∑

η∈λβ |G(η)|2 = |G(β)|2#λ β.

101

Now,

#λ β = |κ×|#λ∈κ×:λδβ=β = q−1

(δ,q−1).

So

δ ≥ |G(β)|2 · q−1(δ,q−1)

≥ |G(β)|2 · q−1δ

≥ |G(β)|2 · q2δ

since q ≥ 2. So

|G(β)|2 ≤ 2q−1δ2,

and hence

|G(β)| ≤√

2q−12 δ,

which proves the proposition.With this relation between G and Gf , we can use the bound on

the former to give us information about the latter. To do this, westart by letting θ ∈ S(K) be the characteristic function of O − Pand f(x) = xδ. Note that if f ′(x) = δxδ−1 = 0 then x = 0. Since0 6∈ supp θ, we see that Cxδ

⋂supp θ = ∅. Thus,

Gxδθ ∈ S(K).

This allows us to apply our knowledge of Kelvin’s principle. Weknow that ∃ lattices L,M ⊂ K such that L ⊃ M , supp Gxδθ ⊂ L,and Gxδθ is constant on each coset in L/M . We claim that

(∗) L = π−1O = P−1 ⊃M = O ⊃ P .

Now,

Gxδθ(ξ) =∫O−P χ(xδξ)dx.

To show (∗), it suffices to show that Gxδθ(ξ) = 0 for ξ outside L. So

102

Gxδθ(η) = 0 if η 6∈ P−1

⇔ η = πνξ for ν ≥ 2, ξ ∈ O − πO.

Now, let

A = (O −P)/Pν ,B = (O −P)/Pν−1,C = O/P .

So

A = a = b+ πν−1c, b ∈ B, c ∈ C.

Then

Gxδθ(π−νξ) =∫O−P χ(xδπ−νξ)dx

=∑

a∈A∫a+Pν χ(xδπ−νξ)dx

= q−ν∑

a∈A χ(xδπ−νξ).

But

aδ = (b+ πν−1c)δ

= bδ + δbδ−1πν−1c+

(δ2

)c2π2(ν−1)bδ−2 + ....

≡ bδ + δbδ−1πν−1c (mod Pν)

since ν ≥ 2 implies that 2(ν − 1) = 2ν − 2 ≥ ν. So

π−νaδξ ≡ π−νbδξ + δbδ−1π−1cξ (mod O),

which means that

χ(π−νaδξ) = χ(π−νbδξ)χ(δbδ−1π−1cξ).

Plugging this in for our expression above Gxδθ(π−νξ) above gives

Gxδθ(π−νξ) =∑

b∈B∑

c∈C χ(π−νbδξ)χ(δbδ−1π−1cξ)=∑

b∈B χ(π−νbδξ)∑

c∈C χ(δbδ−1π−1cξ)

103

=∑

b∈B χ(π−νbδξ)∑

γ∈κ χ(δbδ−1ξγ).

Now, p - δ, b 6∈ P , and ξ ∈ O − πO. So δbδ−1ξ ∈ O − P , whichmeans that∑

γ∈κ χ(δbδ−1ξγ) = 0,

which shows that Gxδθ(π−νξ) = 0 when ν ≥ 2.

We can combine this information about Gxδθ with that of Gxδ togive a bound for |Gxδθ|. To this end, we present the following theo-rem

Theorem 3.8.1: |Gxδθ(ξ)| ≤√

2δ|ξ|−1δ

K ∀ ξ ∈ K.

Proof : Let ϕ be the characteristic function of O. So

Gxδϕ(ξ) =∫O χ(xδξ)dx

=∫O−P χ(xδξ)dx+

∫P χ(xδξ)dx

= Gxδθ(ξ) +∫P χ(xδξ)dx.

Let

x = πy ∈ P

for y ∈ O. Hence

dx = q−1dy.

Plugging into the above gives

Gxδϕ(ξ) = Gxδθ(ξ) + q−1∫O χ(yδπδξ)dy

= Gxδθ(ξ) + q−1Gxδϕ(πδξ).

For notational expediency, let

G = Gxδϕ,G0 = Gxδθ.

104

So

G(ξ) = G0(ξ) + q−1G(πδξ)= G0(ξ) + q−1G0(πδξ) + q−2G0(π2δξ) + ...+ q−(N−1)G0(π(N−1)δξ)+q−NG(πNδξ).

Recall that χ(O) = 1 since ord χ = 0. So if ξ ∈ O then χ(ξ) = 1.Thus, we assume that ξ 6∈ O, and we let N ≥ 1 be the smallest Nsuch that πNδξ ∈ O. So

πNδξ = πν0ε, ε ∈ O×

for some 0 ≤ ν0 < δ. Then

π(N−1)δξ = πν0−δε,

and for r ≤ N − 1,

πrδξ = π−(N−r)δπNδξ= π−(N−r)δπν0ε= π−((N−r)δ−ν0)ε.

Case 1 : 0 ≤ r ≤ N − 2.

Then

(N − r)δ − ν0 > 2δ − δ = δ ≥ 2.

So

πrδξ = π−((N−r)δ−ν0)ε 6∈ P−1

⇒ G0(πrδξ) = 0.

Case 2 : r = N − 1.

Then

πrδξ = π−((N−r)δ−ν0)ε = π−(δ−ν0)ε.

105

Subcase 2.1 : δ − ν0 > 1.

In this case,

G0(π(N−1)δxi) = G0(π−(δ−ν0)ε) = 0.

Subcase 2.2 : δ − ν0 = 1.

Here,

G0(π(N−1)δξ) = G0(π−1ε)=∫O−P χ(xδ ε

π)dx

=∫O χ(xδ ε

π)dx−

∫P χ(xδ ε

π)dx

= G(π−1ε)−∫P χ(xδ ε

π)dx.

Now, if x ∈ P then xδ επ∈ O. So∫

P χ(xδ επ)dx = q−1.

Thus,

G0(π(N−1)δξ) = G(π−1ε)− q−1

= Gxδ(ε)− q−1.

So

G(ξ) =

q−NG(πNδξ) if δ − ν0 ≥ 2,

q−(N−1)(Gxδ(ε)− q−1) + q−NG(πNδξ) if δ − ν0 = 1,

=

q−N if δ − ν0 ≥ 2,

q−(N−1)Gxδ(ε) if δ − ν0 = 1.

since G(πNδξ) = 1. Now, we know that

|Gxδ(ε)| ≤√

2δq−12 .

So

106

|G(ξ)| ≤

q−N if δ − ν0 ≥ 2,

q−(N− 12

)√

2δ if δ − ν0 = 1.

Recall that πNδξ ∈ O. So

q−Nδ|ξ|K ≤ 1,

or

q−N ≤ |ξ|−1δ

K .

which proves the theorem for δ − ν0 ≥ 2. In the case of δ − ν0 = 1,we have

πNδξ = πν0ε.

Thus

q−(Nδ−ν0) ≤ |ξ|−1K

⇔ q−(N− ν0δ

) ≤ |ξ|−1δ

K .

But

N − ν0δ≤ N − δ−1

δ≤ N − 1

2.

So

q−(N− 12

) ≤ q−(N− ν0δ

) ≤ |ξ|−1δ

K .

if ξ 6∈ O. This proves the theorem.

3.9 Back to Fq(t)

Let us return now to the case where k = Fq(t) is a rational functionfield. Let χ be a basic character of k+

A such that

χ(x) = Πvχv(xv),

where x = (xv), xv ∈ kv, xv ∈ Okv ∀′ v, and χv ∈ k+v . As be-

107

fore, the order of χv is given by

P−ord χvv = x ∈ kv : χv(xy) = 1 ∀ y ∈ Ov.

Moreover, we put O = Fq[t], and for v = π 6=∞,

kv = kπ = x =∑

ν≥ν0 cνπν : cν ∈ O, cν0 6= 0 for x 6= 0,

deg cν < deg π, ν0 ∈ Z,|x|π = q−ν0π

where qπ = qdeg π. Additionally, for v =∞,

kv = k∞ = x =∑

ν≥ν0 cνt−ν : cν ∈ Fq, ν0 ∈ Z,

|x|∞ = q−ν0 .

Recall that the character at the infinite place was

χ∞(x∞) = ep(Tr(Res x∞))

where

ep(α) = e2πipα.

For non-infinite places, we had

k(π) = aπn, n ≥ 0, a ∈ O

χπ( aπn

) = ep(Tr(Res ( aπn

)))

with the diagram

kπ

AAAAAAAA

k(π)

CCCCCCCC Oπ

||||||||

O

0

108

Now,

ord χπ = 0ord χ∞ = −2.

Hence, if we use the Haar measure dx with∫kA/k

dx = 1

then

dχπxπ = dxπ,dχ∞x∞ = qdx∞,

and thus

dx = qΠvdxv.

We define f : kn → k such that

f(x) = xδ1 + xδ2 + ...+ xδn.

Using the same polynomial, one can define fv : knv → kv andfA : knA → kA. We will use fA and f interchangeably when thecontext is clear.

Now, in order to compute the singular series, we require a defi-nition for ϕ. Let

ϕ(x) = Πvϕv(xv),ϕv =the characteristic function of Onv .

In order for this ϕ to be admissible, we must ensure that the variousconditions for admissibility are satisfied; specifically, we check that

(A.1)A ϕ ∈ L1(knA),(A.2)A Gfϕ ∈ L1(kA)

hold, where

109

Gfϕ(ξ) =∫knAϕ(x)χ(f(x)ξ)dx.

For the first one,∫knAϕ(x) = Πv

∫knvϕv(xv)d

χvxv

=∫kn∞ϕv(x∞)dx∞Ππ

∫knπϕv(xπ)dχπxπ

=∫kn∞ϕv(x∞)qndx∞Ππ

∫knπϕv(xπ)dχπxπ

= qn.

For the second one, we can write

Gfϕ(ξ) = ΠvGfvϕv(ξv).

Now, it will aid in our computation to note that, for ξv ∈ Ov,

Gfvϕv(ξv) =∫Onπχπ(f(xπ)ξπ)dxπ = 1.

More generally, we can see that

Gfvϕv(ξv) =∫Onvχv(f(xv)ξv)dxv

= (∫Ov χv(x

δvξv)dxv)

n

= (Gxδ(ξv))n.

From Theorem 3.8.1, we know that for all but finitely many v,

|Gxδθ(ξv)| ≤√

2δ|ξv|− 1δ

v ,

or, equivalently

|Gfvϕv(ξv)| ≤ 2n2 δn|ξv|

−nδ

v .

We denote S to be the finite set of primes for which the theoremwas not proven. By definition, putting Gf = Gfϕv for simplicity,∫

kA|Gf (ξ)|dξ = Πv∈S

∫kv|Gf (ξv)|dξvΠv 6∈S

∫kv|Gf (ξv)|dξv.

Since S is a finite set, we see that the integral over kA is conver-gent by showing that the latter product on the right side converges

110

(cf. Theorem 2.5.2 (Kelvin’s Principle)). For v 6∈ S,∫kv|Gf (ξv)|dξv =

∫Ov |Gf (ξv)|dξv +

∫kv−Ov |Gf (ξv)|dξv

= 1 +∫kv−Ov |Gf (ξv)|dξv.

This means that

Πv 6∈S∫kv|Gf (ξv)|dξv = Πv 6∈S(1 +

∫kv−Ov |Gf (ξv)|dξv).

We will use the fact that, when an ≥ 0, Π(1 + an) converges ifand only if

∑an does as well. So convergence of the above is equiv-

alent to convergence of the sum∑v 6∈S∫kv−Ov |Gf (ξv)|dξv.

Now, we note from the theorem that∫kv−Ov |Gf (ξv)|dξv ≤ c

∫kv−Ov |ξv|

−nδ dξv.

We rewrite kv −Ov with

kv−Ov = (P−1v −Ov) + (P−2

v −P−1v ) + ...+ (P−νv −P

−(ν+1)v ) + ....

So ∫kv−Ov |ξv|

−nδ dξv =

∑∞ν=1 q

−nδν

v

∫P−νv −P

−(ν+1)v

dξv

=∑∞

ν=1 q−nδν

v (q−νv − q−(ν+1)v )

= (1− 1qv

)∑∞

ν=1 q(1−n

δ)ν

v

= (1− 1qv

) 1

qnδ−1

v

.

Let us absorb the 1− 1qv

’s into a constant c1. Then∑v 6∈S∫kv−Ov |Gf (ξv)|dξv ≤ c1

∑v 6∈S

1

qnδ−1

v

.

This converges if nδ− 1 > 1, which occurs if n > 2δ. This re-

striction on n is slightly weaker than the one we found in applyingthe classical circle method to rational function fields.

Now, the goal of this chapter is to show that the quantity

111

S(ν) = Gfϕ(ν) =∫kAGfϕ(ξ)χ(ξν)dξ,

derived from the methods of this chapter, is similar to the quan-tity S(ν) (p. 55), the classical singular series. Let

t = (T d, 1, 1, 1, 1, ....) ∈ k×A

where the first coordinate is the ”infinite” place and the others arethe finite ones. Additionally, we let

ρ(t) = tIn,ω(t) = tδ

where In denotes the n× n identity matrix. Then

∆(ρ(t)) = |t|nA,∆(ω(t)) = |t|δA.

With these definitions,

St(ν) =∫kAGϕt(ξ)χ(ξν)dξ

where

ϕt(x) = ϕ(t−1x),

and hence

Gϕt(ξ) =∫knAϕt(x)χ(f(x)ξ)dx.

Earlier, we showed that

St(ν) = ∆ρ(t)∆ω(t)−1S(ω(t)−1ν)= |t|n−δA S(t−δν).

In our case, |t|A = qd. Now,

P (t) = x ∈ kA : |x|v ≤ |t|v ∀ v

112

= P−d∞ × ΠπOπ,

and

L(t) = k⋂P (t) = x ∈ O = Fq[T ] : |x|∞ ≤ qd def= M(d)

where

l(t) = dimFqL(t) = d+ 1.

Let

η = t−δν = (T−δdν, ν, ν, ...).

So

η∞ = T−δdν,ηπ = ν, ∀ π.

We evaluate

S(η) = ΠvS(ηv).

In the case where v =∞,

S(η∞) =∫k∞

(∫On∞

χ∞(f(x∞)ξ∞)dχ∞x∞)χ∞(ξ∞t−dδν)dχ∞ξ∞

= qn+1∫k∞

(∫O∞ χ∞(ζγ)dζ)nχ∞(γt−dδν)dγ

= qn+1Γ0.

which means that

S(η∞) = qn+1Γ0.

Let

Γ =∫k∞

(∫O∞ χ∞(γζδ)dζ)ndγ

=∫k∞

(Gϕ∞(γ))ndγ,

and let

113


∫O∞ χ∞(γζδ)dζ)ndγ

for 0 < θ ≤ 12. From Theorem 2.6.1, if ch(k) > δ and n > δ2δ

θthen

|Γ− Γd,θ| ≤ c1q−ε′d.

Similarly,

|Γ0 − Γ0,d,θ| ≤ c2q−ε′′d.

So

|Γ− Γ0| ≤ |Γ− Γd,θ|+ |Γd,θ − Γ0,d,θ|+ |Γ0,d,θ − Γ0|.

The first and the third terms on the right side go to zero. Wecheck to see that the second term does as well. Evaluating this term,

Γd,θ − Γ0,d,θ =∫|γ|∞≤qdθ Gfϕ(γ)(1− χ∞(γt−dδν))dγ.

Since d >> 1, we know that γt−dδν → 0. So the integral goesto zero. Thus,

|Γ− Γ0| ≤ cq−εd,

i.e.

S(η∞)− qn+1Γ = O(q−εd).

Next, for the cases where v = π 6=∞, we claim that

(#) ΠπS(ν)π = S(ν).

We show what this claim implies. First, evaluating S(ηπ), we have

S(ηπ) =∫kπGfϕπ(ξπ)χ∞(ξπν)dξπ.

Since we assume that n > 2δ2δ > 2δ, we know not only that

114

Gfϕπ ∈ L1(kπ)

but more generally that

Gfϕ ∈ L1(kA).

Recall that in Section 2.4, we defined

S(ν) =∑

m monic

∑a∈M(µ−1),(a,m)=1(|m|−1

∞ S am

)nχ∞(ν am

)

where µ = deg m and

S am

=∑

x∈M(µ−1) χ∞( amxδ)

= Wµ−1( am

).

Additionally, we defined

Sd,θ(ν) =∑

m monic,µ≤dθ∑

a∈M(µ−1),(a,m)=1(|m|−1∞ S a

m)nχ∞(ν a

m).

We know that

|S(ν)−Sd,θ(ν)| ≤ c3q−ε′′′d.

In the case of the rational function field, the classical Hardy-LittlewoodTheorem states that, given our assumptions from before, ∃ ε suchthat

Nt(ν)

|t|n−δA= qn+1S(ν)Γ +O(q−dε)

for some ε > 0. If (#) is true,

qn+1S(ν)Γ = qn+1ΠπS(ν)πΓ∼ S(η∞)ΠπS(ν)π

if we ignore the error term of size O(q−εd). But

S(η∞)ΠπS(ν)π = S(T−dδν)∞ΠπS(ν)π = S(t−δν).

115

So the classical Hardy-Littlewood Theorem can be restated as

Nt(ν)

|t|n−δA= S(t−δν) +O(q−dε).

All that remains, then, is for us to check whether (#) is true.To test this, we have

S(ν)π =∫kπGf (ξπ)χπ(ξπν)dξπ

with

Gf (ξπ) =∫Onπχ(f(xπ)ξπ)dxπ.

Now, since

Oπ ⊂ P−1π ⊂ ... ⊂ P−lπ ⊂ ... ⊂ kπ,

we can rewrite S as

S(ν)π = liml→∞∫P−lπ Gf (ξπ)χπ(ξπν)dξπ.

Note that,

P−lπ /Oπ ≈ Oπ/P lπ = O/P l = M(deg(πl)− 1),

where the last equality follows from

P−lπ =⋃a∈M(deg(πl)−1)(Oπ + a

πl).

So

S(ν)π = liml→∞∑

a∈M(deg(πl)−1)

∫Oπ+ a

πlGf (ξπ)χπ(ξπν)dξπ

= liml→∞∑

a∈M(deg(πl)−1) Gf (aπl

)χπ( aπlν).

We see that

Gf ( aπl ) = (∫Oπ χπ(xδ a

πl)dx)n.

By the equivalence between Oπ/P lπ and M(deg(πl)− 1),

116

Oπ =⋃γ∈M(deg(πl)−1)(γ + P lπ).

So we change variables, letting y ∈ Oπ be such that

x = γ + πly,dx = |π|lπdy = q−lπ dy.

This means that∫Oπ χπ(xδ a

πl)dx =

∑γ∈M(deg(πl)−1)

∫γ+Plπ

χπ(xδ aπl

)dx

=∑

γ∈M(deg(πl)−1) |πl|−1π χπ(γδ a

πl).

Thus,∑a∈M(deg(πl)−1) Gf (

aπl

)χπ( aπlν) =

∑a∈M(deg(πl)−1)(

∑γ∈M(deg(πl)−1) |πl|−1

π χπ(γδ aπl

))nχπ( aπlν).

On the flip side, let us examine S. Define

S am

=∑

γ∈M(µ−1) χ∞(γδ am

),

G am

= |m|−1∞ S a

m.

Then

S(ν) =∑

m monic

∑a∈M(µ−1),(a,m)=1 G

namχ∞(ν a

m).

If we let

Sm(ν) =∑

a∈M(µ−1),(a,m)=1Gnamχ∞(ν a

m)

then

S(ν) =∑

m monic Sm(ν).

Note that

Smm′(ν) = Sm(ν)Sm′(ν).

So, for m = Πππlπ ,

117

Sm(ν) = ΠπSπlπ (ν).

Therefore

S(ν) = Ππ(1 + Sπ(ν) + ...+ Sπl(ν) + ...).

But

S(ν)π = liml→∞∫P−lπ Gf (ξπ)χπ(ξπν)dξπ

= liml→∞∑

a∈M(deg(πl)−1)(G a

πl)nχπ( a

πlν)

= 1 + Sπ(ν) + ...+ Sπl(ν) + ....

Thus,

S(ν) = ΠπS(ν)π.

Q.E.D.

118

Lectures on the Hardy-Littlewood Singular Serieswright/HLnotes.pdf · Lectures on the...

Documents

Transcript of Lectures on the Hardy-Littlewood Singular Serieswright/HLnotes.pdf · Lectures on the...