Upcoming events!

There will be quite a few number-theoretic activities that I will be involved-in this year. In chronological order:

(1) In March, on Friday the 15th and Saturday the 16th, there will first be the 10th edition of the ETH-EPFL Number Theory Days, this time in Lausanne. The web page is currently not very informative (except for links to the previous editions…), but the speakers this year will be L. Berger (ENS Lyon), E. Lindenstrauss (Hebrew University), H. Oh (Brown University), N. Templier (Princeton University) and J. Wolf (École Polytechnique). This is organized by Ph. Michel and myself.

(2) Immediately following, there will be a conference on “Equidistribution in number theory and dynamics” at the Forschungsinstitut für Mathematik of ETH, organized jointly by M. Einsiedler, E. Lindenstrauss, Ph. Michel and myself, from March 18 to 22. There is a web page with the current list of speakers. We would especially like to invite young mathematicians to apply here for financial support if they wish to attend this conference (the deadline indicated is January 15th, but a few more days should not hurt).

(3) During the first week of June, again at FIM, G. Wustholz and myself are organizing a conference to celebrate the 25th anniversary of the Number Theory Seminar at ETH. Additional details will appear soon….

(4) Finally, from June 17 to 21, in sunny Marseille, R. de la Bretèche, Ph. Michel, J. Rivat and myself are co-organizing a conference on Analytic Number Theory in honor of É. Fouvry’s 60th birthday. This will be held at CIRM, which is a also a very nice place indeed to do mathematics. The web page for the conference is here; registration to the conference should be done on the CIRM website (the registration form is not there yet).

Three new papers

In this post, I will just describe briefly three papers (one long, two short) that É. Fouvry, Ph. Michel and myself have finished in recent weeks and days concerning the properties of trace functions. The last one should be on arXiv tomorrow, the others are there already. I will probably say more about some (or all) of these papers later, but here are quick summaries of what we do…

(1) “Counting sheaves with spherical codes”

This is a fairly short note, where we use the quasi-orthogonality of trace functions (in the geometrically irreducible case), which encapsulates Deligne’s general form of the Riemann Hypothesis over finite fields, in order to derive upper-bounds for the number of such functions with bounded conductor over a given finite field. As it turns out, the same quasi-orthogonality implies that we do something more geometrically interesting: for “small enough” conductor, the trace function essentially determines the sheaf, and so we are counting sheaves.

In spirit, this is therefore close to many counting problems of number theory: we have a countable set, a measure of complexity which allows us to write it as an increasing union of finite sets, and we want to know how many elements there are in these finite subsets.

A difference with many more classical problems, however, is that it seems rather difficult to get asymptotics when counting trace functions. If we use a Langlands correspondance, we are trying to count automorphic representations on $latex \mathrm{GL}_n(\mathbf{F}_p[T])$ with some bounds on ramification. We realized only rather late the existence of very striking conjectures and results of Drinfeld and Deligne (among others; see this excellent account of Deligne’s work by Esnault and Kerz) for the precise counting in the vertical direction (fixing a base field, and extending it), which should — under suitable conditions — take a form very similar to a Lefschetz Trace Formula. Our bounds do not really contribute to this question, since they are (probably far off) upper-bounds only, but they are completely explicit and they work in the “horizontal” direction (bounding the conductor and letting $latex p$ go to infinity.)

As to the spherical codes of the title, they arise because the quasi-orthogonality shows that, as vectors in the $latex 2p$-dimensional real vector space of complex-valued functions on $latex \mathbf{F}_p$, the (normalized) trace functions with conductor $latex \leq M$ have a strong angular-separation property, and subsets of unit spheres with this property are precisely called “spherical codes”. The question of giving upper-bounds for the cardinality of spherical codes with given angular separation is quite important, but interestingly we did not find the range given by the Riemann Hypothesis in the literature (this has the effect of making the cardinality grow polynomially as a function of $latex p$ for a fixed bound on the conductor; a polynomial growth is the right answer for our problem, though finding the right exponent is a rather delicate question). We tweaked the ideas of Kabatjanski and Levenshtein (who have the best-known results in general) for our purpose, which involves some fun estimates depending on the location of the first zero of the Airy function…

(2) “An inverse theorem for Gowers norm of trace functions over prime fields”

This is again a fairly short paper, which does pretty much what the title suggests: for a trace function $latex K$ over $latex \mathbf{F}_p$, and an integer $latex d\geq 1$, we find an estimate for the $latex d$-th Gowers norm $latex \|K\|_{d}$ of $latex K$ (see Section 3 of this part of T. Tao’s notes on higher-order Fourier analysis for an introduction to these norms). This takes the form
$latex \|K\|_{d}^{2^d}\ll p^{-1},$
where the implied constant depends only (completely explicitly) on the conductor of $latex K$ and on $latex d$, except (this is the inverse part in the title) when the sheaf $latex \mathcal{F}$ which gives rise to $latex K$ contains at least one Jordan-Hölder factor with trace function of the type
$latex e\Bigl(\frac{P(x)}{p}\Bigr),$
for some polynomial $latex P$ of degree $latex \leq d-1$. These functions are the natural obstructions to having small Gowers norms (as already emphasized by Gowers), but one doesn’t usually get to have such strong structural statements as those we get: if $latex K$ is geometrically irreducible, then the only possibility is that it is exactly proportional to a function of the type above (for all $latex x$).

None of the three of us can be said to be a great expert on the Gowers norms (which have been studied much more deeply by others, most spectacularly by Gowers and recently by Green, Tao and Ziegler), and this note is basically our attempt at seeing if the (fairly algebraic) definition could be studied using the sheaf formalism and the Riemann Hypothesis. But the final estimate is interesting in that, as far as the dependency on $latex p$ is concerned, it is the same as one would get for a “random” function (in the model where we consider a function $latex \varphi$ modulo $latex p$ such that the $latex \varphi(x)$ are independent uniformly bounded random variables with mean zero; we found this statement in the book of Tao and Vu), and it seems that no “deterministic” examples of such functions had been written down before. From our result, one can see for instance that
$latex \|x\mapsto \chi(f(x))\|_d^{2^d}\ll p^{-1},$
for any fixed non-constant polynomial $latex f\in \mathbf{Z}[T]$, $latex \chi$ being the Legendre character modulo $latex p$, with the implied constant depending only on $latex \deg(f)$ and $latex d$ (again, completely explicitly).

(3) “Algebraic trace functions over the primes”

The longest and deepest of our three papers continues the study of orthogonality of trace functions against other natural arithmetic sequences. After dealing with Fourier coefficients of modular forms in the first paper, we consider sums over primes, and sums against the Möbius function. Precisely, let $latex K$ be a trace function modulo $latex p$. Say that $latex K$ is $latex p$-exceptional if it is proportional to
$latex K_{\chi,a}(x)=\chi(x)e(ax/p),$
for some Dirichlet character $latex \chi$ modulo $latex p$ and some $latex a\in\mathbf{F}_p$ (allowing trivial $latex \chi$ and/or $latex a=0$.) Then, if $latex K$ is not $latex p$-exceptional we have
$latex \sum_{n\leq X}\Lambda(n)K(n)\ll X\Bigl(1+\frac{p}{X}\Bigr)^{1/12}p^{-1/48+\varepsilon}$
for any $latex \varepsilon>0$, where the implied constant depends only on the conductor of $latex K$ and on $latex \varepsilon$. The “critical” case is when $latex X=p$ or is a bit smaller, in which case we therefore get cancellation with a power saving. It is well-known that one expects such a bound for a $latex p$-exceptional $latex K$ also, but that this is essentially equivalent to proving the existence of a zero-free strip for some Dirichlet $latex L$-function, so that the restriction is natural in the current state of knowledge.

Similarly, we have
$latex \sum_{n\leq X}\mu(n)K(n)\ll X\Bigl(1+\frac{p}{X}\Bigr)^{1/12}p^{-1/48+\varepsilon}$
with the same conditions.

These estimates are rather sweeping: we can take any of the examples of trace functions explained in the previous post (making sure they are not exceptional, but for instance any irreducible sheaf of rank at least $latex 2$ is not exceptional, as is any rank $latex 1$ sheaf with a singularity not at $latex 0$ or $latex \infty$…). Although some specializations to specific trace functions had been already studied (sometimes with stronger exponents), we find the generality to be a really remarkable example of the power of the structural features coming from Deligne’s work. and from the formalism of algebraic geometry, which we use again extensively. Indeed, we need not only all the work of the previous paper on twists of Fourier coefficients of modular forms (applied to Eisenstein series), but we also had to establish some additional sheaf-theoretic properties.

To give an example, we get immediately that if $latex \chi$ is a Dirichlet character of order $latex h\geq 2$, and $latex f\in \mathbf{Z}[T]$ is a polynomial which is not proportional to an $latex h$-th power times a monomial (e.g., if $latex f$ is squarefree) we have
$latex \sum_{n\leq X}\Lambda(n)\chi(f(n))\ll X\Bigl(1+\frac{p}{X}\Bigr)^{1/12}p^{-1/48+\varepsilon}$
where the implied constant depends only on $latex \deg(f)$ and on $latex \varepsilon$. As far as we know, the only case previously treated (going back to Karatsuba) is when $latex f(x)=aX+b$, with $latex b\not=0$, is linear…

Among a number of applications, which can be found in the paper (and before we find others…), the following is also fairly nice: given $latex f\in\mathbf{Z}[T]$ squarefree and non-constant, we have
$latex \sum_{0\leq a<p-1} E(X,p,f(a))\ll X\Bigl(1+\frac{p}{X}\Bigr)^{1/12}p^{-1/48+\varepsilon}$
where $latex E(X,q,a)$ denotes in general the error term in the prime number theorem in arithmetic progressions
$latex \sum_{p\leq X,\ p\equiv a\bmod q}1=\frac{\pi(X)}{\varphi(q)}+E(X;q,a)$
and the implied constant depends only on $latex \deg(f)$ and on $latex \varepsilon$. In fact, with a whiff of extra formalism, we can replace the sum over residue classes of the form $latex f(a)$ taken with the multiplicity of representation to the corresponding sum over the the residues of this form, without multiplicity.

Orthogonality of columns of integral unitary operators: a challenge

Given a unitary matrix $latex A=(a_{i,j})$ of finite size, it is a tautology that the column vectors of $latex A$ are orthonormal, and in particular that
$latex \sum_{i} a_{i,j} \overline{a_{i,k}} =0$
for any $j\not=k$. This has an immediate analogue for a unitary operator $latex U\,:\, H\rightarrow H$, if $latex H$ is a separable Hilbert space: given any orthonormal basis $latex (e_n)_{n\geq 1}$ of $latex H$, we can define the “matrix” $latex (a_{i,j})_{i,j\geq 1}$ representing $latex U$ by
$latex U(e_j)=\sum_{i\geq 1}a_{i,j}e_i,$
and the “column vectors” $latex (a_{i,j})_{i\geq 1}$, for distinct indices $latex j$, are orthogonal in the $latex \ell_2$-sense: we have
$latex 0=\langle e_j,e_k\rangle = \langle U(e_j),U(e_k)\rangle=\sum_{i}a_{i,j}\overline{a_{i,k}}$
if $latex j\not=k$.

Now assume that $latex H$ is some $latex L^2$ space, say $latex H=L^2(X,\mu)$, and $latex U$ is an integral operator on $latex H$ given by a kernel $latex k\,:\, X\times X\rightarrow \mathbf{C}$, so that
$latex U(\varphi)(x)=\int_{X}\varphi(y)k(x,y)d\mu(y)$
for $latex \varphi \in L^2(X,\mu).$
Intuitively, the values $latex k(x,y)$ of the kernel form a kind of “continuous matrix” representing $latex U$. The question is: are its columns orthogonal? In other words, given $latex y\not=z$ in $latex X$, do we have
$latex \int_{X}k(x,y)\overline{k(x,z)}d\mu(x)=0?$

If one remembers the fact that “nice” kernels define trace class integral operators in such a way that the trace can be recovered as the integral
$latex \int_{X}k(x,x)d\mu(x)$
over the diagonal (the basis of the trace formula for automorphic forms…), this sounds rather reasonable. There is however a difficulty: it is not so easy to write kernels $latex k(x,y)$ which both define a unitary operator, and are such that the integrals
$latex (\star)\quad\quad\quad\quad \int_{X}k(x,y)\overline{k(x,z)}d\mu(x)$
are well-defined in the usual sense! For instance, the most important unitary integral operator is certainly the Fourier transform, defined on $latex L^2(\mathbf{R},dx)$, and its kernel is
$latex k(x,y)=e^{2i\pi xy},$
for which the integrals above are all undefined in the Lebesgue sense. This is natural: if the kernel $latex k(x,y)$ were square integrable on $latex X\times X$, for instance, the corresponding integral operator on $latex L^2(X,\mu)$ would be compact, and its spectrum could not be contained in the unit circle (excluding the degenerate case of a finite-dimensional $latex L^2$-space.)

This probably explains why this question of orthogonality of column vectors is not to be found in standard textbooks. There are some examples however where things do work.

We consider the space $latex H=L^2(\mathbf{R}^*,|x|^{-1}dx)$, and as in the previous post, we look at the unitary operator
$latex T=\rho\Bigl(\begin{pmatrix}0&-1\\1&0\end{pmatrix}\Bigr),$
where $latex \rho$ is the principal series representation with eigenvalue $latex 1/4$ of $latex \mathrm{PGL}_2(\mathbf{R})$. The result of Cogdell and Piatetski-Shapiro already mentioned there shows that $latex T$ is, indeed, a unitary operator given by a smooth kernel $latex k(x,y)=j(xy)$ for some function $latex j$ on $latex \mathbf{R}^*$. This function is explicit, and (as expected) not very integrable: we have
$latex j(x)=\begin{cases}-2\pi \sqrt{x}Y_0(4\pi\sqrt{x})\text{ for } x>0,\\4\sqrt{|x|}K_0(4\pi\sqrt{|x|})\text{ for } x<0.\end{cases}.$

Since it is classical that $latex Y_0(x)\approx x^{-1/2}$ for $latex x\rightarrow +\infty$, this function is neither integrable nor square-integrable. But, the function $latex K_0$ on $latex [0,+\infty[$ decays exponentially at infinity! This means that the integrals $latex (\star)$, which are given by
$latex \int_{\mathbf{R}^*}j(xy)\overline{j(xz)}\frac{dx}{|x|},$
make perfect sense when $latex y$ and $latex z$ have opposite sign (this requires also knowing that there is no problem at $latex 0$, but that is indeed the case, because the Bessel functions here have just a logarithmic singularity there, and the factors $latex \sqrt{|x|}$ eliminate the $latex |x|^{-1}$ in the integral.)

It should not be a surprise then that we have
$latex \int_{\mathbf{R}^*}j(xy)\overline{j(xz)}\frac{dx}{|x|}=0$
for $latex yz<0$. This boils down to an identity for integrals of Bessel functions that can be found in (combinations of) standard tables, or it can be proved more conceptually by viewing
$latex j(xy)=k(x,y)$
as limit of
$latex \frac{1}{2\epsilon}\int_{|u-y|<\epsilon} k(x,u)du,$
which is $latex T(f_{y,\epsilon})$ for the function $latex f_{y,\epsilon}$ which is the normalized characteristic function of the interval of radius $latex \epsilon$ around $latex y$, and similarly for $latex z$. Since
$latex \langle f_{y,\epsilon},f_{z,\epsilon}\rangle =0$
when $latex \epsilon$ is small enough, the unitarity gives
$latex \int_{\mathbf{R}^*} Tf_{y,\epsilon}(x)\overline{Tf_{z,\epsilon}(x)}\frac{dx}{|x|}=0,$
and one must take the limit $latex \epsilon\rightarrow 0$, which is made relatively easy by the exponential decay of $latex K_0$ at infinity…

This is nice, but here comes a challenge: if one spells out this identity in terms of Bessel functions, what needs to be done is equivalent to showing that the function
$latex K(a, b)=\int_{0}^{+\infty}{Y_0(ax)K_0(bx)xdx}$
defined for $latex a,b>0$, is antisymmetric: we have
$latex K(a,b)=-K(b,a).$
Now, this fact is an “elementary” property of classical functions. Can one prove it directly? (By which I mean, without using the operator interpretation, but also without using an explicit formula for the integral…) For the moment, I have not succeeded…

I’ll conclude by correcting a mistake in my previous post (it should not be a surprise to anyone that if I attempt to be as clever as Euler, I may stumble rather badly, and the correction is in some sense rather small compared with one might expect)… There I claimed that the integral transform $latex w\mapsto W$ appearing in the Voronoi formula for the divisor function is given by
$latex |y|^{1/2}W(y)=T(|x|^{1/2}w(|x|)).$
But this is not the case: the proper formula is
$latex |y|^{1/2}W(y)=T(|x|^{1/2}\tilde{w}(x)),$
where $latex \tilde{w}(x)=w(x)$ if $latex x>0$, but $latex \tilde{w}(x)=0$ if $latex x<0$. This affects the final formula: we have
$latex \|W\|^2=\|w\|^2,$
instead of the claimed
$latex \|W\|^2=2\|w\|^2$
(the "proof" using the Fourier transform has the same mistake of using $latex w(|xy|)$ instead of $latex \tilde{w}(xy)$, so there is no contradiction between the informal argument and the rigorous one.)

Trace functions, II: Examples

Continuing after my last post, this one will be a list of examples of trace functions modulo some prime number $latex p$. For each of the examples, I will give a bound for its conductor, which I recall is the main numerical invariant that allows us to measure the complexity of the trace function $latex K(n)$ (formally, the conductor is attached to the object $latex \mathcal{F}$ that gives rise to $latex K$, but we can define the conductor of a trace function to be the minimal conductor of such a $latex \mathcal{F}$.) These objects $latex \mathcal{F}$ will be called sheaves, since this is the language used in the paper(s) of Fouvry, Michel and myself, but one doesn’t need to know anything about sheaves to understand the examples.

I will start with a list of concrete functions which are trace functions, and then explain some of the basic operations one can perform on known trace functions to obtain new ones. All these examples will be (I hope) very natural, but it is usually a deep theorem that the functions come from sheaves.

Throughout, $latex p$ is a fixed prime number. Generically, $latex \psi$ denotes a non-trivial additive character modulo $latex p$, for instance
$latex \psi(x)=e^{2i\pi x/p},$
(which may also be viewed casually as an $latex \ell$-adic character), and $latex \chi$ denotes a multiplicative character modulo $latex p$ (non-trivial, unless specified otherwise.)

(1) Characters and mixed characters

Let $latex f$ and $latex g$ be non-zero rational functions in $latex \mathbf{F}_p(T)$. Let
$latex K(x)=\psi(f(x))\chi(g(x)),$
for $latex x$ which is not a pole of $latex f$, or a zero or pole of $latex g$, and $latex K(x)=0$ in that case. Then $latex K$ is a trace weight. The (or an) associated sheaf is of rank $latex 1$, and its conductor is bounded by the sum of degrees of numerators and denominators of $latex f$ and $latex g$. However, the size of the conductor arises for different reasons for $latex f$ and $latex g$: for the “additive” component $latex f$, singularities are poles of $latex f$, and the contribution of each pole $latex x_0$ comes from the Swan conductor, which is bounded by the order of the pole at $latex x_0$; for the “multiplicative” component $latex g$, the singularities are zeros and poles of $latex g$, and each only contributes $latex 1$ to the conductor: the Swan conductors for $latex K_g=\chi(g(x))$ are all zero.

For analytic applications, the main point is that, by fixing $latex f$ and $latex g$ over $latex \mathbf{Q}$, one obtains for each $latex p$ large enough (so that the reduction modulo $latex p$ makes sense), and each choice of characters $latex \psi$ and $latex \chi$, a trace weight associated to $latex f$ and $latex g$ which has conductor uniformly bounded (depending on $latex f$ and $latex g$ only). Thus any estimates valid for all primes with implied constants depending only on the conductor of the trace functions involved will become an interesting estimate concerning $latex f$ and $latex g$. This applies to the main theorem of my paper with Fouvry and Michel concerning orthogonality of Fourier coefficients of modular forms and trace functions…

These examples are the most classical, and are very useful. Even the simple case $latex g=1$ and $latex f(X)=X^{-1}$ is full of surprises.

(2) Fiber-counting functions

Another very useful example comes from a fixed non-constant rational function $latex f\in \mathbf{F}_p(T)$, which is viewed as defining a morphism
$latex f\,:\, \mathbf{P}^1\rightarrow \mathbf{P}^1.$
Consider then
$latex K(x)=|\{y\in \mathbf{P}^1\,\mid\, f(y)=x\}|.$
This is a trace weight, associated to the direct image sheaf
$latex \mathcal{F}=f_*\bar{\mathbf{Q}}_{\ell},$
which in representation theoretic terms is an induced representation from a finite-index subgroup, so that it remains relatively simple.
Here the rank $latex r$ of the sheaf is the degree $latex \deg(f)$ of $latex f$ as a morphism (i.e., the generic number of pre-images of a point $latex x$); the singularities are the finitely many $latex x$ in $latex \mathbf{P}^1$ such that the equation
$latex f(y)=x$
has fewer than $latex r$ solutions (in $latex \mathbf{P}^1(\bar{\mathbf{F}}_p)$) and, at least if $latex p>\deg(f)$, the Swan conductors vanish everywhere, so that the conductor is bounded in terms of the degrees of the numerator and denominator of $latex f$ only. In particular, if $latex f$ is defined over $latex \mathbf{Q}$, varying $latex p$ (large enough) will provide a family of trace functions modulo primes with uniformly bounded conductor, similar to the characters of the previous example with fixed rational functions as arguments.

The main reason this function is useful is that, for any other (arbitrary) function $latex \varphi$ on $latex \mathbf{P}^1(\mathbf{F}_p)$, we have tautologically
$latex \sum_{y}{\varphi(f(y))}=\sum_{x}{K(x)\varphi(x)}$
(in other words, it is maybe better to interpret $latex K$ as the image measure of the uniform measure on the finite set $latex \mathbf{P}^1(\mathbf{F}_p)$ under $latex f$, and this formula is the classical “integration” formula for an image measure…)

One also often takes the function
$latex \tilde{K}(x)=K(x)-1,$
where $latex 1$ is the average of $latex K$ over $latex \mathbf{F}_p$. This is also a trace function (the sheaf corresponding to $latex K$ contains a trivial quotient, and this is the trace function of the kernel of the map to this trivial quotient). We now have
$latex \sum_{x}{\tilde{K}(x)\varphi(x)}=\sum_{y}{\varphi(f(y))}-\sum_{x}{\varphi(x)}.$

(3) Number of points on families of algebraic varieties

More generally, we can count points on one-parameter families of algebraic varieties of dimension $latex d\geq 1$. For instance, families of elliptic curves or of more general curves are quite common. To be concrete, one may have a polynomial $latex f\in \mathbf{F}_p[T,Y,Z]$, where $latex T$ is seen as the parameter, and consider the curves
$latex C_t\,:\, f(t,X,Y)=0.$
Usually, it is not so much the number of points as the correction term that is most interesting. For instance, if the curves are generically geometrically irreducible, and have a single point at infinity, the size of $latex C_t(\mathbf{F}_p)$ is (for all but finitely many $latex t$) of the form
$latex |C_t(\mathbf{F}_p)|=p-a(C_t),$
where $latex a_(C_t)$ satisfies the Weil bound
$latex |a(C_t)|\leq 2g(C_t)\sqrt{p},$
in terms of the genus of $latex C_t$. In fact, once one ensures that the family of curves is such that the genus of the curves is the same $latex g\geq 0$ (for all but finitely many $latex t$), the function
$latex K(t)=a(C_t)$
is a trace function on the corresponding dense open set of $latex \mathbf{A}^1$, for some sheaf which has rank $latex 2g$. For the other values of $latex t$, the trace function of the corresppnding middle-extension sheaf might differ from the value $latex a(C_t)$ defined as above using the number of points, but since the number of those singularities is bounded by the conductor, one can usually (analytically at least) not worry too much about this. Similarly, in many cases the sheaf is tamely ramified everywhere (i.e., all Swan conductors vanish), and so the conductor is well-controlled.

In contrast with the first two examples, the construction of a sheaf with this trace function is not elementary: it is an example of the so-called “higher direct image sheaves” (with compact support). Since, for every “good” $latex t$, the Riemann Hypothesis for curves shows that
$latex a_p(C_t)=\sqrt{p}(\theta_{1,t}+\cdots+\theta_{2g,t}),$
where the $latex \theta_{i,t}$ are complex numbers of modulus $latex 1$, we can interpret the existence of this sheaf as saying that the algebraic variation of the “eigenvalues” $latex \theta_{i,t}$ is itself controlled by an algebraic object. This is one of the main insights that algebraic geometry (and étale cohomology in particular) brings to analytic number theory.

The family of elliptic curves
$latex x+x^{-1}+y+y^{-1}+t=0$
in my bijective challenge is of this type.

(4) Families of Kloosterman sums

One of the great examples, for analytic number theory, is given by families of Kloosterman sums: for an integer $latex m\geq 1$, and a non-zero $latex a\in\mathbf{F}_p$, we let
$latex Kl_m(a)=\frac{(-1)^{m-1}}{p^{(m-1)/2}}\sum_{x_1\cdots x_m=a}e\Bigl(\frac{x_1+\cdots +x_m}{p}\Bigr).$
The Weil bound for $latex m=2$, and the even deeper work of Deligne for larger $latex m$, prove that
$latex |Kl_m(a)|\leq m$
for all $latex a$ invertible modulo $latex p$. Further work, relying once more on the powerful formalism of étale sheaves and higher direct images in particular, shows that the function
$latex K(a)=Kl_m(a),$
is (the restriction to invertible $latex a$ of) a trace function for an irreducible sheaf, with conductor bounded in terms of $latex m$ only.

(5) The Fourier transform

If we have a function $latex K(x)$ modulo $latex p$, we define its Fourier transform by
$latex \hat{K}(t)=\frac{1}{\sqrt{p}}\sum_{x\in \mathbf{F}_p}{K(x)e\Bigl(\frac{xt}{p}\Bigr)}$
for $latex t\in\mathbf{F}_p$ (the normalization here is convenient, as I will explain). It is now a very deep fact that, if $\latex K$ comes from a sheaf, then so does $latex -\hat{K}$ (the minus sign is natural, but this has to do with rather deep algebraic geometry…) More precisely, one has to be careful because of the fact that the Fourier transform of an additive character (as a function) is a multiple of a delta function. The latter does fit nicely in the framework of étale sheaves, but not as a middle-extension sheaf or Galois representation (because it is zero on a dense open set, so it would have to be zero to be a middle-extension sheaf or to come from a Galois representation). There is a geometric solution to this issue, but it involves speaking of perverse sheaves and related machinery, which we have barely started to understand: the Fourier transform works perfectly well at the level of perverse sheaves, and one can use their trace functions just as well as those of Galois representations. Since, in our current applications, we can always deal separately with additive characters (or delta functions), we have avoided having to deal with perverse sheaves (up to now…)

The existence of the $latex \ell$-adic Fourier transform of sheaves was first proved by Deligne, but the theory of the sheaf-theoretic Fourier transform was largely built by Laumon (with further contributions, in particular, from Brylinski and Katz). To illustrate how powerful it is, consider
$latex K(x)=e\Bigl(\frac{x^{-1}}{p}\Bigr),$
a relatively simple case of Example (1). We then have
$latex \hat{K}(x)=Kl_2(x),$
so that the existence of the Fourier transform at the level of sheaves implies the existence of the Kloosterman sheaf parameterizing classical Kloosterman sums as in the previous example.

Other examples that arise from our previous examples are many families of exponential sums, for instance
$latex K(t)=\frac{1}{\sqrt{p}}\sum_{x\in\mathbf{F}_p}{\psi(f(x)+tx)\chi(g(x))},$
(arising from Example (1); one must assume either that $latex f(x)$ is not a polynomial of degree $latex \leq 1$ or that $latex \chi$ is non-trivial to have a well-defined sheaf), or
$latex K(t)=\frac{1}{\sqrt{p}}\sum_{x}{e\Bigl(\frac{tf(x)}{p}\Bigr)},$
for $latex t\not=0$ with $latex K(0)$ equal to the number of poles of $latex f$ (the sum over $latex x$ is over values where the rational function $latex f$ is defined), that arises from Example (2) (applied with the function $latex \tilde{K}$).

This operation of Fourier transform has one last crucial feature for applications to the analysis of trace functions: the conductor of $latex \hat{K}$ is bounded in terms of that of $latex K$ only. This is something we prove in our paper using Laumon’s analysis of the singularities of the Fourier transform, and in fact we show that if the conductor of $latex K$ is at most $latex M\geq 1$, then the conductor of $latex \hat{K}$ is at most $latex 10M^2$. Hence the examples above, if the rational functions $latex f$ (and/or $latex g$) are fixed in $latex \mathbf{Q}(T)$ and then reduced modulo various primes, always have conductor bounded uniformly for all $latex p$.

(6) Change of variable

Given a non-constant rational function $latex f\in\mathbf{F}_p(T)$ seen as a morphism
$latex \mathbf{P}^1\rightarrow \mathbf{P}^1,$
and a trace function $latex K(x)$, one can form the function
$latex f^*K(x)=K(f(x)).$
This is again, essentially, a trace function: as in Example (3), one may have to tweak the values of $latex f^*K$ at some singularities (because pull-back of middle-extension sheaves do not always remain so), but this is fairly easily controlled. Moreover, one can also control the conductor of $latex f^*K$ in terms of that of $latex K$, taking into account the degree of latex f$. A specially simple case of great importance is when $latex f$ is an homography
$latex f(x)=\frac{ax+b}{cx+d},\quad\quad\quad ad-bc\not=0,$
(an automorphism of $latex \mathbf{P}^1$) in which case no tweaking is necessary to defined $latex f^*K$, and the conductor is the same as that of $latex K$ (which certainly seems natural!)

We can now compose these various operations. One construction is the following (a finite-field Bessel transform): start with $latex K$, apply the Fourier transform, change the variable $latex t$ to $latex t^{-1}$, apply again the Fourier transform. If we call $latex \check{K}$ the resulting function, the examples above show that if $latex K$ is a trace function with conductor $latex \leq M$, then $latex \check{K}$ will also be one, and its conductor will be bounded solely in terms of $latex M$ (in fact, it will be $latex \leq 100M^4$ by the bound discussed in Example (5)).


Trailer! In the next post in this series, I will discuss the Riemann Hypothesis for trace functions and its applications. But probably before I will discuss the more recent works of Fouvry, Michel and myself, since we now have three further papers in our series — two small, and one big.

On Weyl groups and gaussians

Am I the last person to notice that for $latex k\geq 0$, the even moment
$latex m_{2k}=\frac{(2k)!}{2^kk!}$
of a standard gaussian random variable (with expectation zero and variance one) is the same as the index of the Weyl group of $latex \mathrm{Sp}_{2k}$ inside the Weyl group of $latex \mathrm{GL}_{2k}$ (in other words, the index of the groups of permutations of $latex 2k$ elements commuting with a fixed-point free involution among all permutations)?

If “Yes”, what else have I been missing in the same spirit?