\documentstyle{amsart} \newtheorem{thm}{Theorem} \newtheorem{lemma}{Lemma} \newtheorem{cor}{Corollary} \theoremstyle{definition} \begin{document} \title[Diophantine approximation and deformation] {Diophantine approximation and deformation (ROUGH DRAFT)} \author{Minhyong Kim, Dinesh S. Thakur and Jos\'e Felipe Voloch} \address{} \keywords{} \subjclass{} \thanks{} \maketitle \vskip .2truein \abstractname{: It is well-known that while the analogue of Liouville's theorem on diophantine approximation holds in finite characteristic, the analogue of Roth's theorem fails quite badly. We associate certain curves over function fields to given algebraic power series and show that bounds on the rank of Kodaira-Spencer map of this curves imply bounds on the exponents of the power series, with more `generic' curves (in the deformation sense) giving lower exponents. If we transport Vojta's conjecture on height inequality to finite characteristic by modifying it by adding suitable deformation theoretic condition, then we see that the numbers giving rise to general curves approach Roth's bound. We also prove a hierarchy of exponent bounds for approximation by algebraic quantities of bounded degree. } \vskip .2truein 0.0 Let $F$ be a finite field of characteristic $p$. For $\beta$ an element of $F((t^{-1}))$ algebraic irrational over $F(t)$ (an algebraic irrational real number, respectively), define its diophantine approximation exponent $E(\beta)$ by $$E(\beta):=\lim\sup (-\frac{\log |\beta-P/Q|}{\log |Q|})$$ where $P$ and $Q$ run over polynomials in $F[t]$ (integers, respectively), the absolute value is the usual one in each case and the limit is taken as $|Q|$ grows. 0.1 The well-known theorems of Dirichlet and Liouville and their analogues for function fields [M] show that $2\leq E(\beta)\leq d(\beta)$, where $d(\beta)$ is the algebraic degree of $\beta$ defined as $[F(t,\beta):F(t)]$ ($[{\bold Q}(\beta):{\bold Q}]$, respectively). That the diophantine approximation results and in particular, the improvement on the Liouville bound of $d(\beta)$ have interesting implications for the study of related diophantine equations is well-known since the work of Thue, Siegel etc. For the real number case, the well-known theorem of Roth shows that $E(\beta)=2$, but Mahler showed [M] that $E(\beta)=d(\beta)=q$ for $\beta=\sum t^{-q^i}$, (so that $\beta^q-\beta-t^{-1}=0$) as a straightforward estimate of approximation by truncation of this series shows. (Here and in what follows $q$ is a power of $p$). Osgood [O2] and Baum and Sweet [BS] gave many examples in various degrees. See [T] for the references to other examples. 0.2 For given $d=d(\beta)$, $E(\beta)$'s form a countable subset of interval $2\leq x\leq d$. What is it? Does it contain any irrational number? Does it contain all the rationals in the range? In [Sc2] and [T], the following result was proved. \begin{thm} Given any rational $\mu$ between $2$ and $q+1$, we can find a family of $\beta$'s (given by explicit equations and explicit continued fractions), with $E(\beta)=\mu$ and $d(\beta) \leq q+1$. \end{thm} The question of exact degree of $\beta$ is easily addressed for explicit families. (For more, see [T]). 1.0 Osgood [O2] proved that the Liouville/Mahler bound can be improved to (even effectively) $E(\beta)\leq \lfloor (d(\beta)+3)/2\rfloor$ (or rather $\lfloor d(\beta)/2\rfloor +1$, see [Sc1] or [LdM]) for $\beta$ not satisfying the generalized Riccati differential equation $d\beta/dt=a\beta^2+b\beta +c$, with $a, b, c\in F(t)$. Most known (see e.g., . [L] for exceptions) examples $\beta$, whose continued fraction is known, do satisfy Riccati equation and indeed $\beta$ is an integral linear fractional transformation (of determinant $\pm 1$ for the first family) of $\beta^q$. What is the range of exponents for $\beta$ not of this form? 1.1 In [V2], [V3] there is an observation (for the lack of a better reference, we provide a proof of this in the appendix) that the Riccati condition is equivalent to the vanishing of Kodaira-Spencer (we write KS in short-form) class of projective line minus conjugates of $\beta$. Hence, it may not be too wild to speculate that it might be possible to successively improve on Osgood's bound, if we throw out some further classes of differential equations coming from the conditions that some corresponding Kodaira-Spencer map (or say the vector space generated by derivatives of the cross-ratios of conjugates of $\beta$) has rank not more than some integer. It should be also noted that even though the KS connection holds in characteristic zero, analogue of Roth's theorem holds in the complex function field case. The Osgood bound still holds (conjectured in [V1], proved in [LdM]) by throwing out only a subclass given by `Frobenius' equation $\beta^q=(a\beta+b)/(c\beta +d)$. This might be the best one can get. Similarly, the differential equation hierarchy suggested above might have some corresponding more refined hierarchy. 2.0 Though we have not succeeded yet in improving the Osgood bound unconditionally, by throwing out more classes of numbers, we prove existence of hierarchies, given by deformation theoretic conditions, of bounds, using results of [K], which we now recall: Let $X$ be a smooth projective surface over the perfect field $k$. Assume that $X$ admits a map $f: X\rightarrow S$ to a smooth projective curve $S$ defined over $k$, with function field $L$ in such a way that the fibers of $f$ are geometrically connected curves and the generic fiber $X_L$ is smooth of genus $g\geq 2$. Consider algebraic points $P: T\rightarrow X$ of $X_L$, where $T$ is a smooth projective curve mapping to $S$ (such that the triangle commutes). Define the canonical height of $P$ to be $h(P) :=\deg P^*\omega/[T:S]=\langle P(T).\omega\rangle/[K(P(T)):L]$, where $\omega=\omega_X:=K_X\otimes f^*K_S^{-1}$ denotes the relative dualising sheaf for $X\rightarrow S$. This is a representative for the class of height functions on $X_L(\overline{L})$ associated to the canonical sheaf $K_{X_L}$. Define the relative discriminant to be $d(P):=(2g(T)-2)/[T:S]=(2g(P(T))-2)/[K(P(T)):L]$. The Kodaira-Spencer map is constructed on any open set $U\subset S$ over which $f$ is smooth from the exact sequence $0\rightarrow f^*\Omega_U^1 \rightarrow \Omega_{X_U}^1\rightarrow \Omega_{X_U/U}^1\rightarrow 0$, by taking the coboundary map $KS: f_*(\Omega_{X_U/U}^1)\rightarrow \Omega_U^1\otimes R^1f_*( {\cal O}_{X_U})$. \begin{thm} {\mbox [K]} (1) Suppose the KS-map of $X/S$ (defined on some open subset of $S$) is non-zero. Then $h(P)\leq (2g-2)d(P)+O(h(P)^{1/2})$ if $g>2$. If g=2, then $h(P)\leq (2+\epsilon)d(P)+O(1)$. (2) Suppose the KS map of $X/S$ has maximal rank, then $h(P)\leq (2+\epsilon)d(P)+O(1)$. \end{thm} 2.1 The inequality in (2) was proved [Voj2] in the characteristic $0$ function field analogue, without any hypothesis, by Vojta, who also conjectured [Voj1] the stronger inequality with $2$ replaced by $1$ in the number field (and presumably also in the characteristic $0$ function field) case. 2.2 From the proof in [K] of Theorem 2, it follows, in fact, that there is an hierarchy of bounds: If the rank of the kernel of the KS map is $\leq i$, then we have $h(P)\leq (2g-2)d(P)/(g-i) + O(1)$, for $01$. Now the naive height is $\deg (z)$ ($\deg(x)$ differs by an additive constant, so it does not matter which is bigger). Hence the $h(P)+ O(h(P)^{1/2})$ is $((d-1)(k-1)-2))/k$ times that. %Note that %in the asymptotic, it does not matter that we computed the height %on this model, singular at infinity. Now zeros of $m$ and $z$ can ramify totally, so that Riemann-Hurwitz (as $p$ does not divide $k$) gives (for approximation approaching the exponent bound) $2g(P)-2\leq -2k+(1+d-e)(k-1)\deg (z)$, which is $(1+d-e)(k-1)\deg(z)$ asymptotically. Hence under the maximal rank hypothesis, the Theorem gives $(d-1)(k-1)-2\leq 2(1+d-e)(k-1)$, which simplifies to $e\leq (d+3)/2 +1/(k-1)$. This is again worse than, but asymptotic to, Osgood bound. 3.2.2 In this case, if we assume Vojta's bound under the maximal rank hypothesis, then we get $e\leq 2+2/(k-1)$ again approaching Roth bound, this time with $k$ approaching infinity. So we can say that $e=2$ under the maximal rank hypothesis and assuming the corresponding modification of Vojta's conjecture. 4.0 Similar ideas can be used to study approximation of $\beta$ by algebraic functions of lower degree in the spirit of Wirsing's theorem [Sc3]. The setting is as follows: Let $\beta$ as before of degree $d$ over $F(t)$. Now we want to see how close $\beta$ can be to $\alpha$ of degree $r < d$ over $F(t)$. Let $\alpha$ have height $H$ and be such that $-\log|\beta-\alpha|/H$ is close to $e$. We use the curves from 3.2 $y^k = f(x)$ and the point $P = (\alpha, f(\alpha)^{1/k})$ and $[F(t)(P):F(t)] = kr$ now. Then $h(P) = ((d-1)(k-1)-2)H/kr$ by the same calculation as before. Also the ramification of $K(P)$ over $K(\alpha)$ (where $K =F(t)$) is bounded by $(1+d-e)(k-1)H$. So by Riemann-Hurwitz $2g(P)-2 < k(2g(\alpha) -2) + (1+d-e)(k-1)H$, where $g(\alpha)$ is the genus of $K(\alpha)/F$. To bound $g(\alpha)$ apply the Castelnuovo inequality to $K(\alpha)$ viewed as the compositum of $F(t)$ and $F(\alpha)$ both function fields of genus 0 together with $[K(\alpha):F(t)] = r, [K(\alpha):F(\alpha)] = H$. So $g(\alpha) < Hr$ and $2g(P)-2 < 2krH + (1+d-e)(k-1)H$. Finally, we can apply Theorem 2, provided the Kodaira-Spencer map is of maximal rank and get $$((d-1)(k-1)-2)H/kr < (2+\epsilon)(2krH + (1+d-e)(k-1)H)/kr + O(1).$$ Now divide by $H$ and make $H$ big and $\epsilon$ small, obtaining $$((d-1)(k-1)-2)/kr \le 2(2kr + (1+d-e)(k-1))/kr.$$ The last inequality gives a bound for $e$ in terms of $d,r$ and $k$. If we can take $k$ arbitrarily large it gives $e \le (d+3+4r)/2$. Of course this is only interesting when $r < (d-3)/4$ but it seems that other methods that yield improvements on Liouville's inequality, such as Osgood's, do not give anything in this setting. 4.0.2 In this case, our finite characteristic version of Vojta's conjecture gives (under the maximal rank condition) $e\leq 2r+2$, for $r>1$. 5.0 Now we turn to calculation of Kodaira-Spencer matrix and explicit formulae for the rank conditions. 5.1 For $X$ as in 3.1, the basis for holomorphic differentials is $\omega_{(a, b)}=x^ay^b (dx/F_y)$, with $a, b \geq 0$ and $a+b\leq n-3$: Differentiating $F=1$ (treating $t$ as a constant for now), we get $F_xdx+F_ydy=0$, hence $dx/F_y=-dy/F_x$. Since $F_x$ and $F_y$ are not simultaneously zero, one has only to look at poles at infinity. Since $dx$ has order two pole and $F_y$ has order $n-1$ pole there, the claim follows. Since $F_xdx+F_ydy+F_tdt=0$, the relative differential $dx/F_y$ has good liftings: $dx/F_y$ in the open set $U_1$ where $F_y\neq 0$ and $-dy/F_x$ in the open set $U_2$ where $F_x\neq 0$. By our assumptions $U_1$ and $U_2$ cover $X$ and Cech cohomology computation shows that the map $KS: H^0(\Omega_{X/S}^1)\rightarrow H^1({\cal O}_X)\otimes \Omega_S^1$ sends $dx/F_y$ to $F_tdt/(F_xF_y)$. Hence $x^ay^bdx/F_y$ is sent to $x^ay^bF_tdt/(F_xF_y)$. We calculate the $g \times g$ matrix $M=(m_{ij})$ of the Kodaira-Spencer map in the basis above: If $P_k$'s are zeros of $F_y$, i.e points in the complement of $U_1$, then since the Serre duality sends a differential on $U_1\cap U_2$ to the sum of its residues at $P_k$'s (if we use $U_2$ instead, we get negative of this: it is well-defined only up to a sign), we have $$m_{(a, b)(c, d)}=\sum_k Res_{P_k}(\frac{x^{a+c}y^{b+d}F_tdx} {F_y^2F_x})dt \in \Omega_S^1$$ 5.2 In the situation of 3.2, now $x$ has degree $k$ and $y$ has degree $d$, so that $x^idx/y^j$ is holomorphic, as long as $02}a_m (\frac{\beta^m-\beta_1^m}{\beta-\beta_1}+\frac{\beta_3^m-\beta_2^m}{\beta_3-\beta_2} -\frac{\beta^m-\beta_2^m}{\beta-\beta_2}-\frac{\beta_3^m-\beta_1^m}{\beta_3-\beta_1})=0$$ Now $(\beta^m-\beta_1^m)/(\beta-\beta_1)=\sum_{i+j=m-1} \beta^i\beta_1^j$ and $\beta^j\beta_1^k+\beta_3^j\beta_2^k-\beta^j\beta_2^k-\beta_3^j\beta_1^k=(\beta-\beta_3)[\beta_1^k (\beta^j-\beta_3^j)/(\beta-\beta_3)-\beta_2^k(\beta^j-\beta_3^j)/(\beta-\beta_3)]$, so that taking out the non-zero factor $(\beta-\beta_3)(\beta_1-\beta_2)$ we get $$0=\sum_{m>2}a_m[\sum_{j, k>0, j+k=m-1} \frac{\beta^j-\beta_3^k}{\beta-\beta_3}\frac{\beta_1^k-\beta_2^k}{\beta_1-\beta_2}]$$ Now the quantity between the square brackets is just $\sum \beta_{i_1}\cdots \beta_{i_{m-3}}$, where each $\beta_i$ is one of the four conjugates. Hence we get $0=a_n\sum \beta_{i_1}\cdots \beta_{i_{n-3}}+a_{n-1} \sum \beta_{i_1}\cdots \beta_{i_{n-4}}+\cdots + a_3$. The coefficient of $a_n$ is of degree $n-3$ in $\beta_i$'s. Subtracting a similar equation that one obtains when $\beta$ is replaced by another conjugate $\overline{\beta}$, and taking out the non-zero factor $\beta-\overline{\beta}$ we get another equation with degrees dropping by one. Continuing in this fashion with other conjugates (there are $d-4\geq n-3$ of them), we get $a_n=0$, a contradiction. {\bf Acknowledgments}: The second author would like to thank Nitin Nitsure and Nicholas Katz for helpful conversations about the deformation theory. \bigskip \begin{thebibliography}{OS1} \bibitem[BS]{} L. Baum and M. Sweet, {\sl Continued fractions of algebraic power series in characteristic 2}, Ann. of Math {\bf 103} (1976), 593-610. \medskip \bibitem[K]{} M. Kim, {\sl Geometric height inequalities and the Kodaira-Spencer map}, Compositio Math. {\bf 105} (1997), 43-54. \medskip \bibitem[LdM]{} A. Lasjaunias and B. de Mathan, {\sl Thue's theorem in positive characteristic}, J. Reine Angew. Math {\bf 473} (1996), 195-206. \medskip \bibitem[M]{} K. Mahler, {\sl On a theorem of Liouville in fields of positive characteristic}, Can. J. Math {\bf 1} (1949), 397-400. \medskip \bibitem[O1]{} C. Osgood, {\sl An effective lower bound on the diophantine approximation of algebraic functions by rational functions}, Mathematika {\bf 20} (1973), 4-15. \medskip \bibitem[O2]{} C. Osgood, {\sl Effective bounds on the diophantine approximation of algebraic functions over fields of arbitrary characteristic and applications to differential equations}, Indag. Math {\bf 37} (1975), 104-119. \medskip \bibitem[Sc1]{} W. Schmidt, {\sl On Osgood's effective Thue theorem for algebraic functions}, Comm. pure applied math {\bf 29} (1976), 759-773. \medskip \bibitem[Sc2]{} W. Schmidt, {\sl On continued fractions and diophantine approximation in power series fields}, Preprint. \medskip \bibitem[Sc3]{} W. Schmidt, {\sl Diophantine approximation } Lecture Notes in Mathematics {\bf 785}, Springer, Berlin 1980. \medskip \bibitem[T]{} D. Thakur, {\sl Diophantine approximation exponents and continued fractions for algebraic power series}, To appear in J. Number Theory. \medskip \bibitem[V1]{} J. F. Voloch, {\sl Diophantine approximation in positive characteristic}, Periodica Math Hungarica {\bf 19} (1988), 217-225. \medskip \bibitem[V2]{} J. F. Voloch, {\sl Diophantine approximation in characteristic $p$}, Monatsh. Math {\bf 119} (1995), 321-325. \medskip \bibitem[V3]{} J. F. Voloch, {\sl Diophantine geometry in characteristic $p$: a survey} in Arithmetic Geometry, F. Catanese, ed., Symposia Mathematica XXXVII, Cambridge Univ. Press, 1997, pp. 260-278. \medskip \bibitem[Voj1]{} P. Vojta, {\sl Diophantine approximations and value distribution theory}, Lecture noes in Mathematics {\bf 1239}, Springer, Berlin 1987. \medskip \bibitem[Voj2]{} P. Vojta, {\sl On algebraic points on curves}, Comp. Math. {\bf 78}, (1991), 29-36. \end{thebibliography} \end{document}