Schwarzian derivative
In mathematics, the Schwarzian derivative, named after the German mathematician Hermann Schwarz, is a certain operator that is invariant under all Möbius transformations. Thus, it occurs in the theory of the complex projective line, and in particular, in the theory of modular forms and hypergeometric functions. It plays an important role in the theory of univalent functions, conformal mapping and Teichmüller spaces.
Definition
The Schwarzian derivative of a holomorphic function f of one complex variable z is defined by
The same formula also defines the Schwarzian derivative of a C3 function of one real variable. The alternative notation
is frequently used.
Properties
Let
denote any Möbius transformation, then its derivatives are where . We make use of the third and second derivative: With , we find that the Schwarzian derivative maps any Möbius transformation to zero. Conversely, the Möbius transformations are the only functions with this property. As such, the Schwarzian derivative precisely measures the degree to which a function fails to be a Möbius transformation.
If g is a Möbius transformation, then the composition g o f has the same Schwarzian derivative as f; and on the other hand, the Schwarzian derivative of f o g is given by the chain rule
More generally, for any sufficiently differentiable functions f and g
This makes the Schwarzian derivative an important tool in one-dimensional dynamics [1] since it implies that all iterates of a function with negative Schwarzian will also have negative Schwarzian.
Introducing the function of two complex variables[2]
its second mixed partial derivative is given by
and the Schwarzian derivative is given by the formula:
The Schwarzian derivative has a simple inversion formula, exchanging the dependent and the independent variables. One has
which follows from the inverse function theorem, namely that
Differential equation
The Schwarzian derivative has a fundamental relation with a second-order linear ordinary differential equation in the complex plane.[3] Let and be two linearly independent holomorphic solutions of
Then the ratio satisfies
over the domain on which and are defined, and The converse is also true: if such a g exists, and it is holomorphic on a simply connected domain, then two solutions and can be found, and furthermore, these are unique up to a common scale factor.
When a linear second-order ordinary differential equation can be brought into the above form, the resulting Q is sometimes called the Q-value of the equation.
Note that the Gaussian hypergeometric differential equation can be brought into the above form, and thus pairs of solutions to the hypergeometric equation are related in this way.
Conditions for univalence
If f is a holomorphic function on the unit disc, D, then W. Kraus (1932) and Nehari (1949) proved that a necessary condition for f to be univalent is[4]
Conversely if f(z) is a holomorphic function on D satisfying
then Nehari proved that f is univalent.[5]
In particular a sufficient condition for univalence is[6]
Conformal mapping of circular arc polygons
The Schwarzian derivative and associated second -order ordinary differential equation can be used to determine the Riemann mapping between the upper half-plane or unit circle and any bounded polygon in the complex plane, the edges of which are circular arcs or straight lines. For polygons with straight edges, this reduces to the Schwarz–Christoffel mapping, which can be derived directly without using the Schwarzian derivative. The accessory parameters that arise as constants of integration are related to the eigenvalues of the second-order differential equation. Already in 1890 Felix Klein had studied the case of quadrilaterals in terms of the Lamé differential equation.[7][8][9]
Let Δ be a circular arc polygon with angles πα1, ..., παn in clockwise order. Let f : H → Δ be a holomorphic map extending continuously to a map between the boundaries. Let the vertices correspond to points a1, ..., an on the real axis. Then p(x) = S(f)(x) is real-valued for x real and not one of the points. By the Schwarz reflection principle p(x) extends to a rational function on the complex plane with a double pole at ai:
The real numbers βi are called accessory parameters. They are subject to three linear constraints:
which correspond to the vanishing of the coefficients of and in the expansion of p(z) around z = ∞. The mapping f(z) can then be written as
where and are linearly independent holomorphic solutions of the linear second-order ordinary differential equation
There are n−3 linearly independent accessory parameters, which can be difficult to determine in practise.
For a triangle, when n = 3, there are no accessory parameters. The ordinary differential equation is equivalent to the hypergeometric differential equation and f(z) is the Schwarz triangle function, which can be written in terms of hypergeometric functions.
For a quadrilateral the accessory parameters depend on one independent variable λ. Writing U(z) = q(z)u(z) for a suitable choice of q(z), the ordinary differential equation takes the form
Thus are eigenfunctions of a Sturm–Liouville equation on the interval . By the Sturm separation theorem, the non-vanishing of forces λ to be the lowest eigenvalue.
Complex structure on Teichmüller space
Universal Teichmüller space is defined to be the space of real analytic quasiconformal mappings of the unit disc D, or equivalently the upper half-plane H, onto itself, with two mappings considered to be equivalent if on the boundary one is obtained from the other by composition with a Möbius transformation. Identifying D with the lower hemisphere of the Riemann sphere, any quasiconformal self-map f of the lower hemisphere corresponds naturally to a conformal mapping of the upper hemisphere onto itself. In fact is determined as the restriction to the upper hemisphere of the solution of the Beltrami differential equation
where μ is the bounded measurable function defined by
on the lower hemisphere, extended to 0 on the upper hemisphere.
Identifying the upper hemisphere with D, Lipman Bers used the Schwarzian derivative to define a mapping
which embeds universal Teichmüller space into an open subset U of the space of bounded holomorphic functions g on D with the uniform norm. Frederick Gehring showed in 1977 that U is the interior of the closed subset of Schwarzian derivatives of univalent functions.[10][11][12]
For a compact Riemann surface S of genus greater than 1, its universal covering space is the unit disc D on which its fundamental group Γ acts by Möbius transformations. The Teichmüller space of S can be identified with the subspace of the universal Teichmüller space invariant under Γ. The holomorphic functions g have the property that
is invariant under Γ, so determine quadratic differentials on S. In this way, the Teichmüller space of S is realized as an open subspace of the finite-dimensional complex vector space of quadratic differentials on S.
Diffeomorphism group of the circle
Crossed homomorphisms
The transformation property
allows the Schwarzian derivative to be interpreted as a continuous 1-cocycle or crossed homomorphism of the diffeomorphism group of the circle with coefficients in the module of densities of degree 2 on the circle.[13] Let Fλ(S1) be the space of tensor densities of degree λ on S1. The group of orientation-preserving diffeomorphisms of S1, Diff(S1), acts on Fλ(S1) via pushforwards. If f is an element of Diff(S1) then consider the mapping
In the language of group cohomology the chain-like rule above says that this mapping is a 1-cocycle on Diff(S1) with coefficients in F2(S1). In fact
and the 1-cocycle generating the cohomology is f → S(f−1). The computation of 1-cohomology is a particular case of the more general result
Note that if G is a group and M a G-module, then the identity defining a crossed homomorphism c of G into M can be expressed in terms of standard homomorphisms of groups: it is encoded in a homomorphism φ of G into the semidirect product such that the composition of φ with the projection onto G is the identity map; the correspondence is by the map C(g) = (c(g), g). The crossed homomorphisms form a vector space and containing as a subspace the coboundary crossed homomorphisms b(g) = g ⋅ m − m for m in M. A simple averaging argument shows that, if K is a compact group and V a topological vector space on which K acts continuously, then the higher cohomology groups vanish Hm(K, V) = (0) for m > 0. n particular for 1-cocycles χ with
averaging over y, using left invariant of the Haar measure on K gives
with
Thus by averaging it may be assumed that c satisfies the normalisation condition c(x) = 0 for x in Rot(S1). Note that if any element x in G satisifes c(x) = 0 then C(x) = (0,x). But then, since C is a homomorphism, C(xgx−1) = C(x)C(g)C(x)−1, so that c satisfies the equivariance condition c(xgx−1) = x ⋅ c(g). Thus it may be assumed that the cocycle satisfies these normalisation conditions for Rot(S1). The Schwarzian derivative in fact vanishes whenever x is a Möbius transformation corresponding to SU(1,1). The other two 1-cycles discussed below vanish only on Rot(S1) (λ = 0, 1).
There is an infinitesimal version of this result giving a 1-cocycle for Vect(S1), the Lie algebra of smooth vector fields, and hence for the Witt algebra, the subalgebra of trigonometric polynomial vector fields. Indeed, when G is a Lie group and the action of G on M is smooth, there is a Lie algebraic version of crossed homomorphism obtained by taking the corresponding homomorphisms of the Lie algebras (the derivatives of the homomotphisms at the identity). This also makes sense for Diff(S1) and leads to the 1-cocycle
which satisfies the identity
In the Lie algebra case, the coboundary maps have the form b(X) = X ⋅ m for m in M. In both cases the 1-cohomology is defined as the space of crossed homomorphisms modulo coboundaries. The natural correspondence between group homomorphisms and Lie algebra homomorphisms leads to the "van Est inclusion map"
In this way the calculation can be reduced to that of Lie algebra cohomology. By continuity this reduces to the computation of crossed homomorphisms φ of the Witt algebra into Fλ(S1). The normalisations conditions on the group crossed homomorphism imply the following additional conditions for φ:
for x in Rot(S1).
Following the conventions of Kac & Raina (1987), a basis of the Witt algebra is given by
so that [dm,dn] = (m – n) dm + n. A basis for the complexification of Fλ(S1) is given by
so that
for gζ in Rot(S1) = T. This forces φ(dn) = an ⋅ vn for suitable coefficients an. The crossed homomorphism condition φ([X,Y]) = Xφ(Y) – Yφ(X) gives a recurrence relation for the an:
The condition φ(d/dθ) = 0, implies that a0 = 0. From this condition and the recurrence relation, it follows that up to scalar multiples, this has a unique non-zero solution when λ equals 0, 1 or 2 and only the zero solution otherwise. The solution for λ = 1 corresponds to the group 1-cocycle . The solution for λ = 0 corresponds to the group 1-cocycle φ0(f) = log f' . The corresponding Lie algebra 1-cocycles for λ = 0, 1, 2 are given up to a scalar multiple by
Central extensions
The crossed homomorphisms in turn give rise to the central extension of Diff(S1) and of its Lie algebra Vect(S1), the so-called Virasoro algebra.
Coadjoint action
The group Diff(S1) and its central extension also appear naturally in the context of Teichmüller theory and string theory.[14] In fact the homeomorphisms of S1 induced by quasiconformal self-maps of D are precisely the quasisymmetric homeomorphisms of S1; these are exactly homeomorphisms which do not send four points with cross ratio 1/2 to points with cross ratio near 1 or 0. Taking boundary values, universal Teichmüller can be identified with the quotient of the group of quasisymmetric homeomorphisms QS(S1) by the subgroup of Möbius transformations Moeb(S1). (It can also be realized naturally as the space of quasicircles in C.) Since
the homogeneous space Diff(S1)/Moeb(S1) is naturally a subspace of universal Teichmüller space. It is also naturally a complex manifold and this and other natural geometric structures are compatible with those on Teichmüller space. The dual of the Lie algebra of Diff(S1) can be identified with the space of Hill's operators on S1
and the coadjoint action of Diff(S1) invokes the Schwarzian derivative. The inverse of the diffeomorphism f sends the Hill's operator to
Pseudogroups and connections
The Schwarzian derivative and the other 1-cocycle defined on Diff(S1) can be extended to biholomorphic between open sets in the complex plane. In this case the local description leads to the theory of analytic pseudogroups, formalizing the theory of infinite-dimensional groups and Lie algebras first studied by Élie Cartan in the 1910s. This is related to affine and projective structures on Riemann surfaces as well as the theory of Schwarzian or projective connections, discussed by Gunning, Schiffer and Hawley.
A holomorphic pseudogroup Γ on C consists of a collection of biholomorphisms f between open sets U and V in C which contains the identity maps for each open U, which is closed under restricting to opens, which is closed under composition (when possible), which is closed under taking inverses and such that if a biholomorphisms is locally in Γ, then it too is in Γ. The pseudogroup is said to be transitive if, given z and w in C, there is a biholomorphism f in Γ such that f(z) = w. A particular case of transitive pseudogroups are those which are flat, i.e. contain all complex translations Tb(z) = z + b. Let G be the group, under composition, of formal power series transformations F(z) = a1z + a2z2 + .... with a1 ≠ 0. A holomorphic pseudogroup Γ defines a subgroup A of G, namely the subgroup defined by the Taylor series expansion about 0 (or "jet") of elements f of Γ with f(0) = 0. Conversely if Γ is flat it is uniquely determined by A: a biholomorphism f on U is contained in Γ in if and only if the power series of T–f(a) ∘ f ∘ Ta lies in A for every a in U: in other words the formal power series for f at a is given by an element of A with z replaced by z − a; or more briefly all the jets of f lie in A.[15]
The group G has a natural homomorphisms onto the group Gk of k-jets obtained by taking the truncated power series taken up to the term zk. This group acts faithfully on the space of polynomials of degree k (truncating terms of order higher than k). Truncations similarly define homomorphisms of Gk onto Gk − 1; the kernel consists of maps f with f(z) = z + bzk, so is Abelian. Thus the group Gk is solvable, a fact also clear from the fact that it is in triangular form for the basis of monomials.
A flat pseudogroup Γ is said to be "defined by differential equations" if there is a finite integer k such that homomorphism of A into Gk is faithful and the image is a closed subgroup. The smallest such k is said to be the order of Γ. There is a complete classification of all subgroups A that arise in this way which satisfy the additional assumptions that the image of A in Gk is a complex subgroup and that G1 equals C*: this implies that the pseudogroup also contains the scaling transformations Sa(z) = az for a ≠ 0, i.e. contains A contains every polynomial az with a ≠ 0.
The only possibilities in this case are that k = 1 and A = {az: a ≠ 0}; or that k = 2 and A = {az/(1−bz) : a ≠ 0}. The former is the pseudogroup defined by affine subgroup of the complex Möbius group (the az + b transformations fixing ∞); the latter is the pseudogroup defined by the whole complex Möbius group.
This classification can easily be reduced to a Lie algebraic problem since the formal Lie algebra of G consists of formal vector fields F(z) d/dz with F a formal power series. It contains the polynomial vectors fields with basis dn = zn+1 d/dz (n ≥ 0), which is a subalgebra of the Witt algebra. The Lie brackets are given by [dm,dn] = (n − m)dm+n. Again these act on the space of polynomials of degree ≤ k by differentiation—it can be identified with C[[z]]/(zk+1)—and the images of d0, ..., dk – 1 give a basis of the Lie algebra of Gk. Note that Ad(Sa) dn= a–n dn. Let denote the Lie algebra of A: it is isomorphic to a subalgebra of the Lie algebra of Gk. It contains d0 and is invariant under Ad(Sa). Since is a Lie subalgebra of the Witt algebra, the only possibility is that it has basis d0 or basis d0, dn for some n ≥ 1. There are corresponding group elements of the form f(z)= z + bzn+1 + .... Composing this with translations yields T–f(ε) ∘ f ∘ T ε(z) = cz + dz2 + ... with c, d ≠ 0. Unless n = 2, this contradicts the form of subgroup A; so n = 2.[16]
The Schwarzian derivative is related to the pseudogroup for the complex Möbius group. In fact if f is a biholomorphism defined on V then φ2(f) = S(f) is a quadratic differential on V. If g is a bihomolorphism defined on U and g(V) ⊆ U, S(f ∘ g) and S(g) are quadratic differentials on U; moreover S(f) is a quadratic differential on V, so that g∗S(f) is also a quadratic differential on U. The identity
is thus the analogue of a 1-cocycle for the pseudogroup of biholomorphisms with coefficients in holomorphic quadratic differentials. Similarly and are 1-cocycles for the same pseudogroup with values in holomorphic functions and holomorphic differentials. In general 1-cocycle can be defined for holomorphic differentials of any order so that
Applying the above identity to inclusion maps j, it follows that φ(j) = 0 ;and hence that if f1 is the restriction of f2, so that f2 ∘ j = f1, then φ(f1) = φ (f2). On the other hand, taking the local holomororphic flow defined by holomorphic vector fields,—the exponential of the vector fields—the holomorphic pseudogroup of local biholomorphisms is generated by holomorphic vector fields. If the 1-cocycle φ satisfies suitable continuity or analyticity conditions, it induces a 1-cocycle of holomorphic vector fields, also compatible with restriction. Accordingly, it defines a 1-cocycle on holomorphic vector fields on C:[17]
Restricting to the Lie algebra of polynomial vector fields with basis dn = zn+1 d/dz (n ≥ −1), these can be determined using the same methods of Lie algebra cohomology (as in the previous section on crossed homomorphisms). There the calculation was for the whole Witt algebra acting on densities of order k, whereas here it is just for a subalgebra acting on holomorphic (or polynomial) differentials of order k. Again, assuming that φ vanishes on rotations of C, there are non-zero 1-cocycles, unique up to scalar multiples. only for differentials of degree 0, 1 and 2 given by the same derivative formula
where p(z) is a polynomial.
The 1-cocycles define the three pseudogroups by φk(f) = 0: this gives the scaling group (k = 0); the affine group (k = 1); and the whole complex Möbius group (k = 2). So these 1-cocycles are the special ordinary differential equations defining the pseudogroup. More significantly they can be used to define corresponding affine or projective structures and connections on Riemann surfaces. If Γ is a pseudogroup of smooth mappings on Rn, a topological space M is said to have a Γ-structure if it has a collection of charts f that are homeomorphisms from open sets Vi in M to open sets Ui in Rn such that, for every non-empty intersection, the natural map from fi (Ui ∩ Uj) to fj (Ui ∩ Uj) lies in Γ. This defines the structure of a smooth n-manifold if Γ consists of local diffeomorphims and a Riemann surface if n = 2—so that R2 ≡ C—and Γ consists of biholomorphisms. If Γ is the affine pseudogroup, M is said to have an affine structure; and if Γ is the Möbius pseudogroup, M is said to have a projective structure. Thus a genus one surface given as C/Λ for some lattice Λ ⊂ C has an affine structure; and a genus p > 1 surface given as the quotient of the upper half plane or unit disk by a Fuchsian group has a projective structure.[18]
Gunning (1966) describes how this process can be reversed: for genus p > 1, the existence of a projective connection, defined using the Schwarzian derivative φ2 and proved using standard results on cohomology, can be used to identify the universal covering surface with the upper half plane or unit disk (a similar result holds for genus 1, using affine connections and φ1).
Notes
- Weisstein, Eric W. "Schwarzian Derivative." From MathWorld—A Wolfram Web Resource.
- Schiffer 1966
- Hille 1976, pp. 374–401
- Lehto 1987, p. 60
- Duren 1983
- Lehto 1987, p. 90
- Nehari 1953
- von Koppenfels & Stallmann 1959
- Klein 1922
- Ahlfors 1966
- Lehto 1987
- Imayoshi & Taniguchi 1992
- Ovsienko & Tabachnikov 2005, pp. 21–22
- Pekonen 1995
- Sternberg 1983, pp. 421–424
- Gunning 1978
- Libermann
- Gunning 1966
References
- Ahlfors, Lars (1966), Lectures on quasiconformal mappings, Van Nostrand, pp. 117–146, Chapter 6, "Teichmüller Spaces"
- Duren, Peter L. (1983), Univalent functions, Grundlehren der Mathematischen Wissenschaften, 259, Springer-Verlag, pp. 258–265, ISBN 978-0-387-90795-6]
- Guieu, Laurent; Roger, Claude (2007), L'algèbre et le groupe de Virasoro, Montreal: CRM, ISBN 978-2-921120-44-9
- Gunning, R. C. (1966), Lectures on Riemann surfaces, Princeton Mathematical Notes, Princeton University Press
- Gunning, R. C. (1978), On uniformization of complex manifolds: the role of connections, Mathematical Notes, 22, Princeton University Press, ISBN 978-0-691-08176-2
- Hille, Einar (1976), Ordinary differential equations in the complex domain, Dover, pp. 374–401, ISBN 978-0-486-69620-1, Chapter 10, "The Schwarzian".
- Imayoshi, Y.; Taniguchi, M. (1992), An introduction to Teichmüller spaces, Springer-Verlag, ISBN 978-4-431-70088-3
- Kac, V. G.; Raina, A. K. (1987), Bombay lectures on highest weight representations of infinite-dimensional Lie algebras, World Scientific, ISBN 978-9971-50-395-6
- von Koppenfels, W.; Stallmann, F. (1959), Praxis der konformen Abbildung, Die Grundlehren der mathematischen Wissenschaften, 100, Springer-Verlag, pp. 114–141, Section 12, "Mapping of polygons with circular arcs".
- Klein, Felix (1922), Collected works, 2, Springer-Verlag, pp. 540–549, "On the theory of generalized Lamé functions".
- Lehto, Otto (1987), Univalent functions and Teichmüller spaces, Springer-Verlag, pp. 50–59, 111–118, 196–205, ISBN 978-0-387-96310-5
- Libermann, Paulette (1959), "Pseudogroupes infinitésimaux attachés aux pseudogroupes de Lie", Bull. Soc. Math. France, 87: 409–425, doi:10.24033/bsmf.1536
- Nehari, Zeev (1949), "The Schwarzian derivative and schlicht functions", Bulletin of the American Mathematical Society, 55 (6): 545–551, doi:10.1090/S0002-9904-1949-09241-8, ISSN 0002-9904, MR 0029999
- Nehari, Zeev (1952), Conformal mapping, Dover, pp. 189–226, ISBN 978-0-486-61137-2
- Ovsienko, V.; Tabachnikov, S. (2005), Projective Differential Geometry Old and New, Cambridge University Press, ISBN 978-0-521-83186-4
- Ovsienko, Valentin; Tabachnikov, Sergei (2009), "What Is . . . the Schwarzian Derivative?" (PDF), AMS Notices, 56 (1): 34–36
- Pekonen, Osmo (1995), "Universal Teichmüller space in geometry and physics", J. Geom. Phys., 15 (3): 227–251, arXiv:hep-th/9310045, Bibcode:1995JGP....15..227P, doi:10.1016/0393-0440(94)00007-Q
- Schiffer, Menahem (1966), "Half-Order Differentials on Riemann Surfaces", SIAM Journal on Applied Mathematics, 14 (4): 922–934, doi:10.1137/0114073, JSTOR 2946143
- Segal, Graeme (1981), "Unitary representations of some infinite-dimensional groups", Comm. Math. Phys., 80 (3): 301–342, Bibcode:1981CMaPh..80..301S, doi:10.1007/bf01208274
- Sternberg, Shlomo (1983), Lectures on differential geometry (Second ed.), Chelsea Publishing, ISBN 978-0-8284-0316-0
- Takhtajan, Leon A.; Teo, Lee-Peng (2006), Weil-Petersson metric on the universal Teichmüller space, Mem. Amer. Math. Soc., 183