A complex function \(\zeta = T(z)\) may be considered geometrically as a transformation which maps a set of points, the domain of the independent variable \(z\) , onto another set, the domain of the function values \(\zeta\) . A most illuminating example is the general linear transformation \[\tag{3.00} \zeta=\frac{az+b}{cz+d}\] where \(a, b, c, d\) are four complex numbers with non-vanishing determinant; \(a d-b c \neq 0\) . The transformation (3.00) provides a one-to-one mapping of the full z-plane 1 onto the full \(\zeta\) -plane, for to each value of \(z\) there corresponds a unique value of \(\zeta\) and for each value of \(\zeta\) , \(z\) is given by the inverse transformation mapping \[\tag{3.01} z = \frac{d\zeta - b}{-c\zeta + a}\] which has the same determinant as (3.00) . We shall show that the mapping is conformal and, later, that the linear transformation is completely characterized by these two properties, namely, that the only one-to-one conformal mapping of the full \(z\) -plane onto itself is given by the general linear transformation.


 

1.3.1 Composition of the General Linear Transformation

The product \(ST\) of two transformations \(w=S(\zeta)\) , \(\zeta=T(z)\) is simply the transformation \(w=S(T(z))\) . The product of two linear transformations is easily seen to be another linear transformation. It will simplify our discussion to consider the transformation (3.00) as composed of several simple transformations:

  1. The translation \[\tag{3.10} \zeta = z + b\] simply represents a rigid displacement of the entire \(z\) -plane by the vector \(b\) . It follows that the image of any geometrical configuration by this transformation is a congruent configuration. The mapping \(\zeta=z\) is called the identity \(I\) since it leaves every point unchanged. Those points which map into themselves are called the fixed points and are of special interest in any transformation. If \(b \neq 0\) the only fixed point of the mapping (3.10) is \(\infty\) .
  2. The transformation \[\tag{3.11} \zeta=a z, \quad a \neq 0\] is best considered in polar coordinates. Set \(z=|z|(\cos \theta+i \sin \theta)\) and \(a=|a|(\cos \alpha+i \sin \alpha)\) . We then have \[\zeta=|a|\cdot|z|\big[\cos (\alpha+\theta)+i \sin (\alpha+\theta)\big]\] whence \[|\zeta|=|a|\cdot |z|\quad \text {and}\quad \operatorname{am}(\zeta) = \operatorname{am}(a) + \operatorname{am}(z).\] We conclude that the transformation represents a stretching by a factor \(|a|\) along the rays from the origin plus a rigid rotation about the origin through the angle \(\alpha= \operatorname{am}(a)\) . Thus (3.11) may be considered as the product of the two transformations \[w=\frac{a}{|a|} z \quad \text { and } \quad \zeta=|a| w\] the first being a rotation and the second a magnification. Since the image of any configuration by either of these transformations is a similar configuration it is clear that the resultant transformation \(\zeta=a z\) also has the same property. Note that \(\infty\) and \(0\) are both fixed points of this mapping.
  3. The function \[\tag{3.12} \zeta=\frac{1}{z}\] transforms the \(z\) -plane in a one-to-one way except for the point \(z=0\) . According to our convention we complete the mapping by writing \(\zeta=\infty\) at \(z=0\) and, inversely, \(z=\infty\) at \(\zeta=0\) .

If, as above, we write \(z=r(\cos \theta+i \sin \theta)\) then (3.12) takes the form \[\zeta=\frac{1}{r(\cos \theta+i \sin \theta)}=\frac{1}{r}\big[\cos (-\theta)+i \sin (-\theta)\big].\] Thus (3.12) may be decomposed into the transformation \(\zeta=\overline{w}\) and \(w=\frac{1}{r}(\cos \theta+i \sin \theta)=\frac{1}{\bar{z}}\)
The first is simply a reflection of the \(z\) -plane through the real axis and hence transforms figures into congruent configurations with angles reversed in sense. The second is an inversion with respect to the unit circle, i.e. \(z\) and \(w\) lie on the same line and \(|w|=\frac{1}{|z|}\) . Two points which lie in this relation are referred to as inverse points.

image

The principal properties of inversion are assumed known from elementary geometry: the transformation carries circles into circles 2 and preserves angles but reverses their sense. It follows that the resultant transformation \(\frac{1}{z}\) is conformal and preserves circles.

The most important property of the general linear transformation \[\tag{3.13} \zeta=\frac{a z+b}{c z+d}, \quad \Delta=a d-b c \neq 0\] is that

Theorem 1.7 . Under the above transformation, circles are transformed into circles. 

Proof. We give two different proofs:

I . The theorem is immediately proved if the general linear transformation can be decomposed into the simple transformations a), b), c) for which the property is known to hold. If \(c \neq 0\) this is dones as follows: \begin{align} \zeta &= w_{3}+\frac{a}{c} \\ w_{3} &= \left(-\frac{\Delta}{c}\right) w_{2} \\ w_{2} &= \frac{1}{w_{1}} \\ w_{1} &= c z+d. \end{align} In the special case \(c=0\) it follows from \(\Delta=a d \neq 0\) that \(d \neq 0\) . The transformation then takes the form \[\zeta=\alpha z+\beta, \quad \alpha \neq 0.\] 

Observe that this method also furnishes a proof of the conformality of the mapping (3.13) .

II . The second proof depends upon another basic property of the linear transformation, namely, that the cross ratio of four points remains invariant under linear transformation. ◻

1.3.2 The Cross-Ratio of Four Points

In analogy with projective geometry we define the cross ratio of four given complex points \(z_{1}, z_{2}, z_{3}, z_{4}\) as \[\tag{3.20} \left(z_{1}, z_{2}, z_{3}, z_{4}\right)=\frac{z_{3}-z_{1}}{z_{3}-z_{2}} \frac{z_{4}-z_{2}}{z_{4}-z_{1}}.\] If \(\zeta_{1}, \zeta_{2}, \zeta_{3}, \zeta_{4}\) are the respective function values for \(z_{1}, z_{2}, z_{3}, z_{4}\) in (3.13) then it may be confirmed by direct calculation that \[\left(\zeta_{1}, \zeta_{2}, \zeta_{3}, \zeta_{4}\right)=\left(z_{1}, z_{2}, z_{3}, z_{4}\right)\] or that the cross ratio of four points remains fixed in any linear transformation .

Theorem 1.8 . A necessary and sufficient condition that four points lie on a circle is that their cross ratio have a real value. 

image

Proof. Let the four given points be \(z_{1}, z_{2}, z_{3}, z_{4}\) and denote the distance between any two points \(z_{j}\) and \(z_{k}\) by \(r_{j k}=\left|z_{j}-z_{k}\right|\) . Setting \(z_{3}-z_{1}=r_{31}e\left(\alpha_{1}\right)\) , 3 and \(z_{3}-z_{2}=r_{32}e\left(\alpha_{2}\right)\) we obtain \[\frac{z_3 - z_1}{z_3 - z_2} = \frac{r_{32}}{r_{32}}\, e\left(\alpha_{1}-\alpha_{2}\right)\] where the difference \(\alpha_{1}-\alpha_{2}\) is the angle between the vectors \(z_{3}-z_{1}\) and \(z_{2}-z_{3}\) . Similarly \[\frac{z_4 - z_1}{z_4 - z_2} = \frac{r_{41}}{r_{42}}\, e\left(\beta_{1}-\beta_{2}\right)\] where \(\beta=\beta_{1}-\beta_{2}\) is the angle between \(z_{4}-z_{1}\) and \(z_{2}-z_{4}\) . (We may assume \(0 \leq \alpha, \beta \leq \pi\) .)

The cross ratio of the four points now takes the form \begin{align} \left(z_{1}, z_{2}, z_{3}, z_{4}\right)&=\frac{z_3 - z_1}{z_3 - z_2} \frac{z_4 - z_2}{z_4 - z_1}\\ &= \frac{r_{31}}{r_{32}} \frac{r_{42}}{r_{41}}\, e(\alpha-\beta). \end{align} But the necessary and sufficient condition that \(z_{4}\) lie on the circle (in the extended sense) determined by \(z_{1}, z_{2}, z_{3}\) is that \(\alpha=\beta\) and the cross ratio is real if and only if \(\alpha-\beta=0\) . This concludes the proof. ◻

The circle preserving property of linear transformations follows from the invariance of the cross ratio.

The general linear transformation \[\tag{3.21} \zeta=\frac{az + b}{cz + d}, \quad \Delta = ad - bc \neq 0\] actually depends on no more than three complex parameters for by multiplying the numerator and denominator by a suitable constant we may prescribe the condition \(\Delta=a d-b c=1\) which enables us to give one parameter in terms of the other three. It is therefore to be expected that a linear transformation is uniquely prescribed by the condition that three given points \(z_{1}, z_{2}, z_{3}\) map into three given points \(\zeta_{1}, \zeta_{2}, \zeta_{3}\) . This is actually the case. The point \(\zeta\) corresponding to the independent variable \(z\) must satisfy the equation \[\tag{3.22} \frac{\zeta_{3}-\zeta_{1}}{\zeta_{3}-\zeta_{2}} \frac{\zeta-\zeta_{2}}{\zeta-\zeta_{1}}=\frac{z_3 - z_1}{z_3 - z_2} \frac{z-z_{2}}{z-z_{1}}.\] It is not difficult to verify that this is a linear transformation having the requisite properties. Furthermore the transformation specified by the mapping of three given points is unique. For proof consider the fixed points of the mapping (3.21) . These satisfy the condition \[z=\frac{a z+b}{c z+d},\] which is equivalent to the quadratic equation \[\tag{3.23} c z^{2}+(d-a) z-b=0.\] The two solutions of (3.23) are the fixed points of the transformation. Now if \(T_{1}\) and \(T_{2}\) are any two transformations which take \(z_{1}, z_{2}, z_{3}\) into \(\zeta_{1}, \zeta_{2}, \zeta_{3}\) respectively, then \(T=T_{1} T_{2}^{-1}\) , where \(T_2^{-1}\) is the inverse transformation to \(T_{2}\) , is a linear transformation which leaves \(z_{1}, z_{2}, z_{3}\) fixed. The quadratic equation (3.23) would then have three roots \(z_{1}, z_{2}, z_{3}\) which is impossible unless the coefficients are all zero; i.e. \(c=0\) , \(a=d\) , \(b=0\) . But the transformation with these coefficients is the identity \(I\) . It follows from \(S_{1} S_{2}^{-1}=I\) that \(S_{1}=S_{2}\) . Thus the uniqueness of the linear transformation (3.22) is proved.

A deeper insight (due to F. Klein) into the structure of linear transformation, is obtained by considering the behavior of the family \(K\) of circles through the fixed points. Let us assume first that the equation (3.23) has two distinct roots \(z_{1}\) and \(z_{2}\) . Since \(z_{1}\) and \(z_{2}\) are fixed points of the mapping the circles of \(K\) map into circles of the same family. From the conformality of the mapping it follows that the orthogonal family \(K'\) also maps into itself. Three possibilities occur:

image
  1. Each circle in \(\boldsymbol{K}\) goes over into itself . The circles of \(K\) may then be considered kinematically as the paths along which the points of the plane travel toward their images. Such a transformation is called hyperbolic .
  2. Each circle of the orthogonal family \(\boldsymbol{K'}\) maps onto itself . In this case the curves of \(K'\) are the paths for the points of the plane. The transformation is said to be elliptic .
  3. If neither of the first two cases holds the transformation is called loxodromic .

This classification leads naturally to a normal form of description for three kinds of linear transformation. Given \(z_{1}\) and \(z_{2}\) , the two fixed points, let \(z_{3}\) and \(z\) be two points on a circle through the fixed points \(z_{1}\) and \(z_{2}\) , \(\zeta_{3}\) and \(\zeta\) the respective images of \(z_{3}\) and \(z\) . The images \(\zeta_{1}\) and \(\zeta_{2}\) are the fixed points \(z_{1}\) and \(z_{2}\) themselves. From the invariance of the cross ratio we obtain \[\frac{z-z_{1}}{z-z_{2}} \frac{z_3 - z_2}{z_{3}-z_{1}}=\frac{\zeta-z_{1}}{\zeta-z_{2}} \frac{\zeta_{3} - z_2}{\zeta_3 - z_1}.\] The transformation may then be written in the normal form \[\tag{3.24} \frac{z-z_{1}}{z-z_{2}}=\alpha\, \frac{\zeta-z_{1}}{\zeta-z_{2}}, \quad \alpha \neq 0\] where \[\alpha=\frac{z_3 - z_1}{z_3 - z_2}\cdot\frac{\zeta_3 - z_2}{\zeta_3 - z_1}.\] In the hyperbolic case the points \(z_{1}, z_{2}, z_{3}, \zeta_{3}\) are on the same circle and therefore their cross ratio \[\frac{z_3 - z_1}{z_3 - z_2}\cdot\frac{\zeta_3 - z_2}{\zeta_3 - z_1}\] is a real constant. The hyperbolic transformation takes the form \[\tag{3.25} \frac{z-z_{1}}{z-z_{2}}=\alpha\,\frac{\zeta-z_{1}}{\zeta-z_{2}} \quad(\alpha \text { real} \neq 0)\] Conversely, it follows from (3.25) that \(\zeta\) , the image of \(z\) , lies on the circle through \(z, z_1, z_2\) and consequently that the transformation is of hyperbolic type.

For the elliptic transformation the theorem of Apollonius in elementary geometry (see Exercises 23 , 24 ), implies the relation \[\tag{3.26} \frac{\left|z-z_{1}\right|}{\left|z-z_{2}\right|}=\frac{\left|\zeta-z_{1}\right|}{\left|\zeta-z_{2}\right|}.\] We conclude that the elliptic transformation takes the form \[\tag{3.27} \frac{z-z_{1}}{z-z_{2}}=\alpha\, \frac{\zeta-z_{1}}{\zeta-z_{2}} \quad(|\alpha|=1, \alpha \neq 1).\] Conversely, it follows from (3.27) that (3.26) holds and therefore that \(\zeta\) and \(z\) lie on the same member of the orthogonal family.

The loxodromic transformation contains the remaining possibilities: \[\tag{3.28} \frac{z-z_{1}}{z-z_{2}}=\alpha\, \frac{\zeta-z_{1}}{\zeta-z_{2}} \quad(\alpha \text { non-real and }|\alpha| \neq 1).\] 

Finally, if the roots of the equation (3.23) coincide, the transformation is said to be parabolic . The derivation of the normal form of the parabolic equation is left as an exercise.

Exercises

Exercise 1.23 . Show by inversion that a circle intersects a given circle orthogonally if and only if it passes through a pair of inverse points with respect to the given circle.

Exercise 1.24 . Prove the theorem of Apollonius that a circle is the locus of all points in the plane, the ratio of whose distances from two fixed points \(P_{1}\) and \(P_{2}\) is constant. (Hint: \(P_{1}\) and \(P_{2}\) are conjugate points with respect to the circle.)

Exercise 1.25 . Under inversion with respect to the unit circle what happens to the set of

  1. Circles tangent at the origin to a fixed line through the origin.
  2. Circles passing through the origin and another fixed point A. What happens to the orthogonal family of circles?

Exercise 1.26 . Determine the normal forms of the elliptic, hyperbolic and loxodromic transformations in the event that one fixed point is at infinity.

Exercise 1.27 . In the event that the two fixed points of a linear transformation coincide the transformation is said to be of the parabolic type. Find its normal form if the fixed point is finite.

Exercise 1.28 . If a linear transformation maps the interior of a circle onto itself show that it cannot be loxodromic.

1.3.3 Special Mappings

A number of the mappings given by linear transformations are of particular importance. Specifically we consider mappings of circles into circles. As we have seen, the coefficients of the general linear transformation can be picked so that three arbitrarily designated points of the \(z\) -plane pass into three arbitrarily designated points of the \(\zeta\) -plane. Since circles transform into circles and a circle is determined by any three of its points it follows that there is a linear transformation which takes any given circle of the \(z\) -plane into any given circle of the \(\zeta\) -plane. In the determination of specific transformations we will need the following lemma:

Lemma 1.1 . Under linear transformation points that are inverses with respect to a given circle are mapped onto points which are inverses with respect to the image circle. 

This lemma follows directly from the result of Exercise 23 .

For the first example we take the general transformation which maps the real axis onto the unit circle so that the upper half-plane \(\operatorname{Im}(z)>0\) maps onto the interior of the circle \(|\zeta|<1\) . Let the point \(z=a\) with \(\operatorname{Im}(a)>0\) be mapped onto the origin \(\zeta=0\) . It follows by the lemma above that the point \(\bar{a}\) goes over to the point \(\zeta=\infty\) . The transformation must consequently be of the form \[\zeta=k\,\frac{z-a}{z-\bar{a}}.\] The complex constant \(k\) may then be determined if we observe that the real axis maps onto the unit circle and therefore \(|\zeta|=1\) for \(z\) real. It follows that \(|k|=1\) . With this restriction on \(k\) it is clear that \(\operatorname{Im}(z)>0\) for \(|\zeta|<1\) . By direct computation it may then be verified that the transformation \[\tag{3.30} w=k \frac{z-a}{z-\bar{a}}\quad \text{with}\quad |k|=1\quad\text{and}\quad \operatorname{Im}(a)>0\] maps the upper half-plane into the interior of the unit circle and is therefore the most general linear transformation of the prescribed type.

An interesting example is that of the transformation which maps the interior of the unit circle onto itself. We arbitrarily select a point \(z=a\) in the interior of the circle \((|a|<1)\) and map it on the origin \(\zeta=0\) . Applying our lemma we see that the inverse point \(\frac{1}{\bar{a}}\) must go into \(\zeta=\infty\) . The transformation can only be of the form \[\zeta=c\, \frac{z-a}{z-\frac{1}{\bar{a}}}=k \,\frac{z-a}{\bar{a} z-1}.\] Moreover, for \(|z|=1\) we have \(|\zeta|=1\) and therefore by the theorem of Apollonius \[|k|\,\frac{|z-a|}{|\bar{a} z-1|}=|k|=1.\] The transformation can now be written in the form \[\tag{3.31} \zeta=k\, \frac{z - a}{\bar{a}z - 1} \text { with } |k|=1 \text { and } |a| < 1.\] The verification that this satisfies the given requirements is a simple exercise.

From the form of the transformation (3.31) we conclude that in mapping the unit circle onto itself we are free to specify the mapping of any one point and, by adjusting the amplitude of \(k\) , to map any direction through the chosen point into any direction through its image.

For our last example we consider the most general linear transformation of the upper half-plane \(\operatorname{Im}(z) > 0\) onto itself. In the mapping of the real axis onto itself some point \(z=p\) must map into \(\zeta=0\) and for another real value \(z=q\) , \(\zeta=\infty\) . The transformation must therefore take the form \[\zeta=\lambda \frac{z-p}{z-q}\] Since \(\zeta\) is real when \(z\) is real it follows that \(\lambda\) is real. We have proven that the transformation can be written in the form \[\tag{3.32} \zeta=\frac{a z+b}{c z+d}, \quad (a, b, c, d \text { real})\] But we have not yet employed the condition that \(|\zeta|>0\) for \(|z|>0\) . This condition is equivalent to the requirement ( Exercise 30 ) that the determinant \[\tag{3.33} \Delta=a d-b c>0.\] In the same manner as before we can easily show that (3.32) is the most general transformation of this type.

One of the most remarkable applications of linear transformations is the elegant example due to Poincaré of the non-euclidean geometry in the upper half plane. In order to display this application to advantage we shall first enter a brief general discussion.

1.3.4 The Poincaré Geometry

Euclidean geometry may be said to describe our experiences with pencil diagrams on paper. Non-euclidean geometry, though at first a little foreign to our sense, furnishes a better description than ordinary geometry of certain other experiences, notably the facts of optics. There are, however, certain factors common to these geometries and it will illuminate the discussion to give these in some detail.

Our geometry will deal with the notions of points, lines, distances and displacements as in ordinary geometry. In the first place we restrict ourselves to the space consisting of the upper half-plane \(\operatorname{Im}(z) > 0\) . For points we take the ordinary euclidean points. For lines we take the circles with centers on the real axis – restricting ourselves to the semicircles in the upper half-plane. According to our convention these include the half-lines orthogonal to the real axis. For this definition of line we have the usual properties: There is exactly one line passing through any two points. Two lines intersect in no more than one point.

image

There are certain transformations of the space called displacements which have the property that lines map onto lines in such a way so that

  1. The order of points on a line is not changed by displacement
  2. Any given point may be displaced into any other point so that any ray originating at the mapped point may be transformed into any given ray at the image point.

As our displacements we take linear transformations of the upper half-plane \[\tag{3.40} \zeta=\frac{a z+b}{c z+d} \quad(a, b, c, d \text { real and } a d-b c>0).\] 

It is the analogy to the ordinary distance between two points that is of the greatest interest. There are certain requirements we naturally expect a distance function to satisfy. Namely, if \(P_{1}\) and \(P_{2}\) denote any two points of the space and \(D\left(P_{1}, P_{2}\right)\) is to be the distance between them we specify the following conditions:

  1. \(D\left(P_{1}, P_{2}\right)=D\left(P_{2}, P_{1}\right)\) 
  2. \(D\left(P_{1}, P_{2}\right)>0\) for \(P_{1} \neq P_{2}\) and \(D\left(P_{1}, P_{2}\right)=0\) if \(P_{1}=P_{2}\) 
  3. The triangle inequality \[D\left(P_{1}, P_{2}\right)+D\left(P_{2}, P_{3}\right) \geq D\left(P_{1}, P_{3}\right)\] where the sign of equality holds if and only if \(P_{1}, P_{2}, P_{3}\) lie on the same line in that order.

Finally we ask that the distance of two points remain unchanged in any displacement.

The possibilities of defining a distance function are somewhat restricted. If \(D\left(P_{1}, P_{2}\right)\) is any distance function then any other distance function \(R\left(P_{1}, P_{2}\right)\) can be written as a single-valued function of \(D\left(P_{1}, P_{2}\right)\) . In other words there is a one-to-one relationship between the values of \(D\) and the values of \(R\) .

In proof we show if \(z_{1}, z_{2}\) and \(\zeta_{1}, \zeta_{2}\) are any pairs of points with \(D\left(z_{1}, z_{2}\right)=D\left(\zeta_{1}, \zeta_{2}\right)\) that \(R\left(z_{1}, z_{2}\right)=R\left(\zeta_{1}, \zeta_{2}\right)\) . First, there is a displacement which takes \(z_{1}\) into \(\zeta_{1}\) and \(z_{2}\) into \(\zeta_{2}\) . For there is a displacement which maps \(z_{1}\) onto \(\zeta_{1}\) and transforms the ray \(\overrightarrow{z_{1} z_{2}}\) into the ray \(\overrightarrow{\zeta_{1} \zeta_{2}}\) . Now let \(w\) be the image of \(z_{2}\) . There are two possibilities; either \(w\) lies between \(\zeta_{1}\) and \(\zeta_{2}\) or \(\zeta_{2}\) lies between \(\zeta_{1}\) and \(w\) . In the first case we obtain by 3) \[D\left(\zeta_{1}, w\right)+D\left(w, \zeta_{2}\right)=D\left(\zeta_{1}, \zeta_{2}\right).\] But from the invariance of \(D\) under displacement \[D\left(\zeta_{1}, w\right)=D\left(z_{1}, z_{2}\right)=D\left(\zeta_{1}, \zeta_{2}\right).\] Thus \(D\left(w, \zeta_{2}\right)=0\) which can be true only if \(w=\zeta_{2}\) . The second possibility yields the same result. Now, the fact that there is a displacement which carries \(z_{1}\) into \(\zeta_{1}\) and \(z_{2}\) into \(\zeta_{2}\) shows that \[R\left(z_{1}, z_{2}\right)=R\left(\zeta_{1}, \zeta_{2}\right).\] The same proof may be used in the other direction to demonstrate the existence of a one-to-one correspondence between the values of \(D\) and the values of \(R\) .

image

In casting about for a distance function, as invariant under the transformation (3.40) , one naturally thinks of the cross ratio of four points. But in defining the distance \(D\left(P_{1}, P_{2}\right)\) what two auxiliary points shall we choose to complete the cross ratio? An obvious choice is the pair of axis-points \(Q_{1}\) and \(Q_{2}\) which lie at the extremities of the semicircle joining \(P_{1}\) and \(P_{2}\) . These have the special virtue of being defined in a unique fashion by \({P}_{1}\) and \({P}_{2}\) and of transforming into corresponding points for the images of \(P_{1}\) and \(P_{2}\) by displacement.

The cross ratio, however, does not satisfy the additive property of distance; that is, if \({P}_{1}, {P}_{2}, {P}_{3}\) are points on the same line in that order then \[D\left(P_{1}, P_{2}\right)+D\left(P_{2}, P_{3}\right)=D\left(P_{1}, P_{3}\right).\] However, the cross ratio does satisfy a multiplicative relationship \[\tag{3.41} \left(P_{1} P_{2} Q_{1} Q_{2}\right)\left(P_{2} P_{3} Q_{1} Q_{2}\right)=\left(P_{1} P_{3} Q_{1} Q_{2}\right).\] This suggests the use of the logarithm of the cross ratio as a suitable distance function. Since a distance function must be non-negative we take the absolute values: \[\tag{3.42} D\left(P_{1} P_{2}\right)=k\left|\log \left(P_{1} P_{2} Q_{1} Q_{2}\right)\right|\] where \(k > 0\) may be fixed arbitrarily.

The formula (3.42) is the most general distance function obtainable in the Poincaré geometry.

image

For the purposes of the proof we displace the semicircle joining the points \({P}_{1}, {P}_{2}\) into a ray perpendicular to the real axis. That we may do so is clear, for we have three real parameters at our disposal and this allows us to prescribe one point arbitrarily and to specify that the \(x\) -coordinate of the second point be the same as the first. The cross ratio \(\left(P_{1} P_{2} Q_{1} Q_{2}\right)\) becomes simply \(\frac{y_{2}}{y_{1}}\) . We then choose as our distance function \[D\left(P_{1}, P_{2}\right)=\left(P_{1}', P_{2}'\right)=\left|\log y_{2}-\log y_{1}\right|.\] 

Now, if \(R\left(P_{1}, P_{2}\right)\) is any other expression for distance it is a single-valued function of \(D\left(P_{1}, P_{2}\right)\) , \[R\left(P_{1}, P_{2}\right)=f(D)=f\left(\left|\log \frac{y_{1}}{y_{2}}\right|\right).\] In particular, if \({y}_{1}, {y}_{2}, {y}_{3}\) are points of the line arranged in that order we have \begin{align} f\left(\left|\log \frac{y_{2}}{y_{1}}\right|\right)+f\left(\left|\log \frac{y_{3}}{y_{2}}\right|\right)&=f\left(\left|\log \frac{y_{3}}{y_{1}}\right|\right)\\ &=f\left(\left|\log \frac{y_{3}}{y_{2}}+\log \frac{y_{2}}{y_{1}}\right|\right)\\ &=f\left(\left|\log \frac{y_{3}}{y_{2}}\right|+\left|\log \frac{y_{2}}{y_{1}}\right|\right). \end{align} Hence, for any positive values \(a, b\) , we have \[\tag{3.43} f(a)+f(b)=f(a+b).\] We shall show that \(f(x)\) must be a linear function \[f(x)=kx.\] From (3.43) it is immediately clear that \[f(n x)=n f(x)\] for all integers \(n\) . Furthermore, for any rational \(\frac{p}{q}\) we observe \[q f\left(\frac{p}{q} x\right)=p f(x)\] whence \[f\left(\frac{p}{q} x\right)=\frac{p}{q} f(x).\] The result is easily generalized to include all real numbers. If \(r\) is a real number it is exactly determined by the order relationships it satisfies with the rational numbers. Suppose, for example, that \(\alphawhere \(\alpha, \beta\) are rational. It follows that \[f(\alpha x) < f(rx) < f(\beta x)\] whence \[\alpha<\frac{f(r x)}{f(x)}<\beta.\] Thus \(\frac{f(r x)}{f(x)}\) satisfies the same inequalities with respect to the rational numbers as does the number \(r\) . We conclude that \[f(r x)=r f(x).\] 

Setting \(f(1)=k\) we then have \[f(r)=f(r.1)=k r\] In particular \[R\left(P_{1}, P_{2}\right)=R(D)=k D\left(P_{1}, P_{2}\right)=k\left|\log \frac{y_{2}}{y_{1}}\right|.\] 

The Riemann Concept of Length

Suppose we have a curve \(c\) in the upper half plane. How shall we define its length in the new geometry? The most natural way is to employ the method of Riemann exactly as in the euclidean case.

image

Consider a curve \(c\) given in parametric representation \begin{align} \left\{\begin{array}{l} x=x(t) \\ y=y(t) \end{array} \quad 0 \leq t \leq 1\right. \end{align} or in complex notation \[z=z(t)=x(t)+i y(t).\] We suppose that no point is covered by two values of the parameter, that the curve is piecewise smooth and that always \[z'(t)=x'(t)+i y'(t) \neq 0.\] In the ordinary euclidean case the length of the curve is evaluated by taking the limit of approximations to \(c\) by polygonal arcs. By subdividing the interval of \(t\) \[0=t_{0}

image

we obtain a subdivision of the curve given by the points \(\zeta_{0}=z\left(t_{0}\right), \ldots, \zeta_{N}=z\left(t_{N}\right)\) . If we join the successive points by straight line segments we obtain an approximating polygon of length \[\sum\left|\zeta_{n+1}-\zeta_{n}\right|=\sum\left|\frac{\Delta \zeta}{\Delta t}\right| \Delta t.\] The limit of this sum as \(\max{\Delta t} \rightarrow 0\) is the integral \[\int_{0}^{1}\left|\frac{d z}{d t}\right|\,d t=\int_{0}^{1} \sqrt{x'(t)^{2}+y'(t)^{2}}\, d t.\] 

In the Poincaré half plane the length of the curve is defined in the same manner. We approximate the curve by polygonal lines where for the line segment joining two points we take an arc of a circle orthogonal to the real axis. The length of the curve is then defined as the limit of the sum \[\sum D\left(\zeta_{n}, \zeta_{n+1}\right)\quad \text{as}\quad \max \left(t_{n+1}-t_{n}\right) \rightarrow 0.\] 

image

The distance between nearby points \(\zeta, \zeta+\Delta \zeta\) can be obtained by displacing the semi-circle connecting them onto a half line orthogonal to the real axis. Then, as before, we have \begin{align} D(\zeta, \zeta+\Delta \zeta)&=\left|\log \frac{y+\Delta y}{\Delta y}\right|\\ &=|\Delta \log y|\\ &=\frac{|\Delta y|}{y}+\varepsilon(\Delta y) \Delta y. \end{align} The differential of arc must then be of the form \(\dfrac{|d y|}{y}\) . In terms of the original point \(\zeta\) this becomes \(\dfrac{|d\zeta|}{\eta}\) where \(\eta=\operatorname{Im}(\zeta)\) . We conclude that the length of the curve \(c\) is \[\int_{0}^{1} \frac{\left|\frac{d z}{d t}\right|}{y(t)}\,d t.\] 

Exercises

Exercise 1.29 . Find a linear transformation which maps the region between the unit circle and an eccentric circle in its interior onto:

  1. The region between the unit circle and a concentric circle in its interior.
  2. The region exterior to two equal circles.

Exercise 1.30 . Verify that the linear transformations of the upper half plane (3.40) actually obey the conditions for displacements in the Poincaré geometry.

Exercise 1.31 . Verify the triangle inequality for the distance function (3.42) \[D\left(P_{1} P_{2}\right)=k\left|\log \left(P_{1} P_{2} Q_{1} Q_{2}\right)\right| \quad (k>0).\] 

Exercise 1.32 . Show that the length \(L\) of a curve \(c\) in the upper half plane where \[L=\int_{t_{0}}^{t_{1}} \frac{|d z|}{y}\] is an invariant under displacement (i.e. under linear transformation of the upper half plane).