Linear Fractional Transformations

 

1  Introduction

 

A linear fractional transformation (LFT) is defined as a function of the form

 

 

where a,b,c,d are complex constants. Every LFT defines a one-to-one mapping of the extended complex plane (C U ∞) onto itself. Two LFTs f and g are similar iff there exists a purely linear transformation L(z) = αz + β where α and β are complex constants such that

 

 

for every complex z. (The condition is reciprocal, because if L is a linear function, then so is the inverse of L.) This condition implies a mapping between the iterates of f and the iterates of g as shown schematically in Figure 1. In essence, letting zk denote the iterates of f for any given initial value z0, and letting wk denote the iterates of g with the corresponding initial value w0 = L(z0), it follows that if f and g are similar, then wk = L(zk) for all k. In other words, the iterates of f can be mapped to the iterates of g by the linear function L, and since a linear transformation of the complex plane consists of (at most) a simple translation, a rotation about the origin, and a uniform scale change, it follows that the configuration of points L(zk) is geometrically similar to the configuration of points zk.

 

mob2

 

To derive necessary and sufficient conditions for similarity, suppose the two LFTs

 

 

are similar. Then by definition there exists a linear function L(z) = αz + β such that L(f(z)) = g(L(z)), so we have

 

 

Substituting for f(z) and L(z) into this equation gives

 

 

If α was zero then L would simply be a constant, and if c or C was zero the respective LFT would simply be a linear function, so we set these degenerate cases aside and consider the general case with α,C,C all non-zero. We can therefore normalize each side of the preceding equation by dividing through by c or Cα respectively (so the coefficients of z in the denominators are zero), and then equate the three remaining corresponding coefficients, which gives the three conditions

 

 

Solving the first and third conditions for α and β we get

 

 

Substituting this expression for β into the second condition and solving for α gives

 

 

(assuming ad – bc ¹ 0). Equating this with the previous expression for α, we arrive at a necessary and sufficient condition for the LFTs f and g to be similar:

 

 

This shows that for any given LFT (az+b)/(cz+d), the similarity class is determined solely by the parameter (a+d)2/(ad–bc). Hereafter we will refer to this as the similarity parameter, and denote it by σ. Thus, for any LFT (az+b)/cz+d) we define

 

 

Two LFTs f and g are said to be conjugate if there exists an LFT h such that h(f(z)) = g(h(z)) for all z. This is identical to the definition of similarity, except that similarity requires h to be purely linear. Thus, similarity would seem to be a more restrictive condition than conjugacy. However, the necessary and sufficient condition for f and g to be conjugate is exactly the same as the condition for similarity (cf. Complex Functions, p 32, Jones and Singerman, 1987). The only strength gained by allowing h to be an LFT instead of a linear function is in the "singular" cases when the coefficients of the linear coupling function L are not well defined, such as when c, C, (a+d), or (A+D) are equal to 0. However, when (a+d) = 0 the only conjugate LFTs are those with A+D = 0. Thus, returning to the previous conditions and inserting (a+d) = (A+D) = 0, we find that in this case it is necessary and sufficient for α to be given by (1). Thus, the only case when conjugacy can exist without a well-defined linear similarity correspondence relationship between the two LFTs is if c or C equal zero. The first of the three conditions above implies that is either c or C is zero, then the other is also zero, unless α is zero, which would imply a degenerate linear function.) In these cases, by the inclusion of the "point at infinity" (which is the same expedient used to ensure closure of the LFT iterates) we can include among the linear similarity relations those with infinite coefficients (and, conversely, zero coefficients).

 

 

2  Closed Form Expressions For the nth Iterate

 

To find a closed-form expression for the nth iterate zn of the linear fractional transformation f(z) = (az+b)/(cz+d), it is most convenient to consider a similar function, g, having the sequence of iterates w0, w1, w2, ... where wk is related to zk by a linear function

 

 

for some complex constants α and β. Our objective is to find a linear transformation such that a closed-form expression for wn (the nth iterate of g) is easy to determine. In general, the original LFT can be expressed in terms of w as follows

 

 

Solving for wk+1 gives

 

where

 

Notice that if c is not equal to zero, we cannot force C to equal zero (except by putting α = 0, which is singular). This rules out any chance of finding coefficients α,β for which wk+1 = Kwk. However, there are several other "nice" LFT forms for which a closed-form solution can easily be found. One such, which we will call the "bi-polar" form, is

 

 

Every LFT (az+b)/(cz+d) with two distinct fixed points is similar to an LFT of this special form. The coefficients α and β of the linear similarity function are determined by setting B = 0, which implies that

 

 

We then impose the condition (C+D)/A = 1, which implies

 

 

Substituting the previous expression for β and solving for α gives

 

 

This is determined by mapping the two fixed points to the points 0 and 1 on the complex plane. We will refer to this as the bi-polar form of the given LFT similarity class. The value of K here is given by

 

 

Notice that this implies

 

 

The "K form" has fixed points at 0 and 1, and it can be shown by induction that zn has the convenient closed-form expression

 

 

In this case the periodicity requirement is simply . It's interesting that to convert a purely real LFT to this "bi-polar K" form, the real axis is mapped to the line 0.5 + μi, which figures in the Riemann Hypothesis.

 

The real and imaginary components of zn = xn + iyn can be expressed in closed form in terms of the components of K and z0. If we set

 

 

then the components of zn are given by

 

 

 

where f = nθ. This shows that the action of the LFT on the complex plane is entirely dependent on the magnitude and phase angle of the constant K, which, as we saw previously, is given by

 

 

If a,b,c,d are all real, then σ is real, in which case either K is real (σ>4 or σ<0) or K is complex (0<σ<4) with a magnitude (norm) of 1. However, if a,b,c,d are allowed to be complex, then K can be complex with a magnitude other than 1.

 

If K is real, then we can set R = K and θ = 0, so f equals zero for all n, and the equations for xn and yn reduce to

 

 

 

If (a+d) = 0 then K = –1, in which case the only conjugate LFTs are those with σ = 0.

 

The preceding expressions for the nth iterate of a linear fractional transformation assume the LFT has two distinct fixed points. More generally, given any linear fractional recurrence relation

 

 

we can derive a closed-form expression for the nth iterate zn in terms of n and z0. These expressions are summarized in Table 1 for the various cases. Notice that cases 2 and 3 are really linear transformations, since c = 0. Case 4 corresponds to the case when the two fixed points of the LFT are co-incident. In this "parabolic" case, if a + d = 0 then the LFT reduces to Case 1 with ad–bc = 0. Case 5 is the general case with two distinct fixed points. In this case, if a + d = 0 then σ = 0 and K = –1.

 

It's also interesting to note that the general expression for Case 5 can be written in the form

 

 

so if we define the LFT

we have

 

Setting n = 1 and noting that f(z0) = z1, we have

 

 

which shows that f(z) is conjugate to the simple function Kz. It's interesting to note that α + β is the "conjugate" of β, and so h(z) can be expressed as

 

 

where

 

Therefore, if c = 0, then either h(z) = 0 or h(z) is undefined.

 

 

3  General Periodicity Condition

 

It can be shown (see article on rational sines) that the sequence of iterates zk of an LFT is cyclical with a period of m (meaning that zk+m = zk for every integer n) if and only if

 

 

for some integer k. The "periodic" values of σ can also be expressed algebraically in the form

 

 

It's interesting that each –σm is a root of the polynomial

 

 

This gives the roots of polynomials whose coefficients are diagonal elements of Pascal's triangle

 

 

For example, the five roots of the quintic x5 + 9x4 + 28x3 + 35x2 + 15x + 1 are given by

 

 

and the three roots of the cubic x3 + 6x2 + 10 + 4 are given by

 

 

The factorizations of the first 24 "periodic σ polynomials" are presented in Table 2. If Fk(x) denotes the polynomial with roots –σk, then Fm(x) divides Fn(x) if and only if m divides n. This follows from the fact that if an LFT is cyclical with a period of m, and if m divides n, then the LFT also is has a (not fundamental) period of n.

 

Although the roots of each Fm(x) are easily expressed in trigonometric form, it's interesting to note the following algebraic solutions for m = 16, 20, and 24. These are found by solving the quartic factor of Fm(x) in each case, which gives the four values of σ corresponding to the LFTs with a fundamental period of p in each case.

 

 

 

 

Since the condition for having period m is entirely a function of σ, and two LFTs are similar if they have the same σ, if follows that the number of dissimilar LFT groups of order m is equal to the number of distinct values 4cos2(kπ/m). This is clearly half the number of integers less than and co-prime to m, so the number is f(m)/2, half the Euler totient function.

If an LFT has a periodic of m, then it is said to generate a (cyclic) group order m.

 

 

4  The σ Form

 

Given any linear fractional transformation f(z) = (az+b)/(cz+d) with ad ¹ bc and (a+d),c ¹ 0, we can solve the similarity parameter equation

 

for b to give

 

 

It follows that the function

 

 

is similar to f(z) for any choice of parameters a, b, and c (provided c and a+d do not equal zero). In particular, setting d=0 and a=c gives the simple form

 

 

This implies that the iterates of g(z) are given as simple continued fractions. For example, z3 can be expressed as a function of z0 as follows

 

 

 

5  Self-Similar Iterations

 

Given any linear fractional transformation f(z) = (az+b)/(cz+d), the first iterate of this function is

 

 

which has the similarity parameter

 

 

Therefore, the sequence of similarity parameters of the iterates of a linear fractional transformation satisfy the non-linear recurrence

 

 

Setting σk+1 = σk and solving the resulting quadratic shows that all the iterates of f are similar to each other if and only if σ = 1 or 4. The recurrence gives a chaotic sequence for a range of initial values, similar to the logistic equation.

 

 

6  The λ Form

 

One of the most interesting special forms of linear fractional transformations is

 

 

Every (non-degenerate) LFT is similar to an LFT of this special form. In fact, it is similar to exactly two LFTs of this form, because this special LFT is similar to the LFT given by replacing λ with 1/λ. This implies the important fact that every periodic λ is either purely real or it is complex with a magnitude of 1 (i.e., on the unit circle). Figure 2 shows the range of periodic l values in the complex plane.

 

mob3

 

The periodicity condition for this form is that zn+m = zn if and only if λ has one of the values

 

 

where k is an integer. These values can also be expressed in terms of the corresponding similarity parameters as

 

which implies

 

We saw in the previous section that iterations of f have the same similarity parameter σ if and only if σ equals 1 or 4. These values of σ gives the following values for the corresponding λ forms

 

 

Note that these are the extreme values of periodic λ on the imaginary axis and the real axis, respectively (excluding –1).

 

The values of λ that give periodic LFT iterations are the roots of a special set of polynomials whose coefficients are members of a generalized set of binomial coefficients. Each member of the generalized family of coefficient sets is denoted by a three-index symbol, defined as follows

 

 

Based on this definition, the following properties can be established:

 

Symmetry:

 

Unit Plane:

 

Binomial Plane:

 

Recurrence Relation:

 

 

Column Relations:

 

It is often more convenient to index the generalized coefficients in a manner similar to the indexing of the binomial coefficients. We define the generalized form as

 

 

with the understanding that if no J subscript is specified, the value J=1 is implied (giving the normal binomial coefficients). In these terms we can establish the following recurrence relating the sums of successive rows of the coefficients:

 

 

For J=1 this obviously gives the sums of the rows of binomial coefficients as powers of 2. The sum of the general row with alternating signs is given by

 

 

(with the stipulation that the sum equals 1 for the case J=1, m=0). The generalized coefficients with J = –1 are as follows

 

mob

 

The roots of the polynomials with coefficients from a row of this table are precisely the values of λ for which the LFT is periodic. Specifically, letting λq denote the values of λ for which λ(1+z)/(1–z) has period q, we have

 

 

Since the coefficients are symmetrical, we know that if λ is a root, then so is 1/λ. The general form of the roots of these polynomials was given earlier in terms of the cosine function. However, several of these can be expressed purely algebraically. A list of these is presented in Table 3.

 

 

7  The Sum and Difference Form

 

Another interesting special form is

 

 

which has the convenient closed form expression

 

 

The periodicity condition for this form is

 

 

 

8  The Complement Form

 

Another interesting form of linear fractional transformation is

 

 

The values of a for which zn+m = zn are 4sin2(kπ/2m), k = 1, 2,...

 

 

9  The Diagonal Form

 

Another interesting form is

 

 

This LFT is similar to any other LFT with a σ such that

 

 

The values of G for which the LFT has a finite period m are

 

 

The values of –Gm2 are precisely the roots of the polynomial whose coefficients are every second value in the mth row of Pascal's triangle. For example, consider the polynomial with coefficients taken from the 7th row

 

 

The three roots of the cubic x3 + 21x2 + 35x +7 are –tan2(kπ/7) with k=1,2,3. If, on the other hand, we took every other coefficient from the 7th row beginning with the second column, we know that the three roots of the cubic 7x3 + 35x2 + 21x + 1 are simply the inverses of the roots of the former polynomial, i.e., –1/tan2(kπ/7) with k = 1, 2, 3.

 

A convenient way of expressing these roots is to make use of the trigonometric identity

 

 

On this basis, we can say that the roots of the two polynomials with coefficients from the 7th row of Pascal's triangle are given by

 

 

Notice that we now have 2m instead of m in the denominator of the tangent. This same formulation works for even numbered rows as well. For example, the roots of the two polynomials taken from the 8th row are

 

 

 

10  Density of Real Iterates

 

Given the linear fractional recurrence

 

 

we can derive a closed-form expression for xn that is particularly useful when the sequence is purely real. Beginning with the closed-form solution presented in Table 1 for the general case 5, we can substitute the trigonometric for the exponential form and arrive at the expression

 

 

where we have defined the parameters

 

 

and

 

If we set x0 = μ, then the expression for xn reduces to

 

 

The function tan(x) is periodic modulo π, and assuming θ/π is irrational we can surmise that the values of nθ (mod π) for n = 0,1,2,...k... will approach an even distribution on the interval [0 , π] for sufficiently large k. Therefore, the asymptotic density of the iterates at any value x is proportional to the slope, i.e., derivative, of n with respect to x, since that is proportional to the number of n's that fall within a given interval Δx. The preceding equation gives

 

 

so the derivative of n with respect to x is

 

 

The asymptotic density of the iterates is proportional to this derivative, so we need only determine the appropriate scale factor to give the absolute density. The scale factor must be such that the integral of the density from x = –∞ to +∞ equals 1. In terms of the variable y = (x–μ)/ε, we know that

 

 

and since invtan(±∞) = ±π/2, it follows that

 

 

Therefore, the scale factor on 1/(1 + y2) must be 1/(επ), so the asymptotic density of iterates is given by

 

 

This is called the Cauchy distribution in probability theory. It's easy to show that half of the iterates will fall within the range μ ± ε, and two thirds of the iterates will fall within the range .

 

The Cauchy distribution has no well-defined mean or standard deviation, but the appearance of π in this density suggests that we could infer the value of π from a sequence of iterates. For example, if we compute N consecutive real-valued iterates of the linear fractional transformation f(x) = (ax+b)/(cx+d), and let n denote the number of these iterates that fall in the range –r/2 to +r/2 for some small real number r, then, provided f is not periodic, we expect the relation

 

 

Re-arranging and inserting the expressions for ε and μ in terms of the linear fractional transformation coefficients a,b,c,d, this gives

 

 

If we choose the coefficients such that (a–d)2 = –4b(b+c), the right-hand factor equals unity and the equation reduces to π ≈ rN/n. A slightly different approach can be based on the density in the small range of width r around an arbitrary value of x

 

 

Thus we have

 

Now, this applies for any arbitrarily small range r centered on any value x, and in each case n represents the number of iterates that fall within that range. Hence if we consider k consecutive such ranges, and we add together the estimates of 1/π from each range, and divide by k, we get the average of 1/π for all of those ranges. This gives kr in the denominator, which is simply the entire range Δr of the k consecutive intervals. Consequently, for any finite range Δr and initial value x0 we can compute 1/π from a series of iterates x1, x2, x3, …, xN by the formula

 

 

where Δx = xmax – xmin and q(x) = 1 or 0 accordingly as x is or is not in the range from xmin to xmax. This formula shows that we are essentially just computing the variance of the iterates, which behave (overall) like random samples from a Cauchy distribution, which is the normalized standard Cauchy distribution if μ = 0 and ε = 1. This occurs if and only if a = d and b = –c for non-zero values of a and b. Of course, the complete normalized standard Cauchy distribution

 

 

has no finite mean, standard deviation, or variance, but the truncated distribution over any finite range (–X,X) has the variance

 

 

Hence for any linear fractional transformation of the form f(x) = (ax + b)/(–bx + a) for any non-zero real coefficients a,b, (provided f is not periodic, i.e., provided the inverse tangent of b/a is not a rational multiple of π) we can say that the variance of the iterates that fall within the range (–X,X) is

 

 

where NX is the number of iterates in the summation. In particular, if we set X = 1, we have

 

 

The fact that the variance of the linear fractional iterates falling in the range from –1 to +1 does indeed converge on 4/π – 1 demonstrates that the values of nθ modulo π are (in the agregate) uniformly distributed over the interval 0 to π for most values of θ.

 

 

Table 1:  Closed Form Expressions For nth Iterates of  (az+b)/(cz+d)

 

Case 1:     ad = bc,   c ¹ 0

 

Case 2:   c = 0,   a = d ¹ 0

 

Case 3:   c = 0,    a ¹ d

 

 

Case 4:   c ¹ 0,   (a–d)2 = –4bc

 

 

Case 5:   c ¹ 0,   (a–d)2 ¹ –4bc

 

where

and

 

 

 

Table 2

mpolynomial for σm

3(x + 1)

4(x + 2)

5(x2 + 3x + 1)

6(x + 1)(x + 3)

7(x3 + 5x2 + 6x + 1)

8(x + 2)(x2 + 4x + 2)

9(x + 1)(x3 + 6x2 + 9x + 1)

10(x2 + 3x + 1)(x2 + 5x + 5)

11(x5 + 9x4 + 28x3 + 35x2 + 15x + 1)

12(x + 1)(x + 2)(x + 3)(x2 + 4x + 1)

13(x6 + 11x5 + 45x4 + 84x3 + 70x2 + 21x + 1)

14(x3 + 5x2 + 6x + 1)(x3 + 7x2 + 14x + 7)

15(x + 1)(x2 + 3x + 1)(x4 + 9x3 + 26x2 + 24x + 1)

16(x + 2)(x2 + 4x + 2)(x4 + 8x3 + 20x2 + 16x + 2)

17(x8 + 15x7 + 91x6 + 286x5 + 495x4 + 462x3 + 210x2 + 36x + 1)

18(x + 1)(x + 3)(x3 + 6x2 + 9x + 1)(x3 + 6x2 + 9x + 3)

19(x9 + 17x8 + 120x7 + 455x6 + 1001x5 + 1287x4 + 924x3 + 330x2 + 45x + 1)

20(x + 2)(x2 + 3x + 1)(x2 + 5x + 5)(x4 + 8x3 + 19x2 + 12x + 1)

21(x + 1)(x3 + 5x2 + 6x + 1)(x6 + 13x5 + 64x4 + 146x3 + 148x2 + 48x + 1)

22(x5 + 9x4 + 28x3 + 35x2 + 15x + 1)(x5 + 11x4 + 44x3 + 77x2 + 55x + 11)

23(x11 + 21x10 + 190x9 + 969x8 + 3060x7 + 6188x6 + 8008x5 + 6435x4 + 3003x3 + 715x2 + 66x + 1)

24(x + 1)(x + 2)(x + 3)(x2 + 4x + 1)(x2 + 4x + 2)(x4 + 8x3 + 20x2 + 16x + 1)

 

 

Table 3:  Algebraic Expressions For Selected λm

 

mλm

                                                                                               

            1                     

 

            2                      –1

 

            3                      ± i

 

            4                      +1

 

            5                     

 

            6                     

 

            8                     

 

            10                   

 

            12                   

 

            16                   

 

20                  

 

24