三角形式

It is now quite easy to prove the easiest one of the so-called canonical form theorems. Our assumption about the scalar field (namely, that it is algebraically closed) is still in force.

Theorem 1. If A is any linear transformation on an n -dimensional vector space 𝒱 , then there exist n + 1 subspaces 0 , 1 , , n 1 , n with the following properties:

  1. each j ( j = 0 , 1 , , n 1 , n ) is invariant under A ,
  2. the dimension of j is j ,
  3. ( 𝒪 = ) 0 1 n 1 n ( = 𝒱 ).

Proof. If n = 0 or n = 1 , the result is trivial; we proceed by induction, assuming that the statement is correct for n 1 . Consider the dual transformation on ; since it has at least one proper vector, say , there exists a one-dimensional subspace invariant under it, namely, the set of all multiples of . Let us denote by n 1 the annihilator (in ) of , n 1 = 0 ; then n 1 is an ( n 1 ) -dimensional subspace of 𝒱 , and n 1 is invariant under A . Consequently we may consider A as a linear transformation on n 1 alone, and we may find 0 , 1 , , n 2 , n 1 , satisfying the conditions (i), (ii), (iii). We write n = 𝒱 , and we are done. ◻

The chief interest of this theorem comes from its matricial interpretation. Since 1 is one-dimensional, we may find in it a vector x 1 0 . Since 1 2 , it follows that x 1 is also in 2 , and since 𝒩 2 is two-dimensional, we may find in it a vector x 2 such that x 1 and x 2 span 2 . We proceed in this way by induction, choosing vectors x j so that x 1 , , x j lie in j and span j for j = 1 , , n . We obtain finally a basis 𝒳 = { x 1 , , x n } in 𝒱 ; let us compute the matrix of A in this coordinate system. Since x j is in j and since j is invariant under A , it follows that A x j must be a linear combination of x 1 , , x j . Hence in the expression A x j = i α i j x i the coefficient of x i must vanish whenever i > j ; in other words, i > j implies α i j = 0 . Hence the matrix of A has the triangular form It is clear from this representation that det ( A α i i ) = 0 for i = 1 , , n , so that the α i i are the proper values of A , appearing on the main diagonal of [ A ] with the proper multiplicities. We sum up as follows.

Theorem 2. If A is a linear transformation on an n -dimensional vector space 𝒱 , then there exists a basis 𝒳 in 𝒱 such that the matrix [ A ; X ] is triangular; or, equivalently, if [ A ] is any matrix, there exists a non-singular matrix [ B ] such that [ B ] 1 [ A ] [ B ] is triangular.

The triangular form is useful for proving many results about linear transformations. It follows from it, for example, that for any polynomial p , the proper values of p ( A ) , including their algebraic multiplicities, are precisely the numbers p ( λ ) , where λ runs through the proper values of A .

A large part of the theory of linear transformations is devoted to improving the triangularization result just obtained. The best thing a matrix can be is not triangular but diagonal (that is, α i j = 0 unless i = j ); if a linear transformation is such that its matrix with respect to a suitable coordinate system is diagonal we shall call the transformation diagonable .

EXERCISES

Exercise 1. Interpret the following matrices as linear transformations on 2 and, in each case, find a basis of 2 such that the matrix of the transformation with respect to that basis is triangular.

  1. [ 1 1 0 1 ] .
  2. [ 1 1 1 0 ] .
  3. [ 1 0 1 1 ] .
  4. [ 1 1 1 1 ] .
  5. [ 0 1 0 0 0 1 1 0 0 ] .
  6. [ 0 1 1 0 0 1 1 0 0 ] .

Exercise 2. Two commutative linear transformations on a finite-dimensional vector space 𝒱 over an algebraically closed field can be simultaneously triangularized. In other words, if A B = B A , then there exists a basis 𝒳 such that both [ A ; 𝒳 ] and [ B ; 𝒳 ] are triangular. (Hint: to imitate the proof in Section: Triangular form , it is desirable to find a subspace of 𝒱 invariant under both A and B . With this in mind, consider any proper value λ of A and examine the set of all solutions of A x = λ x for the role of .)

Exercise 3. Formulate and prove the analogues of the results of Section: Triangular form for triangular matrices below the diagonal (instead of above it).

Exercise 4. Suppose that A is a linear transformation over an n -dimensional vector space. For every alternating n -linear form w , write A w for the function defined by

Since A w is an alternating n -linear form, and, in fact, A is a linear transformation on the (one-dimensional) space of such forms, it follows that A w = τ ( A ) w , where τ ( A ) is a scalar.

  1. τ ( 0 ) = 0 .
  2. τ ( 1 ) = n .
  3. τ ( A + B ) = τ ( A ) + τ ( B ) .
  4. τ ( α A ) = α τ ( A ) .
  5. If the scalar field has characteristic zero and if A is a projection, then τ ( A ) = ρ ( A ) .
  6. If ( α i j ) is the matrix of A in some coordinate system, then τ ( A ) = i α i i .
  7. .
  8. τ ( A B ) = τ ( B A ) .
  9. For which permutations π of the integers 1 , , k is it true that τ ( A 1 A k ) = τ ( A π ( 1 ) A π ( k ) ) for all k -tuples ( A 1 , , A k ) of linear transformations?
  10. If the field of scalars is algebraically closed, then τ ( A ) = tr A . (For this reason trace is usually defined to be τ ; the most popular procedure is to use (f) as the definition.)

Exercise 5. 

  1. Suppose that the scalar field has characteristic zero. Prove that if E 1 , , E k and E 1 + + E k are projections, then E i E j = 0 whenever i j . (Hint: from the fact that tr ( E 1 + + E k ) = tr ( E 1 ) + + tr ( E k ) conclude that the range of E 1 + + E k is the direct sum of the ranges of E 1 , , E k .)
  2. If A 1 , , A k are linear transformations on an n -dimensional vector space, and if A 1 + + A k = 1 and ρ ( A 1 ) + + ρ ( A k ) n , then each A i is a projection and A i A j = 0 whenever i j . (Start with k = 2 and proceed by induction; use a direct sum argument as in (a).)

Exercise 6. 

  1. If A is a linear transformation on a finite-dimensional vector space over a field of characteristic zero, and if tr A = 0 , then there exists a basis 𝒳 such that if [ A ; 𝒳 ] = ( α i j ) , then α i i = 0 for all i . (Hint: using the fact that A is not a scalar, prove first that there exists a vector x such that x and A x are linearly independent. This proves that α 11 can be made to vanish; proceed by induction.)
  2. Show that if the characteristic is not zero, the conclusion of (a) is false. (Hint: if the characteristic is 2 , compute B C C B , where B = [ 0 1 0 0 ] and C = [ 0 0 1 0 ] .)