A Similarity Classes of Matrices

Appendix A
Similarity Classes of Matrices

The classification of representations of GL_n(Fq) is closely analogous to the classification of conjugacy classes. The results in this chapter give a classification of the conjugacy classes in GL_n(Fq), along with representatives for each class. Descriptions of the centralisers are also given.

A.1. Basic properties of matrices

Let F be any field.

Definition A.1. Two matrices A and B with entries in F are said to be similar if there exists an invertible matrix X such that BX = XA.

Similarity is an equivalence relation on the set of all n×n matrices. The equivalence classes are called similarity classes. Given a matrix A M_n(F), for every vector x Fⁿ and every polynomial f(t) F[t] define fx = f(A)x. This endows Fⁿ with the structure of an F[t]-module, which will be denoted by M^A.

Exercise A.2. If A is similar to B, then M^A is isomorphic to M^B as an F[t]-module.

Conversely, given an F[t]-module M, pick any basis of M as an F-vector space. Let A_M be the matrix by which t acts on M with respect to this basis. A different basis of M would give rise to a matrix similar to A_M. Therefore, M determines a similarity class of matrices.

Proposition A.3. AM^A gives rise to a bijection between the set of similarity classes of matrices and the set of isomorphism classes of F[t]-modules.

Definition A.4 (Simple matrix). Recall that an F[t]-module is called simple if there is no non-trivial proper subspace of M which is preserved by F[t]. A matrix A is said to be simple if M^A is a simple F[t]-module.

Exercise A.5. Show that A is simple if and only if its characteristic polynomial is irreducible.

Exercise A.6. For any two matrices A and B, let A⊕B denote the block matrix (A 0)
0 B . A⊕B will be called the direct sum of A and B. Show that M^A⊕B = M^A ⊕ M^B (a canonical isomorphism of F[t]-modules).

Definition A.7 (Indecomposable matrix). A matrix is said to be indecomposable if it is not similar to a matrix of the form A ⊕ B, where A and B are two strictly smaller matrices. Equivalently, A is indecomposable if M^A is indecomposable as an F[t]-module.

Definition A.8 (Semisimple matrix). A matrix is said to be semisimple if it is similar to a direct sum of simple matrices. Equivalently, A is semisimple if M^A is a semisimple F[t]-module (i.e., M^A is a direct sum of simple F[t]-modules).

Exercise A.9. For any λ F, show that the matrix (λ 1)
0 λ is indecomposable, but not semisimple (and hence not simple either).

A.2. Primary decomposition

Let f(t) be any irreducible monic polynomial in F[t]. Given an F[t]-module M, its f-primary part is the submodule

M = {x ∈ M : fkx = 0 for some k ∈ N}. f

Theorem A.10 (Primary decomposition). [Jac84, Theorem 3.11] Let M be an F[t]-module which is also a finite dimensional F-vector space. Then M_f = 0 for all but finitely many irreducible monic polynomials f(t) F[t].

⊕ M = Mf , f

the sum being over all the irreducible monic polynomials f for which M_f

Let f F[t] be an irreducible monic polynomial. An F[t]-module M is called f-primary if M = M_f. M is called primary if it is f-primary for some f.

Exercise A.11. Let f(t) F[t] be an irreducible monic polynomial, and p(t) F[t] be any monic polynomial. Show that F[t]∕p(t) is f-primary if and only if p(t) = f(t)^r for some r ≥ 0.

Theorem A.12. Let f(t) F[t] be an irreducible monic polynomial, and A be a square matrix. Then M_f^A0 if and only if f(t) divides the characteristic polynomial of A.

Proof. Let χ_A denote the characteristic polynomial of A. If f is an irreducible polynomial that does not divide χ_A, then there exist polynomials r and s such that fr+χ_As = 1. Evaluating at A and applying the Cayley-Hamilton theorem shows that f(A)r(A) = I. It follows that f(A) is non-singular. Hence f(A)^k is also non-singular for every positive integer k. Therefore, M_f^A = 0.

Conversely, if M_f^A = 0, then f(A)^k is non-singular for every k N. In particular, f(A) is non-singular. Let E be a splitting field of f. Suppose that

∏h f (t) = (t- μi)mi, i=1

with μ₁,…,μ_h

E distinct, and m₁,…,m_h

N. Therefore,

∏h f(A) = (A - μiI)mi. i=1

Since f(A) is non-singular, so is A - μ_iI for each i. Therefore, no μ_i is an eigenvalue of A. It follows that f does not divide χ_A. □

If M^A is f-primary then the matrix A is called an f-primary matrix. It follows that a matrix is primary if and only if its characteristic polynomial has a unique irreducible factor.

Corollary A.13. Every matrix A M_n(F) is similar to a matrix of the form

⊕ Af, f∣χA

where A_f is an f-primary matrix, and the sum is over the irreducible factors of the characteristic polynomial of A. Moreover, for every f, the similarity class of A_f is uniquely determined by the similarity class of A.

Thus, the study of similarity classes of matrices is reduced to the study of similarity classes of primary matrices.

A.3. Structure of a primary matrix

Theorem A.14 (Structure theorem). [Jac84, Section 3.8] For every F[t]-module M, there exist non-constant monic polynomials f₁,…,f_r such that f₁∣ ⋅⋅⋅ ∣f_r and

M ~= F[t]∕f (t)⊕ ⋅⋅⋅⊕ F[t]∕f (t). 1 r

Fix an irreducible monic polynomial f(t) F[t]. If M is f-primary, then by Exercise A.11, each for each i, f_i = f^λ_i for some λ_i > 0. Therefore,

Corollary A.15 (Structure of a primary module). If M is an f-primary F[t]-module, then there exists a non-decreasing sequence of integers λ₁ ≤ ⋅⋅⋅ ≤ λ_r such that

M ~= F [t]∕f(t)λ1 ⊕ ⋅⋅⋅⊕ F[t]∕f(t)λr.

Definition A.16 (Partition). A partition is a finite sequence λ = (λ₁, ⋅⋅⋅ ,λ_r) of positive integers such that λ₁ ≤ ≤ λ_r. Define ∣λ∣ := λ₁ + + λ_r. One says that λ is a partition of ∣λ∣. The length of λ is the non-negative integer r (there is an ‘empty partition’ of length 0 denoted ∅, with ∣∅∣ = 0). Let Λ denote the set of all partitions.

Given a partition λ = (λ₁,…,λ_l), define an F[t]-module

λ1 λl Mf,λ = F[t]∕f(t) ⊕ ⋅⋅⋅⊕ F[t]∕f(t) .

Corollary A.15 says that every f-primary F[t]-module is isomorphic to M_f,λ for some partition λ.

Exercise A.17. Suppose that f and f′ are two irreducible monic polynomials, λ and λ′ two partitions. Show that the F[t]-modules M_f,λ and M_f′,λ′ are isomorphic if and only if f = f′ and λ = λ′.

Let S denote the set of all irreducible monic polynomials in F[t]. Given a function ψ : S → Λ such that ψ(f) = ∅ for all but finitely many f S, let M_φ denote the F[t]-module

⊕ M ψ = Mf,ψ(f). f∈S

Then dim_FM_ψ = ∑ _fSdeg(f)∣ψ(f)∣. Let n_ψ = dim_FM_ψ.

Theorem A.18 (Similarity classes of matrices). The map ψM_ψ is a bijective correspondence between the set of all functions S → Λ with the property that ψ(f) = ∅ for all but finitely many f S and n_ψ = n and the set of isomorphism classes of n-dimensional F[t]-modules (and hence the set of similarity classes of n × n matrices).

A.4. Block Jordan canonical form

There is a version of the Jordan canonical form for matrices for which the irreducible factors of the characteristic polynomial have derivatives which are not identically zero.

In order to obtain this form, we need the following result:

Theorem A.19. Suppose that f an irreducible monic polynomial in F[t] such that f′(t) is not identically zero. Let E denote the field F[t]∕f(t). Then the rings k[t]∕f(t)^r and E[u]∕u^r are isomorphic.

Proof. The main step in the proof is a version of Hensel’s Lemma

Lemma A.20 (Hensel). There exists q_r(t) F[t] such that q_r(t) ≡ t mod f(t), and f(q_r(t)) ≡ 0 mod f(t)^r.

Proof. The proof is by induction on r. When r = 1, one may take q₁(t) = t.

Suppose that q_r-1(t) F[t] is such that

r- 1 qr-1(t) ≡ t mod f(t) and f(qr-1(t)) ≡ 0 mod f(t) .

Then, using the Taylor expansion, for any h(t)

F[t],

r-1 r- 1 ′ r f (qr-1(t)+ f(t) h(t)) ≡ f (qr-1(t))+ f(t) h(t)f (qr-1(t)) mod f(t).

Since q_r-1(t) ≡ t mod f(t), f′(q_r-1(t) ≡ f′(t) mod f(t). By hypothesis f′(t) is a non-zero polynomial of degree strictly less than f(t). Therefore, f′(t) is not divisible by f(t). Since f(t) is irreducible, it means that there exists r(t),s(t)

F[t] such that f′r + fs = 1, which means that f′(t)r(t) ≡ 1 mod f(t). Since f(q_r-1(t)) ≡ 0 mod f(t)^r-1, there exists f₁(t)

F[t] such that

r- 1 f(qr-1(t)) = f(t) f1(t).

When h(t) = -f₁(t)r(t) and q_r(t) = q_r-1(t) + f(t)^r-1h(t), one has

q (t) ≡ t mod f(t) and f(q (t)) ≡ 0 mod f(t)r. r r

□

Given q_r(t) as in Hensel’s lemma, the map

φ : F[u,v]∕(f(v),ur) → F[t]∕f(t)r

given by setting φ(v) = q_r(t), and φ(u) = f(t) gives rise to a well defined ring homomorphism, since f(q_r(t)) ≡ 0 mod f(t)^r Since q_r(t) ≡ t mod f(t), and f(t) and q_r(t) lie in the image of φ, t also lies in the image of φ. This makes φ surjective. Moreover, φ is a linear transformation of F-vector spaces of dimension rd. Therefore, φ must be an isomorphism of rings. □

Definition A.21 (Companion matrix). Let f(t) = tⁿ-a_n-1t^n-1 - ⋅⋅⋅ -a₁t-a₀. Then the companion matrix of f is the n × n matrix:

( 0 0 ⋅⋅⋅ 0 a ) | 0 | || 1 0 ⋅⋅⋅ 0 a1 || Cf = || 0 1 ⋅⋅⋅ 0 a2 || . |( ... ... ... ... ... |) 0 0 ⋅⋅⋅ 1 an-1

Theorem A.22 (Block Jordan Canonical Form). Let A M_n(F) be such that for every irreducible factor f of the characteristic polynomial of A, f′ is not identically zero. Then A can be written as a block diagonal matrix with blocks of the form

( ) Cf 0 0 ⋅⋅⋅ 0 0 || I Cf 0 ⋅⋅⋅ 0 0 || || 0 I C ⋅⋅⋅ 0 0 || Jr(f) = || . . .f . . .|| , || .. .. .. .. .. ..|| |( 0 0 0 ⋅⋅⋅ Cf 0 |) 0 0 0 ⋅⋅⋅ I C f rd×rd

where d is the degree of f, an irreducible factor of the characteristic polynomial of A, C_f is the companion matrix of f, and r is a positive integer. Up to rearrangement of blocks, this canonical form is unique.

Proof. By Exercise A.6 and Theorem A.10 one may assume that A is f-primary, for some irreducible monic polynomial f. Let E = F[v]∕f(v). By Corollary A.15 and Theorem A.15, there exists a partition λ such that

A ~ λ1 λr M = E[u]∕u ⊕ ⋅⋅⋅⊕ E[u]∕u .

In the notation of the proof of Theorem A.19, let θ(t) = t - q(t), where we write q for q_{λ_i} for some i. Then θ(t)

(f(t)). But θ(t)

(f(t)²), for if it did, we would have

f(t) = f(θ(t) + q(t)) ~= f(q(t))+ θ(t)f′(q(t) mod (f(t)2) 2 = 0 mod (f(t)),

a contradiction. Therefore, θ(t) = αf(t), where α is a unit in F[t]∕f(t)^λ_i. In the isomorphism

r r r F[t]∕(f(t) ) → E[u]∕u = F[u,v]∕(u ,f(v)),

αu + v. Since A acts by t, with respect to the basis of E[u]∕u^λ_i over F given by

d-1 d- 1 λ -1 λ- 1 λ -1 d-1 1,v,...,v ,α,αv,...,αv ,...,α i ,α i v,...α i v ,

the matrix of multiplication by t = αu + v is J_{λ_i}(f). □

The hypothesis on A in Theorem A.22 always holds when F is a perfect field, as we shall see in Section A.6. By Corollary B.8 every finite field is perfect. Therefore, every matrix over a finite field has a Jordan canonical form.

A.5. Centralisers

For any A M_n(F) define

Z(A) = {B ∈ Mn(A) ∣AB = BA}.

Theorem A.23. Let A M_n(F) be a matrix such that for each irreducible factor f of the characteristic polynomial of A, f′ is not identically zero. Suppose that A is similar to ⊕_fA_f, where A_f is f-primary (see Corollary A.13). Then Z(A) ⊕_fZ(A_f). If A is f-primary, E = F[t]∕f(t), and λ is the partition associated to M^A in Corollary A.15, then

~ λ1 λr Z(A) = EndE[u](E[u]∕u ⊕ ⋅⋅⋅⊕ E[u]∕u ).

Note that the group of units of the centraliser algebra Z(A) will be the centraliser of A in GL_n(F).

Proof. The theorem follows easily from Theorem A.19, using the fact that End_F[t]M^AZ(A). □

A.6. Perfect fields

Definition A.24. A perfect field is either a field of characteristic zero, or a field of characteristic p > 0 for which the map xx^p is bijective.

Theorem A.25. Suppose that F is a perfect field and f(t) F[t] is a non-constant irreducible polynomial. Then f′(t) does not vanish identically.

Proof. If f′ = 0, then the characteristic of F must be p > 0 and f must be of the form

p 2p f(t) = a0 + a1t + a2t + ⋅⋅⋅ .

Since F is perfect, there exist b_i

E such that b_i^p = a_i. Then

f(t) = (b0 + b1t+ b2t2 + ⋅⋅⋅)p,

contradicting the irreducibility of f. □

[next] [prev] [prev-tail] [front] [up]

Appendix ASimilarity Classes of Matrices