5 Properties of the two dimensional \(\operatorname{SL}_2(F)\)
5.1 General Notation
Throughout this paper, \(F\) will denote an arbitrary algebraically closed field. For convenince we let \(L\) denote the infinite group \(\operatorname{SL}_2(F)\). The letter \(p\) will be used to denote the characteristic of \(F\). Recall that the characteristic of a field is the smallest number of times which the multilplicative identity of the field, say 1, needs to be summed to reach the additive identity of the field, say 0. If there is no such number, then we regard \(p\) as being zero, otherwise it is always a prime.
Unless otherwise stated, the letters \(\alpha , \beta , \gamma , \delta , \lambda , \gamma \), and \(\sigma \) will denote elements of \(F\) and \(\delta \) and \(\rho \) elements of \(F^\times \), where \(F^\times \) are the non-zero elements of \(F\).
5.2 Subsets of \(\pmb {L}\)
In this chapter we make some useful observations about specific elements and subgroups of \(L\). We define the following elements of \(L\) as follows.
Special matrices of \(\operatorname{SL}_2(F)\)
Given an element \(\delta \in F^\times \) we define the diagonal matrix:
Given an element \(\delta \in F\) we define the shear matrix:
We denote the matrix which corresponds to a rotation by \(\pi / 2\) radians to be:
The matrices \(d\), \(s\) and \(w\) satisfy the following relations:
For any \(\delta , \rho \in F^\times \) we have that
We verify by matrix multiplication that indeed:
For any \(\sigma , \gamma \in F\) we have that
We verify by matrix multiplication that indeed:
We have that for all \(\delta \in F^\times \) and \(\sigma \in F\)
We verify by matrix multiplication that indeed:
For any \(\delta \in F^\times \) we have:
We verify by matrix multiplication that indeed
From these relations we can now single out the following subgroups of \(\operatorname{SL}_2(F)\).
Special subgroups of \(\operatorname{SL}_2(F)\)
The set of diagonal matrices with matrix multiplication is a subgroup of \(\operatorname{SL}_2(F)\):
The set of shear matrices with matrix multiplication is a subgroup of \(\operatorname{SL}_2(F)\):
The set of lower triangular matrices with matrix multiplication is a subgroup of \(\operatorname{SL}_2(F)\)
Observe that \(L\) is the set of all lower triangular matrices in \(\operatorname{SL}_2(F)\) whilst \(Dw\) is the set of all anti-diagonal matrices.
These elements and subgroups are fundamental to this paper and this notation will be used throughout.
The map \(\phi : F^\times \overset {\sim }{\rightarrow } D\) defined by \(d_\delta \mapsto (d_\delta )_{11}\) defines a group isomorphism.
The function \(\psi : F^\times \rightarrow D\) defined by \(\psi (\delta ) = d_\delta \) is a homomorphism between the group \(F^\times \) under normal multiplication and \(D\) under normal matrix multiplication:
Observe that \(\psi \) is trivially injective and surjective and thus an isomorphism. So \(D\cong F^\times \) and \(D\) is a subgroup of \(L\).
The map \(\phi : F \overset {\sim }{\rightarrow } S\) defined by \(s_\sigma \mapsto (s_\sigma )_{11}\) defines a group isomorphism.
The function \(\phi : F \rightarrow T\) defined by \(\phi (\sigma ) = s_\sigma \) is a homomorphism between the group \(F\) under addition and \(S\) under normal matrix multiplication:
It’s clear that \(\phi \) is injective and surjective and thus an isomorphism. So \( S \cong F\) and \(S\) is a subgroup of \(L\).
\(S\) is a normal subgroup of \(H\)
Let \(s_\gamma \) and \(d_\delta s_\sigma \) be arbitrary elements of \(T\) and \(H\) respectively. Conjugating \(s_\gamma \) by \(d_\delta s_\sigma \) gives,
Since \(s_\gamma \) was chosen arbitrarily from \(T\) we have (\(d_\delta s_\sigma ) T (d_\delta s_\sigma )^{-1} = T\) and since \(d_\delta s_\sigma \) was chosen arbitrarily from \(H\), we have that \(T \vartriangleleft H\).
\(H / T \cong D\).
The function \(\pi : H \rightarrow D\) defined by \(\pi (d_\delta s_\sigma ) = d_\delta \) is a homomorphism between \(H\) under normal matrix multiplication and \(D\) under normal matrix multiplication:
We see that \(\pi \) is trivially surjective and has kernel
Thus by the First Isomorphism Theorem,
5.3 The Centre of \(\pmb {L}\)
The centre \(Z(G)\) of a group \(G\) is the set of elements of \(G\) that commute with every element of \(G\).
It is an immediate observation that \(Z(G)\) is a normal subgroup of \(G\), since for each \(z \in Z\), \(gzg^{-1} = gg^{-1}z = z\), \(\forall g \in G\). It’s also clear that a group is abelian if and only if \(Z(G)=G\).
For ease of notation, \(Z(L)\) will be denoted simply by \(Z\) throughout the rest of this paper.
\(Z = \langle - I_L \rangle \).
Take an arbitrary element \(x=\begin{bmatrix} \alpha & \beta \\ \gamma & \delta \end{bmatrix} \in L\) and an arbitrary element \(z = \begin{bmatrix} z_1 & z_2 \\ z_3 & z_4 \end{bmatrix} \in Z\) and consider their product:
Equating either the top left or bottom right entries, we see that \(z_2 \gamma = z_3 \beta \). Since \(\beta \) and \(\gamma \) can take any values in \(F\), for equality to always hold we must have \(z_2 = 0 = z_3\). Hence equation (3) simplifies to
Thus
Since we are working in the special linear group, det\((z)=1\), thus \(z_1 = \pm 1\) and \(Z = \langle - I_L \rangle \) as required. Observe that this is a cyclic group of order 2 except in the case of \(p=2\) where \(- I_L = I_L\).
If \(p\neq 2\), then \(L\) contains a unique element of order 2.
Consider an arbitrary element \(x \in L\) with order 2. That is \(x^2 = I_L\), \(x \neq I_L\) and thus \(x=x^{-1}\).
Thus \(\alpha = \delta \), \(\beta = - \beta \Rightarrow 2\beta = 0\) and \(\gamma = - \gamma \Rightarrow 2\gamma = 0\). In the case of \(p \neq 2\) this gives \(\beta = 0 = \gamma \). So
Also \(\alpha ^2 = 1\) since \(x \in \) \(\operatorname{SL}_2(F)\), so \(\alpha = \pm 1\). For \(x\) to have order 2, we must have \(\alpha = - 1\). Hence there is a unique element of order 2, namely \(- I_L\).
5.4 Conjugacy of the Elements of \(\pmb {L}\)
Each element of \(L\) is conjugate to either \(d_\delta \) for some \(\delta \in F^\times \), or to \(\pm s_\sigma \) for some \(\lambda \in F\).
Since \(F\) is algebraically closed, any element \(x \in L\) can be regarded as a linear transformation in the 2 dimensional vector space over \(F\), with the eigenvalues \(\pi _1\) and \(\pi _2\).
If \(\pi _1\) and \(\pi _2\) are distinct, then \(x\) is thus diagonalisable. That is, there exists an invertible matrix \(a \in GL(2, F)\) such that \(y = axa^{-1}\) is a diagonal matrix. Furthermore, we can multiply \(a\) by a suitable scalar to find an element in \(L\) which conjugates \(x\) and \(y\):
Observe that det\((b)=1\), hence \(x\) and \(y\) are conjugate in \(L\). Furthermore, since \(y\) is a diagonal matrix it must belong to the set \(D\), showing that \(x\) is conjugate to \(d_\delta \) for some \(\delta \in F^\times \).
If \(\pi _1 = \pi _2\) then \(x\) has just one repeated eigenvalue. Suppose that \(x\) is diagonalisable. Then there exists an element \(c \in GL(2, F)\) and a diagonal matrix \(\pi _1 I_G\) such that \(x = c(\pi _1 I_G)c^{-1} = \pi _1 I_G\). Thus \(x = \pm I_G\), which trivially belongs to both \(D\) and \(T \times Z\).
Now assume that \(x\) is not diagonalisable. Chapter 7 of
[
shows that there exists an element \(d \in GL(2, F)\), such that \(x= djd^{-1}\), where,
is the Jordan Normal Form of \(x\). By the method described above, we can multiply \(d\) by a suitable scalar to show that \(x\) is conjugate to \(j\) in \(L\). Now we conjugate \(j\) by an element of \(L\) whose top left entry is 0.
Now clearly the determinant of \(x\) is equal to the determinant of \(j\), namely 1, which means that \(\pi _1 = \pm 1\). This shows that \(j\) is conjugate in \(L\) to some element in \(T \times Z\) as well as \(x\). Furthermore, since conjugation is transitive, \(x\) is conjugate to \(\pm s_\sigma \) for some \(\lambda \in F\).
5.5 Centralisers & Normalisers
The centraliser \(C_G(H)\) of a subset \(H\) of a group \(G\) is the set of elements of \(G\) which commute with each element of \(H\).
The normaliser \(N_G(H)\) of a subset \(H\) of a group \(G\) is the set of elements of \(G\) which stabilise \(H\) under conjugation.
Both the centraliser and normaliser of a subset \(H\) are subgroups of \(G\). Note also that the centraliser is a stronger condition than the normaliser and any element in the centraliser of \(H\) is also in its normaliser. If \(H\) is a singleton then it’s clear that its centraliser and normaliser are equal.
(i) \(N_L(T_1) \subset H\), where \(T_1\) is any subgroup of \(T\) with order greater than 1.
(ii) \(C_L(\pm s_\sigma ) = T \times Z\) where \(\lambda \neq 0\).
(i) Let \(s_\sigma \) be an arbitary element of \(T_1\) with \(\lambda \neq 0\). To determine the normaliser of \(T_1\) in \(L\) we consider which \(x \in L\) satisfy \(x s_\sigma x^{-1} \in T_1\).
Since \(x s_\sigma x^{-1} \in T_1\) we have \(\minus \beta ^2 \lambda = 0\) and since \(\lambda \neq 0\), we have \(\beta = 0\). Since \(s_\sigma \) was chosen arbitrarily, any element which normalises \(T_1\) is a lower diagonal matrix and is therefore in \(H\) by (1). Thus \(N_L(T_1) \subset H\) as required.
(ii) To determine the centraliser of \(s_\sigma \) in \(L\), we consider which \(y \in L\) satisfy \(y s_\sigma = s_\sigma y\) for an arbitrarily chosen \(s_\sigma \), with \(\lambda \neq 0\).
Equating the top left entries of (4) gives \(\alpha + \beta \lambda = \alpha \) which means \(\beta = 0\) since \(\lambda \neq 0\) by assumption. Equating the bottom left entries gives that \(\alpha = \delta \). Finally, since det\((y) = 1\), we have \(\alpha \delta = 1\) so \(\alpha = \pm 1\). Thus a \(y \in C_L(s_\sigma )\) is
So \(y = \pm t_\sigma \) for some \(\sigma \in F\), and \(TZ = \{ \pm t_\sigma \} \subset C_L(s_\sigma )\). Now take an arbitrary \(s_\gamma z \in TZ\).
Thus \(s_\gamma z\) and indeed the whole of \(TZ\) is contained in \( C_L(s_\sigma )\), so \(C_L(s_\sigma ) = TZ\).
Since \(T\) commutes elementwise with \(Z\) and \(T \cap Z = \{ I_G \} \), we can apply Corollary 3.21 and assert that \(C_L(s_\sigma ) = TZ \cong T \times Z\) as required. The centraliser of \(\minus s_\sigma \) is also \(T \times Z\), since an element \(x\) commutes with \(\minus s_\sigma \) if and only if it commutes with \(s_\sigma \):
Note that in case of \(\lambda = 0\), \(\pm s_\sigma \in Z\) and thus it’s centraliser is the whole of \(L\).
(i) \(N_L(D_1) = \langle D , w \rangle \), where \(D_1\) is any subgroup of \(D\) with order greater than 2.
(ii) \(C_L(d_\delta )= D\) where \(\delta \neq \pm 1\).
(i) Since \(|D_1| {\gt} 3\), we can choose a \(d_\delta \in D_1 \! \setminus \! Z\), that is where \(\delta \neq 1\). To determine the normaliser of \(D_1\) in \(L\) we consider which \(x \in L\) satisfy \(x d_\delta x^{-1} \in D_1\).
Since (5) is in \(D_1\), the top right and bottom left entries must be 0. Since \(\delta \neq \pm 1\), we have \(\delta \neq \delta ^{-1}\) and so \(\alpha \beta = 0 = \gamma \delta \).
If \(\alpha = 0\), then \(\beta \) and \(\gamma \) are non-zero since det\((x) = 1\), thus \(\delta = 0\). So det\((x) = \minus \gamma \beta = 1\) and \(\minus \gamma = \beta ^{-1}\). (5) becomes
Since \(D_1\) is a group, it contains the inverse of each of it’s elements, so \(d^{-1}_\delta \in D_1\) as required. In this case we have \(x \in wD\).
If \(\alpha \neq 0\), then similarly \(\beta = 0\), \(\delta = \alpha ^{-1}\) and \(\gamma = 0\). (5) now becomes
This time we have \(x \in D\). So \(x \in D \cup wD = \langle D , w \rangle \) and any element which normalises \(D_1\) is in \(\langle D , w \rangle \), thus \(N_L(D_1) \subset \langle D , w \rangle \).
Now take an arbitrary \(y \in \langle D , w \rangle = D \cup wD\). If \(y \in D\) then \(y = d_{\rho 1}\), for some \(\rho 1 \in F^\times \).
If \(y \in wD\) then \(y = w d_{\rho 2}\), for some \( d_{\rho 2} \in F^\times \).
Thus \(y\) indeed who whole of \(\langle D , w \rangle \) is contained in \(N_L(D_1)\). This inclusion gives the desired result, \(N_L(D_1) = \langle D , w \rangle \).
(ii) Now we consider which \(y \in L\) satisfy \(y d_\delta = d_\delta y\) for an arbitrarily chosen \(d_\delta \), with \(\delta \neq \pm 1\).
Equating the top right and bottom left entries of (6) gives that \(\beta = 0 = \gamma \) since Since \(\delta \neq \delta ^{-1}\). Thus \(\delta = \alpha ^{-1}\) and
Thus \(x\) and indeed the whole of \(C_L(d_\delta )\) is contained in \(D\). Now take an arbitrary \(d_\rho \in D\).
So clearly \(D \subset C_L(d_\delta )\) and thus \(C_L(d_\delta ) = D\) as required.
Let \(a\) and \(b\) be conjugate elements in a group \(G\). Then \(\exists \, x \in G\) such that \(xC_G(a)x^{-1} = C_G(b)\).
This proposition essentially claims that conjugate elements have conjugate centralisers. Since \(a\) and \(b\) are conjugate there exists an \(x \! \in \! G\) such that \(b = xax^{-1}\). Let \(g\) be an arbitrary element of \(C_G(a)\). Then,
Thus \(xgx^{-1} \in C_G(xax^{-1})\). Since \(g\) was chosen arbitrarily,
Conversely, let \(h\) be an arbitary element of \(C_G(xax^{-1})\). Then,
So \(x^{-1}hx \in C_G(a)\) and since \(h\) was arbitrarily chosen from \(C_G(xax^{-1})\),
\(x^{-1}C_G(xax^{-1})x \subset C_G(a)\). Multiplication on the left by \(x\) and on the right by \(x^{-1}\) gives \(C_G(b) = C_G(xax^{-1}) \subset xC_G(a)x^{-1}\). Since we have shown that each set contains the other, \(xC_G(a)x^{-1} = C_G(b)\) as required.
The centraliser of an element \(x\) in \(L\) is abelian unless \(x\) belongs to the centre of \(L\).
This is almost an immediate consequence of the preceding results. Propositions 5.21 and 5.22 show that an element of the form \(\pm s_\sigma \) which does not lie in the centre of \(L\) has centraliser \(T \times Z\), whilst a non-central element of the form \(d_\delta \) has centraliser \(D\). Both \(T\) and \(D\) are abelian since they are isomoprhic to \(F\) and \(F^\times \) respectively. Let \(s_\sigma z_1\) and \(s_\gamma z_2\) be arbitrary elements of \(T \times Z\).
Thus \(T \times Z\) is also abelian. Since every element of \(L\) is conjugate to \(d_\delta \) or \(\pm s_\sigma \) by Proposition 5.18 and conjugate elements have conjugate centralisers by Proposition 5.23, the centraliser of each \(x \in L \setminus Z\) is conjugate to either \(T \times Z\) or \(D\). Proposition 3.17(iii) shows that conjugate subgroups are isomorphic and therefore have the same structure, thus since both \(T \times Z\) and \(D\) are abelian, \(C_L(x)\) is also abelian. Note that in general this does hold for \(x \in Z\), since its centraliser is the whole of \(L\) which is not abelian unless \(L = Z\).
5.6 The Projective Line & Triple Transitivity
It is convenient to sometimes take a geometric viewpoint and regard the elements of \(L\) as pairs of vectors in the 2-dimensional vector space over \(F\), which we will denote \(V\). An element of \(L\) is thus a linear transformation of \(V\).
Let \(\mathscr {L}\) be the set of all 1-dimensional subspaces of \(V\). A subset \(\mathscr {S}\) of \(\mathscr {L}\) is called a subspace of \(\mathscr {L}\) if there is a subspace \(U\) of \(V\) such that \(\mathscr {S}\) is the set of all 1-dimensional spaces of \(U\). We have dim \(U =\) dim \(\mathscr {S} + 1\). The set \(\mathscr {L}\) on which this concept of subspaces is defined is called the projective line on \(V\) and an element of \(\mathscr {L}\) is a 0-dimensional subspace of \(\mathscr {L}\) and consequently called a point. The projective line can be considered as a straight line in the field, plus a point at infinity.
Any 1-dimensional subspace of \(V\) is a set of vectors of the form \(\eta u\), where \(u\) is a non-zero vector of \(V\) and \(\eta \in F^\times \). Thus the points of \(\mathscr {L}\) are equivalence classes with the following relation defined on the set of vectors of \(V\).
Notice that \(u\) and \(v\) are equivalent if and only if \(u_1 v_2 = v_1 u_2\). Importantly each point \(P_i\) of \(\mathscr {L}\) can be represented by a corresponding equivalence class of vectors of \(V\), that is, \(P\) corresponds to \(u\) if \(P = u_1 / u_2\). In the case when \(u_2 = 0\), this corresponds to the point at infinity.
Let \(S\) be a permutation group which acts on a set \(X\) and \(\{ x_1, x_2, x_3 \} \) and \(\{ x_1', x_2', x_3' \} \) be two subsets of distinct elements of \(X\). Then \(S\) is said be triply transitive on \(X\) if there is an element \(\pi \in S\) such that,
Let \(\mathscr {L}\) be the projective line over the field \(F\). Then \(L\) is triply transitive on the set of the points of \(\mathscr {L}\).
Let \(P_1\), \(P_2\) and \(P_3\) be distinct points of \(\mathscr {L}\) and \(p_i\) be a vector in \(V\) corresponding to \(P_i\). Since each \(P_i\) is distinct, \(p_1\), \(p_2\) and \(p_3\) are thus pairwise linearly independent. Thus \(p_1\) and \(p_2\) form a basis for \(V\) and it’s clear that there exist \(\alpha , \beta \in F^\times \) such that,
Now, let \(Q_1\), \(Q_2\) and \(Q_3\) be three more distinct points of \(\mathscr {L}\) and \(q_i\) be a vector in \(V\) corresponding to \(Q_i\). Similarly, by the above argument, there exist \(\gamma , \delta \in F^\times \) such that,
Let \(\pi \in GL(2,F)\) be the linear transformation which sends \(\alpha p_1\) to \(\gamma q_1\) and \(\beta p_2\) to \(\delta q_2\). Thus,
Hence we get \(P^\pi _1 = Q_1\), \(P^\pi _2 = Q_2\) and \(P^\pi _3 = Q_3\) and \(GL(2,F)\) is triply transitive. Now set,
Consider the mapping \(\theta \) which sends \(\alpha p_1\) to \(\eta \gamma q_1\) and \(\beta p_2\) to \(\eta \delta q_2\). Observe that,
So \(\theta \in SL(2,F) = L\) and since \(P^\theta _1 = Q_1\), \(P^\theta _2 = Q_2\) and \(P^\theta _3 = Q_3\), we have that \(L\) is also triply transitive.
The following proposition looks at what happens when the group \(L\) acts on the projective line \(\mathscr {L}\).
(i) Each element of the form \(d_\delta \) (with \(\delta \neq \pm 1\)), fixes the same two points on the projective line \(\mathscr {L}\) and fix no other point.
(ii) Each element of the form \(\pm s_\sigma \) (with \(\lambda \neq 0\)), fixes the same point \(P\) on \(\mathscr {L}\) and fix no other point. Furthermore, Stab\((P) = H\).
(iii) All conjugate elements have the same number of fixed points on \(\mathscr {L}\).
(iv) Any noncentral element of \(L\) has at most 2 fixed points on \(\mathscr {L}\).
(i) Let \(P\) be a fixed a point of an arbitrary \(d_\delta \in D\), with \(\delta \neq \pm 1\) and let \(u\) belong to the corresponding equivalence class of vectors of \(V\) to \(P\).
Since \(\delta \neq \pm 1\), \(\delta \) does not equal \(\delta ^{-1}\), and so either \(u_1 = 0\) or \(u_2 = 0\). Thus \(u\) is equivalent to either the vector \(\begin{bmatrix} 0
\\ 1
\end{bmatrix}\) or \(\begin{bmatrix} 1
\\ 0
\end{bmatrix}\) and these correspond to 2 distinct points of \(\mathscr {L}\) which are fixed by \(d_\delta \).
(ii) Let \(P\) be a fixed a point of an arbitrary \(s_\sigma \), with \(\lambda \neq 0\), and let \(u\) be the corresponding element of \(V\) to \(P\).
This gives \({u_1}^2 \lambda = 0\) and since \(\lambda \neq 0\) we have \(u_1 = 0\). Thus \(s_\sigma \) has just one fixed point, \(P\) which corresponds to the equivalence class of \(\begin{bmatrix} 0 \\ 1 \end{bmatrix}\) in \(V\). We show also that \(P\) is also the only fixed point of \(-s_\sigma \), with \(\lambda \neq 0\).
So again \(u_1 =0\) and \(-s_\sigma \) fixes \(P\) and no other point. We now calculate the stabiliser of \(P\) in \(L\), by considering which \(x \in L\) fix \(P\).
Thus \(\beta = 0\) and \(x \in H\). Since \(x\) was chosen arbitrarily from Stab\((P)\), we have Stab\((P) \subset H\). Now let an arbitrarily chosen \(y \in H\) act on \(P\).
Thus \(y\) and indeed \(H\) is contained in Stab\((P)\), so Stab\((P) = H\) as desired.
(iii) Let \(P_i\) \((i = 1,2,...)\) be the fixed points of \(x\in L\) and let \(y\) be conjugate to \(x\) in \(L\). That is, there exists a \(g \in L\) such that \(x = gyg^{-1}\).
This shows that \(P_i\) is a fixed point of \(x\) if and only if \(g^{-1} P_i\) is a fixed point of \(y\). Thus conjugate elements have the same number of fixed points.
(iv) By Proposition 5.18(i), every element of \(L\) is conjugate to either \(d_\delta \) or \(\pm s_\sigma \), so since conjugate elements have the same number of fixed points, every element of \(L \! \setminus \! Z\) has either the same number of fixed points as \(d_\delta \) (with \(\delta \neq \pm 1\)), namely 2, or the same number as \(\pm s_\sigma \), (with \(\lambda \neq 0\)), namely 1.