Traces of nilpotent matrices

Posted on 18 November 2022 by Remy

I needed the following well-known result in the course I’m teaching:

Lemma. Let $k$ be a field and $M \in M_n(k)$ a nilpotent matrix. Then $\operatorname{tr} M = 0$ .

The classical proof uses the flag of subspaces

$0 \subseteq \ker M \subseteq \ker M^2 \subseteq \ldots \subseteq \ker M^n = k^n$

to produce a basis for which $M$ is upper triangular. Here is a slick basis-independent commutative algebra proof that shows something better:

Lemma. Let $R$ be a commutative ring with nilradical $\mathfrak r$ , and let $M \in M_n(k)$ be a nilpotent matrix. Then the characteristic polynomial $p_M(t) = \det(tI-M)$ satisfies

$p_M(t) \in t^n + \mathfrak r[t]_{< n}.$

Here we write $I[t]_{<n} \subseteq R$ for the polynomials in $R$ of degree smaller than $n$ whose coefficients lie in a given ideal $I \subseteq R$ .

Note that the formulation should ring a bell: in the previous post we saw that $R[t]^\times = R^\times + t\mathfrak r[t]$ . When $R$ is a domain, this reduces to $R[t]^\times = R^\times$ , and the lemma just says that $p_M(t) = t^n$ .

This suggests that we shouldn’t work with $\det(tI-M)$ but with its anadrome (or reciprocal) $t^n\det(t^{-1}I-M) = \det(I-tM)$ .

Proof of Lemma. We have to show that $\det(I-tM) \in 1 + t\mathfrak r[t]$ . Since $M$ is nilpotent, there exists $r \in \mathbf N$ with $M^{r+1} = 0$ , so $(I-tM)(I+tM+\ldots+t^rM^r) = I$ . Thus $\det(I-tM) \in R[t]^\times = R^\times + t\mathfrak r[t]$ . Evaluating at $t=0$ shows that the constant coefficient is 1. $\qedsymbol$

Invertible polynomials

Posted on 11 November 2022 by Remy

Here is a classic little lemma on polynomials:

Lemma. Let $R$ be a commutative ring with nilradical $\mathfrak r$ . Then $R[t]^\times = R^\times + t\mathfrak r[t]$ .

That is, a polynomial $a_nt^n + \ldots + a_1t + a_0 \in R[t]$ is invertible if and only if $a_0$ is invertible and $a_1,\ldots,a_n$ are nilpotent. This is the formulation you find in Atiyah–MacDonald, exercise 1.2(i). Note that when $R$ is a domain (or more generally a reduced ring), this just says that $R[t]^\times = R^\times$ (via the constant polynomials).

Proof. First suppose $f = \sum_{i=0}^n a_nt^n$ with $a_0 \in R^\times$ and $a_1,\ldots,a_n \in \mathfrak r$ . Multiplying by $a_0^{-1}$ we may assume $a_0 = 1$ ; this does not change the nilpotence assumption on $a_1,\ldots,a_n$ . Then $g := 1-f = -a_1t-\ldots - a_nt^n$ is nilpotent since it is a sum of nilpotent elements $-a_it^i$ in the commutative ring $R[t]$ . Thus we get $g^{r+1} = 0$ for some $r \in \mathbf N$ , and

$(1-g)(1+g+\ldots+g^r) = 1$

shows that $f \in R[t]^\times$ . Conversely, let $f \in R[t]^\times$ , and suppose $fg = 1$ for some polynomial $g = b_mt^m + \ldots + b_1t+b_0 \in R[t]$ . Without loss of generality we may assume $a_n \neq 0 \neq b_m$ . If $R$ is a domain, then the leading term of $fg$ is $a_nb_mt^{m+n} \neq 0$ , so we must have $n=m=0$ and $f$ and $g$ are constant. In the general case, we conclude that $\phi(f) \in S^\times \subseteq S[t]$ for any ring homomorphism $\phi \colon R \to S$ to a domain $S$ . In particular, taking $S = R/\mathfrak p$ for any prime ideal $\mathfrak p \subseteq R$ shows $a_1,\ldots,a_n \in \mathfrak p$ and $a_0 \not\in\mathfrak p$ . Taking the intersection over all prime ideals gives

$\qedsymbol$ $\begin{align*}a_1,\ldots,a_n &\in \bigcap_{\mathfrak p \text{ prime}} \mathfrak p = \mathfrak r,\\ a_0 &\in \bigcap_{\mathfrak p \text{ prime}} (R \setminus \mathfrak p) = R^\times. \end{align*}$

Union of hyperplanes over a finite field

Posted on 22 February 2022 by Remy

The following lemma is a (presumably well-known) result that Raymond Cheng and I happened upon while writing our paper Unbounded negativity on rational surfaces in positive characteristic (arXiv, DOI). Well, Raymond probably knew what he was doing, but to me it was a pleasant surprise.

Lemma. Let $q$ be a power of a prime $p$ , and let $x_0,\ldots,x_n \in \bar{\mathbf F}_q$ . Then $x_0,\ldots,x_n$ satisfy a linear relation over $\mathbf F_q$ if and only if

$\det \begin{pmatrix} x_0 & x_1 & \cdots & x_n \\ x_0^q & x_1^q & \cdots & x_n^q \\ \vdots & \vdots & \ddots & \vdots \\ x_0^{q^n} & x_1^{q^n} & \cdots & x_n^{q^n} \end{pmatrix} = 0.$

Proof. If $\sum_{i=0}^n c_ix_i = 0$ for $(c_0,\ldots,c_n) \in \mathbf F_q^n - \{0\}$ , then $c_i^{q^j} = c_i$ for all $i,j \in \{0,\ldots,n\}$ since $c_i \in \mathbf F_q$ . As $(-)^q \colon \bar{\mathbf F}_q \to \bar{\mathbf F}_q$ is a ring homomorphism, we find

$\begin{pmatrix} x_0 & x_1 & \cdots & x_n \\ x_0^q & x_1^q & \cdots & x_n^q \\ \vdots & \vdots & \ddots & \vdots \\ x_0^{q^n} & x_1^{q^n} & \cdots & x_n^{q^n} \end{pmatrix}\begin{pmatrix} c_0 \\ c_1 \\ \vdots \\ c_n \end{pmatrix} = 0,$

so the determinant is zero. Conversely, the union of $\mathbf F_q$ -rational hyperplanes $H \subseteq \mathbf P^n_{\mathbf F_q}$ is a hypersurface $Y$ of degree $|\check{\mathbf P}^n(\mathbf F_q)| = q^n + \ldots + q + 1$ (where $\check{\mathbf P}^n$ denotes the dual projective space parametrising hyperplanes in $\mathbf P^n$ ). Since the determinant above is a polynomial of the same degree $q^n + \ldots + q + 1$ that vanishes on all $\mathbf F_q$ -rational hyperplanes, we conclude that it is the polynomial cutting out $Y$ , so any $[x_0:\ldots:x_n] \in \mathbf P^n(\bar{\mathbf F_q})$ for which the determinant vanishes lies on one of the hyperplanes. $\qedsymbol$

Of course when the determinant is zero, one immediately gets a vector $(c_0,\ldots,c_n) \in \bar{\mathbf F}_q^{n+1} - \{0\}$ in the kernel. There may well be an immediate argument why this vector is proportional to an element of $\mathbf F_q^{n+1}$ , but the above cleverly circumvents this problem.

For concreteness, we can work out what this determinant is in small cases:

$n=0$ : a point $x_0 \in \bar{\mathbf F}_q$ only satisfies a linear relation over $\mathbf F_q$ if it is zero.
$n=1$ : the polynomial $x_0x_1^q-x_0^qx_1$ cuts out the $\mathbf F_q$ -rational points of $\mathbf P^1$ .
$n=2$ : the polynomial
$x_0x_1^qx_2^{q^2}+x_1x_2^qx_0^{q^2}+x_2x_0^qx_1^{q^2}-x_0^{q^2}x_1^qx_2-x_1^{q^2}x_2^qx_0-x_2^{q^2}x_0^qx_1$
cuts out the union of $\mathbf F_q$ -rational lines in $\mathbf P^2$ . This is the case considered in the paper.

Algebraic closure of the field of two elements

Posted on 1 April 2021 by Remy

I think I learned about this from a comment on MathOverflow.

Recall that the field of two elements $\mathbf F_2$ is the ring $\mathbf Z/2\mathbf Z$ of integers modulo $2$ . In other words, it consists of the elements $0$ and $1$ with addition $1+1 = 0$ and the obvious multiplication. Clearly every nonzero element is invertible, so $\mathbf F_2$ is a field.

Lemma. The field $\mathbf F_2$ is algebraically closed.

Proof. We need to show that every non-constant polynomial $f \in \mathbf F_2[x]$ has a root. Suppose $f$ does not have a root, so that $f(0) \neq 0$ and $f(1) \neq 0$ . Then $f(0) = f(1) = 1$ , so $f$ is the constant polynomial $1$ . This contradicts the assumption that $f$ is non-constant. $\qedsymbol$

Internal Hom

Posted on 31 January 2020 by Remy

This is an introductory post about some easy examples of internal Hom.

Definition. Let $(\mathscr C, \otimes)$ be a symmetric monoidal category, i.e. a category $\mathscr C$ with a functor $\otimes \colon \mathscr C \times \mathscr C \to \mathscr C$ that is associative, unital, and commutative up to natural isomorphism. Then an internal Hom in $\mathscr C$ is a functor

$\mathbf{Hom}(-,-) \colon \mathscr C\op \times \mathscr C \to \mathscr C$

such that $-\otimes Y$ is a left adjoint to $\mathbf{Hom}(Y,-)$ for any $Y \in \mathscr C$ , i.e. there are functorial isomorphisms

$\operatorname{Hom}(X \otimes Y, Z) \stackrel\sim\to \operatorname{Hom}(X, \mathbf{Hom}(Y,Z)).$

Remark. In the easiest examples, we typically think of $\mathbf{Hom}(Y,Z)$ as ‘upgrading $\operatorname{Hom}(Y,Z)$ to an object of $\mathscr C$ ‘:

Example. Let $R$ be a commutative ring, and let $\mathscr C = \mathbf{Mod}_R$ be the category of $R$ -modules, with $\otimes$ the tensor product. Then $\mathbf{Hom}(M,N) = \operatorname{Hom}_R(M,N)$ with its natural $R$ -module structure is an internal Hom, by the usual tensor-Hom adjunction:

$\operatorname{Hom}_R(M \otimes_R N, K) \cong \operatorname{Hom}_R(M, \mathbf{Hom}(N, K)).$

The same is true when $\mathscr C =\!\ _R\mathbf{Mod}_R$ is the category of $(R,R)$ -bimodules for a not necessarily commutative ring $R$ .

However, we cannot do this for left $R$ -modules over a noncommutative ring, because there is no natural $R$ -module structure on $\operatorname{Hom}_R(M,N)$ for left $R$ -modules $M$ and $N$ . In general, the tensor product takes an $(A,B)$ -bimodule $M$ and a $(B,C)$ -bimodule $N$ and produces an $(A,C)$ -bimodule $M \otimes_B N$ . Taking $A = C = \mathbf Z$ gives a way to tensor a right $R$ -module with a left $R$ -module, but there is no standard way to tensor two left $R$ -modules, let alone equip it with the structure of a left $R$ -module.

Example. Let $\mathscr C = \mathbf{Set}$ . Then $\mathbf{Hom}(X,Y) = \operatorname{Hom}(X,Y) = Y^X$ is naturally a set, making it into an internal Hom for $(\mathscr C, \times)$ :

$\operatorname{Hom}(X \times Y, Z) \stackrel\sim\to \operatorname{Hom}(X, \mathbf{Hom}(Y,Z)).$

When $\otimes$ is the categorical product $\times$ , the internal $\mathbf{Hom}(X,Y)$ (if it exists) is usually called an exponential object, in analogy with the case $\mathscr C = \mathbf{Set}$ above.

Example. Another example of exponential objects is from topology. Let $\mathscr C = \mathbf{Haus}$ be the category of locally compact Hausdorff topological spaces. Then the compact-open topology makes $\mathbf{Hom}(X,Y) := Y^X$ into an internal Hom of topological spaces. (There are mild generalisations of this beyond the compact Hausdorff case, but for an arbitrary topological space $X$ the functor $- \times X$ does not preserve colimits and hence cannot admit a right adjoint.)

Example. An example of a slightly different nature is chain complexes: let $R$ be a commutative ring, and let $\mathscr C = \mathbf{Ch}(\mathbf{Mod}_R)$ be the category of cochain complexes

$\ldots \to C^{i-1} \to C^i \to C^{i+1} \to \ldots$

of $R$ -modules (meaning each $C^i$ is an $R$ -module, and the $d^i \colon C^i \to C^{i+1}$ are $R$ -linear maps satisfying $d \circ d = 0$ ). Homomorphisms $f \colon C \to D$ are commutative diagrams

$\begin{array}{ccccccc}\ldots & \to & C^i & \to & C^{i+1} & \to & \ldots \\ & & \!\!\!\!\! f^i\downarrow & & \downarrow f^{i+1}\!\!\!\!\!\!\! & & \\ \ldots & \to & D^i & \to & D^{i+1} & \to & \ldots,\!\!\end{array}$

and the tensor product is given by the direct sum totalisation of the double complex of componentwise tensor products.

There isn’t a natural way to ‘endow $\operatorname{Hom}(C, D)$ with the structure of a chain complex’, but there is an internal Hom given by

$\mathbf{Hom}(C, D)^i = \prod_{m \in \mathbf Z} \operatorname{Hom}(C_m, D_{m+i}),$

with differentials given by

$d^if = d_D f - (-1)^i f d_C.$

Then we get for example

$\operatorname{Hom}(R[0], \mathbf{Hom}(C, D)) \cong \operatorname{Hom}(C, D),$

since a morphism $R[0] \to \mathbf{Hom}(C, D)$ is given by an element $f \in \mathbf{Hom}(C, D)^0$ such that $df = 0$ , i.e. $d_Df = f d_C$ , meaning that $f$ is a morphism of cochain complexes.

Example. The final example for today is presheaves and sheaves. If $X$ is a topological space, then the category $\mathbf{Ab}(X)$ of abelian sheaves on $X$ has an internal Hom given by

$\mathbf{Hom}(\mathscr F, \mathscr G)(U) = \operatorname{Hom}(\mathscr F|_U, \mathscr G|_U),$

with the obvious transition maps for inclusions $V \subseteq U$ of open sets. This is usually called the sheaf Hom. A similar statement holds for presheaves.

Local structure of finite unramified morphisms

Posted on 27 November 2019 by Remy

It is well known that a finite étale morphism $f \colon X \to Y$ of schemes is étale locally given by a disjoint union of isomorphisms, i.e. there exists an étale cover $Y' \to Y$ such that the pullback $X' \to Y'$ is given by $X' = \coprod_{i=1}^n Y' \to Y'$ . Something similar is true for finite unramified morphisms:

Lemma. Let $f \colon X \to Y$ be a finite unramified¹ morphism of schemes. Then there exists an étale cover $Y' \to Y$ such that the pullback $X' \to Y'$ is given by $X' = \coprod_i Z_i \to Y'$ , where $Z_i \hookrightarrow Y'$ are closed immersions of finite presentation.

Proof. Let $y \in Y$ be a point, let $A = \mathcal O_{Y,y}^{\operatorname{sh}}$ be the strict henselisation of $Y$ at $y$ , and let $\Spec B \to \Spec A$ be the base change of $X \to Y$ along $\Spec A \to Y$ . Then $A \to B$ is unramified, so by Tag 04GL it splits as

$B = A_1 \times \ldots \times A_r \times C$

where $A \to A_i$ is surjective for each $i$ and no prime of $C$ lies above $\mathfrak m_y \subseteq A$ . But $A \to C$ is also finite, so by Tag 00GU the map $\Spec C \to \Spec A$ hits the maximal ideal if $\Spec C \neq \varnothing$ . Thus, we conclude that $C = 0$ , hence $B$ is a product of quotients of $A$ .

But $A$ is the colimit of $\mathcal O_{Y',y'}$ for $(Y',y') \to (Y,y)$ an étale neighbourhood inducing a separable extension $\kappa(y) \to \kappa(y')$ . Since $f$ is of finite presentation, each of the ideals $\ker(A \to A_i)$ and the projections $B \to A_i$ are defined over some étale neighbourhood $(Y',y') \to (Y,y)$ . Then the pullback $X' \to Y'$ is given by a finite disjoint union of closed immersions in $Y'$ .

Then $Y' \to Y$ might not be a covering, but since $y \in Y$ was arbitrary we can do this for each point separately and take a disjoint union. $\qedsymbol$

Remark. The number of $Z_i$ needed is locally bounded, but if $Y$ is not quasi-compact it might be infinite. For example, we can take $X \cong Y = \coprod_{i \in \N} \Spec k$ an infinite disjoint union of points, and $f \colon X \to Y$ such that the fibre over $y_i \in Y$ for $i \in \N$ has $i$ points.

Remark. In the étale case, we may actually take $Y' \to Y$ finite étale, by taking $Y'$ to be the Galois closure of $X \to Y$ , which exists in reasonable cases². For example, if $Y$ is normal, we may take $Y'$ to be the integral closure of $Y$ in the field extension corresponding to the Galois closure of $k(Y) \to k(X)$ . In general, if $Y$ is connected it follows from Tag 0BN2 that a suitable component of the $\deg(f)$ -fold fibre product of $X$ over $Y$ is a Galois closure $Y' \to Y$ of $X \to Y$ . If the connected components of $Y$ are open, apply this construction to each component.

In the unramified case, this is too much to hope for. For example, if $Y = \mathbf P^2_{\mathbf C}$ , then we may take $X$ to be a nontrivial finite étale cover of an elliptic curve $E \subseteq Y$ . This is finite and unramified, but does not split over any finite étale cover of $\mathbf P^2$ since there aren’t any. In fact, it cannot split over any connected étale cover $Y' \to \mathbf P^2$ whose image contains $E$ , since that implies the image only misses finitely many points (as $E$ is ample), which is again impossible since $\pi_1(\mathbf P^2 \setminus \{p_1,\ldots,p_r\}) = 0$ .

¹For the purposes of this post, unramified means in the sense of Grothendieck, i.e. including the finite presentation hypothesis. In Raynaud’s work on henselisations, this was weakened to finite type. See Tag 00US for definitions.

²I’m not sure what happens in general.

Rings that are localisations of each other

Posted on 5 September 2019 by Remy

This is a post about an answer I gave on MathOverflow in 2016. Most people who have ever clicked on my profile will probably have seen it.

Question. If $A$ and $B$ are rings that are localisations of each other, are they necessarily isomorphic?

In other words, does the category of rings whose morphisms are localisations form a partial order?

In my previous post, I explained why $k[x]$ and $k[x,x^{-1}]$ are not isomorphic, even as rings. With this example in mind, it’s tempting to try the following:

Example. Let $k$ be a field, and let $K = k(x_1, x_2, \ldots)$ . Let

$A = K[x_0,x_{-1},\ldots]$

be an infinite-dimensional polynomial ring over $K$ , and let

$B = A\left[\frac{1}{x_0}\right].$

Then $B$ is a localisation of $A$ , and we can localise $B$ further to obtain the ring

$k(x_0,x_1,\ldots)[x_{-1},x_{-2},\ldots]$

isomorphic to $A$ by shifting all the indices by 1. To see that $A$ and $B$ are not isomorphic as rings, note that $A^\times \cup \{0\}$ is closed under addition, and the same is not true in $B$ . $\qed$

Is there a moral to this story? Not sure. Maybe the lesson is to do mathematics your own stupid way, because the weird arguments you come up with yourself may help you solve other problems in the future. The process is more important than the outcome.

Is the affine line isomorphic to the punctured affine line?

Posted on 1 September 2019 by Remy

This is the story of Johan Commelin and myself working through the first sections of Hartshorne almost 10 years ago (nothing creates a bond like reading Hartshorne together…). This post is about problem I.1.1(b), which is essentially the following:

Exercise. Let $k$ be a field. Show that $k[x]$ and $k[x,x^{-1}]$ are not isomorphic.

In my next post, I will explain why I’m coming back to exactly this problem. There are many ways to solve it, for example:

Solution 1. The $k$ -algebra $k[x]$ represents the forgetful functor $\mathbf{Alg}_k \to \mathbf{Set}$ , whereas $k[x,x^{-1}]$ represents the unit group functor $R \mapsto R^\times$ . These functors are not isomorphic, for example because the inclusion $k \to k[x]$ induces an isomorphism on unit groups, but not on additive groups. $\qed$

A less fancy way to say the same thing is that all $k$ -algebra maps $k[x,x^{-1}] \to k[x]$ factor through $k$ , while the same evidently does not hold for $k$ -algebra maps $k[x] \to k[x]$ .

However, we didn’t like this because it only shows that $k[x]$ and $k[x,x^{-1}]$ are not isomorphic as $k$ -algebras (rather than as rings). Literal as we were (because we’re undergraduates? Lenstra’s influence?), we thought that this does not answer the question. After finishing all unstarred problems from section I.1 and a few days of being unhappy about this particular problem, we finally came up with:

Solution 2. The set $k[x]^\times \cup \{0\}$ is closed under addition, whereas $k[x,x^{-1}]^\times \cup \{0\}$ is not. $\qed$

This shows more generally that $k[x]$ and $\ell[x,x^{-1}]$ are never isomorphic as rings for any fields $k$ and $\ell$ .

Rings are abelian

Posted on 14 November 2018 by Remy

In this post, we prove the following well-known lemma:

Lemma. Let $(R,+,\times,0,1)$ satisfy all axioms of a ring, except possibly the commutativity $a + b = b + a$ . Then $(R,+)$ is abelian.

That is, additive commutativity of a ring is implied by the other axioms.

Proof. By distributivity, we have $2(a+b) = 2a + 2b$ , so multiplication by $2$ is a homomorphism. By our previous post, this implies $R$ is abelian. $\qedsymbol$

Finiteness is not a local property

Posted on 27 January 2018 by Remy

In this post, we consider the following question:

Question. Let $A$ be a Noetherian ring, and $M$ and $A$ -module. If $M_\mathfrak p$ is a finite $A_\mathfrak p$ -module for all primes $\mathfrak p \subseteq A$ , is $M$ finite?

That is, is finiteness a local property?

For the statement where local means the property is true on a cover by Zariski opens, see Tag 01XZ. Some properties (e.g. flatness) can also be checked at the level of local rings; however, we show that this is not true for finiteness.

Example 1. Let $A = \mathbb Z$ , and let $M = \bigoplus_{p \text{ prime}} \mathbb Z/p\mathbb Z$ . Then $M_{(p)} = \mathbb Z/p\mathbb Z$ , because localisation commutes with direct sums and $(\mathbb Z/q\mathbb Z)_{(p)} = 0$ if $q \neq p$ is prime. Thus, $M_{(p)}$ is finitely generated for all primes $p$ . Finally, $M_{(0)} = 0$ , because $M$ is torsion. But $M$ is obviously not finitely generated.

Example 2. Again, let $A = \mathbb Z$ , and let $M \subseteq \mathbb Q$ be the subgroup of fractions $\frac{a}{b}$ with $\gcd(a,b) = 1$ such that $b$ is squarefree. This is a subgroup because $\frac{a}{b} + \frac{c}{d}$ can be written with denominator $\lcm(b,d)$ , and that number is squarefree if $b$ and $d$ are. Clearly $M$ is not finitely generated, because the denominators can be arbitrarily large. But $M_{(0)} = \mathbb Q$ , which is finitely generated over $\mathbb Q$ . If $p$ is a prime, then $M_{(p)} \subseteq \mathbb Z_{(p)}$ is the submodule $\frac{1}{p}\mathbb Z_{(p)}$ , which is finitely generated over $\mathbb Z_{(p)}$ .

Another way to write $M$ is $\sum_{p \text{ prime}} \frac{1}{p}\mathbb Z \subseteq \mathbb Q$ .

Remark. The second example shows that over a PID, the property that $M$ is free of rank $1$ can not be checked at the stalks. Of course it can be if $M$ is finitely generated, for then $M$ is finite projective [Tag 00NX] of rank $1$ , hence free since $A$ is a PID.

Lovely little lemmas

Or maybe ‘amusing observations’

Category Archives: Commutative algebra