Divulgaciones Matem´aticas Vol. 22, No. 1 (2021), pp. 90–95

A proof of a version of Hensel’s lemma

Una prueba de una versi´on del lema de Hensel

Dinam´erico P. Pombo Jr. (dpombojr@gmail.com)

Instituto de Matem´atica e Estat´ıstica

Universidade Federal Fluminense

Rua Professor Marcos Waldemar de Freitas Reis, s/n

Bloco G, Campus do Gragoat´a

24210-201 Niteri, RJ Brasil

Abstract

By using a few basic facts, a proof of a known version of Hensel’s lemma in the context

of local rings is presented.

Key words and phrases: local rings, discrete valuation rings, Hensel’s lemma.

Resumen

Usando algunos pocos hechos b´asicos, se presenta una demostraci´on de una versi´on del

lema de Hensel en el contexto de los anillos locales.

Palabras y frases clave: anillos locales, anillos de valoraci´on discretos, lema de Hensel.

1 Introduction

A classical and fundamental result, known as Hensel’s lemma, is discussed in [1], [3], [5], [6] and

[7], for instance. A quite general form of Hensel’s lemma may be found in Chapter III of [2],

although special cases of it may also be very important, as the one valid in the framework of local

rings and presented in Chapter II of [6]. The main purpose of this note is to oﬀer an elementary

proof of the last-mentioned form of Hensel’s lemma, as well as to derive a few consequences of it.

2 A proof of a version of Hensel’s lemma

Deﬁnition 2.1 (cf. [2, p. 80]). A commutative ring R with and identity element 1 6= 0 is said to

be a local ring if it contains a unique maximal ideal I

, namely, the set of non-invertible elements

of R. If K is the quotient ring R/I

, which is a ﬁeld,

λ ∈ R 7−→

λ ∈ K

will denote the canonical surjection. For f(X) = a

+ a

X + · · · + a

∈ R[X], we will write

f(X) = ¯a

+ ¯a

X + · · · + ¯a

∈ K[X].

Received 29/06/2021. Revised 30/06/2021. Accepted 26/07/2021.

MSC (2010): Primary 12J25, 13F30; Secondary 13H99, 13J10, 13B25.

Corresponding author: Dinam´erico P. Pombo Jr.

A proof of a version of Hensel’s lemma 91

Example 2.1 (cf. [6]). Let R be a discrete valuation ring and I

the maximal ideal of R, which

may be written as I

= π R. We have that

= π R ⊃ I

= π

R ⊃ · · · ⊃ I

= π

R ⊃ I

n+1

= π

n+1

R ⊃ ....

is a decreasing sequence of ideals of R such that I

⊂ I

n+1

for each integer n ≥ 1 and

n≥1

= {0}.

Example 2.2 (cf. [3]). Let K be a ﬁeld endowed with a non-trivial discrete valuation | · |,

R = {λ ∈ R; |λ| ≤ 1} the ring of integers of (K, | · |) and I

= {λ ∈ R; |λ| < 1} the maximal ideal

of R. Let µ ∈ I

be such that |µ| = sup{|λ|; λ ∈ I

}. Then

= µ R ⊃ I

= µ

R ⊃ · · · ⊃ I

= µ

R ⊃ I

n+1

= µ

n+1

R ⊃ ....

is a decreasing sequence of ideals of R such that I

⊂ I

n+1

for each integer n ≥ 1 and

n≥1

= {0}.

It may be seen that every discrete valuation ring may be regarded as the ring of integers of a

ﬁeld endowed with a non-trivial discrete valuation.

Let us recall that, if X is a non-empty set, a mapping

d: X × X −→ R

is an ultrametric on X if the following conditions hold for all x, y, z ∈ X:

(a) d(x, y) = 0 if and only if x = y;

(b) d(x, y) = d(y, x);

By induction,

d(x

, x

) ≤ max{d(x

, x

), . . . , d(x

n−1

, x

)}

for n = 2, 3, . . . and x

, . . . , x

∈ X. And, since max{d(x, z), d(z, y)} ≤ d(x, z) + d(z, y), d is a

metric on X.

We shall present an elementary proof of the following form of Hensel’s lemma [6, p. 43]:

Proposition 2.1. Let R be a local ring and I

its maximal ideal, and assume the existence of a

decreasing sequence I

⊃ I

⊃ · · · ⊃ I

⊃ I

n+1

⊃ .... of ideals of R such that I

⊂ I

n+1

for

each integer n ≥ 1 and

n≥1

= {0}. Then there exists a translation-invariant ultrametric d on

R such that I



λ ∈ R; d(λ, 0) ≤



for each integer n ≥ 1 (thus (I

) n ≥ 1 is a fundamental

system of neighborhoods of 0 in R with respect to the topology deﬁned by d ) and the mappings

(λ, µ) ∈ R × R 7−→ λ + µ ∈ R and (λ, µ) ∈ R × R 7−→ λµ ∈ R

are continuous. Moreover, if the metric space (R, d) is complete and if f(X) ∈ R[X] is such

that

f(X) admits a simple root θ in K, then there exists a unique root λ of f (X) in R such that

λ = θ.

Divulgaciones Matem´aticas Vol. 22, No. 1 (2021), pp. 90–95

92 Dinam´erico P. Pombo Jr.

In order to prove Proposition 2.1 we shall need an auxiliary result:

Lemma 2.1. Let (G, +) be a commutative group and H

⊃ H

⊃ · · · ⊃ H

⊃ H

n+1

⊃ · · · a

decreasing sequence of subgroups of G such that

n≥1

= {0}. Then there exists a translation-

invariant ultrametric d on G such that H



x ∈ G; d(x, 0) ≤



for each integer n ≥ 1

(thus (H

) n ≥ 1 is a fundamental system of neighborhoods of 0 in G with respect to the topology

deﬁned by d ) and the mapping

(x, y) ∈ G × G 7−→ x + y ∈ G

is continuous.

Proof of Lemma 2.1. We shall use a classical argument. Put H

= G and let g : G → R

the mapping given by g(0) = 0 and g(x) =

if x ∈ H

n+1

(n = 0, 1, 2, . . . ). Obviously,

g(x) > 0 if g ∈ G\{0}, g(−x) = g(x) if x ∈ G and



x ∈ G; g(x) ≤



for n = 0, 1, 2, . . . . Moreover, g(x+y) ≤ max{g(x), g(y)} for all x, y ∈ G, which is clear if x = 0 or

y = 0. Indeed, if x, y ∈ G\{0}, x ∈ H

k+1

, y ∈ H

`+1

, with ` ≥ k ≥ 0, then g(x) =

and

g(y) =

≤

· But, since H

⊂ H

, x + y ∈ H

, and hence g(x + y) ≤

= max{g(x), g(y)}.

Therefore the mapping

d: G × G −→ R

deﬁned by d(x, y) = g(x − y), is a translation-invariant ultrametric on G such that



t ∈ G; d(t, 0) ≤



for each integer n ≥ 0. Consequently,

x + H



t ∈ G; d(t, x) ≤



if x ∈ G and n = 0, 1, 2, . . . are arbitrary.

Finally, if x

, y

∈ G and n = 0, 1, 2, . . . are arbitrary,

+ H

) + (y

+ H

) ⊂ (x

+ y

) + H

proving the continuity of the mapping

(x, y) ∈ G × G 7−→ x + y ∈ G

at (x

, y

Now, let us turn to the

Divulgaciones Matem´aticas Vol. 22, No. 1 (2021), pp. 90–95

A proof of a version of Hensel’s lemma 93

Proof of Proposition 2.1. By Lemma 2.1 there is a translation-invariant ultrametric d on R

such that



λ ∈ R; d(λ, 0) ≤



for each integer n ≥ 1, and the operation of addition in R is continuous. Moreover, if (λ

, µ

) ∈

R × R and n = 1, 2, . . . are arbitrary, the relations λ ∈ λ

+ I

, µ ∈ µ

+ I

imply

λµ − λ

= λµ − λ

µ + λ

µ − λ

= µ(λ − λ

) + λ

(µ − µ

) ∈ I

+ I

⊂ I

proving the continuity of the mapping

(λ, µ) ∈ R × R 7−→ λµ ∈ R

at (λ

, µ

Now, assume that (R, d) is complete and let f(X),

f(X), λ, θ be as in the statement of the

proposition. In order to conclude the proof we shall apply Newton’s approximation method, as

in p. 44 of [6]. Let us ﬁrst observe that, if h(X) ∈ R[X] and γ ∈ R, then h(γ) =

h(¯γ).

To prove the uniqueness, assume the existence of a µ ∈ R so that ¯µ = θ and f(µ) = 0. Since

λ = θ is a simple root of

f(X), there is a g(X) ∈ R[X] such that f(X) =

(X − λ) g(X) and ¯g(θ) 6= 0; thus

0 = f(µ) = (µ − λ) g(µ).

Therefore, since g(µ) = ¯g(θ) 6= 0, we conclude that g(µ) is an invertible element of R; hence

λ = µ.

To prove the existence, we claim that there is a sequence (λ

) n ≥ 1 in R so that

= θ,

f(λ

) ∈ I

and λ

n+1

− λ

∈ I

for each integer n ≥ 1. Indeed, let λ

∈ R be such that

= θ. Then f(λ

) =

f(θ) = 0, that is, f(λ

) ∈ I

. Now, let n ≥ 1 be arbitrary, and

suppose the existence of a λ

∈ R such that

= θ and f(λ

) ∈ I

. Then, for every h ∈ I

(λ

+ h) − λ

∈ I

and (λ

+ h) =

h = θ. We shall show the existence of an h ∈ I

with

f(λ

+ h) ∈ I

n+1

. In fact, by Taylor’s formula [4, p. 387], there is a ξ ∈ R so that

f(λ

+ h) = f(λ

) + hf

(λ

) + h

ξ.

And, by hypothesis, h

ξ = h(hξ) ∈ I

⊂ I

n+1

. But, since θ is a simple root of

f(X), f

(λ

) = (

(θ) 6= 0, that is, f

(λ

) is an invertible element of R. Thus, by taking

h = −f(λ

)(f

(λ

))

−1

∈ I

and λ

n+1

= λ

+ h, we arrive at λ

n+1

= θ, f(λ

n+1

) ∈ I

n+1

and

n+1

− λ

∈ I

, as desired.

Finally,



f(λ

)



n≥1

converges to 0 in R, because d



f(λ

), 0



≤

for n = 1, 2, . . . . On the

other hand, for n, ` = 1, 2, . . . ,

d(λ

n+`

, λ

) ≤ max{d(λ

n+`

, λ

n+`−1

), . . . , d(λ

n+1

, λ

)} ≤ max



n+`−1

, . . . ,



and hence (λ

) n ≥ 1 is a Cauchy sequence in (R, d). By the completeness of (R, d), there is a

λ ∈ R for which (λ

) n ≥ 1 converges. Consequently, in view of the continuity of the mappings

(α, β) ∈ R × R 7−→ α + β ∈ R and (α, β) ∈ R × R 7−→ αβ ∈ R,

Divulgaciones Matem´aticas Vol. 22, No. 1 (2021), pp. 90–95

94 Dinam´erico P. Pombo Jr.



f(λ

)



n ≥ 1 converges to f(λ); thus f(λ) = 0.

Now, let us consider K = R/I

endowed with the discrete ultrametric d

, given by d

(s, s) = 0

and d

(s, t) = 1 if s 6= t (s, t ∈ K). Since the canonical surjection

λ ∈ (R, d) 7−→

λ ∈ (K, d

)

is continuous (

= {0}) and (λ

) n ≥ 1 converges to λ, (

) n ≥ 1 converges to

λ. Therefore

λ = θ.

Corollary 2.1. Let R be a discrete valuation ring which is complete under the ultrametric d

given in Proposition 2.1. Let f(X) ∈ R[X] be such that

f(X) ∈ K[X] admits a simple root θ.

Then there exists a unique root λ of f(X) such that

λ = θ.

Proof. Follows immediately from Proposition 2.1, by recalling Example 2.1.

Remark 2.1. Let (K, | · |) and I

(n = 1, 2, . . . ) be as in Example 2.2. Then

d(λ, µ) = |λ − µ|

is an ultrametric on K, and hence its restriction to R × R is an ultrametric on R (which we shall

also denote by

d). Since, for n = 1, 2, . . . ,



λ ∈ R;

d(λ, 0) = |λ| ≤



= I



λ ∈ R; d(λ, 0) ≤



d being as in Proposition 2.1, it follows that

d and d are equivalent.

Corollary 2.2. Let (K, | · |) and µ be as in Example 2.2, and assume that (K,

d) is complete. If

f(X) ∈ R[X] and

f(X) ∈ K[X] admits a simple root θ, then there is a unique root λ of f (X) so

that |λ − ξ| ≤ |µ| (where ξ ∈ R and

ξ = θ).

Proof. Follows immediately from Remark 2.1 and Proposition 2.1.

Corollary 2.3 (cf. [5, p. 16]). Let p be a prime number, Z

= {λ ∈ Q

; |λ|

≤ 1} the ring of

p-adic integers and f(X) ∈ Z

[X]. If there is an a

∈ Z

such that |f(a

< 1 and |f

= 1,

then there is a unique a ∈ Z

such that f(a) = 0 and |a − a

≤

Proof. Since the condition “|f(a

)| < 1 ” is equivalent to the condition “

f(¯a

) = f(a

) = 0 ”, and

the condition “|f

= 1 ” is equivalent to the condition “(

(¯a

) = f

) 6= 0”, Theorem

6, p. 391 of [4] guarantees that ¯a

is a simple root of

f(X). Therefore the result follows from

Corollary 2.2.

Example 2.3 (cf. [3, p. 52]). Let p be a prime number, p 6= 2, and let b ∈ Z

with |b|

= 1.

If there is an a

∈ Z

such that |a

− b| p < 1, then b = a

for a unique a ∈ Z

such that

|a − a

| p ≤

Indeed, put f(X) = X

− b ∈ Z

[X]. Then |f (a

)| p = |a

− b| p < 1 and |f

)| p =

|2a

| p = |2|

| p = |a

| p = 1 (the relation |a

− b| p < 1 = |b|

= 1 implies



| p



|(a

− b) + b|

= |b|

= 1). Thus the result follows from Corollary 2.9.

In the same vein one shows that if p is a prime number, p 6= 3, c ∈ Z

, |c|

= 1, and there is

an f

∈ Z

such that |f

− c| p < 1, then c = f

for a unique f ∈ Z

such that |f − f

| p ≤

Divulgaciones Matem´aticas Vol. 22, No. 1 (2021), pp. 90–95

A proof of a version of Hensel’s lemma 95

References

[1] E. Artin. Algebraic Numbers and Algebraic Functions, American Mathematical Society, Prov-

idence, Rhode Island, 2005.

[2] N. Bourbaki. Commutative Algebra, Hermann and Addison-Wesley, Paris and Reading, Mas-

sachusetts, 1972.

[3] J.W. Cassels. Local Fields, London Mathematical Society Student Texts 3, Cambridge Uni-

versity Press, Cambridge, 1986.

[4] R. Godement. Cours d’algbre, Troisime dition, Enseignement des Sciences, Hermann, Paris,

1966.

[5] N. Koblitz. p-adic Numbers, p-adic Analysis, and Zeta-Functions, Second edition, Springer-

Verlag, Berlin. Heidelberg. New York, 1984.

[6] J. P. Serre. Corps Locaux, Quatrime dition, Actualits Scientiﬁques et Industrielles 1296,

Hermann, Paris, 1968.

[7] J. P. Serre. A Course in Arithmetic, Third printing, Graduate Texts in Mathematics 7,

Springer-Verlag, Berlin. Heidelberg. New York, 1985.

Divulgaciones Matem´aticas Vol. 22, No. 1 (2021), pp. 90–95