Section 7.2 From Linear to General
ΒΆSubsection 7.2.1 Combining solutions
ΒΆOne of the most important things we can do is study congruences with prime (power) modulus, because we can combine their solutions to get solutions for any congruences when we combine the Chinese Remainder Theorem and Fundamental Theorem of Arithmetic (recall Proposition 6.5.1). Even more interestingly, we can combine the numbers of solutions. Informally, if you want to get the total number of solutions of a polynomial congruence, just write the modulus as a product of prime powers n=βki=1peii, find out how many solutions the congruence has with each prime power modulus, then multiply those numbers for the total number of solutions.Example 7.2.1.
For instance, if f(x)β‘0 has 2 solutions modulo 3, 1 solution modulo 5, and 3 solutions modulo 7, it would have 2β 1β 3=6 solutions modulo 105=3β 5β 7.
Fact 7.2.2.
Let n1,n2,β―,nk be a set of k mutually coprime moduli. Suppose that for some polynomial f(x) you know that there are Ni (congruence classes of) solutions to
Then the congruence
Proof.
For all \(i\text{,}\) among the \(N_i\) solutions to the \(i\)th congruence choose a solution \(a_i\text{,}\) so that
Since the moduli \(n_i\) for these congruences are coprime, we can use the Chinese Remainder Theorem to obtain one number \(a\) such that \(a\equiv a_i\) (mod \(n_i\)) for all \(i\text{.}\)
Since polynomials are exclusively made up of addition and multiplication, and addition and multiplication are well-defined, we also have \(f(a)\equiv f(a_i)\equiv 0\) (mod \(n_i\)), so as promised we have a solution
Each such set of \(a_i\) will yield a solution, and if \(\{a_i\}_{i=1}^k\neq \{b_i\}_{i=1}^k\) then if \(a_j\not\equiv b_j\) (mod \(n_j\)) they certainly are not equivalent modulo \(\prod_{i=1}^k n_i\) either.
Now multiply how many solutions there are for each \(n_i\) to get the total number of combinations of solutions. If there are \(N_i\) solutions modulo \(n_i\text{,}\) we would get \(\prod_{i=1}^k N_i\text{.}\) There aren't any additional answers, because any answer to the βbigβ congruence automatically also satisfies the βlittleβ ones; if \(\prod_{i=1}^k n_i\mid f(a)\text{,}\) then certainly \(n_i\mid f(a)\) as well.
Subsection 7.2.2 Prime power congruences
ΒΆWe have already discussed prime power congruences in Subsection 6.5.2. Recall that in Examples 6.5.3 and 6.5.4 we took the (obvious) solution of 4xβ‘1 (mod 7) (namely, x=[2]), and got solutions (mod 49) and even (mod 343) from it relatively easily. But that is essentially the same as asking for solutions to 4xβ1β‘0, a linear congruence. Let's see if we can generalize this method for more general polynomial congruences. The key was taking the already known fact 7β£1β4β 2 and then cancelling out 7 from the entire congruence to get thatTheorem 7.2.3. Hensel's Lemma.
For p prime and eβ₯2, suppose you already know a solution equivalence class xeβ1 (mod peβ1) of the (polynomial) congruence
Assume the technical condition that gcd(p,fβ²(xeβ1))=1. Then there is also a solutionβ1βGiven these conditions, it will be the only one of this form. to
of the form
where k satisfies
Proof.
If \(p\) and \(f'(x_{e-1})\) are relatively prime, then by Proposition 5.1.1 any linear congruence of the form \(f'(x_{e-1})k\equiv b \text{ (mod }p)\) with coefficient \(a=f'(x_{e-1})\text{,}\) unknown \(k\text{,}\) and known \(b\) can be solved (uniquely modulo \(p\text{,}\) given the \(\gcd\) condition). Since \(x_{e-1}\) is a known zero of \(f(x)\) for modulus \(p^{e-1}\text{,}\) we know that as an integer (not modulo anything) \(p^{e-1}\mid f(x_{e-1})\text{.}\)
This means that \(-\frac{f(x_{e-1})}{p^{e-1}}\) exists in \(\mathbb{Z}\text{,}\) so if we set \(b=-\frac{f(x_{e-1})}{p^{e-1}}\) there will indeed be a solution \(k\) to the congruence \(\frac{f(x_{e-1})}{p^{e-1}}+k\cdot f'(x_{e-1})\equiv 0\text{ (mod }p\text{)}\text{.}\) Then the only question becomes why \(x_{e}=x_{e-1}+kp^{e-1}\) is actually a solution to \(f(x)\equiv 0\text{ (mod }p^{e})\text{.}\)
To see this, think of \(f\) as a polynomial with terms of the form \(c_i x^i\text{,}\) e.g. \(f(x)=\sum_{i=0}^n c_i x^i\text{.}\) Then \(f(x_{e-1}+kp^{e-1})\) can be expanded out term-by-term in the following form:
Let's break this down on a term-by-term basis in the sum. Each term will look like
Since \(e\geq 2\) in this context, the extra terms (from Taylor or binomial series)β2βOne way or another one of these series will have to enter in, unfortunately; [C.2.1, Section 4.3] has more of a binomial theorem-esque treatment, while [C.2.13, Theorem 4.7] and [C.5.1, Theorem 6.2] more explicitly invoke Taylor series. involving \(p^{(e-1)/2}\) will be divisible by at least \(p^e\) and hence be irrelevant in that modulus, so each term in the sum will be equivalent to
We're nearly done with the individual terms. Recall Proposition 5.2.6 where we are allowed to cancel a nonzero divisor from βall three sidesβ of a congruence. That motivates dividing each term and the modulus by \(p^{e-1}\) to get
Now add up the terms of the sum for all \(i\) to find out something about \(f(x_e)\text{.}\) Summing up the \(x_{e-1}^i\) will give us \(f(x_{e-1})\text{,}\) and summing up \(i x_{e-1}^{i-1}\) is adding terms that look like the derivative of polynomials, so modulo \(p\) we have
which is divisible by \(p\) by hypothesis.
Now we have that
so we multiply everything by \(p^{e-1}\) (this time actually using Proposition 5.2.6) and get \(f(x_{e})\equiv 0\text{ (mod }p^e)\) as desired.
Example 7.2.4.
First we solve x2+1β‘0 (mod 25). By inspection, the solutions modulo 5 are [2],[3] (or [Β±2]). So solutions modulo 25 will look like 3+kβ 5 or 2+kβ 5. Further, fβ²(x)=2x, so for either solution modulo 5 the technical derivative condition is met.
Let x1=3. Then the condition for k is
which simplifies to 2+6kβ‘0, which solves to kβ‘β2β‘3. Then our solution to the congruence modulo 25 would be
And indeed 182+1=325 is divisible by twenty-five.
Now try the same procedure with x1=2 to get the solution x2β‘7 in Exercise 7.7.3. (If you get stuck, see Example 16.1.3.)
Example 7.2.5.
We can try the same process with e=3. Taking (from the previous example) x2β‘7 yields, as a condition for k,
This gives k=2, and indeed
yields
It's good practice to try the same process with x1=18 instead in Exercise 7.7.3.