# Apostol: Introduction to Analytic Number Theory Solutions

## Foreward: Tommy, Greg, Sean and me


I thought I'd take a run at analytic number theory as a self-study course
by means of Professor Apostol's fine introductory book. (I last had the 
privilege of 
enjoying one of Professor Apostol's lectures in person in 1983.
At that time my 
fellow classmates affectionately referred to him (or his calculus books,
depending on context) as *Tommy*. Anyway I am enthusiastic about
both the topic and the process of learning it. 


On starting to work through the book I found I 
understood theorems and proofs on the 'in the moment' sense;
but internalizing them proved challenging.
This became more evident as I tried exercises with
very little success.
Casting about about for guidance on the web I cam across
solutions by both Sean Li and Greg Hurst. These proved
extremely helpful; and as a consequence the solutions I'm
writing here are frequently thos of Messrs. Li and Hurst 
filtered through my lens.



Thanks Sean and Greg.


Thanks Tommy.

## Solution style elements


More than anything the key element to progress learning this material
is time and patience (plus cash to buy scratch paper). In the sense of
accumulating tools, the plan is to pick them up and internalize them
in passing. I will give two examples; and for the record many many
others can be found in Aigner and Ziegler's 
excellent **Proofs From The Book**.


Example: It can be useful to 'multiply something by one'; and
$c \cdot 1$ is fine as far as it goes. But your professional
number theorist will take a given 
$(a, b) = 1$ and proceed to $c = c \cdot (ax + by)$ 
for some $x$ and $y$. 


Example: $\alpha^n-1$ may suggest a factorization that uses the
$- \; 1$ to telescope a product back to an original form. 
Your professional
number theorist will go further, taking $\alpha^n + 1$ as an 
invitation to create a factorization that uses alternating
signs $+ \; - \; + \; - \; \dots - \; +$ to collapse the telescope. 
(See problems **1.16** and **1.17**.)


In computer programming there is a notion of *code golf* where a given
piece of code is shortened as much as possible. *Solution golf* is a
temptation in writing up answers to problems here. The result is often
text that is tedious to parse; so I'll try and go back over solutions
and expand them out into a more conversational style. 


On method: I've noticed *ducking the wave* is tempting for some 
problems. An intended elegant solution
(see *hints*, for example) can be avoided through pedestrian means;
drudgery, solving for a set of cases that span $Z$
and so on. I often start working a problem by working
examples; and resulting patterns can
suggest *mechanical* solutions that duck the wave. 
Problem 2.5 is an example. 


Problem solutions ought to strive
for two things: They should enhance, not impede, one's 
understanding; and they should be fun to read.

## Chapter 1



### Some useful ideas:


* 'a divides b' can be de-mystified by writing 
it as multiplication by some number. 
$a | b$ means $b = a q$. I tend to make this substitution
without explaining *some $q$ exists for which...*


* Another substitution principle: 
$(a, b) = c \implies \exists \; x \textrm{ and } y \backepsilon c = xa + yb$.
This does not mean *any* $x$ and $y$ will suffice. One must find the 
*correct* $x$ and $y$. However as an intermezzo I will mention that there are two 
important qualifiers for any $x$ and $y$, i.e. for an
arbitrary linear combination of $a$ and $b$.
    * An arbitrary linear combination $xa + yb$ will never be a positive 
common factor of $a$ and $b$ *less than* $(a, b)$. 
    * An arbitrary linear combination $xa + yb$ will 
always be a *multiple* of $(a, b)$.<BR><BR>
* Having $c$ is like replacing $1$ with a new atomic unit that re-scales the number line. 
This atomic nature of the gcd is apparent in un-solvable gallon puzzles, for example 
'You have two containers with capacities four gallons and six gallons. The horses 
need exactly five gallons of water.' Might as well just give them six and suffer their
disapprobation.
* Corollary: The linear combination for $c$ is reversible: 
$x$ and $y$ can be presumed to exist and thus $ax + by$ 
can be substituted for $c$. 
* Corollary: The linear combination that yields $c$ is 
not unique. Suppose $x$ and $y$ are established to have definite 
values to yield $c$. There are any number of other pairs for 
$x$ and $y$ that yield this same gcd. Take for example $a=15$
and $b=24$. Then $c=3=15\cdot(-3)+24\cdot(2)$. So $x=-3$ and
$y=2$ give $c=3$. For any $n$ we have $n(8a-5b)=0$ so to generalize
we have $x=-3+8n$ and $y=2-5n$ for any choice of $n$ will give $c=3$.


For more on this gcd thought process see the remark in problem 1.2.
    

Finally a remark on the obvious, one of many to follow. If $a | b$ and $a | (b + c)$
then $a | c$. Makes sense. But check out how this is used in problem 1.18 where
$c$ is prime (in this case $2$) and $a$ is the gcd of two numbers. This gives us
the logical result that $a$ must be $1$ or $c$. I'm not sure this sort of thing 
is *obvious* obvious until it is introduced a bit. 
    
    
The **`ANT_examples`** notebook has some code for working through actual cases.
    
    
#### Problems 1--6: 'Prove the statements'

##### 1.1 Show: If $(a, b) = 1$ and $c|a$ and $d|b$ then $(c, d) = 1$.




$a=cq$ and $b=dr$. Hence $1=ax+by=cqx+dry=(qx)c + (ry)d$ so $(c, d)=1 \; {\Box}$


...taking advantage of 'one and only one' in Theorem 1.3. (Sean Li does proof by contradiction.)

##### 1.2 Show: If $(a,b) = (a,c) = 1$ then $(a, bc) = 1$.

The inspiration here is to show that $x$ and $y$ exist (in some form) for $(a, bc)$ with
$a \cdot x + bc \cdot y = 1$. That being the case we are led to understand that there is 
no *larger* value available for the gcd of $a$ and $bc$. Let me return to that after the
arithmetic, thus: 

$(a,b)=(a,c)=1 \implies 1 \cdot 1=(ax+by)\cdot(au+cv)=a(aux+cvx+buy)+bc(yv)$

In other words $aq + (bc)r = 1$ so $(a, bc) = 1$. So that earns the $\Box$ but I'm
very puzzled by something here; so let's digress for a moment. 


Theorem 1.2: A common divisor exists and can be expressed as a linear combination (LC) 
of $a$ and $b$. So far so good. 


Confusing: An arbitrary linear combination (choice of $x$ and $y$) does not imply
$ax+by$ is a divisor of $a$ and $b$. Take $(7, 3) = 1$ for example: A trivial LC
gives $2$. 


Try to force an LC to give an inferior common divisor: This can't be done. 
The gcd acts as a building block of both $a$ and $b$ so LCs will always be
expressed in terms of this building block. This quantization concept is helpful.


Result: Only *one* divisor, the gcd of $a$ and $b$ can be expressed as an LC of $a$
and $b$. Other divisors: No, can't get there from here. However *multiples* of this
gcd can also be produce by LC: Just multiply the gcd LC by whatever factor you
like. Here is an instructive example: 


$a=2 \cdot 3 \cdot 11$ and $b=2 \cdot 5 \cdot 7$.


$(66, 70) = 2 = 66 \cdot x + 70 \cdot y$. ($x=-18$ and $y=17$.)


Multiply $x$ and $y$ by $3$ to arrive at $6$. Now $6$ is not a common divisor 
of $66$ and $70$ but it *is* an LC of $66$ and $70$. So if I create 
an LC of two numbers that is less than both of them: That number is an upper
bound for the gcd. Of course if it is equal to $1$ then it *is* the gcd
because we may not go any lower than $1$.


This gives me a better feeling about reaching an $LC = 1$ as a means to
concluding $a$ and $b$ are relatively prime. (Or for this problem 
that $a$ and $bc$ are relatively prime.) 


Final observation: Any LC of $a$ and $b$ will be a multiple of $(a, b)$. 
Or in other words $(a,b)|(ax+by)$ for any $x$ and any $y$.

##### 1.3 If $(a, b) = 1$ then $(a^n,b^k)=1$ for all $n \ge 1, \; k \ge 1$.

Suppose $(a^n, b^k) > 1$. Then $\exists \textrm{ some } p 
\backepsilon p|a^n \textrm{ and }
p|b^k$. This $p$ must therefore divide both $a$ and $b$, reaching a 
contradiction of the given that $(a, b) = 1$. $\Box$

##### 1.4 If $(a, b) = 1$ then $(a+b, a-b)$ is either $1$ or $2$.

Since $1$ is the least of gcd values, and since I can provide
an example with $a=10 \textrm{ and } b=7 \textrm{ where }(17,3)=1$: 
The value of $(a+b, a-b)$ can be 1. Similarly it might be $2$ by
example, say $a=9 \textrm{ and } b=7$. Can $(a+b,a-b)$ be some $c>2$?


If so: $a+b=c \alpha \textrm{ and } a-b=c \ beta. 
Their sum gives $2a=c(\alpha + \beta)$. That is, 
$c | 2a$. However $c \nmid a$ as if it did $c$ would also divide $b$
since $c | (a + b)$. Also $c > 2 \textrm{ so } c \nmid 2$, 
a contradiction. Also excluded is the possibility that
$\alpha = - \beta$ as this would imply $a=0$ and $(0, b) = |b|$. 
But again we have it from a reliable source that $(a, b) = 1$
so $b=1$ and we have $(0 + 1, 0 - 1) = 1$. 


This excludes $c > 2$ and we are left with $(a + b, a - b) = 1 \textrm{ or } 2$.

##### 1.5 Show that if $(a,b)=1$ then $(a+b, a^2-ab+b^2)$ is either $1$ or $3$.


As in **1.4**: Suppose $(a+b, a^2-ab+b^2) = c$. $c = 1$ is the floor value; is it also
demonstrable? Take $a=4 \textrm{ and }b=3: \; (7,13)=1$, so yes. How about $c=3$?
Take $a=5 \textrm{ and } b=7 \textrm{: This results in }(12, 39)=3$. So both $c=1$ and $c=3$ 
are possible gcd values. Are other gcd values possible? Let's suppose
so for some $c > 1$. $c$ does not divide one of $a$ or $b$ since it would then divide 
the other (as it divides $a+b$) which would violate $(a,b)=1$. 


We write $'(a+b, a^2-ab+b^2) = c' \textrm{ as }a+b=c \alpha \textrm{ and }a^2-ab+b^2 = c \beta$.
Squaring the first and subtracting the second gives $3ab=c(c \alpha^2 - \beta)$. In other words
$3ab = c \gamma$, presuming $c > 1$. 


Let $p$ be ***any*** prime factor of $c$. Can this $p$ divide either $a$ or $b$?
Suppose $p|a$. 
Because $c | (a+b)$ it is also the case that
$p | (a+b)$. If $p|(a+b) \textrm{ and } p|a \textrm{ then } p|b \textrm{; but this
violates }(a,b)=1$. Consequently $p \nmid a$ and by the same argument $p \nmid b$.
Therefore $p=3$ is the only possibility and $c=p=3$ is the only alternative to $c=1$. $\Box$


The same rationale used in the previous problem applies to setting $\gamma = 0$ where
the end result is a gcd of $1$. 


I think the crux here is the transition from a hypothetical gcd $c$ to *any* of its prime 
factors in the reasoning. Not as successful: Thinking in terms of *some* single prime factor
of $c$, call it $p$. This is muddier because it does not yield the clear implication
that $c \nmid ab$. 


##### 1.6 Show that if $(a, b)=1$ and $d|(a+b)$ then $(d,a)=(d,b)=1$.

Set $(d,a)=c$. Then $d = c \alpha \textrm{ and } a = c \beta$. 
As $d|(a + b) \textrm{ it is also the case that } c | (a + b): \; c \gamma = a + b = c \beta + b$.
Hence $b = c(\gamma - \beta) \textrm { meaning that } c|b$. From $a = c \beta$ we also have $c|a$.
This means $(a, b) \ge c$ but we are given that $(a, b) = 1$ so $c = 1 = (d, a)$. 
By symmetry $(d, b) = 1$. $\Box$

##### 1.7 If the sum of two reduced fractions is an integer, say $a/b + c/d = n$ where $(a,b)=(c,d)=1$, show that $|b|=|d|$. 

$n = \frac{ad + bc}{bd} \implies b|(ad+bc)$ and $d|(ad+bc)$. That is, the integer $n$
requires the denominator cancels out. 


Now here is the 
insight: If $q|(r+s)$ and if $q|r$ then $q|s$; so as $b|bc$ it must also
divide $ad$. But $b$ does not divide $a$ (recall $(a,b)=1$) so $b|d$. By
the same reasoning $d|b$. So they divide one another and are the same 
give or take a minus sign: $|b|=|d|, \;\Box$


This problem is of particular help in getting on with problem 1.13.

##### 1.8 A number is *squarefree* if it is not divisible by the square of any prime. Prove that for every $n \ge 1$ there exist unique $a > 0$ and $b > 0$ such that $n = a^2 b$ with b *squarefree*.

The numbers $a$ and $b$ can be built by a decomposition of each
$\alpha_{i}$ prime exponent of $n$ as 
$2 \cdot \lfloor {\alpha_i/2} \rfloor + \alpha_{i} \; mod \; 2$.
We then have $a=\prod p_i^{\lfloor {\alpha_i/2} \rfloor} \textrm{ and }
b=\prod p_i^{\alpha_{i} \; mod \; 2}$ so that $n=a^2b$.


To demonstrate this factorization is unique frankly I'd just 
cite the FTA. I suppose we could also say that any migration
of a prime factors from $b$ to $a^2$ is impossible since $b$
is squarefree; and likewise in the other direction. 


##### 1.9 a) Show or give a counterexample: If $b^{2}|n$ and $a^{2}|n$ and $a^2 \le b^2$ then $a | b$.

For $n = 36, a = 2, b = 3$ the conditions are met but $ 2 \nmid 3, \; \otimes$.

##### 1.9 b) Likewise: If $b^2$ is the largest square divisor of $n$, then $a^2 | n$ implies $a | b$. 

True by decomposition: As in **1.8** we can manufacture $b$ from the prime factorization of $n$:
$b = \prod {p_i}^{\lfloor \alpha_i / 2 \rfloor}$. As $a|n$ we have a factorization of $a$ that
involves the same or fewer of the factors of $b$ so $a | b$.


Proofs like this one proofs live somewhere in the grey area between *proof by it's obvious* and 
*proof by assertion*. Greg Hurst does a much better job being rigorous.

##### 1.10 Given $x$ and $y$, let $m = ax + by$ and $n = cx + dy$ where $ad - bc = \pm 1$. Prove $(m, n) = (x, y)$.


Part 1: Looking ahead we need a general
result, taking $q, r, s > 0$ to avoid absolute value clutter:
If $q|r$ and $q|s$ then $q |(r, s)$. This is easy to 
show: $q|r \implies r = q \alpha$ and likewise $s = q \beta$.
Then $(r, s) = (q \alpha, q \beta) = q (\alpha, \beta)$ from theorem 1.3c; 
so $q|(r,s)$.


Part 2: 
The condition imposed on 
$a, b, c, \textrm{and} \; d$ 
suggests the determinant of a 2D linear transform. 
From linear algebra: The inverse of a $2 \times 2$ matrix
$
A=\left[{\begin{array}{cc}
   a & b \\
   c & d \\
  \end{array}}\right]
\; \; \; $
is
$\; \; \; A^{-1} = \frac{1}{det A} \cdot \left[{\begin{array}{cc}
   d & -b \\
   -c & a \\
  \end{array} }\right]$ with $det A = ad - bc$.
  
  
Writing $x$ and $y$ as a column vector
$\tilde{v}=\left[{\begin{array}{c}x\\y\end{array}}\right]$
and the result of the transform as
$\tilde{k}=\left[{\begin{array}{c}m\\n\end{array}}\right]$
we have $A \tilde{v} = \tilde{k}$ and $\tilde{v}=A^{-1}\tilde{k}$.

$\left[{\begin{array}{c}x\\y\end{array}}\right] =
\frac{1}{det A} \cdot \left[{\begin{array}{cc}
   d & -b \\
   -c & a \\
  \end{array} }\right]
\left[{\begin{array}{c}m\\n\end{array}}\right] =
\pm 1 \cdot \left[{\begin{array}{cc}
   d & -b \\
   -c & a \\
  \end{array} }\right]
\left[{\begin{array}{c}m\\n\end{array}}\right] 
$

In rude mechanical terms
$x = \pm (md-nb) \textrm{ and } y = \pm (na-mc)$.
However we do not need to write this out as such.
Rather say: 
$x$ and $y$ are linear combinations of $m$ and $n$
so $(m, n)|x$ and $(m, n)|y$. This in turn (from part 1)
means that $(m, n)|(x, y)$. 

In the problem statement we see $m$ and $n$ are linear combinations 
of $x$ and $y$. By the same argument then $(x, y)|(m, n)$.
As $(x, y)|(m, n)$ and $(m, n)|(x,y)$ the two are equal,
$(m, n)=(x,y)$, $\Box$.

#### 1.11 Prove $n^{4}+4$ is composite for $n > 1$. 


An approach would be to factor this expression, for example: $n^{4}+4 = (n^2+an+2)(n^2+bn+2)$. 
Happily this works out using $a=2$ and $b=-2$. Both resulting factors are positive 
for all values of $n > 1$. $\Box$

***In 12, 13 and 14 we have $a, b, c, m, n$ denoting positive integers.***

##### 1.12 a) Prove or provide a counterexample: $a^n | b^n \implies a | b$


For $n = 1$ the assertion is true. Continuing with $n > 1$:
$a^n | b^n \implies a \cdot a^{n-1} \cdot c = b \cdot b^{n-1}$ with $c > 0$.
This means that either $a | b$ or $a | b^{n-1}$.
If $a | b$ the assertion is true. Taking $a \nmid b$
implies $a \ne 1$ and this
leads to a contradiction when we continue casting out factors of $b$ to reduce
the exponent to $0$: $a$ must divide in succession:
$b^{n-2},\; b^{n-3},\; \dots \; b^{1}, \; 1$.
This last step $a|1$ implies $a=1$. $\Box$


##### 1.12 b) Prove or provide a counterexample: $n^n | m^m \implies n | m$. 


This assertion is false. The premise implies $n \le m$. Suppose that $n \nmid m$ even though all
the prime factors of $n$ are also prime factors of $m$. For example $25$ and $55$. 
That statement concerns $n$ and $m$ without exponentiation. 


Now let's transition to $m^m$. This can stack up a lot of prime factors, eventually reaching or exceeding
the number of prime factors found in $n^n$. Whereupon it will be the case that $n^n | m^m$. 
Calculations follow for three examples. In the first case $4 \nmid 10$ but $10^{10}$ contains
$10$ factors of $2$ exceeding the $8$ found in $4^4$; so $4^4 \mid 10^{10}$ and 
this is a counterexample. $\otimes$

See the **`ANT_examples.ipynb`** notebook for some examples.

##### 1.12 c) Unsolved: For $n > 1$ show $a^n | (2b^n) \implies a|b$. 


If $a$ is odd we have $a^n|b^n$ and **1.12a** establishes the implication $a|b$.


If $a$ is even then suppose $a=2^sc$ and so $2^{ns} \cdot c^n | 2 \cdot b^n$.



##### 1.13 a) If $(a, b)=1$ and $(a/b)^m=n$ show that $b=1$.

In problem 1.7 we show that if the sum of two reduced fractions is an integer then their denominators are the same (or negatives). 
In this case we have $\frac{a^m}{b^m}-\frac{n}{1}=0$ and since $b$ is positive we have $b^m=1$ and therefore $b=1. \; \Box$

##### 1.13 b) If $n$ is not the $m$th power of a positive integer, show that $n^{1/m}$ is irrational.

Requiring $n > 1$ let us suppose the contrary, that $n^{1/m}=a/b$, a rational number.
Equivalently $n = \frac{a^m}{b^m}$. From 1.13a we have $b=1$ and $n=a^m$. Thusly
$n$ *is* the $m$th power of a positive integer, a contradiction. $\Box$ 

##### INCOMPLETE 1.14 If $(a, b) = 1$ and $ab=c^n$ show that $x$ and $y$ exist such that $a=x^n$ and $b=y^n$. [Hint: Consider $d=(a,c)$.]


$d = (a,c) = au + cv$ and $e = (b,c) = bq + cr$. 

$c^n = \left( \frac{d-au}{v} \right)^{n} = ab$.

##### 1.15 Show that every $n \ge 12$ is the sum of two composite numbers. 

$n$ even can be written $n = 4 + (n - 4) = 2\cdot 2 + 2 \cdot (\frac{n}{2}-2)$, a sum of two composites.

$n$ odd can be written $n = 9 + (n - 9)$ and noting $n - 9$ is even we have $n = 3\cdot 3 + 2\cdot(\frac{n-9}{2})$, again a sum of two composites.

This works for $n=13$ and $n=14$ and up. $\Box$

##### 1.16 Show that if $2^n-1$ is prime, $n$ is prime.

The appearance  of $x^\alpha-1$ is 
suggestive of a telescoping factorization.
Factors in term imply composite; so this proceeds by way of 
contradiction. Assume $n$ is *not* prime, $n = ab$ 
for $a, b > 1$. Factor $2^{ab}-1$ establishing that both factors 
are greater than one, the requisite contradiction;
hence $n$ must be prime.  


$2^{ab}-1 = (2^a-1)(2^X + 2^{X-1} + 2^{X-2} + \dots + 2^{1} + 2^{0})$. 


What must $X$ be in order for the product of the two first terms 
$2^a \cdot 2^X$ to be $2^{ab}$? 
$X = ab - a = a(b-1)$. Now all the other product terms arising from $2^a$ 
cancel those from $(-1)$ but for the last, $-1 \cdot 1 = -1$ and the telescoping
factorization is ok. As $a>1$ the first factor is greater than $2$. Likewise 
$a(b-1)>1$. 'Both factors greater than one' implies the 
contradiction that $2^n-1$ is not prime if $n$ is not prime. $\Box$

##### 1.17 Show that if $2^n + 1$ is prime, $n = 2^k$ for some $k \ge 0$.

This follows a similar approach to that used in **1.16** above; with the
telescoping factorization relying on alternating signs in the second factor. 
Here we suppose $n=2^k\cdot b$ with $b$ an odd number. The contradiction 
is reached by assuming $b \ge 3$ and factoring $2^n+1$ so as to imply it is
composite. The conclusion is then that $b = 1$ and $n$ is simply $2^k$.


Begin by presuming a factorization:


$2^n+1 = 2^{2^{k}b}+1=(2^{2^{k}}+1)\cdot(Y^X - Y^{X-1} + Y^{X-2} - \dots - Y^{1} + Y^{0})$


Now to sort both $Y$ and $X$; noting that the second factor must have
an odd number of terms for the alternating signs to resolve the telescope properly.
Noting the 'counting down' exponents $X, X-1, \dots, 1$: $X$ must
be even.


First term product of these factors is $2^n$ so 
$2^n = 2^{2^{k}b} = 2^{2^{k}}\cdot Y^X$ so $Y=2^{2^{k}}$ and $X=(b-1) \ge 2$.
$b$ is odd so $X$ is even as required in the factorization. Both
factors are greater than one; where the right factor might
require some staring. One approach is to pair the terms of the sum
as $positive + negative$ with a decreasing exponent. 
Example $Y^8-Y^7$ where $Y$ is a positive integer. The result is
that each pair has a net positive sign; plus the trailing $1$
giving a total $> 1$. 


As a result the factorization arrives at the
contradiction that $2^n+1$ is composite. $\Box$

##### 1.18 If $m > n$ compute the gcd $g = (a^{2^{m}} + 1, a^{2^{n}} + 1)$ in terms of $a$. 


[***Hint: Define $A_n = a^{2^{n}}+1$ and show that $A_n | A_m - 2$.***]


That's a good hint. This write-up will rely on two points: First that $m > n \ge 1$ and that consequently $2^m$ can be divided by
$2^n$ an even number of times, in fact $2^{m-n}$ times. I state the obvious as I had tremendous difficulty working with the double 
exponential term $a^{2^{n}}$; so I proceed by trying to stay in a 'self-evident' line of thinking. The telescoping product
that follows echoes machinery in problem 1.16 above.


The key idea here is to set up the divisibility hint: $A_m - 2 = A_n \cdot \varphi$. Anticipating a telescoping product
let's presume $\varphi$ will be a sequence of alternating-sign terms and arrive at a (presumed) factorization.


$A_m-2 = a^{2^m} - 1 = A_n \cdot \varphi = (a^{2^n} + 1) \cdot (A - B + C \dots + Y - Z)$


From here we can state some 'must be' from the telescope logic. That is, the $a^{2^n}$ multiplied
by term $K$ will be the negative of the $+1$ multiplied by term $J$ so they sum to $0$.


It must be that $a^{2^n} \cdot A = a^{2^m}$, the non-telescoping lead term.

Likewise $a^{2^n} \cdot B = A \cdot 1$, the first telescoping pair.

Also $a^{2^n} \cdot C = B$

...und so weiter to the end...

$a^{2^n} \cdot Z = Y$

and finally $Z = 1$, the second non-telescoping term, which will be the $-1$ of $A_m - 2$.

This result $Z = 1$ gives $a^{2^n} = Y$. Hence the multiplying factor to go from one term
to the next in $\varphi$ must be $\frac{1}{a^{2^n}}$. I'll write this as $a^{-2^n}$.

From above: $A=\frac{a^{2^m}}{a^{2^n}} = a^{2^m} \cdot a^{-2^n} = a^{2^m-2^n}$. This is the first term of $\varphi$.

Likewise $B=\frac{A}{a^{2^n}}=a^{2^m-2 \cdot 2^n} \textrm{ and } C=\frac{B}{a^{2^n}}=a^{2^m-3 \cdot 2^n}$.

That is, each term in $\varphi$ is the prior term scaled down by $a^{2^n}$; and as noted this 
produces an even number of terms. As the terms alternate sign and start positive the even
count ensures the final $1$ is negative as required. Now the end is in sight.


We have gcd $g=(A_n, A_m) \textrm{ so } g|A_m$. While $A_n \nmid A_m$ we do have 
$A_n | A_m - 2$ and therefore $g | A_m - 2$ as well. So $g|A_m \textrm{ and } g|A_m-2$.
Hence $g$ can only be $1$ or $2$. If $a$ is odd then $A_n$ and $A_m$ will be even
and $g=2$. If $a$ is even then $A_n$ and $A_m$ will be odd and $g=1$. $\Box$



##### 1.19 Prove for the Fibonacci sequence that $(f_{n}, f_{n+1}) = 1$.

Note Theorem 1.1c) implies that $(a, b) = (a, b + na)$; so $(a, a + b) = (a, b)$. 
For the first several Fibonacci numbers 1, 1, 2, 3, 5, 8 etcetera the assertion is true. 
This is an opportunity for induction proof. Assume the assertion is true up to 
$(f_{n-1}, f_{n})=1$. Then $(f_{n}, f_{n+1}) = (f_{n}, f_{n} + f_{n-1}) = (f_{n}, f_{n-1}) = 1$. $\Box$

#### 1.20

#### 1.21

#### 1.22

#### 1.23

#### 1.24

#### 1.25

#### 1.26

#### 1.27

#### 1.28

#### 1.29

##### 1.30 If $n>1$ prove $\sum_{k=1}^{n}{\frac{1}{k}}$ is not an integer. 


This sum is called the [nth Harmonic number](https://en.wikipedia.org/wiki/Harmonic_number).


There are two proofs easily found on the web: The *Bertrand Conjecture proof* 
and the *2-adic proof*.


Bertrand's Postulate aka Chebyshev's Theorem I remember using:


> *Chebyshev said it*<br>
> *I'll say it again*<br>
> *There's always a prime*<br>
> *Between $n$ and $2n$.*


The following is a rewrite of Anton Geraschenko's 'Bertrand' proof mentioned in passing in an online post dated to 2010.

Take $p>n/2$ to be the largest prime less than $n>3$. Then $\sum_{i=1}^{n}\frac{1}{i}=\frac{1}{p} + \frac{a}{b}$,
using $\frac{a}{b}$ as a collective 'sum of everything else'.
The key idea is that the factors of the common denominator $b$ are all primes less than $p$. 
Suppose $\frac{1}{p} + \frac{a}{b}$ is an integer; 
then this number multiplied by $b$ is likewise an integer; but *that* number would be $\frac{b}{p} + a$ and
as $(b,p)=1$ the fraction $\frac{b}{p}$ is not an integer, a contradiction. $\Box$


The 2-adic proof was given by [JÓZSEF KÜRSCHÁK](https://en.wikipedia.org/wiki/J%C3%B3zsef_K%C3%BCrsch%C3%A1k)
in a Hungarian Math/Physics journal; see ***A Harmonikus Sorról, Mat. és Fiz. Lapok, 27 (1918), 299--300***.
One approach to stating this (mentioned by Anton Geraschenko) is to substitute the largest possible
$2^s < n$ for $p$
in the above argument. 

## Chapter 2

### Some useful ideas


I will assume unless stated otherwise that $k$ is the number of unique prime factors of $n$.

### 2.1

Find $n$ for which (a) $\varphi(n) = \frac{n}{2}$; (b) $\varphi(n) = \varphi(2n)$; (c) $\varphi(n) = 12$.

(a) $\varphi(n) = \frac{n}{2}$. For $n=1$ we have 
$\varphi = 1$ (not a solution) and $\varphi(2) = 1$ (a first solution); 
so let's proceed to $n > 2$, the regime where $\varphi(n)$ is always even 
and where necessarily $\varphi(n) = \frac{n}{2}$. 
Combine these to conclude $n$ is a multiple of $4$
so taking $\alpha > 1$ we have

$$n = 2^{\alpha} \cdot \prod_{p \; odd}{p_i}^{a_i}$$

Observing $(2^\alpha, \Pi) = 1$ gives us the second equality in

$$\varphi(n) = \varphi(2^\alpha \cdot \Pi) = \varphi(2^\alpha) \cdot \varphi(\Pi) = 2^{\alpha-1} \cdot \varphi(\Pi) = \frac{n}{2}$$

Powers of two are relatively prime to the odd numbers so $\varphi(2^d)=2^{d-1}.$
Now to use $\varphi(n) = n$ only for $n=1$:

$$2^{\alpha-1} \cdot \varphi(\Pi) = 2^{\alpha-1} \cdot \Pi \implies \Pi = 1$$

Consequently $n = 2^\alpha = \{ 2, 4, 8, \dots \}$.

(b) Useful observation: $\varphi(2^s)=\varphi(2^{s+1})$ only for $s=0$. 
Taking $n=2^s\cdot d$ we want $n$ for which $\varphi(n) = \varphi(2n)$.
As in part (a): $\varphi(2^s)\cdot\varphi(d)=\varphi(2^{s+1})\cdot\varphi(d)$. 
Hence $d$ is any odd number and $s=0$; so $n = \{1, 3, 5, 7, 9, \dots\}$.

(c) Observe $12 + 1$ is prime so $n_1 = 13$. From part (b) above: $n_2 = 26$. 
Likewise any other odd $n$ with $\varphi=12$ will admit a second solution $2n$.

The product form of the totient (for this problem) is 


$$\varphi(n) = n \cdot \prod_{p|n} \frac{p-1}{p} = 12 = 2 \cdot 2 \cdot 3$$

### 2.2 Prove or find counterexamples $\otimes$ for...

(a) Show $(m,n) = 1 \implies (\varphi(m), \varphi(n))=1$. 
Choosing $m=3,\;n=4$ both give $\varphi = 2$, a counterexample. $\otimes$

(b) Show $n \; composite \; \implies (n, \varphi(n)) > 1$.
Choose $n = 15$ and check: $(15, 8) = 1$, a counterexample. $\otimes$

(c) If the same primes divide $m$ and $n$ then $n\cdot\varphi(m) = m\cdot\varphi(n)$.

$\frac{\varphi(m)}{m}=\prod_{p|m}{1-p^{-1}}$. Since all $p$ that divide $m$ also divide $n$
the product on the right is also equal to $\frac{\varphi(n)}{n}. \; \Box$

### 2.3

### 2.4 

Show that $\varphi(n) > \frac{n}{6} \; \forall \; n $ with $k \le 8$ prime factors.


This is an example of proof by calculation.


Taking $\varphi(n) = n \cdot \prod_{p|n}\frac{p-1}{p}$ the goal is to show the
product $\prod < \frac{1}{6}$. Calculate $\prod$ for
the first eight primes: $\{\frac{1}{2} \cdot \frac{2}{3} \cdot \cdots \cdot \frac{18}{19}\}$.
The result is a number greater than $1/6$. Choosing any other set of eight (larger)
primes results in factors *closer* to 1, yielding a bigger product. Likewise choosing
numbers with fewer than eight unique prime factors results in a larger resulting 
product: There are fewer factors where each is less than one. In both cases, therefore,
the product above is a lower bound. $\Box$

### 2.5


Show the Dirichlet product $f = \mu * \nu$ is 0 or 1 where $\nu(n)$ is the number of distinct prime factors of 
$n$. Note $\nu(1)=0$. The number of unique prime factors of $n$ is $k$. 


Useful observation:
Define an alternating-sign weighted sum of binomial coefficients as 
$A_a(k) = \sum_{i = 0}^{k} {-1}^i \cdot \binom k i \cdot i^a$. 
Then

\begin{equation}
A_0(k) = \sum_{i=0}^{k} {-1}^i \cdot \binom k i
\end{equation}

\begin{equation}
A_1(k) = \sum_{i=0}^{k} {-1}^i \cdot \binom k i \cdot i
\end{equation}


Evaluating: $A_1(1) = 1$ and $A_0(k \ge 1) = A_1(k > 1) = 0$.


To the problem then: After noting $f(1) = 0$ we consider $n > 1$ with $n = \prod_{i=1}^{k \ge 1} {p_i}^{a_i}$.



Define $m$ to be the number of prime factors of $n$ that are squarish: They have $a_i \ge 2$.
For example $360 = 2^3 \cdot 3^2 \cdot 5^1$ has $m=2$. 
This parameter is motivated by terms in the Dirichlet convolution sum where 
a divisor $d$ of $n$ includes a
prime factor ${p_i}^1$ with a corresponding $a_i \ge 2$. In this case $p_i$ is present both
in the argument in $\mu(d)$ and in the argument in $\nu(\frac{n}{d})$.

As these $m$ squarish primes must be factors of both $d$ and $n/d$ they impact both
$\mu(d)$ and $\nu(n/d)$. Only square-free 
divisors $d$ have non-zero Mobius value. These we choose to group by number of 
prime factors. If there were five values of $d$ that each have four prime factors:
Each contributes a Mobius function value of $\mu(d) = 1$. 


For each of these groups we then want to add up the total number of 
prime factors in the complementary divisor; add all of the $\nu(n/d)$ values.


Notice this product enumerates all non-square factors of $n$:


\begin{equation}
\prod_{i=1}^{k}(1+p_i)=1 + p_1 + p_2 + \dots + p_k + p_1 \cdot p_2 + \dots + p_{k-1} \cdot p_{k} + \dots + p_1 \cdot p_2 \cdot p_3 \cdots p_k.
\end{equation}


Grouping the factors $d$ by the number of prime factors in each: 


- group 0: no prime factors (i.e. $\{ 1 \}$): one element
- group 1: individual primes: $\{ p_1, p_2, \dots, p_k \}$: $k$ elements
- group 2, pairs of primes $\{ p_1 \cdot p_2, p_1\cdot p_3, \dots, p_{k-1} \cdot p_{k}\}$: $\binom{k}{2}$ elements
- $\dots$ 
- group $g$: $\binom{k}{g}$ elements
- $\dots$
- group $k$: $\{ p_1\cdot p_2 \cdot \cdots \cdot p_k \}$: one element. 


Write the Dirichlet convolution with the outer sum over these $k+1$ groups:


\begin{equation}
f = \mu * \nu = \sum_{g=0}^{k} \sum_{i=1}^{\binom k g} \mu(d_{g,i}) \cdot \nu(\frac{n}{d_{g,i}}) 
\end{equation}


When $m = 0$ the number $n$ is square-free, a product of a set 
of distinct primes. Here for group $g$ the function $\nu(n/d)$ 
will be $(k - g)$: The number of remaining prime factors from $k$
total after $g$ of them are multiplied to produce $d$. 
The above sum becomes


\begin{equation}
f = \mu * \nu = \sum_{g=0}^{k} {\binom k g} \cdot ({-1}^g) \cdot (k - g) 
\end{equation}

With $k = 1$: $n$ is prime and $f(n_{prime}) = (1 \cdot 1 \cdot 1) + (1 \cdot -1 \cdot 0) = A_1(1) = 1$. 


With $k > 1$, still retaining the square-free condition ($m=0$),
we have equivalence by symmetry to $A_1(k > 1)$ give or take a factor of $-1$.
In this case the sum is $f(n) = 0$.

$$
f(n_{squarefree}) = \mu * \nu = \sum_{g=0}^{k} {\binom k g} \cdot ({-1}^g) \cdot (k - g) = 0.
$$


For $m > 0$ the value of $\nu(\frac{n}{d})$ will vary for elements within a group.
However only $d$ values that are squarefree give non-zero values of $\mu$. 


##### INCOMPLETE