# **First Order Linear Equations**

---

### **Introduction**
This notebook goes over the solution of first order linear equations.

---

### **Author**
**Junichi Koganemaru**  

---

### **Last Updated**
**January 14, 2025**

In this course we will navigate different techniques for finding solutions to differential equations and IVPs. Our main focus will be on identifying explicit solution formulas for linear ODEs and IVPs, though in specialized cases we can also solve some simple nonlinear equations.

### A Precautionary Tale
**Example**
> Suppose we want to find all solutions to $x + \sqrt{x} = 0$. Upon first try most students would likely write something like this:
> $$
> x + \sqrt{x} = 0 \implies x = -\sqrt{x} \implies x^2 = (-\sqrt{x})^2 = x  \implies x^2 - x = 0 \implies x(x-1) = 0  \implies x = 0 \; \text{or} \; x =  1,
> $$
> and then conclude that the solutions are $x=0$ and $x=1$.
> However, this isn't quite right, because what we've really shown above is that "if $x$ solves $x + \sqrt{x} = 0$, then the candidate solutions are $x = 0$ or $x=1$."
> In order to verify that the candidate solutions are actual solutions we actually need to evaluate the converse. 
> We note that if $x = 0$, then $0 + \sqrt{0} = 0$, so $x=0$ is a solution, but if $x =1$, then $1 + \sqrt{1} = 2 \neq 0$, so $x = 1$ is not a solution. So the correct conclusion is that the only solution to the equation is $x = 0$. 
> In this case we can write the conclusion as a bi-implication,
> $$
> x + \sqrt{x } = 0 \; \text{if and only if} \; x = 0.
> $$
> Now if in the above every forward implication is a bi-implication, then the verification step can be skipped, but as we see in the example above when you naively manipulate equations sometimes the implication only goes one way (e.g. $x = 1$ implies $x^2 = 1$ but $x^2 = 1$ does not imply $x=1$ since it's possible for $x=-1$).

  The upshot here is that when one naively manipulates equations and do not pay close attention to the direction of the implications, the final "answer" is really a candidate set of solutions to the original equation, and a verification step is necessary for us to find the actual solution set. 

  This idea applies when we solve for differential equations too: students are often taught to naively manipulate equations and they are done once they reach the "answer", but in reality the final "answer" is only a candidate solution set because one typically do not pay attention to the direction of the implications, and a final verification step is necessary for the argument to be mathematically and logically precise. 

  Furthermore, to be absolutely precise one also should identify the interval of existence associated to the solution. 

### Strategy for Solving Differential Equations
1. Recognize the type of differential equation you are trying to solve (classification).
2. Apply appropriate techniques to manipulate the equation and derive a candidate solution.
3. From the candidate solution set, identify the interval(s) $J$ on which the solution(s) can be defined.
4. Verify that the candidate solution(s) solve the differential equation on $J$.
5. For IVPs, you also need to identify the interval(s) $J$ that contains $t_0$ and check that the initial conditions specified at $t_0$ are satisfied.



## Solving First Order Linear Equations

In this section we study first order linear equations of the form
$$
     a_1(t) y'(t) + a_0(t) y(t) = f(x) \; \text{for all} \; t \in I,
$$
where $a_1, a_0, f: I \to \mathbb{R}$ are continuous functions on some interval $I \subset \mathbb{R}$.

### The Simplest Case

The simplest case is when the equation is of the following form:
$$
      y'(x) = f(x) \; \text{for all} \; x \in \mathbb{R},
$$
where $f: \mathbb{R} \to \mathbb{R}$ is given. By the fundamental theorem of calculus, $y$ must be an antiderivative of $f$ for this to be satisfied. In other words, we can identify the solution directly through integration. The family of solutions to the equation is then given by  
$$
      y(x) = \int f(x) \; dx + C \; \text{for all} \; x \in \mathbb{R},
$$
for any arbitrary constant $C$. 

We refer to this as a *one-parameter family of solutions* as it is parametrized by the constant $C$.

### The General Case

Next we study equations of the form 
$$
y'(x) + P(x) y = f(x) \; \text{for all} \; x \in I
$$
i.e. when the coefficient in front of $y'$ is 1. 

It turns out that it will be important to distinguish the cases when $f$ is the zero function and when $f$ is not the zero function, which motivates the following terminology.

> **Definition**
If $f$ is the zero function, then we refer to the equation as a *homogeneous equation*. If $f$ is not the zero function, then we refer to the equation as an *inhomogeneous equation*.


### The Method of Integrating Factors
Here we'll introduce the *method of integrating factors* which allows us to solve the equation explicitly. The idea is to multiply the equation by some differentiable function $\mu: I \to \mathbb{R}$ so that the left hand side can be realized as the derivative of some other function.

Suppose $y: I \to \mathbb{R}$ is a solution to the original equation. Multiplying the equation by $\mu$ gives us 
$$
\mu(x)y'(x) +\mu(x) P(x) y =\mu(x) f(x) \; \text{for all} \; x \in I
$$
and notice that if $y,\mu: I \to \mathbb{R}$ are differentiable functions, then by the product rule we have
$$
\frac{d}{dx} \left( \mu(x) y(x) \right) = \mu(x) y'(x) + \mu'(x) y(x) \; \text{for all} \; x \in I.
$$

So if we can look for a specific function $\mu$ such that 
$$
\mu'(x) = \mu(x) P(x) \; \text{for all} \; x \in I
$$
then the LHS of the new equation becomes the derivative of $\mu y$: 
$$
\frac{d}{dx}(\mu(x) y(x)) = \mu(x) f(x) \; \text{for all} \; x \in I.
$$
Once we have this equation, we can identify the solution $y$ through direct integration. 


**Remark**
In some sense, we're just reverse engineering the product rule.

### Finding $\mu$
Now we need to find a differentiable function $\mu$ that satisfies
$$
\mu'(x) = \mu(x) P(x) \; \text{for all} \; x \in I
$$

If we ignore the values of $x$ for which $\mu$ can be zero, then we can divide both sides by $\mu$ to obtain
$$
\frac{\mu'(x)}{\mu(x)} =  P(x) \; \text{for all} \; x \in I \; \text{s.t.} \; \mu(x) \neq 0.
$$

Upon integrating both sides we have
$$
\int \frac{\mu'(x)}{\mu(x)} \; dx = \int P(x) \; dx. 
$$
Recall that 
$$
\int \frac{\mu'(x)}{\mu(x)} \; dx = \ln | \mu(x) | + C,
$$
so from this we see that 
$$
\ln | \mu(x) | = \int P(x) \; dx,
$$
and thus 
$$
\mu(x) = C \exp \left( \int P(x) \; dx \right), 
$$
where we "absorbed" the $\pm 1$ from the absolute value into the constant $C$ (to make this "absorption" argument rigorous we need to specify the domain and use a continuity argument).

Since we're looking for any function $\mu$ that satisfies $\mu' = \mu P$, we can simply choose $C = 1$ and also ignore the constant of integration coming from $\int P(x) \; dx$, as 
$$
\exp\left( g(x) + D \right) = e^D \exp(g(x)),
$$
meaning that the constant of integration can be absorbed into the constant $C$ in the equation. This also makes sense since we're multiplying both sides of the original equation by $\mu$, so the exact constant in front of the exponential term is not significant. 

Therefore by convention, we define the *integrating factor* associated to the equation as the function $\mu: I \to \mathbb{R}$ given by 
$$
\mu(x) = \exp \left( \int P(x) \; dx \right) \; \text{for all} \; x \in I
$$
where we also ignore the constant of integration in the integral term. 

By the chain rule, we may verify that 
$$
\mu'(x) = \underbrace{\left( \frac{d}{dx} \int P(x) \; dx \right)}_{= P(x)} \underbrace{\exp \left( \int P(x) \; dx \right)}_{= \mu(x)}
\\ 
=   \mu(x) P(x) \; \text{for all} \; x \in I.
$$




## Solving The Original Equation
The point is that once we've identified the integrating factor $\mu$, the equation now becomes 
$$
\frac{d}{dx} \left( \mu(x) y(x) \right) = \mu(x) f(x) \; \text{for all} \; x \in I,
$$
and via direct integration with respect to $x$ we have
$$
\mu(x) y(x) = \int \mu(x) f(x) \; dx + C  \; \text{for all} \; x \in I.
$$

We note that since $\mu(x) > 0$ for any $x \in I$, we can divide both sides by $\mu$ to obtain 
$$
y(x) = \frac{\int \mu(x) f(x) \; dx}{\mu(x)} + \frac{C}{\mu(x)} \; \text{for all} \; x \in I
$$
where $C$ is an arbitrary constant and 
$$
\mu(x) = \exp \left( \int P(x) \; dx \right) \; \text{for all} \; x \in I.
$$

**Remark**
What we have shown is that if $y$ solves the original equation, then it must belong to the candidate solution set given via $y(x) = \frac{\int \mu(x) f(x) \; dx}{\mu(x)} + \frac{C}{\mu(x)} \; \text{for all} \; x \in I$. To show that the candidates in the form of $y(x) = \frac{\int \mu(x) f(x) \; dx}{\mu(x)} + \frac{C}{\mu(x)} \; \text{for all} \; x \in I$ satisfy the equation requires a verification step.

**Remark**
Once the verification step is completed, we can conclude that a solution exists to the original equation, and the solution is given by $y(x) = \frac{\int \mu(x) f(x) \; dx}{\mu(x)} + \frac{C}{\mu(x)} \; \text{for all} \; x \in I$. Note that the solution is *global* in the sense that the interval of existence is the entirety of $I$. This is typical of linear equations, whereas for nonlinear equations the interval of existence can be a strict subset of $I$.

On the homework, you are asked to complete the verification step.

### Examples 

> **Example**
> Find the general solution to the first order linear ODE $y' + 2xy = 0$, $x \in \mathbb{R}$.
> 
> **Solution**
> The integrating factor here is given by $\mu: \mathbb{R} \to \mathbb{R}$ defined via $\mu(x) = e^{x^2}$, so multiplying both sides by $\mu$ gives us 
> $$
> e^{x^2} y'(x) + e^{x^2} 2x y(x) = 0 \; \text{for all} \; x \in \mathbb{R}
> $$
> Notice that the left hand side can be written as 
> $$
> \frac{d}{dx} \left( e^{x^2} y(x) \right) = 0 \; \text{for all} \; x \in \mathbb{R},
> $$
> so we get $e^{x^2} y(x) = C$ for all $x \in \mathbb{R}$ for some constant $C$. So the general solution $y$ to the original equation is given by 
> $$
> y(x)= Ce^{-x^2} \; \text{for all} \; x \in \mathbb{R},
> $$
> where $C$ is an arbitrary constant.


> **Example**
> Find the general solution to the linear ODE $$y'(x) + \frac{1}{x} y(x) = x, x > 0.$$
> 
> **Solution**
> The integrating factor here is $\mu: \mathbb{R} \to \mathbb{R}$ defined via $\mu(x) = e^{\ln |x|} = x$ (since $x > 0$), so multiplying both sides by $\mu$ gives us the equation
> $$
> \frac{d}{dx} \left( x y(x) \right) =  x y'(x) + y(x) = x^2 \; \text{for all} \; x > 0.
> $$
> To solve this we perform direct integration, and we find that the solution $y$ that satisfies the equation is given by
> $$
> y(x) = \frac{x^2}{3} + \frac{C}{x} \; \text{for all} \; x > 0,
> $$
> where $C$ is an arbitrary constant.

> **Example**
> Consider the equation 
> $$
> y'(x) + 3x^2 y(x) = x^2, \; x \in \mathbb{R}.
> $$
> The candidate integrating factor for this equation can be chosen to be 
> $$
> \mu(x) = \exp\left( 3x^2 \; dx \right) = e^{x^3}.
> $$
> Upon multiplying the original equation by $e^{x^3}$, we find that 
> $$
> \frac{d}{dx}[e^{x^3} y(x)] = e^{x^3} y'(x) + 3x^2 e^{x^3}y(x) = e^{x^3} x^2, \; x \in \mathbb{R}.
> $$
> Then
> $$
> e^{x^3} y(x) = \int x^2 e^{x^3} \; dx = \frac{1}{3} \int 3x^2 e^{x^3} \; dx = \frac{1}{3}e^{x^3} + C, \; x \in \mathbb{R}
> $$
> for an arbitrary constant $C$. Therefore if $y$ solves the original equation, then 
> $$
> y(x) = \frac{1}{3} + Ce^{-x^3}, \; x \in \mathbb{R},
> $$
> for some constant $C$.

> **Example**
> Consider the equation
> $$
> x^2 y'(x) + 3x y(x) = \frac{e^x}{x}, \; x > 0.
> $$
> If $y$ is a solution to the equation, then 
> $$
> y'(x) + \frac{3}{x} y(x) = \frac{e^x}{x^3}, \; x > 0.
> $$
> The integrating factor associated to this equation can be chosen to be
> $$
> \mu(x) = \exp \left( \int \frac{3}{x} \; dx \right)= \exp (3 \ln |x|) = \exp ( \ln |x|^3 ) =  |x|^3 = x^3, \; x > 0 
> $$
> Thus multiplying the equation by $\mu$ tells us that  
> $$
> x^3 y'(x) + 3x^2 y(x) = e^x, \; x >0
> $$
> The left hand side can be written as the derivative of $\mu y$, so we have 
> $$
> \frac{d}{dx} \left( x^3 y(x) \right) = e^x, \; x > 0
> $$
> and upon direct integration we find that 
> $$
> x^3 y(x) = e^x +C, \; x > 0.
> $$
> So if $y$ solves the original equation, then it is given by
> $$
> y(x) = x^{-3}e^x + C x^{-3}, \; x > 0,
> $$
> for an arbitrary constant $C$.

> **Example**
> Consider the equation 
> $$
> 2x y'(x) + y(x) = 2, \; x \in \mathbb{R}.
> $$
> Suppose $y$ is a solution to this equation. Then for $x \neq 0$, we have 
> $$
> y'(x) + \frac{1}{2x}y(x) = \frac{1}{x}.
> $$
> Note that as a differential equation, this is not equivalent to the original equation, because it fails to be well-defined at $x = 0$. Nevertheless, we can try to use the method of integrating factors to identify a candidate solution that is well-defined for all $x \in \mathbb{R}$. 
> The candidate integrating factor for this problem is 
> $$
> \mu(x) = \exp \left( \int \frac{1}{2x} \; dx \right) = \exp \left( \frac{1}{2} \ln |x| \right) =  \exp \left(  \ln |x|^\frac{1}{2} \right) = |x|^{1/2} = \sqrt{|x|},\; x \neq 0.
> $$
> Note that the expression $\sqrt{x}$ is undefined for $x < 0$, so the absolute value sign here is essential. We then multiply both sides of the reduced equation by $\mu$ to obtain 
> $$
> \frac{d}{dx}[|x|^{1/2} y(x)]  = \frac{|x|^{1/2}}{x}, \; x \neq 0.
> $$
> We note that 
> $$
> \frac{|x|^{1/2}}{x} = \begin{cases}
> x^{-1/2}, & x > 0 \\
> -(-x)^{-1/2}, & x < 0.
> \end{cases}
> $$
> Therefore 
> $$
> \int \frac{|x|^{1/2}}{x} = \begin{cases}
> 2 x^{1/2} + C, & x > 0 \\
> 2(-x)^{1/2} + C, & x < 0
> \end{cases} = 2|x|^{1/2} + C, \;  x \neq 0.
> $$
> We therefore find that 
> $$
> |x|^{1/2} y(x) = 2 |x|^{1/2} + C, x \neq 0,
> $$
> and therefore 
> $$
> y(x) = 2 + \frac{C}{|x|^{1/2}}, \; x \neq 0.
> $$
> This gives us a candidate family of solutions on the interval $J = (0,\infty)$ and $J = (-\infty,0)$. Note that the only solution that is well-defined for $J = \mathbb{R}$ is the constant solution 
> $$
> y(x) = 2 , \; x \in \mathbb{R}.
> $$


## Rate Problems 
Some physical problems involving rates can be modeled via first order linear problems. The guiding principle for these types of problems is that 
$$
    \text{rate of change} \; = \; \text{rate in} \; - \; \text{rate out}.
$$

> **Problem**
> Consider the following problem. 
>
> A vat initially (at time $t=0$) contains 100 $\ell$ of pure water. An iodine solution with a concentration of $e^{-t/20}g/l$  is added to the vat at a rate of 3 $l$/min. At the same time the resulting solution is drained from the vat, also at a rate of 3 $\ell$/min. Throughout this process the solution in the vat is kept well mixed.
>
> Find an expression for the amount of iodine in the vat at time $t$.

**Solution**
Let $V(t)$ denote the volume (in $l$) of the mixture at time $t \ge 0$. Since the solution is entering the vat at $3l$/min and also being drained at $3l$/min, we have 
$$
    V'(t) = 3 - 3 = 0 \; \text{for all} \; t \ge 0.
$$
This implies that $V(t) = V(0) = 100$ for all $t \ge 0$.

Let $I(t)$ denote the amount of iodine (in $g$) at time $t$. The amount of iodine is equal to the concentration of the mixture times the volume. Therefore $I$ satisfies 
$$
     \frac{d I}{dt} (t) = \underbrace{3 e^{-t/20}}_{\text{rate in}} - \underbrace{3 \frac{I(t)}{V(t)}}_{\text{rate out}} = 3 e^{-t/20} - \frac{3}{100}I(t), \; t \ge 0. 
$$
We then find that $I$ satisfies the first order IVP
$$
      \begin{cases}
        I'(t) + \frac{3}{100} I(t) =3e^{-t/20}, & t > 0 \\
        I(0) = 0.
      \end{cases}
$$

To solve for $I$, we use the method of integrating factors. We note that the integrating factor for this problem can be chosen to be 
$$
      \mu(t) = \exp\left( \int \frac{3}{100} \; dt\right) = e^{3/100 t}, \; t > 0.
$$
Thus if $I$ solves the original equation, then 
$$
  \frac{d}{dt} \left[ e^{3/100 t} I(t) \right] =  3  e^{-t/50} , \; t > 0.
$$

By direct integration we then find that 
$$
  e^{3/100 t} I(t) = -150 e^{-t/50} +C, \; t > 0
$$
and thus 
$$
     I(t) =   -150 e^{-t/20}   + C e^{-3/100 t}, \; t > 0.
$$
Since $I(0) = 0$, we find that 
$$
  0 = -150 + C \implies C =150. 
$$
Therefore the amount of iodine at time $t \ge 0$ is given by 
$$
      I(t) = -150 e^{-t/20}   + 150 e^{-3/100 t}, \; t \ge 0.
$$ 
