# Introduction to systems of linear equations

There is one algebraic tool in mathematics that is absolutely essential in almost every chapter of applied or theoretical studies. Whether you investigate large datasets, study abstract algebra and vector calculus, or perform simple practical calculations, you will almost certainly encounter **systems of linear equations (SLEs)**.

Systems of linear equations are generalizations of linear equations with a single unknown variable, for example:
$$
3x + 5 = 0.
$$

They allow us to determine a set of unknown variables using combinations of different relations between them. The word *linear* here means, in a precise sense, that these relations are the simplest possible.

However, this does **not** imply that systems of linear equations are a limited tool. On the contrary:

- Many **non-linear problems** can be solved approximately using systems of linear equations.
- There are also problems that can be solved **only** by reducing them to systems of linear equations.

For these reasons, systems of linear equations play a central role across mathematics, science, and engineering.

## Systems of Linear Equations in Real Life

Let’s say you live and work in Liverpool, and you get paid in pounds sterling. At the end of the month, you find out that you have £100 unspent. You decide to keep your savings in dollars. Knowing that
$$
1\text{ USD} = 0.83\text{ GBP},
$$
what amount of dollars can you get for your pounds?

This is an easy question. Denoting the required sum in dollars by $\$$, you obtain the equation
$$
0.83D = 100.
$$
Obviously, you are going to get something around \$120.5:
$$
D = \frac{100}{0.83} \approx 120.5.
$$

### Diversifying savings

Now suppose you want to diversify your savings. You decide to convert:
- one part into dollars (1 = £0.83),
- another part into euros (1 = €0.85),
- and keep the remaining amount in pounds.

Let:
- $D$ be the number of dollars,
- $E$ be the number of euros,
- $P$ be the remaining pounds.

These three values are related by the equation
$$
0.83D + 0.85E + P = 100.
$$

Note that now there are **three unknowns** instead of one, and there are many possible choices of $D$, $E$, and $P$ that satisfy this equality. For example:
Note that now there are three unknowns instead of one, and there are many possible choices of $D$, $E$, and $P$ that satisfy this equality. For example:

- If you have $D$40, $E$30, and $P$41.3, then
  $$
  0.83 \cdot 40 + 0.85 \cdot 30 + 41.3 = 100
  $$

- Likewise, you could have $D$25, $E$45, and $P$41, because
  $$
  0.83 \cdot 25 + 0.85 \cdot 45 + 41 = 100
  $$


### Adding more conditions

To be more specific, suppose you decide that:
1. The total value of all currencies must be equivalent to £100.
2. The combined value of dollars and euros must be **three times** the amount kept in pounds.
3. The pound equivalents of dollars and euros must be equal.

These conditions give the following system of equations:
$$
\begin{cases}
0.83D + 0.85E + P = 100, \\
0.83D + 0.85E = 3P, \\
0.83D = 0.85E.
\end{cases}
$$

You can check that the values
$$
D = 45.2, \quad E = 44.1, \quad P = 25.0
$$
satisfy all three conditions with good accuracy.

### Key takeaway

The main tool we used here to deal with currency conversions is a **system of linear equations with multiple variables**. The key idea is that, given some unknowns and a set of relations between them, we can write down a system of equations. From this system, as we will see later, we can determine the unknown quantities.

With this motivation in mind, we are now ready to move on to precise definitions.

## Main definitions

As we saw earlier, the relations were rewritten as equations, which only contained sums and differences of unknown variables with some numerical coefficients. Summarizing these facts, we give the following definition.

The formal equality
$$
a_1 x_1 + a_2 x_2 + \cdots + a_n x_n = b,
$$
where $a_1, a_2, \dots, a_n, b$ are given numbers (for example, real or complex), and
$$
x_1, x_2, \dots, x_n
$$
are $n$ unknown variables, is called a **linear equation in $n$ variables**.

We often say that this equation gives a *linear relation* between the variables $x_1, x_2, \dots, x_n$.
The most elementary way to understand the word *linear* here is to note that all variables appear only to the first power. For example,
$$
x_1^2 - x_2 = 0
$$
is **not** a linear equation.

Variables can be denoted by any symbols as long as they represent unknowns. For example:
- $5y_1 - 4y_2 = 5$ is a linear equation in variables $y_1$ and $y_2$,
- $-\tfrac{3}{2}x + 3y - z + \tfrac{2}{3}t = -\tfrac{1}{2}$ is a linear equation in variables $x, y, z, t$,
- $0.83D + 0.85E + P = 100$ is a linear equation in variables $D, E, P$.

### Solutions of a linear equation

Consider the equation
$$
a_1 x_1 + a_2 x_2 + \cdots + a_n x_n = b.
$$
A set of numbers
$$
(\chi_1, \chi_2, \dots, \chi_n)
$$
is called a **solution** if
$$
a_1 \chi_1 + a_2 \chi_2 + \cdots + a_n \chi_n = b.
$$

That is, substituting these values for the variables makes the equation true.
Unlike linear equations in one variable, equations with multiple variables usually have **more than one solution**.

For example, both
$$
(5, 5) \quad \text{and} \quad (1, 0)
$$
are solutions of
$$
5y_1 - 4y_2 = 5,
$$
because
$$
5\cdot5 - 4\cdot5 = 5 \quad \text{and} \quad 5\cdot1 - 4\cdot0 = 5.
$$

In fact, for any number $\gamma$, setting
$$
y_1 = \gamma, \qquad y_2 = \frac{5(\gamma - 1)}{4}
$$
produces a solution
$$
\left(\gamma, \frac{5(\gamma - 1)}{4}\right).
$$
Hence, this equation has **infinitely many solutions**.

### Equivalent equations

An equation is called **linear** if it can be reduced to linear form using elementary operations such as:
- adding the same expression to both sides,
- simplifying terms,
- multiplying both sides by a non-zero number.

For example,
$$
5y - 4xz + 3y - z - 5 = -4xz + 6x - 7
$$
is linear because it simplifies to
$$
-6x + 8y - z = -2.
$$

Two equations that have the same solution set are called **equivalent**.


### Systems of linear equations

A collection of $m$ linear equations in $n$ variables,
$$
\begin{cases}
a_{11}x_1 + a_{12}x_2 + \cdots + a_{1n}x_n = b_1, \\
a_{21}x_1 + a_{22}x_2 + \cdots + a_{2n}x_n = b_2, \\
\vdots \\
a_{m1}x_1 + a_{m2}x_2 + \cdots + a_{mn}x_n = b_m,
\end{cases}
$$
is called a **system of linear equations**.

A vector
$$
(\chi_1, \chi_2, \dots, \chi_n)
$$
is a solution of the system if it satisfies **every equation** in the system.


### Examples

All of the following are systems of linear equations:
$$
\begin{cases}
3x + 2y = 2, \\
4x + 3y = 1,
\end{cases}
$$

$$
\begin{cases}
t_1 - 3t_2 = 5, \\
t_2 + t_3 = 4, \\
t_1 - 7t_2 = 0,
\end{cases}
$$

$$
\begin{cases}
x + y + z = -6, \\
2x - z = 1, \\
-2z = y + 4, \\
t = 12.
\end{cases}
$$

If a variable does not explicitly appear in an equation, it is assumed to have coefficient $0$. For example, the last system can be rewritten as:
$$
\begin{cases}
x + y + z + 0t = -6, \\
2x + 0y - z + 0t = 1, \\
0x - y + z + 0t = 4, \\
0x + 0y + 0z + t = 12.
\end{cases}
$$

## The number of solutions and the geometrical interpretation of SLEs

Note that $(1, 3)$ is a solution of the system
$$
\begin{cases}
3x - 2y + 3 = 0, \\
2x + y - 5 = 0,
\end{cases}
$$
since
$$
\begin{cases}
3\cdot 1 - 2\cdot 3 + 3 = 3 - 6 + 3 = 0, \\
2\cdot 1 + 3 - 5 = 2 + 3 - 5 = 0.
\end{cases}
$$

Furthermore, we can show that this is the **unique solution** of the system.

### Geometrical interpretation

This fact can be easily illustrated by the geometric meaning of linear equations.
The set of solutions of a linear equation in two variables is a set of pairs $(x, y)$, which can be interpreted as points on the plane.

Any linear equation in two variables defines a **straight line** on the plane.
(We will leave this fact without proof, but it is worth reflecting on it, as it gives another perspective on the word *linear*.)

Since exactly one straight line passes through any two distinct points on a plane, we can find two particular solutions of each equation and draw the corresponding lines.

For example, for the equation
$$
3x - 2y + 3 = 0,
$$
two solutions are
$$
(-1, 0) \quad \text{and} \quad (3, 6).
$$
The straight line passing through these points represents all solutions of this equation.

Applying the same procedure to the second equation,
$$
2x + y - 5 = 0,
$$
we obtain another straight line.

The **intersection point** of these two lines corresponds to the pair of numbers that satisfies **both equations simultaneously**. Therefore, the point
$$
(1, 3)
$$
is indeed the unique solution of the system.

### Number of equations vs number of solutions

Adding more equations introduces more constraints on the variables, which restricts the set of possible solutions. Intuitively, it might seem that when the number of unknowns equals the number of equations, the solution must be unique. This intuition is close to the truth, but not always correct.

Consider the following two systems:
$$
\begin{cases}
3a + 6b = 9, \\
a + 2b = 3,
\end{cases}
\qquad \text{and} \qquad
\begin{cases}
3a + 6b = 9, \\
a + 2b = 2.
\end{cases}
$$

- The **first system** has solutions such as $(1, 1)$ and $(3, 0)$, and in fact has **infinitely many solutions**.
- The **second system** has **no solutions**.

Geometrically, the first system corresponds to **two identical lines**, so every point on that line is a solution.
The second system corresponds to **two parallel lines**, which never intersect, hence no solution exists.


The problem of solving arbitrary systems of linear equations is not as difficult as it may appear, and the geometric interpretation can be extended to higher dimensions. Next, we will explore more specific applications of these ideas.

## Some examples of SLE applications

### Example 1: Interpolation by a quadratic polynomial

Consider a process described by the model
$$
f(t) = at^2 + bt + c
$$
(for example, the dependence of the coordinate of a thrown body on time).

We know the functional form, but the coefficients $a$, $b$, and $c$ are unknown.
After three experiments, we obtain:
$$
f(0) = 6, \quad f(1) = 2, \quad f(2) = -12.
$$

This allows us to determine $f(t)$ using a system of linear equations.

\[
\begin{cases}
6 = f(0) = a \cdot 0^2 + b \cdot 0 + c = c, \\
2 = f(1) = a \cdot 1^2 + b \cdot 1 + c = a + b + c, \\
-12 = f(2) = a \cdot 2^2 + b \cdot 2 + c = 4a + 2b + c.
\end{cases}
\]

Simplifying, we obtain:
\[
\begin{cases}
c = 6, \\
a + b + c = 2, \\
4a + 2b + c = -12.
\end{cases}
\]

It is easy to check that the unique solution is:
$$
a = -5, \quad b = 1, \quad c = 6.
$$

Therefore, the process is described by the function
$$
f(t) = -5t^2 + t + 6.
$$

The problem we have just solved is called the **search for an interpolation polynomial**.
In general, many interpolation and regression problems are solved using systems of linear equations.
For example, the well-known **least squares method** in linear regression is based on solving a specific SLE.

### Example 2: Finding the intersection of two lines

Now let us consider a geometric application.

Suppose we have a coordinate plane with two straight lines:
- The first line passes through points $(-2, 5)$ and $(4, 3)$.
- The second line passes through points $(-1, 0)$ and $(-2, -2)$.

We want to find their intersection point (if it exists).

To do this, we first need to determine the equations of these lines.
Any straight line on a plane can be uniquely determined by two distinct points.

In this example, the equations are:
$$
x + 3y - 13 = 0 \quad \text{(first line)}
$$
and
$$
-2x + y - 2 = 0 \quad \text{(second line)}.
$$

Although these equations were obtained by guessing, we can verify them.

For the first line:
- Substituting $(-2, 5)$:
$$
-2 + 3 \cdot 5 - 13 = -2 + 15 - 13 = 0,
$$
- Substituting $(4, 3)$:
$$
4 + 3 \cdot 3 - 13 = 4 + 9 - 13 = 0.
$$

Both points satisfy the equation, so it is correct.
A similar check can be done for the second line.

Now we can write the system of linear equations:
\[
\begin{cases}
x + 3y - 13 = 0, \\
-2x + y - 2 = 0.
\end{cases}
\]

Solving this system, we obtain the solution:
$$
(x, y) = (1, 4).
$$

Since $(1, 4)$ satisfies both equations, it lies on both lines.
Therefore, this point is the **intersection point** of the two straight lines.

## Conclusion

Solving systems of linear equations (SLEs) is a technique that is absolutely necessary for everyone whose life is somehow connected with mathematics or calculations.

An equation is called **linear** if it can be reduced to the form
$$
a_1 x_1 + a_2 x_2 + \cdots + a_n x_n = b,
$$
where $a_1, a_2, \ldots, a_n$ and $b$ are fixed numbers, and $x_1, x_2, \ldots, x_n$ are unknown variables.

A set of linear equations involving the same variables
$$
x_1, x_2, \ldots, x_n
$$
is called a **system of linear equations**.

Any set of numbers that satisfies **every equation** in the system is called a **solution** of the system.

A system of linear equations may have:
- a **unique solution**,
- **infinitely many solutions**, or
- **no solutions at all**.

This is also true in the special case when an SLE contains only one equation (then it is simply a linear equation).

Linear equations of two variables describe **straight lines on a plane**.
By solving systems of linear equations, we can find the **intersection points** of these lines.