## 2. Counting

Counting is not always easy, some aspects that involve counting present complex problems such as in how many different ways can 5 different genes be arranged to effectively activate a biological process? or How many different ways can a group of images be selected from a subset of 10?

---

## The Multiplication principle

**Combinatorials:** laborious and time consuming process if solved directly. We rely on mathematical solutions to facilitate a rapid solution to such questions.

Lets imagine two different events A and B from different processes. For example, A can be rolling a die and B growing a card from a deck of 52 cards.
A has 6 possible outcomes and B has 52 possible outcomes. 

Let's define variables...

event A has m possible outcomes and event B has n possible outcomes, then the number of outcomes between A and B is n*m

![title](combinatorials.png)

In our previous example we have 6*52 = 312 possible different outcomes

Example 2 A computer manufacturer has 15 monitors, each of which can be connected to any one of 12 systems for shipping out to retail outlets. How many different combinations of systems and monitors are there?

<h4><center>**There are 12 X 15 = 180 different combinations**!</center></h4>



#### We can generalize the rule to be applied to multiple events.

If k experiments have outcomes n1, n2, n3…$n_k$, then together the k experiments have a total of n1*n2*n3*…*nk
possible outcomes.

For example, suppose a protein complex consists of 5 proteins encoded by 5 different genes. Suppose for protein 1 there are 4 genetic alleles, for protein 2 there are 2 genetic alleles, for protein 3 there are 9 genetic alleles, for protein 4 there are 11 alleles, and for protein 5 there are 6 alleles. How many different genetic alleles are involved in this protein complex? 

The answer is a straightforward multiplication of the number of alleles involved in all 5 proteins,

In [None]:
4*2*9*11*6

---

### Ordered Selection With Replacement - Permutations with replacement

In the previous rule, we were interested in all possible arrangements independent of order. But in many cases we are interested in finding out the different number of arrangements when order matter, but allowing repetited elements. 

For example, we have a set of n elements e.g. n(3) = {1,2,3} where we want to draw two at the times k = {2}.

In general, we can argue that there are k positions in the chosen list: ((Position 1, Position 2, ..., Position k). There are n options for each position. Thus, when ordering matters and repetition is allowed, the total number of ways to choose k objects from a set with n elements is

$$n×n×...×n= n^k$$

Note that this is a special case of the multiplication principle where there are k "experiments" and each experiment has n possible outcomes.

In our example we have n = 3 (also noted as size) and k = 2 (trials) (this notation will become extremely important further ahead).

$$3^2=9$$


In [None]:
library(gtools)
permutations(3,2,repeats.allowed = T)

Example 2:

What is the probability that 10 persons, picked at random, are all born in May? 


---

### Permutations without replacement: Arrangements on same set

**In permutations the order matter**

In a protein complex, there are 4 subunits that in conjunction can activate multiple pathways depending of what order are the subunits arranged, lets call these subunits {A,B,C,D}. What are all the possible permutations for these four subunits.

In [None]:
subun = c("A","B","C","D")
permutations(4,4, subun,repeats.allowed = F)
length(permutations(4,4)[,1])

Writing out all possible arrangements is quite tedious and prone to error, particularly when we have a large number of elements. Let's analyze the problem from a mathematical perspective..

If we name $S_1$ the first subunit, we can choose any so we have 4 options, now for $S_2$ we have only 3 options left as we have already chose one, $S_3$ = 2, and $S_4$ = 1, using pur multiplication rule we have

$$4 * 3 * 2 * 1 =24$$

#### Generalizing:

the number of different arrangements (permutations) of n objects in a row is n(n-1) * (N-2)...321

This formulation is called a **factorial** and has a special notation n! called n factorial

4! = 24

In [None]:
factorial(4)

An important property of factorials is that

0! = 1

Example: 

A CD disc holder has two compartments, each of which holds five CDs. If I have five rock CDs and five classical CDs, in how many different ways can I store them if: 

(a) they are all mixed together, 

(b) the rock and classical CDs are to be stored separately?



### Arrangements without replacement 

Going back to sampling with replacement, remember that we want to choose k elements from a large n set, in that case we are not interested in the order or if we draw the same element multiple times, however, in many cases we don't want to return the element to the main group. In this case the main group n decreases by one after each draw.

##### Order Relevant - Permutations nPk

if The order of our arrangement is relevant, and we are not replacing objects, we can say that there are n ways of drawing the first element, (n-1) the second, etc. in general, (n-k+1) ways of removing the kth element.

$$n(n-1)(n-2)...(n-k+1)$$

This expression can also be written as 
\begin{equation*}nPk = \frac{n!}{(n-k)!}\end{equation*}

 

In [None]:
##in our nucleotide example 
length(permutations(4,4,subun,set=TRUE,repeats.allowed = F)[,1])

#### Order Irrelevant Combinations nCk

**In combinations order doesn't matter**

If order does not matter, we are decreasing the number of possible arrangements by k!, as we don't allow replacement thus we are eliminating many duplicates. We set our combinations as:

\begin{equation*}nCk = \frac{n!}{(n-k)!k!}\end{equation*}

This equation is called the *binomial coefficient* and can also be denoted as.

\begin{equation*}{n \choose k} = \frac{n!}{(n-k)!k!} \end{equation*}

In [None]:
length(combinations(4,4,subun,set=TRUE,repeats.allowed = F)[,1])
combinations(4,4,subun,set=TRUE,repeats.allowed = F)
choose(4,4)

Exercise:

How many combinations of 3 songs can we obtain from a list of the 8 greatest hits of all time?

a. Sample with replacement

<h3><center>$n^k$</center></h3>

b. Sample without replacement but order is relevant

<h3><center>nPk</center></h3>

c. sample without replacement but order is irrelevant

<h3><center>nCk</center></h3>