Merge Sort Algorithm

The Merge Sort Algorithm may seem complicated to implement because it uses a recursive approach, which bugs a lot of people. The easiest way to reason about the merge sort algorithm is: Imagine you have a big pile of cards that you want to sort out.

The principle is simple, divide the pile of cards in 2 pieces (left & right) till new smaller piles consist of 1 card only. Then start merging the left & right piles till you reach the top of the pile.

When merging 2 piles of cards compare each card with the other card in the pile and merge them in the correct (sorted) order.

At the end of this recursive process we shall have the huge pile of cards sorted out.

To make what was described even clearer check out the following example with cards:

Overview

Now that we know what is the Merge Sort Algorithm in a nutshell, let us define couple of rules and technical steps required for this algorithm to work.

On every call find the middle of the incoming array/slice (divide it in 2)
Sort the LEFT side (recursively)
Sort the RIGHT side (recursively)
Merge the LEFT & RIGHT side (recursively)
Array/Slice of 1 element is considered sorted
Repeat the process using the Divide & Conquer principle (Recursive)

Here’s a small example of how the process pretty much looks:

Divide & Conquer principle

While every problem out there can be solved using the good old procedural approach, in many cases using a Divide & Conquer or recursive approach is just much easier and simpler in order to solve a problem.

The Divide & Conquer principle consists of 3 simple steps:

Divide the problem into smaller problems of the same type
Repeat the process till the problem can no longer be divided
Collect & Merge indivisible problems (solutions) or Conquer

The Algorithm

So, we’re getting closer to putting the Merge Sort Algorithm together, we just need to establish couple of things we’ll need and how we’re planning to do things around.

We need a merge function that merges and sorts the final array/slice
We need a recursive sort function
Call sort recursively for the LEFT & RIGHT sides
Call merge recursively for the sorted LEFT & RIGHT sides
Return the sorted slice back to the caller

`merge` function

Right of the bat, the merge function’s responsibility is to merge 2 given arrays/slices into 1, as well as sorting the result in the merging process.

So the merge function will have to create will primarily do these 3 things:

Iterate as many times as the smallest length of the 2 arrays/slices, meaning if we pass a slice of length 7 and a slice of length 3, the loop will iterate 3 times only. This of course is for the highest efficiency while doing comparisons.
On each iteration we compare the elements of LEFT & RIGHT sides and insert the smaller one in the resulting array/slice, till we run out of iterations.
We insert all the missing elements, when we run out of iterations. Like the case when one side has the most smaller elements, the other side will have elements we have not inserted in the resulting array/slice.

While the merge function does all the things we enumerated above, it does not do couple of things which We, the developers are responsible for before calling the merge function.

There is 1 thing that the merge function expects the developer to take care of, in order for it to work correctly:

The RIGHT & LEFT arrays/slices MUST be Sorted

The merge function must be able to work with different array/slice lengths, however, because on every sort call we divide the array/slice in 2, it’s clear that the merge function will be called with n or n+1 elements difference.

Here’s a small overview of the merge function and how it should work when for example we call it with the following LEFT & RIGHT sides:

merge([]int{4,5,7}, []int{1,3,6})

`merge` function pseudocode

function merge(L, R) {
  let A[0..L.length+R.length]
  i, j, k = 0, 0, 0
  // compare LEFT & RIGHT side elements before merging
  while i < L.length and j < R.length {
    if L[i] < R[j] {
      A[k] = L[i]
      i++
    } else {
      A[k] = R[j]
      j++
    }
  }
  
  // check if any elements from the left/right side
  // were missed in the comparison section above
  while i < L.length {
    A[k] = L[i]
    i++
    k++
  }
  while j < R.length {
    A[k] = R[j]
    j++
    k++
  }
  
  return A
}

As you can see from the pseudocode above we’re mainly doing 2 things:

Compare the elements and decide which ones goes first in the resulting A array/slice.

...
while i < L.length and j < R.length {
  if L[i] < R[j] {
    A[k] = L[i]
    i++
  } else {
    A[k] = R[j]
    j++
  }
}
...

Also, notice the sorting condition above is hardcoded. Instead, we could also receive a callback function that accepts a and b and returns bool, giving the control of sorting to the caller.

Something like the following:

type sortFunc func(a, b int) bool

At the end we check for any left elements and we add them at the end of the resulting array/slice A

...
while i < L.length {
  A[k] = L[i]
  i++
  k++
}
while j < R.length {
  A[k] = R[j]
  j++
  k++
}
return A
...

For the Go implementation check out the merge function

`sort` function

You may be asking yourself, how the hell do we sort things when all we do is merge 2 already sorted arrays/slices and call a function that uses merge recursively. Well that’s the beauty of the Divide & Conquer principle.

By dividing the initial array/slice consisting of perhaps a huge amount of elements into smaller arrays/slices and repeating the process we reach the very bottom of the recursion, where by definition an array/slice consisting of 1 element are trivially considered already sorted, which means when we start merging we meet both conditions:

We provide sorted LEFT & RIGHT side arrays/slices (beginning from arrays/slice of length 1).
Repeat the process till we’ve merged everything back to the initial big array/slice, this time in the sorted order.

Have a look at how exactly the sort function will get called for a simple example like the following:

sort([]int{6,2,1,3,5,4}) // [1, 2, 3, 4, 5, 6]

`sort` function pseudocode

function sort(A) {
  if A.length > 1 {
    n1 = A.length / 2
    n2 = A.length - n1
    let L[0..n1] and R[0..n2]
    
    // split A in half => L and R arrays/slices
    for i=0 to n1 {
      L[i] = A[i]
    }
    for j=0 to n2 {
      R[j] = A[n1+j]
    }
    // for languages that use clever constructs
    // m = A.length / 2
    // L = A[:m]
    // R = A[m:]
    // recursively sort & merge L & R arrays/slices
    // A starts growing when recursion reaches the very bottom
    A = merge(sort(L), sort(R))
  }
  
  // return when array/slice consists of 1 element
  return A
}

The sort function primarily consists of 3 steps:

Split the incoming array/slice in 2 sides, on every recursive call & create LEFT & RIGHT arrays/slices.

...
n1 = A.length / 2
n2 = A.length - n1
let L[0..n1] and R[0..n2]
    
// split A in half => L and R arrays/slices
for i=0 to n1 {
  L[i] = A[i]
}
for j=0 to n2 {
  R[j] = A[n1+j]
}
...

Return array/slice when it cannot be divided anymore, or it has only 1 element.

if A.length > 1 {
  ...
}
// Here we have an array/slice of 1 element only
return A

Call sort and merge for LEFT & RIGHT sides recursively, till exit condition is met (array/slice has 1 element only or both sides were merged).

...
// recursively sort & merge L & R arrays/slices
// A starts growing when recursion reaches the very bottom
A = merge(sort(L), sort(R))
...

For the Go implementation check out the sort function and insertionMergeSort function

Facts

As I say in everyone of my tutorials, there is really no perfect tool/solution or algorithm that does it all. You really have to use the right tool for the right problem, or I should say use the right Sorting Algorithm for the right situation. That’s why before we wrap this up, let’s explore couple of facts about the Merge Sort Algorithm, so that you know when is best to use it and when not:

Merge Sort is Efficient & Very Fast for big data sets
Merge Sort Time Complexity is: O(n*log(n))
Merge Sort Space Complexity is: O(n)

Here are couple of facts about the Insertion Sort Algorithm:

Insertion Sort is Efficient for small data sets
Insertion Sort Best Time Complexity is: O(n)
Insertion Sort Worst/Average Time Complexity is: O(n*n)
Insertion Sort Space Complexity is: O(1)

Knowing the above facts, helps identify & diagnose the sorting problem and give us an idea of how we shall proceed with sorting and what kind of sorting algorithms to use.

How to Test

# cd into the package
cd sort

# run benchmark tests for BenchmarkInsertionMergeSortFunc
# feel free to change the above benchmark name with your own
go test -v -bench 'BenchmarkInsertionMergeSortFunc'

Benchmarks

The current benchmarks ran on a relatively small data set of 100 000 randomly generated integers. In real life, data sets can be bigger, and the distribution of the data can be worse in terms of how sorted it is.

Thus, as you can see the Merge Sort Algorithm, as well as the hybrid algorithm using the Insertion Sort Algorithm, and the merge algorithm perform way better, and they are pretty stable algorithms, which makes the result very predictable.

The results may vary, due to the randomly generated numbers, as well as depending on the specific environment and machine they are run on.

These are the results I got on my personal Macbook Pro 2019 machine, yours might be different:

Bottom line, the indicating factor that one algorithm performs the best is how many time it can be run in a time window & how many ns/op it takes.

Pay attention at the 1st number which is how many times the function was executed in a small time window (bigger is better), also at the 2nd number which shows how many nanoseconds/operation it took (smaller is better).

goos: darwin
goarch: amd64
pkg: github.com/algorithms-go/sorting/sort
cpu: Intel(R) Core(TM) i9-9880H CPU @ 2.30GHz
BenchmarkInsertionSort
BenchmarkInsertionSort-16    	             21	           52535921 ns/op
PASS

goos: darwin
goarch: amd64
pkg: github.com/algorithms-go/sorting/sort
cpu: Intel(R) Core(TM) i9-9880H CPU @ 2.30GHz
BenchmarkMergeSort
BenchmarkMergeSort-16    	             747	   1608912 ns/op
PASS

goos: darwin
goarch: amd64
pkg: github.com/algorithms-go/sorting/sort
cpu: Intel(R) Core(TM) i9-9880H CPU @ 2.30GHz
BenchmarkMergeSortFunc
BenchmarkMergeSortFunc-16    	             705	   1614946 ns/op
PASS

goos: darwin
goarch: amd64
pkg: github.com/algorithms-go/sorting/sort
cpu: Intel(R) Core(TM) i9-9880H CPU @ 2.30GHz
BenchmarkInsertionMergeSortFunc
BenchmarkInsertionMergeSortFunc-16    	     915	   1293846 ns/op

Name		Name	Last commit message	Last commit date
Latest commit History 36 Commits
gifs		gifs
presentations		presentations
sort		sort
.gitignore		.gitignore
README.md		README.md
go.mod		go.mod
go.sum		go.sum
main.go		main.go
thumbnail.jpg		thumbnail.jpg

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Merge Sort Algorithm

Overview

Divide & Conquer principle

The Algorithm

`merge` function

`merge` function pseudocode

`sort` function

`sort` function pseudocode

Facts

How to Test

Benchmarks

Resources

About

Uh oh!

Releases

Packages

Uh oh!

Languages

algorithms-go/merge-sort

Folders and files

Latest commit

History

Repository files navigation

Merge Sort Algorithm

Overview

Divide & Conquer principle

The Algorithm

merge function

merge function pseudocode

sort function

sort function pseudocode

Facts

How to Test

Benchmarks

Resources

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

`merge` function

`merge` function pseudocode

`sort` function

`sort` function pseudocode

Packages