# Nearest neighbour algorithm

The nearest neighbour algorithm was one of the first algorithms used to solve the travelling salesman problem approximately. In that problem, the salesman starts at a random city and repeatedly visits the nearest city until all have been visited. The algorithm quickly yields a short tour, but usually not the optimal one.

## Algorithm

These are the steps of the algorithm:

1. Select an arbitrary vertex, set it as the current vertex u. Mark u as visited.
1. Find out the shortest edge connecting the current vertex u and an unvisited vertex v. Set v as the current vertex u. Mark v as visited.
1. If all the vertices in the domain are visited, then terminate. Else, go to step 3.

If you don't understand the above algorithm, please don't worry. Next, we will learn based on an example.

## Step by step

First, we import the python dependencies required for this section.

In [1]:
import numpy as np

接下来我们定义一张旅行图，这张图中包含4个城市a、b、c和d，它们之间的距离使用距离矩阵表示

|  | a  | b  | c  | d  |
|--------|----|----|----|----|
| a      | 0  | 20 | 15 | 35 |
| b      | 20 | 0  | 10 | 25 |
| c      | 15 | 10 | 0  | 12 |
| d      | 35 | 25 | 12 | 0  |

这个矩阵中的每一个元素代表对应行与列城市的距离。例如a和c之间的距离是15。注意，由于任意两点间的往返距离是相同的，所以该距离矩阵为对称矩阵。

代码实现如下：

In [2]:
label = ['a', 'b', 'c', 'd']
G = [
    [0,20,15,35],
    [20,0,10,25],
    [15,10,0,12],
    [35,25,12,0]
]

#### Step 1. Choose any starting node

Select an arbitrary vertex, set it as the current vertex u. Mark u as visited.

In [3]:
city = np.random.randint(len(label))
current, tour, tour_length, tour_lengths = city, [city], 0, []
print(f'Start node: {current}, Tour: {tour}')

Start node: 2, Tour: [2]


#### Step 2. Consider the arcs which join the node just chosen to nodes as yet unchosen.  Pick the one with minimum weight and add it to the cycle

Find out the shortest edge connecting the current vertex u and an unvisited vertex v. Set v as the current vertex u. Mark v as visited.


We define a function to implement this function.

In [4]:
def find_closest_neighbor(city, G, label, tour):
    w = 10000003
    index = -1
    for i in range(len(label)):
        if i not in tour and i != current and G[current][i] < w:
            w = G[current][i]
            index = i
    return index

closest_neighbor = find_closest_neighbor(current, G, label, tour)
print(f'Node: {current}, Closest neighbor: {closest_neighbor}')

Node: 2, Closest neighbor: 1


### Step 3. Repeat step 2 until all nodes have been chosen

If all the vertices in the domain are visited, then terminate. Else, go to step 3.

In [5]:
while len(tour) != len(label):
    index = find_closest_neighbor(current, G, label, tour)
    current = index
    tour.append(current)
print(f'Final Tour: {list(map(lambda x:label[x], tour))}')

Final Tour: ['c', 'b', 'a', 'd']


The sequence of the visited vertices is the output of the algorithm.

## Note

The nearest neighbour algorithm is easy to implement and executes quickly, but it can sometimes miss shorter routes which are easily noticed with human insight, due to its "greedy" nature. As a general guide, if the last few stages of the tour are comparable in length to the first stages, then the tour is reasonable; if they are much greater, then it is likely that much better tours exist. Another check is to use an algorithm such as the lower bound algorithm to estimate if this tour is good enough.

In the worst case, the algorithm results in a tour that is much longer than the optimal tour. To be precise, for every constant r there is an instance of the traveling salesman problem such that the length of the tour computed by the nearest neighbour algorithm is greater than r times the length of the optimal tour. Moreover, for each number of cities there is an assignment of distances between the cities for which the nearest neighbor heuristic produces the unique worst possible tour. (If the algorithm is applied on every vertex as the starting vertex, the best path found will be better than at least N/2-1 other tours, where N is the number of vertices.)

The nearest neighbour algorithm may not find a feasible tour at all, even when one exists.

You can find the implementation of the nearest neighbor algorithm in pyTSP from the following location:

`pyTSP/source/algorithms/tour_construction.py#L33`

## Exercises

 - 这种方法找到路径一定是最短的么？通过代码验证你的想法（参考permutation.ipynb中计算路径长度的函数）