# XOR Gate using Backpropagation in Neural Networks

Relevant [walkthrough](https://towardsdatascience.com/implementing-the-xor-gate-using-backpropagation-in-neural-networks-c1f255b4f20d)

![alt text](https://cdn-images-1.medium.com/max/800/1*_HLG8KlGJFZxtWoB8J1kFA.png)

In [1]:
import numpy as np 
#np.random.seed(0)

In [2]:
def sigmoid (x):
    return 1/(1 + np.exp(-x))

def sigmoid_derivative(x):
    return x * (1 - x)

![Truth table](https://cdn-images-1.medium.com/max/800/1*01idVj7sVw2ZnGZFapvW4A.png)

In [7]:
# Input datasets
# We initialize our weights and expected outputs as per the truth table of XOR.
inputs = np.array([[0,0],[0,1],[1,0],[1,1]])
expected_output = np.array([[0],[1],[1],[0]])

In [4]:
epochs = 10000
lr = 0.1
inputLayerNeurons, hiddenLayerNeurons, outputLayerNeurons = 2,2,1

In [5]:
#Random weights and bias initialization
hidden_weights = np.random.uniform(size=(inputLayerNeurons,hiddenLayerNeurons))
hidden_bias =np.random.uniform(size=(1,hiddenLayerNeurons))
output_weights = np.random.uniform(size=(hiddenLayerNeurons,outputLayerNeurons))
output_bias = np.random.uniform(size=(1,outputLayerNeurons))

print("Initial hidden weights: ",end='')
print(*hidden_weights)
print("Initial hidden biases: ",end='')
print(*hidden_bias)
print("Initial output weights: ",end='')
print(*output_weights)
print("Initial output biases: ",end='')
print(*output_bias)

Initial hidden weights: [0.52399551 0.12936348] [0.37207811 0.09731689]
Initial hidden biases: [0.76282442 0.91433687]
Initial output weights: [0.31447765] [0.90309069]
Initial output biases: [0.46824842]


![alt text](https://cdn-images-1.medium.com/max/800/1*qXt_iBvWods-FOvTldxYFw.png)

In [6]:
#Training algorithm
for _ in range(epochs):
	#Forward Propagation
	hidden_layer_activation = np.dot(inputs,hidden_weights)
	hidden_layer_activation += hidden_bias
	hidden_layer_output = sigmoid(hidden_layer_activation)

	output_layer_activation = np.dot(hidden_layer_output,output_weights)
	output_layer_activation += output_bias
	predicted_output = sigmoid(output_layer_activation)

	#Backpropagation
	error = expected_output - predicted_output
	d_predicted_output = error * sigmoid_derivative(predicted_output)
	
	error_hidden_layer = d_predicted_output.dot(output_weights.T)
	d_hidden_layer = error_hidden_layer * sigmoid_derivative(hidden_layer_output)

	#Updating Weights and Biases
	output_weights += hidden_layer_output.T.dot(d_predicted_output) * lr
	output_bias += np.sum(d_predicted_output,axis=0,keepdims=True) * lr
	hidden_weights += inputs.T.dot(d_hidden_layer) * lr
	hidden_bias += np.sum(d_hidden_layer,axis=0,keepdims=True) * lr

print("Final hidden weights: ",end='')
print(*hidden_weights)
print("Final hidden bias: ",end='')
print(*hidden_bias)
print("Final output weights: ",end='')
print(*output_weights)
print("Final output bias: ",end='')
print(*output_bias)

print("\nOutput from neural network after 10,000 epochs: ",end='')
print(*predicted_output)

Final hidden weights: [5.45203876 2.9984076 ] [5.45119708 2.99828539]
Final hidden bias: [-2.02338284 -4.52453236]
Final output weights: [6.31321859] [-6.68334086]
Final output bias: [-2.8176994]

Output from neural network after 10,000 epochs: [0.10415523] [0.89123295] [0.89123701] [0.12553166]


**Hence, the neural network has converged to the expected output:**
[0] [1] [1] [0]. The epoch vs error graph shows how the error is minimized.

![alt text](https://cdn-images-1.medium.com/max/800/1*lZ0aYuPWcDaGUgbgSFE90A.png)