**Objective**:
WAP to implement a multi-layer perceptron (MLP) network with one hidden layer 
using numpy in Python. Demonstrate that it can learn the XOR Boolean function.   

**Model Description**
Input Layer → 2 neurons (XOR inputs).
Hidden Layer → 4 perceptrons with a step function activation.
Output Layer → 1 perceptron with a step function activation.
Learning Rate → 0.1
Epochs → 100 (reduced for step-by-step tracking)
Loss Calculation → Mean Squared Error (MSE).
Evaluation Metrics → Accuracy.

In [1]:
import numpy as np

In [2]:
class MLP:
    def __init__(self, input_size, hidden_size, output_size, learning_rate=0.1, epochs=100):
        # Initialize weights and biases
        self.W1 = np.random.randn(input_size, hidden_size)
        self.b1 = np.zeros(hidden_size)
        self.W2 = np.random.randn(hidden_size, output_size)
        self.b2 = np.zeros(output_size)
        self.lr = learning_rate
        self.epochs = epochs

    def step_function(self, x):
        """Step activation function"""
        return np.where(x >= 0, 1, 0)

    def forward(self, X):
        """Forward pass"""
        self.z1 = np.dot(X, self.W1) + self.b1
        self.a1 = self.step_function(self.z1)
        self.z2 = np.dot(self.a1, self.W2) + self.b2
        self.a2 = self.step_function(self.z2)
        return self.a2

    def backward(self, X, y, output):
        """Backward pass using weight update rule"""
        error = y - output  # Compute error

        # Adjust weights using perceptron learning rule
        self.W2 += self.lr * np.dot(self.a1.T, error)
        self.b2 += self.lr * np.sum(error, axis=0)
        self.W1 += self.lr * np.dot(X.T, np.dot(error, self.W2.T))
        self.b1 += self.lr * np.sum(np.dot(error, self.W2.T), axis=0)

    def train(self, X, y):
        """Train the MLP"""
        for epoch in range(self.epochs):
            output = self.forward(X)
            self.backward(X, y, output)
            loss = np.mean((y - output) ** 2)  # Mean Squared Error
            acc = self.accuracy(X, y)
            print(f"Epoch {epoch + 1}/{self.epochs}, Loss: {loss:.4f}, Accuracy: {acc:.2f}%")

    def predict(self, X):
        """Make predictions"""
        return self.forward(X)

    def accuracy(self, X, y):
        """Calculate accuracy"""
        predictions = self.predict(X)
        correct = np.sum(predictions == y)
        return correct / len(y) * 100  # Accuracy in percentage

In [3]:
# XOR Dataset
X_xor = np.array([[0, 0], [0, 1], [1, 0], [1, 1]])
y_xor = np.array([[0], [1], [1], [0]])  # XOR Truth Table

In [5]:
# Train MLP
mlp = MLP(input_size=2, hidden_size=4, output_size=1, learning_rate=0.1, epochs=100)
mlp.train(X_xor, y_xor)

Epoch 1/100, Loss: 0.5000, Accuracy: 75.00%
Epoch 2/100, Loss: 0.2500, Accuracy: 50.00%
Epoch 3/100, Loss: 0.5000, Accuracy: 75.00%
Epoch 4/100, Loss: 0.2500, Accuracy: 50.00%
Epoch 5/100, Loss: 0.5000, Accuracy: 75.00%
Epoch 6/100, Loss: 0.2500, Accuracy: 75.00%
Epoch 7/100, Loss: 0.2500, Accuracy: 75.00%
Epoch 8/100, Loss: 0.2500, Accuracy: 75.00%
Epoch 9/100, Loss: 0.2500, Accuracy: 50.00%
Epoch 10/100, Loss: 0.5000, Accuracy: 75.00%
Epoch 11/100, Loss: 0.2500, Accuracy: 75.00%
Epoch 12/100, Loss: 0.2500, Accuracy: 50.00%
Epoch 13/100, Loss: 0.5000, Accuracy: 75.00%
Epoch 14/100, Loss: 0.2500, Accuracy: 75.00%
Epoch 15/100, Loss: 0.2500, Accuracy: 50.00%
Epoch 16/100, Loss: 0.5000, Accuracy: 50.00%
Epoch 17/100, Loss: 0.5000, Accuracy: 50.00%
Epoch 18/100, Loss: 0.5000, Accuracy: 50.00%
Epoch 19/100, Loss: 0.5000, Accuracy: 75.00%
Epoch 20/100, Loss: 0.2500, Accuracy: 50.00%
Epoch 21/100, Loss: 0.5000, Accuracy: 50.00%
Epoch 22/100, Loss: 0.5000, Accuracy: 75.00%
Epoch 23/100, Loss:

In [6]:
# Test Predictions
print("\nXOR Predictions:")
for x in X_xor:
    print(f"Input: {x}, Output: {mlp.predict([x])[0]}")


XOR Predictions:
Input: [0 0], Output: [0]
Input: [0 1], Output: [1]
Input: [1 0], Output: [1]
Input: [1 1], Output: [0]


In [7]:
# Final Accuracy
accuracy = mlp.accuracy(X_xor, y_xor)
print(f"\nFinal Model Accuracy: {accuracy:.2f}%")


Final Model Accuracy: 100.00%


**Description of the Code**
1. Initialization (__init__)
   Initializes weights & biases randomly for hidden and output layers.
   Uses learning rate = 0.1 and epochs = 100.
2. Activation (step_function)
   Uses a step function to classify outputs as 0 or 1.
3. Forward Propagation (forward)
   Computes activations for hidden layer and output layer.
4. Backward Propagation (backward)
   Uses perceptron weight update rule to adjust weights & biases.
5. Training (train)
   Runs 100 epochs, printing loss & accuracy for each epoch.
6. Prediction (predict)
   Uses trained weights to classify new inputs.
7. Accuracy Calculation (accuracy)
   Compares predictions vs. actual values.

**Limitations**
  Step function is non-differentiable, making learning inefficient.
  Learning is slow due to basic weight update rule.