# Exercise Sheet 4: Machine Learning Fundamentals & Linear Regression (Deadline: 01 Dec 23:59)

# ML Fundamentals(7 points)
For theoretical tasks you are encouraged to write in $\\LaTeX$. Jupyter notebooks support them by default. For reference, please have a look at the examples in this short excellent guide: [Typesetting Equations](http://nbviewer.jupyter.org/github/ipython/ipython/blob/3.x/examples/Notebook/Typesetting%20Equations.ipynb)

Alternatively, you can upload the solutions in the written form as images and paste them inside the cells. But if you do this, **make sure** that the images are of high quality, so that we can read them without any problems.

###### 1. Sigmoid Function (1.5 points)
The special case of the logistic function is the *sigmoid function* which is defined as:

\begin{equation*}
  \sigma(a) = \frac{1}{1 + e^{-a}}
\end{equation*}

a) Compute its gradient analytically. (0.5 points)

### a)
$\frac{\partial}{\partial a} \sigma(a) = \frac{\partial}{\partial a} \frac{1}{1 + e^{-a}} = \frac{e^{-a}}{(1 + e^{-a})^2} = 1 + \frac{1 + e^{-a} -1}{(1 + e^{-a})^2} = \frac{1 + e^{-a}}{(1 + e^{-a})^2} + \frac{-1}{(1 + e^{-a})^2} = \frac{1}{1 + e^{-a}} - (\frac{1}{1 + e^{-a}})^2 = \sigma(a) - \sigma(a)^2 = \sigma(a)(1 - \sigma(a)) $

b) What are the inherent properties that you observe from the above computed gradient? (0.5 points) <br />
   *Hint: Think about how would the gradient signal be for the whole domain of the sigmoid function*

c) Prove that the sigmoid function is symmetric. (0.5 points)

###### 2. Regularization (3.5 points)

In the lecture, we've seen that we can add a *regularizer* to our cost function to avoid *over or underfitting*. For example, consider the following training criterion for linear regression:

\begin{equation*}
  J(\textbf{w}) = \frac{1}{m}\sum_{i=1}^{m} \Vert\hat{y}^{(i)} - y^{(i)}\Vert^{2} + \lambda\Omega(\textbf{w})
\end{equation*}
where $\Omega(\textbf{w}) = \textbf{w}^{T}\textbf{w}$ is the regularizer.

a) In the above criterion, what is the role of the regularization parameter $\lambda$ on the regularizer (i.e. parameters of our model) while minimizing $J(\textbf{w})$? (1.0 point)

b) Is $\lambda$ the model parameter or a hyperparameter? Justify.(0.5 points)

c) Derive the closed form solution for the weights ($\textbf{w}$) in the above criterion.(2.0 points)

###### 3. Maximum Likelihood Estimation (MLE) (2 points)
Consider the density function of a ***univariate Gaussian distribution***


\begin{equation*}
 p(x;\mu,\sigma^2) = \frac{1}{\sqrt{2\pi\sigma^2}}exp\left(-\frac{1}{2\sigma^2}(x-\mu)^{2}\right)
\end{equation*}
where $\mu$ is the $\textit{mean}$ and $\sigma^{2}$ is the $\textit{variance}$. 

Let's say you're given *N* samples (i.e. $x_1, x_2, x_3, ..., x_N$) which are drawn from the above stated distribution. Also, you can assume that these samples are **i.i.d** (i.e. [independent and identically distributed](https://en.wikipedia.org/wiki/Independent_and_identically_distributed_random_variables)).

Now, please derive the *MLE step-by-step* for:

a) *mean* $(\mu)$. (1.0 point)

b) *variance* $(\sigma^2)$. (1.0 point)

# Multiple Linear Regression (13 points)

#### 1. Introduction
As we have seen in first assignment sheet, when we have one independent (or explanatory) variable and a scalar dependent variable, it is called **simple linear regression**.
But, when there are more than one explanatory variable (i.e. $x^{(1)}, x^{(2)}, ...,x^{(k)}$), and a single scalar dependent variable (*y*), then it's called $\textit{multiple linear regression}$. (Please don't confuse this with *multivariate linear regression* where we predict more than one (correlated) dependent variable.)

Here, we will implement a **multiple linear regression** model in Python/NumPy using the *Gradient Descent* algorithm. Particularly, we will be using $\textit{stochastic gradient descent}$ (*SGD*) where one performs the update step using a small set of training samples of size *batch_size* which we will set to 64. This is again a hyperparameter but in this exercise we will just use a fixed batch-size of *64* (i.e. we go through the training samples sampling 64 at a time and perform gradient descent.) Such a procedure is sometimes called *mini-batch gradient descent* in the deep learning community.

Going through all the training samples *once* is called an **epoch**. Ideally, the algorithm has to go through multiple epochs over the training samples, each time shuffling it, until a convergence criterion has been satisfied. <br />

Here, we will set a *tolerance value* for the difference in error (i.e. change in MSE values between subsequent epochs) that we will accept. Once this difference falls below the *tolerance value*, we terminate our training phase and return the parameters. 

We repeat the above training procedure for all possible hyperparameter combinations. Later on, using these parameters (*i.e. weight vectors*), we compute the prediction for validation data and the corresponding MSE values. And then, we pick the hyperparameter combination which yielded the least MSE.

As a next step, we will combine training data and validation data and make it as our *new training data*. We keep the test data as it is. Using the hyperparameter combination (for the least MSE) that we found above, we train the model again with the *new training data* and obtain the parameter (*i.e. weight vector*) after convergence according to our *tolerance value*.

Phew! That will be our much desired *weight vector*. This is then used on the *test data*, which has not been seen by our algorithm so far, to make a prediction. The resulting MSE value will be the so-called [*generalization error*](https://en.wikipedia.org/wiki/Generalization_error).

It is this *generalization error* that we want it to be as low as possible for *unseen data* (implies that we can achieve higher accuracy).

#### 2. Dataset
For our task, we will be using the *Wine Quality* dataset and predict the quality of white wine based on 11 features such as acidity, citric acid content, residual sugar etc. .

In [1]:
%matplotlib inline
import itertools
import numpy as np
import pandas as pd
import matplotlib.pyplot as plt

# get data
data_url = 'http://mlr.cs.umass.edu/ml/machine-learning-databases/wine-quality/winequality-white.csv'
data = pd.read_csv(data_url, sep=';')

# inspect data
#print(data.head())
#print(data.shape)

# data as np array
data_npr = data.values
#print (data_npr.shape)

#### 3. Loss function
We will use a *regularized* form of the MSE loss function. In matrix form it can be written as follows:

\begin{equation*}
    J(\textbf{w}) = \frac{1}{2} \Vert{X\textbf{w}-\textbf{y}}\Vert^{2} + \frac{\lambda}{2}\Vert{\textbf{w}}\Vert^{2}
\end{equation*}

It's important to note that, in the above equation, $X$, called *design matrix*, is the horizontal concatenation of shape *(batch_size, num_features)* according to the *order* of the polynomial. To make things easier, you can add the *bias* term as the first column of $X$. Take care to have the *weight* vector $\textbf{w}$ with matching dimensions.

$\textit{Hint}$: see [Design_matrix#Multiple_regression](https://en.wikipedia.org/wiki/Design_matrix#Multiple_regression) for how $X$ with 2 features looks like for $1^{st}$ degree polynomial.

a) Derive the gradient (w.r.t $\textbf{w}$) for the regularized loss function given in **3**. (1.0 point)

### Solution

\begin{equation*}
   \frac{\partial}{\partial w} J(\textbf{w}) = \Vert{X\textbf{w}-\textbf{y}}\Vert * X + \lambda \Vert{\textbf{w}}\Vert
\end{equation*}


#### 4. Matrix format for higher order polynomial

Written in matrix form, linear regression model for second order would look like: <br />
$$\hat{\textbf{y}} = X\textbf{w}_{1} + X^{2}\textbf{w}_{2} + \textbf{b}$$

where $X^{2}$ is the element-wise squaring of the original design matrix $X$, $\textbf{w}_1$ and $\textbf{w}_2$ are the *weight* vectors, and **b** is the *bias* vector.

a) Now, please write down the matrix format for a $9^{th}$ order linear regression model (0.5 points)

### Solution 
$$\hat{\textbf{y}} = X\textbf{w}_{1} + X^{2}\textbf{w}_{2} + X^{3}\textbf{w}_{3} + X^{4}\textbf{w}_{4} + X^{5}\textbf{w}_{5} + X^{6}\textbf{w}_{6} + X^{7}\textbf{w}_{7} + X^{8}\textbf{w}_{8} + X^{9}\textbf{w}_{9} + \textbf{b}$$

#### 5. Hyperparameters
we will experiment with three hyperparameters:

i) regularization parameter $\lambda$ <br />
ii) learning rate $\epsilon$ <br />
iii) order of polynomial *p*

And do a grid search over the values that these hyperparameters can take in order to select the best combination (i.e. the one that achieves lowest test error). This approach is called **hyperparameter optimization or tuning**.

In [2]:
polynomial_order = [1, 5, 9]
learning_rates = [1e-5, 1e-8]
lambdas = [0.1, 0.8]

#hyperparams combination
comb_gen = itertools.product(*(polynomial_order, learning_rates, lambdas))
hparams_comb = list(comb_gen)
#print (hparams_comb.shape)
#print (hparams_comb)
batch_size = 64

#### 6. Normalization
First of all, inspect the data, and understand its structure and features. Ideally, before starting to train our learning algorithm, we would want the data to be normalized. Here, we normalize the data (i.e. normalize each column) using the formula:

\begin{equation*}
  norm\_x_i = \frac{x_i - min(x)}{max(x) - min(x)}
\end{equation*}
where $x_i$ is the $i^{th}$ sample in feature $x$

a) Complete the following function which performs normalization (i.e. normalizes columns of $X$). (0.5 points)

In [3]:
def data_normalization(data):
    # TODO: implement
    #print ("data shape inside data_normalization method")
    #print (data.shape)
    
    data_normalized = []
    for column in data:
        data_normalized_column = (column - min(column)) / (max(column) - min(column))
        data_normalized.append(data_normalized_column)
    #data_normalized = (data - min(data)) / (max(data) - min(data))
    #print (type(data_normalized))
    return np.asarray(data_normalized)

#print ("data_npr before normalization shape")
#print (data_npr.shape)
# perform data normalization
data_normalized = data_normalization(data_npr)
data_npr = data_normalized
#print (data_npr)
#print ("data_npr shape")
#print (data_npr.shape)

In [4]:
def split_data(data_npr):
    # (in-place) shuffling of data_npr along axis 0
    np.random.shuffle(data_npr)

    n_tr = 3898
    n_va = n_tr + 500
    n_te = n_va + 500
    
    X_train = data_npr[0:n_tr, 0:-1]
    Y_train = data_npr[0:n_tr, -1]
    
    X_val = data_npr[n_tr:n_va, 0:-1]
    Y_val = data_npr[n_tr:n_va, -1]
    
    X_test = data_npr[n_va:, 0:-1]
    Y_test = data_npr[n_va:, -1]
    
    return [(X_train, Y_train), (X_val, Y_val), (X_test, Y_test)]


# shuffle only the training data along axis 0
def shuffle_train_data(X_train, Y_train):
    """called after each epoch"""
    perm = np.random.permutation(len(Y_train))
    Xtr_shuf = X_train[perm]
    Ytr_shuf = Y_train[perm]
    
    return Xtr_shuf, Ytr_shuf

###### 7. Implementation of required functions

Complete the following function which computes the MSE value. (0.5 point) <br />
(i.e. just a vanilla version of it.) That is, you can ignore the regularization term and also the constants $\frac{1}{2}$

In [5]:
def compute_mse(prediction, ground_truth):
    # TODO: implement
    mse = np.sum((prediction - ground_truth)**2) / prediction.size
    return mse

Implement a function which computes the prediction of your model. (0.5 point)

In [6]:
def get_prediction(X, W):
    # TODO: implement
    #print ("shape of training data")
    #print (X.shape)
    #print ("shape of parameters W")
    #print (W.shape)
    Yhat = X.dot(W).flatten()
    return Yhat

Implement a function which computes the gradient of your loss function. (1.0 point) <br />
*Hint: Just implementing the gradient computed in **3.** (a)*

In [7]:
def compute_gradient(X, Y, Yhat, W, lambda_):
    # TODO: implement
    error = Y - Yhat
    gradient = np.sum(error.dot(X)) - ((lambda_) * np.sum(W))
    return gradient

Implement a function which performs a single update step of SGD. (0.5 point)

In [8]:
# Hint: avoid in-place modification
def sgd(gradient, lr, cur_W):
    # TODO: implement
    new_W = cur_W - (lr * gradient)
    return new_W

Complete the following function which reformats your data as a design matrix. (0.5 point)

In [9]:
# concatenate X acc. to order of polynomial; likewise do it for W
# where X is design matrix, W is the corresponding weight vector
# [1 X X^2 X^3], [1 W1 W2 W3].T
def prepare_data_matrix(X, W, order):
    # TODO: implement
    X_mat = np.ones((X.shape[0], 1))
    
    W_vec = [1]
    for i in range(order):
        X_mat = np.append(X_mat, np.power(X, order + 1), axis=1)
        W_vec = np.append(W_vec, W, axis=0)
    print (X_mat.shape)
    print (W_vec.shape)
    return X_mat, W_vec


###### 8. Training
Complete the code in the following cell such that it performs **mini-batch gradient descent** on the training data for all possible hyperparameter combinations. (4.0 points)

Note: You can also define a function, named appropriately, which performs training. But, take care to do correct bookkeeping of hyperparameter combinations, weight vectors, and the MSE values.

In [None]:
splits = split_data(data_npr)
X_train, Y_train, X_val, Y_val, X_test, Y_test = itertools.chain(*splits)

tolerance = 1e-3
## different tolerance values. check how many epochs for each tolerance value
#tolerance = 1e-4
#tolerance = 1e-5
#tolerance = 1e-6
start = 1

# initialize weight vector from normal distribution
# TODO: implement
w_shape = X_train.shape[1]
W_init = np.random.randn(w_shape)
# cache weights for each hyperparam combination
# TODO: implement
weights_hist = {key: None for key in hparams_comb}
#print (hparams_comb)
# keep track of MSE for each hparam combination. will be useful for plotting
# TODO: implement
mse_hist = {key: [] for key in hparams_comb}

# find optimal hyperparameters
for order in polynomial_order:
    for lr in learning_rates:
        for lamb in lambdas:
            # initialize necessary stuffs
            # TODO: implement
            #iterations = 0
            key = order, lr, lamb
            # design matrix needed at this point
            # use the function that we defined above
            # TODO: implement
            X_mat, W_vec = prepare_data_matrix(X_train, W_init, order)
            epochs = 1
            # goes through multiple epochs
            while True:
                # good idea to shuffle the train data
                # TODO: implement
                Xtr_shuf, Ytr_shuf = shuffle_train_data(X_mat, Y_train)
                # some more initialization
                # TODO: implement
                bs = 0
                nsamples = data_npr.shape[0]
                # goes through 1 epoch
                while bs < nsamples:
                    # complete code for 1 epoch
                    # TODO: implement
                    tx = Xtr_shuf[bs : bs+batch_size]
                    ty = Ytr_shuf[bs : bs+batch_size]
                    prediction = get_prediction(tx, W_vec)
                    gradient = compute_gradient(tx, ty, prediction, W_vec, lamb)
                    
                    error = compute_mse(prediction, ty)
                    W_vec = sgd(gradient, lr, W_vec)
                    bs += batch_size
                
                # after each epoch
                # get prediction for whole X_train
                # compute the MSE
                # might need to do bookkeeping of mse values as well


                # stopping/convergence criterion
                # check whether diff-in-mse < tolerance
                # TODO: implement
                
                prediction = get_prediction(X_mat, W_vec)
                new_error = compute_mse(prediction, Y_train)
                #new_error = get_gradient(W, X_train, Y_train)[1]
                print ("Epoch: %d - Error: %.4f" %(epochs, new_error))
    
                # Stopping Condition
                if abs(new_error - error) < tolerance:
                    print ("Converged.")
                    break
                    # cache weight vector for later use
                    # but we also need the hparam combination
                    # TODO: implement

                    # print("order: {} , learning rate: {} , regularizer: {} ".format(order, lr, lamb))
                    # print("Convergence after epoch {} with MSE {}".format(epochs, ...), "\n")
                    break
                epochs += 1

(3898, 12)
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data


  This is separate from the ipykernel package so we can avoid doing imports until



(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
sh

shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of

shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of

shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(58, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of 

Epoch: 35 - Error: 385912.3392
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of 

(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(58, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of trainin

shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of

shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of

shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(3898, 12)
shape of parameters W
(12,)
Epoch: 67 - Error: 22243791759.2702
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shap

shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of

(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
sh

shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(3898, 12)
shape of parameters W
(12,)
Epoch: 90 - Error: 58746620327322.7969
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of 

(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(3898, 12)
shape of parameters W
(12,)
Epoch: 98 - Error: 910259418525567.2500
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shap

(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(3898, 12)
shape of parameters W
(12,)
Epoch: 106 - Error: 14104168188456440.0000
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(

shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of tra

(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(58, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of tra

shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of tra

shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(58, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of trainin

shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(3898, 12)
shape of parameters W
(12,)
Epoch: 146 - Error: 12596764578625716486144.0000
shape of training data
(64, 12)
sha

shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(58, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W

shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of

(12,)
shape of training data
(3898, 12)
shape of parameters W
(12,)
Epoch: 170 - Error: 46860390153042450173657088.0000
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of

(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
sha

shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of tra

(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
sha

(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(3898, 12)
shape of parameters W
(12,)
Epoch: 203 - Error: 3804589937131827660756659208192.0000
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of trainin

(12,)
shape of training data
(58, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameter

(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
sha

(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(3898, 12)
shape of parameters W
(12,)
Epoch: 226 - Error: 10048053360075178663763104917094400.0000
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
sha

shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(3898, 12)
shape of parameters W
(12,)
Epoch: 234 - Error: 155691261101817643655168284038266880.0000
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of

(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(58, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training

(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
sha

(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
sha

(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(3898, 12)
shape of parameters W
(12,)
Epoch: 266 - Error: 8974152883137576605939450332873238773760.0000
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shap

shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of tra

(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(58, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape 

(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
sha

(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(58, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape

(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(58, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape o

shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(58, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(

shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(3898, 12)
shape of parameters W
(12,)
Epoch: 323 - Error: 2710456169097464750871906707965897215951805349888.0000
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shap

shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(58, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameter

shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of tra

(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(58, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of trainin

shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(58, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of pa

shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(58, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of train

shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of tra

shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of

shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of

(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(58, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
sha

shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of

shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of tra

shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of

shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of

shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of

shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of tra

shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of

(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
sh

shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of tra

(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
sh

shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(3898, 12)
shape of parameters W
(12,)
Epoch: 481 - Error: 869246191530822844899286054274613011788936272014089891322961503350423552.0000
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape 

(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
sha

(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(58, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of train

shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(58, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of p

shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of tra

shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of tra

(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(58, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of 

(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
sha

shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(3898, 12)
shape of parameters W
(12,)
Epoch: 544 - Error: 2050351862025372893946289424781137518840824789119387041595111782

(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(58, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shap

shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(3898, 12)
shape of parameters W
(12,)
Epoch: 560 - Error: 492258249638447717955945856884892085401793734228176150298639485784888588481807253504.0000
sh

(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
sh

shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(58, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of 

(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
sha

shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(58, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(

(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(58, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape

shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of tra

shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of

(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
sh

shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of tra

(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(3898, 12)
shape of parameters W
(12,)
Epoch: 639 - Error: 278768193711744825292523982117845754574561953861849633823302859597722721490062755446416235036672.0000
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of p

(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(3898, 12)
shape of parameters W
(12,)
Epoch: 647 - Error: 4319420899368388302985498355547113814669035503803487742718187596075245794902515333318598021087232.0000
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape

shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(58, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training

shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(3898, 12)
shape of parameters W
(12,)
Epoch: 663 - Error: 1037027111126785918916810675920656657063850223288695354799841161604767129698054344967654080839155712.0000
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of trainin

(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(58, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of trai

shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(58, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training 

(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
sh

(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(3898, 12)
shape of parameters W
(12,)
Epoch: 695 - Error: 59774967285734283675144042133506160406987799219233286854696742896383736991887516857628794716443182628864.0000
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
sha

(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(3898, 12)
shape of parameters W
(12,)
Epoch: 702 - Error: 657550125445545192459155059014143971803121757944658741747585005005438358402043861250802364603330988081152.0000
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data

(3898, 12)
shape of parameters W
(12,)
Epoch: 710 - Error: 10188521383461574946273592587217366066001160897371791544590699789084083044451736710212352891577895183450112.0000
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of paramet

(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
sha

shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(58, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data


shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of tra

shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of

(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(58, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape 

(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(58, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of param

(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(3898, 12)
shape of parameters W
(12,)
Epoch: 767 - Error: 30772304728446817992972069854827520917998970790569428338

(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
sha

shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of

shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of

(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
sha

shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(58, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0,

shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of

shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of

shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of tra

(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
sh

(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
sh

shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(58, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data

shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of tra

(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(3898, 12)
shape of parameters W
(12,)
Epoch: 870 - Error: 6482717962813161184880793967337895074411213155367066990046634154454305623317158530519349607775724634391714270322182389405900079104.0000
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shap

(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
sh

(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(58, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of trainin

shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(3898, 12)
shape of parameters W
(12,)
Epoch: 894 - Error: 24115935505526096612888696740680636730194067957728236849862456546271538583054800261373044016677664692173753773021225187530348802605056.0000
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
s

(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
sha

(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
sha

(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
sh

shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of tra

shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of tra

(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
sh

(12,)
shape of training data
(3898, 12)
shape of parameters W
(12,)
Epoch: 950 - Error: 5171067399795949816534084517897143113505022617011274365384835767311863539873158550644181685689345201954002288850331767083686688679728364126208.0000
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of par

shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of tra

(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
sha

(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
sha

(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
sh

shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of

(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(3898, 12)
shape of parameters W
(12,)
Epoch: 998 - Error: 71560613469749337300751429554206769674545712072954999116001024708696089128538313529861715112458548612604103176076563652360890682658500067748227317760.0000
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of

(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
sh

(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
sh

(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(3898, 12)
shape of parameters W
(12,)
Epoch: 1022 - Error: 266207919457978241743972849677761999615666737237040746364334855946242328583237657769090260761453728077589117594621023057567633219500050386472368684924928.0000
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
s

(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(58, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of trainin

(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
sha

shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of

shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(58, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data


shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(3898, 12)
shape of parameters W
(12,)
Epoch: 1062 - Error: 237756554645075199347704166231241820792514895198523711372660651206936801564872872195075486075497597683011098199890997633180106874322709397053408126170180878336.0000
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of pa

(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(3898, 12)
shape of parameters W
(12,)
Epoch: 1070 - Error: 368395919822498578174355522379175312414080162944371104318891265465353326381369249547292062303769617024271590212450485091277279882294139519146120

shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of tra

(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(58, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of para

shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of training data
(64, 12)
shape of parameters W
(12,)
shape of tra

shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(0, 12)
shape of parameters W
(12,)
shape of training data
(3898, 12)
shape of parameters W
(12,)
Epoch: 1102 - Error: 212345964237860877022767144888757455721724540301591850871018760494156733964972755523054335340455614722619805423164458553624757277088944769844479350908419487104499712.0000
shape of training data
(64, 12)
shape 

Complete the following function which selects the best hyperparameter combination (i.e. the one that gives lowest MSE on **validation data**). (0.5 point)

In [None]:
# find hparams of minimum MSE on Validation data
def find_best_hparams(weights_hist):
    # TODO: implement
    hpm_best, mse_best = None
    return hpm_best, mse_best

best_hpm_combination = find_best_hparams(weights_hist)

###### 9. Re-Training on Train + Validation data
Complete the following function which does re-training on the combined training and validation data. (**1 point**)

In [None]:
# re-run the training on X_train + X_val combined
# Later test it on X_test; That will be our best possible MSE on test data
# this will be more or less the same training code as you did above
# but, here we just have only one value for each hyperparameter.

# TODO: implement

In [None]:
# plot the convergence of MSE values using matplotlib
# i.e. #epochs on X-axis and MSE values on Y-axis
# TODO: implement

###### 10. Evaluation on Test set
Evaluate your model on test data. (1.0 point)

**Please note that you should keep X_test undisturbed throughout this whole phase.** Else restart the kernel and start from beginning. The whole point of this exercise would not make sense if test data has been *seen in training*.

In [None]:
# finally!!!
# test it on X_test with the Weight vector that you found above
# this will be the generalization error of our model!!
# TODO: implement

#print("Finally!!! MSE achieved on X_test is : {}".format(round(mse_test, 6)))

###### 11. Results
Please report the following

a) MSE value on Test data. (0.5 points)

b) Which hyperparameter combination turned out to be the best? In your understanding, why do you think such a combination turned out to be the best for this task? (1.0 point)

# Bonus (2 points)

Now, please repeat the whole *training, validation, re-training, and testing* procedure that we talked about above with the following hyperparameter combination:

In [None]:
polynomial_order = [1]
learning_rates = [0.1]
lambdas = [0.1]

What are your observations during the training phase? Please explain why such a behaviour happened.

---

## Submission instructions
You should provide a single Jupyter notebook as the solution. The naming should include the assignment number and matriculation IDs of all members in your team in the following format:
**assignment-4_matriculation1_matriculation2_matriculation3.ipynb** (in case of 3 members in a team). 
Make sure to keep the order matriculation1_matriculation2_matriculation3 the same for all assignments.

Please submit the solution to your tutor (with **[NNIA][assignment-4]** in email subject):
1. Maksym Andriushchenko <s8mmandr@stud.uni-saarland.de>
2. Marius Mosbach <s9msmosb@stud.uni-saarland.de>
3. Rajarshi Biswas <rbisw17@gmail.com>
4. Marimuthu Kalimuthu <s8makali@stud.uni-saarland.de>

Note: **If you are in a team, please submit only 1 solution to only 1 tutor.**