## Softmax classifier approach
---
This approach assumes that quantifiers are learned as a group and that essentially each q quantifier example is a negative example for all other quantifiers q'.

The classifier is in effect a solver for which q makes the sentence "Q as are bs" most likely given an input scene s.

This enables us to use not onlt the quantifier quantify evaluation methods but the classifier in order to generate a teacher-student scheme.

## Imports

### my class imports

In [1]:
%load_ext autoreload
%autoreload 2

In [2]:
from quants import *
from models import Classifier

Using TensorFlow backend.


### Global imports

In [3]:
import numpy as np
import pandas as pd

### keras and TF imports

In [4]:
import tensorflow as tf

print("TensorFlow version: ", tf.__version__)

import keras

from keras.models import Sequential
from keras.layers import SimpleRNN, LSTM, Embedding, Dense, Conv1D, Input, Bidirectional, RepeatVector, Dropout, LeakyReLU, Flatten
from keras.preprocessing.text import one_hot
from keras.preprocessing.sequence import pad_sequences
from keras.optimizers import SGD, Adam

TensorFlow version:  2.2.0


In [5]:
gpu_options = tf.compat.v1.GPUOptions(per_process_gpu_memory_fraction=0.1)
sess = tf.compat.v1.Session(config=tf.compat.v1.ConfigProto(gpu_options=gpu_options))
print("Keras backend: ", tf.python.keras.backend.backend())
tf.python.keras.backend.set_session(sess)
tf.config.list_logical_devices()


Keras backend:  tensorflow


[LogicalDevice(name='/device:CPU:0', device_type='CPU'),
 LogicalDevice(name='/device:XLA_CPU:0', device_type='XLA_CPU')]

In [6]:
# from functools import partial, update_wrapper

# def wrapped_partial(func, *args, **kwargs):
#     |   partial_func = partial(func, *args, **kwargs)
#         update_wrapper(partial_func, func)
#         return partial_func

### Classifier models

In [7]:
# deep dense classifier model builder method
def DDNNBuilder(quantifiers):
    model= Sequential()
    model.add(Dense(scene_len, activation="relu", name="input"))
    model.add(Dropout(0.25, name="dropout_1"))
    model.add(Dense(100, activation="relu", name="dense_2"))
    model.add(Dropout(0.25, name="dropout_2"))
    model.add(Dense(50, activation="relu", name="dense_3"))
    model.add(Dropout(0.25, name="dropout_3"))
    model.add(Dense(len(quantifiers), activation='softmax', name="softmax_1"))
    # Compile model
    model.compile(loss='categorical_crossentropy', optimizer='adam', metrics=[tf.keras.metrics.Precision(),
                                                                              tf.keras.metrics.Recall()])
    return model, False

In [8]:
# dense classifier model builder method
def DNNBuilder(quantifiers):
    model= Sequential()
    model.add(Dense(scene_len, activation="relu", name="input"))
    model.add(Dropout(0.5, name="dropout_1"))
    model.add(Dense(len(quantifiers), activation='softmax', name="softmax_1"))
    # Compile model
    model.compile(loss='categorical_crossentropy', optimizer='adam', metrics=[tf.keras.metrics.Precision(),
                                                                              tf.keras.metrics.Recall()])
    return model, False

In [9]:
from tensorflow.keras import initializers

# Convolutional classifier model builder method
def CNNBuilder(quantifiers):
    model= Sequential()
    model.add(Conv1D(filters=2, kernel_size=1, 
                     use_bias=False, 
                     input_shape=(scene_len, len(symbols)), name="conv_1"))
    model.add(Dropout(0.5, name="dropout_1"))
    model.add(Flatten())
    model.add(Dense(len(quantifiers),
#                     kernel_initializer="constant", trainable=False, use_bias=False, 
                    activation='softmax', name="softmax_1"))
    # Compile model
    model.compile(loss='categorical_crossentropy', optimizer='adam', metrics=[tf.keras.metrics.Precision(),
                                                                              tf.keras.metrics.Recall()])
    return model, True

## Quantifier sets for learning

In [10]:
natural_quantifiers = [The(), Both(), No(), All(), Some(), Most()]

In [11]:
unnatural_quantifiers = [MinMax(2, 10), MinMax(3, 6), Or([MinMax(2, 5), MinMax(10, 20)])]
# unnatural_quantifiers = [MinMax(2, 5), MinMax(8, 10), MinMax(12, 15), MinMax(17, 20), MinMax(24, 30), MinMax(37, 50)]

In [62]:
def teach(classifier, min_len=0, max_len=scene_len, repeat=1, epochs=50, batch_size=10):
    """
    This method teaches a classifier to classify its quantifiers
    
    repeat: teacher student learning for repeat # of rounds
    epochs, batch_size: parameters passed to tensorflow learning
    min_len, max_len: genereated scene length limits for training (to test generalization)
    """
    last_classifier = None
    with tf.device("/cpu:0"):
#     with tf.device("/gpu:0"):
        # iterate while using the previous model as label generator
        for _ in range(repeat):
            # generate fit and test model
            if last_classifier:
                train_scenes_labels = classifier.generate_labeled_scenes(last_classifier, min_len, max_len)
                test_scenes_labels = classifier.generate_labeled_scenes(last_classifier)
            else:
                train_scenes_labels = classifier.generate_labeled_scenes(min_len, max_len)
                test_scenes_labels = classifier.generate_labeled_scenes()
            classifier.fit(*train_scenes_labels, epochs=epochs, batch_size=batch_size)
            classifier.test(*test_scenes_labels)
            classifier.test_random(1000)
            last_classifier = classifier.clone()
        return classifier

In [63]:
natural_classifier = teach(Classifier(natural_quantifiers, CNNBuilder), epochs=50, max_len=100)
# natural_classifier = teach(Classifier(natural_quantifiers, DNNBuilder), epochs=500, repeat=3)

CNNBuilder model classifies ['All()' 'Both()' 'Most()' 'No()' 'Some()' 'The()']
Epoch 1/50
Epoch 2/50
Epoch 3/50
Epoch 4/50
Epoch 5/50
Epoch 6/50
Epoch 7/50
Epoch 8/50
Epoch 9/50
Epoch 10/50
Epoch 11/50
Epoch 12/50
Epoch 13/50
Epoch 14/50
Epoch 15/50
Epoch 16/50
Epoch 17/50
Epoch 18/50
Epoch 19/50
Epoch 20/50
Epoch 21/50
Epoch 22/50
Epoch 23/50
Epoch 24/50
Epoch 25/50
Epoch 26/50
Epoch 27/50
Epoch 28/50
Epoch 29/50
Epoch 30/50
Epoch 31/50
Epoch 32/50
Epoch 33/50
Epoch 34/50
Epoch 35/50
Epoch 36/50
Epoch 37/50
Epoch 38/50
Epoch 39/50
Epoch 40/50
Epoch 41/50
Epoch 42/50
Epoch 43/50
Epoch 44/50
Epoch 45/50
Epoch 46/50
Epoch 47/50
Epoch 48/50
Epoch 49/50
Epoch 50/50
Evaluation metrics: 
[1.1788522141774496, 0.7336052656173706, 0.6105473041534424]
Confusion matrix: 
[[808  53  76   0  27  36]
 [  0 188   0   0   0 812]
 [312  56 175   4 378  75]
 [  0  58   0 813   1 128]
 [298   5  83 146 459   9]
 [  0 150   0   0   0 850]]
Classification report: 
              precision    recall  f1-sco

In [67]:
# unnatural_model = teach(Classifier(unnatural_quantifiers, CNNBuilder), epochs=50, max_len=100)
unnatural_model = teach(Classifier(unnatural_quantifiers, DNNBuilder), epochs=50, max_len=100)

DNNBuilder model classifies ['MinMax(m=2,M=10)' 'MinMax(m=3,M=6)'
 'Or(MinMax(m=2,M=5),MinMax(m=10,M=20))']
Epoch 1/50
Epoch 2/50
Epoch 3/50
Epoch 4/50
Epoch 5/50
Epoch 6/50
Epoch 7/50
Epoch 8/50
Epoch 9/50
Epoch 10/50
Epoch 11/50
Epoch 12/50
Epoch 13/50
Epoch 14/50
Epoch 15/50
Epoch 16/50
Epoch 17/50
Epoch 18/50
Epoch 19/50
Epoch 20/50
Epoch 21/50
Epoch 22/50
Epoch 23/50
Epoch 24/50
Epoch 25/50
Epoch 26/50
Epoch 27/50
Epoch 28/50
Epoch 29/50
Epoch 30/50
Epoch 31/50
Epoch 32/50
Epoch 33/50
Epoch 34/50
Epoch 35/50
Epoch 36/50
Epoch 37/50
Epoch 38/50
Epoch 39/50
Epoch 40/50
Epoch 41/50
Epoch 42/50
Epoch 43/50
Epoch 44/50
Epoch 45/50
Epoch 46/50
Epoch 47/50
Epoch 48/50
Epoch 49/50
Epoch 50/50
Evaluation metrics: 
[1.252001072883606, 0.6154882311820984, 0.24415980279445648]
Confusion matrix: 
[[521  58 421]
 [531  44 425]
 [549  48 403]]
Classification report: 
                                       precision    recall  f1-score   support

                     MinMax(m=2,M=10)     0.3254  