Build LSTM logic for local chatbot model #74

jeff1evesque · 2018-12-17T02:41:40Z

Currently the Vagrantfile deploys a prebuild chatbot, executed by run.py. However, we need to adjust our run.py to optionally build our own model. This can be implemented by invoking an optional flag, when set to True will instead create our own local chatbot model. Though many different solutions exists, we will initially try to implement keras to reduce syntax requirements.

The text was updated successfully, but these errors were encountered:

jeff1evesque · 2018-12-18T04:25:25Z

ef6f289: both tensorflow and keras need to be compatible versions.

#74: Add missing 'docker-compose.yml'

#74: Add missing 'docker-compose.yml' (revert)

jeff1evesque · 2018-12-26T04:05:58Z

Running the --train locally, on the entire --insert case, results in MemoryError:

root@development:/vagrant# python3 run.py --train
Using TensorFlow backend.
vocabulary size: 21458
Traceback (most recent call last):
  File "run.py", line 120, in <module>
    main(op='train')
  File "run.py", line 70, in main
    model = train(posts, comments, cwd=cwd)
  File "/vagrant/chatbot/app/train.py", line 56, in train
    posts_train = create_posts(posts, vocab_size, post_maxlen, word2idx)
  File "/vagrant/chatbot/app/train.py", line 138, in create_posts
    post_idx = np.zeros(shape=(len(posts), post_maxlen, vocab_size))
MemoryError

If we decrease our vocab size:

$ git diff chatbot/
diff --git a/chatbot/app/train.py b/chatbot/app/train.py
index eb707da..f512eaa 100644
--- a/chatbot/app/train.py
+++ b/chatbot/app/train.py
@@ -50,7 +50,7 @@ def train(
     word2idx = {w:(i+1) for i,(w,_) in enumerate(counter.most_common())}
     idx2word = {v:k for k,v in word2idx.items()}
     idx2word[0] = 'PAD'
-    vocab_size = len(word2idx) + 1
+    vocab_size = int((len(word2idx) + 1)/1000)
     print('vocabulary size: {vocab}'.format(vocab=vocab_size))

     posts_train = create_posts(posts, vocab_size, post_maxlen, word2idx)

We result with an out of bounds error:

root@development:/vagrant# python3 run.py --train
Using TensorFlow backend.
vocabulary size: 21
i: 0, w: Your
Traceback (most recent call last):
  File "run.py", line 120, in <module>
    main(op='train')
  File "run.py", line 70, in main
    model = train(posts, comments, cwd=cwd)
  File "/vagrant/chatbot/app/train.py", line 56, in train
    posts_train = create_posts(posts, vocab_size, post_maxlen, word2idx)
  File "/vagrant/chatbot/app/train.py", line 140, in create_posts
    post = encode(posts[p], post_maxlen,vocab_size, word2idx)
  File "/vagrant/chatbot/app/train.py", line 128, in encode
    indices[i, word2idx[w]] = 1
IndexError: index 486 is out of bounds for axis 1 with size 21

jeff1evesque · 2018-12-27T04:18:34Z

We temporarily moved all our sample dataset except the first month into chatbot/data2:

root@development:/vagrant# date
Thu Dec 27 04:10:13 UTC 2018
$ git status
On branch feature-74
Changes not staged for commit:
  (use "git add/rm <file>..." to update what will be committed)
  (use "git checkout -- <file>..." to discard changes in working directory)

        modified:   chatbot/data/reddit-2005-12
        deleted:    chatbot/data/reddit-2006-01
        deleted:    chatbot/data/reddit-2006-02
        deleted:    chatbot/data/reddit-2006-03
        deleted:    chatbot/data/reddit-2006-04
        deleted:    chatbot/data/reddit-2006-05

Untracked files:
  (use "git add <file>..." to include in what will be committed)

        chatbot/data2/

no changes added to commit (use "git add" and/or "git commit -a")

Then, we removed approximately half the existing content from chatbot/data/reddit-2006-05, then executed our python run.py --insert followed by python run.py --train:

root@development:/vagrant# python3 run.py --train
Using TensorFlow backend.
vocabulary size: 1865
_________________________________________________________________
Layer (type)                 Output Shape              Param #
=================================================================
input_1 (InputLayer)         (None, 10, 1865)          0
_________________________________________________________________
lstm_1 (LSTM)                (None, 128)               1020928
_________________________________________________________________
repeat_vector_1 (RepeatVecto (None, 20, 128)           0
_________________________________________________________________
time_distributed_1 (TimeDist (None, 20, 1865)          240585
_________________________________________________________________
activity_regularization_1 (A (None, 20, 1865)          0
_________________________________________________________________
activation_1 (Activation)    (None, 20, 1865)          0
=================================================================
Total params: 1,261,513
Trainable params: 1,261,513
Non-trainable params: 0
_________________________________________________________________
None
Train on 54 samples, validate on 14 samples
Epoch 1/1
2018-12-27 04:12:37.075971: I tensorflow/core/platform/cpu_feature_guard.cc:137] Your CPU supports instructions that this TensorFlow binary was not compiled to use: SSE4.1 SSE4.2 AVX AVX2
32/54 [================>.............] - ETA: 1s - loss: 26.9587 - acc: 0.0000e+00Epoch 00001: saving model to /vagrant/model/checkpoint.ckpt
54/54 [==============================] - 4s 66ms/step - loss: 23.9524 - acc: 0.0000e+00 - val_loss: 14.3032 - val_acc: 0.0000e+00
root@development:/vagrant# ls -l model/
total 29644
-rwxrwxrwx 1 vagrant vagrant 15165772 Dec 27 04:12 chatbot.h5
-rwxrwxrwx 1 vagrant vagrant 15165772 Dec 27 04:12 checkpoint.ckpt
-rwxrwxrwx 1 vagrant vagrant    18185 Dec 27 04:12 idx2word.pkl

#74: Build LSTM logic for local chatbot model

jeff1evesque changed the title ~~Build LSTM logic local chatbot model.~~ Build LSTM logic for local chatbot model. Dec 17, 2018

jeff1evesque added a commit that referenced this issue Dec 17, 2018

#74: train.py, refactor from 'select.py'

2b3454d

jeff1evesque added a commit that referenced this issue Dec 17, 2018

#74: Vagrantfile, 'pip3 install keras'

04de15b

jeff1evesque added a commit that referenced this issue Dec 17, 2018

#74: refactor ingest as 'insert' for 'run.py'

24afc5f

jeff1evesque added a commit that referenced this issue Dec 17, 2018

#74: select.py, add file

5395a8d

jeff1evesque added a commit that referenced this issue Dec 17, 2018

#74: insert.py, fix database connection

9601892

jeff1evesque added a commit that referenced this issue Dec 17, 2018

#74: run.py, update conditional logic

5d22b42

jeff1evesque added a commit that referenced this issue Dec 17, 2018

#74: run.py, supply 'client' to select case

beb32f6

jeff1evesque added a commit that referenced this issue Dec 17, 2018

#74: run.py, use 'select' result to train

da97570

jeff1evesque added a commit that referenced this issue Dec 17, 2018

#74: run.py, fix typos

0a0f060

jeff1evesque added a commit that referenced this issue Dec 17, 2018

#74: select.py, fix typos

841a29e

jeff1evesque added a commit that referenced this issue Dec 17, 2018

#74: resolve merge conflict with 'master'

305ef9e

jeff1evesque added a commit that referenced this issue Dec 18, 2018

#74: run.py, define 'username'

1a1d922

jeff1evesque added a commit that referenced this issue Dec 18, 2018

#74: Vagrantfile, ensure compatible tensorflow + keras

ef6f289

jeff1evesque added a commit that referenced this issue Dec 19, 2018

#74: train.py, implement initial keras logic

3ab59c8

jeff1evesque added a commit that referenced this issue Dec 19, 2018

#74: train.py, more granular checkpoints

74a7ba6

jeff1evesque added a commit that referenced this issue Dec 19, 2018

#74: train.py, generalize paramters + generate model

dca0d5a

jeff1evesque added a commit that referenced this issue Dec 19, 2018

#74: run.py, supply 'cwd' argument

09bcdca

jeff1evesque added a commit that referenced this issue Dec 19, 2018

#74: train.py, remove default value

18f19bc

jeff1evesque added a commit that referenced this issue Dec 19, 2018

#74: train.py, abstract checkpoint process

911302f

jeff1evesque added a commit that referenced this issue Dec 19, 2018

#74: train.py, replce 'None' with 'False'

308868b

jeff1evesque mentioned this issue Dec 19, 2018

#74: Add missing 'docker-compose.yml' #80

Merged

jeff1evesque closed this as completed in #80 Dec 19, 2018

jeff1evesque added a commit that referenced this issue Dec 19, 2018

Merge pull request #80 from jeff1evesque/feature-74

62a38cf

#74: Add missing 'docker-compose.yml'

jeff1evesque added a commit that referenced this issue Dec 19, 2018

Revert "#74: Add missing 'docker-compose.yml'"

e776253

jeff1evesque added a commit that referenced this issue Dec 19, 2018

Merge pull request #81 from jeff1evesque/revert-80-feature-74

5b8c881

#74: Add missing 'docker-compose.yml' (revert)

jeff1evesque added a commit that referenced this issue Dec 19, 2018

#74: train.py, reapply file

d3cb244

jeff1evesque reopened this Dec 19, 2018

jeff1evesque added a commit that referenced this issue Dec 19, 2018

#74: run.py, revert file

9e6540d

jeff1evesque added a commit that referenced this issue Dec 25, 2018

#74: run.py, add train case

10de828

jeff1evesque added a commit that referenced this issue Dec 25, 2018

#74: predict.py, add file

0d644af

jeff1evesque added a commit that referenced this issue Dec 25, 2018

#74: predict.py, adjust docstring

34f46e3

jeff1evesque added a commit that referenced this issue Dec 25, 2018

#74: predict.py, implement + define 'decode'

fb67cd9

jeff1evesque added a commit that referenced this issue Dec 25, 2018

#74: train.py, dump 'idx2word'

0ef907f

jeff1evesque added a commit that referenced this issue Dec 25, 2018

#74: train.py, properly implement 'joblib.dump'

8d3076a

jeff1evesque added a commit that referenced this issue Dec 25, 2018

#74: train.py, properly dump 'model'

3ef0c3a

jeff1evesque added a commit that referenced this issue Dec 25, 2018

#74: train.py, relocate 'joblib.dump'

7eaf7fa

jeff1evesque added a commit that referenced this issue Dec 25, 2018

#74: relocate utility scripts to root project

7bd52e9

jeff1evesque added a commit that referenced this issue Dec 25, 2018

#74: implement 'single_stack' in 'Vagrantfile'

cd00b1c

jeff1evesque added a commit that referenced this issue Dec 25, 2018

#74: train.py, minor spacing change

2299917

jeff1evesque changed the title ~~Build LSTM logic for local chatbot model.~~ Build LSTM logic for local chatbot model Dec 25, 2018

jeff1evesque mentioned this issue Dec 25, 2018

#74: Build LSTM logic for local chatbot model #90

Merged

jeff1evesque added a commit that referenced this issue Dec 26, 2018

#74: integrate bash arguments for cwd + dropbox reference

27b08a6

jeff1evesque added a commit that referenced this issue Dec 26, 2018

#74: Vagrantfile, ensure proper utility permissions

9847765

jeff1evesque added a commit that referenced this issue Dec 26, 2018

#74: single_stack, implement 'wget'

2259118

jeff1evesque added a commit that referenced this issue Dec 26, 2018

#74: Vagrantfile, properly execute 'single_stack'

d8f18b6

jeff1evesque added a commit that referenced this issue Dec 26, 2018

#74: single_stack, conditionally install TF

fa2bcb5

jeff1evesque added a commit that referenced this issue Dec 26, 2018

#74: single_stack, fix conditional tf install

e26c450

jeff1evesque added a commit that referenced this issue Dec 26, 2018

#74: single_stack, improve conditional tf + keras

f32c852

jeff1evesque added a commit that referenced this issue Dec 27, 2018

#74: select.py, remove duplicate regex

9873419

jeff1evesque added a commit that referenced this issue Dec 27, 2018

#74: train.py, properly create vocabulary + store results

93f5bec

jeff1evesque added a commit that referenced this issue Dec 27, 2018

#74: .gitignore, ignore checkpoint and 'idx2word.pkl'

93ea649

jeff1evesque added a commit that referenced this issue Dec 27, 2018

#74: .gitignore, ignore 'checkpoint.ckpt'

f8533e5

jeff1evesque added a commit that referenced this issue Dec 27, 2018

#74: chatbot.h5, update model with reduced dataset

8c71b0a

jeff1evesque added a commit that referenced this issue Dec 27, 2018

#74: resolve merge conflict with 'master'

48f3758

jeff1evesque closed this as completed in #90 Dec 27, 2018

jeff1evesque added a commit that referenced this issue Dec 27, 2018

Merge pull request #90 from jeff1evesque/feature-74

acf64f7

#74: Build LSTM logic for local chatbot model

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Build LSTM logic for local chatbot model #74

Build LSTM logic for local chatbot model #74

jeff1evesque commented Dec 17, 2018 •

edited

jeff1evesque commented Dec 18, 2018

jeff1evesque commented Dec 26, 2018

jeff1evesque commented Dec 27, 2018

Build LSTM logic for local chatbot model #74

Build LSTM logic for local chatbot model #74

Comments

jeff1evesque commented Dec 17, 2018 • edited

jeff1evesque commented Dec 18, 2018

jeff1evesque commented Dec 26, 2018

jeff1evesque commented Dec 27, 2018

jeff1evesque commented Dec 17, 2018 •

edited