DBAL with Image Data implementation using modAL #48

damienlancry · 2019-06-17T05:47:31Z

I created an example script trying to reproduce the results of Deep Bayesian Active Learning with Image Data using modAL.
I used this keras code from one of the authors.
I cannot think of anything I am doing differently and yet their code works and not mine.
For the acquisition function instead of using their modified keras, i used yarin gal's implementation (first author).
Can you spot any mistake in my code?
EDIT: I actually found a mistake in my code, I was not really computing the entropy but rather the other half of BALD function. I fixed this mistake and am currently running the code.
EDIT2: Still not working

codecov-io · 2019-06-17T05:53:15Z

Codecov Report

Merging #48 into dev will not change coverage.
The diff coverage is n/a.

@@           Coverage Diff           @@
##              dev      #48   +/-   ##
=======================================
  Coverage   97.17%   97.17%           
=======================================
  Files          31       31           
  Lines        1629     1629           
=======================================
  Hits         1583     1583           
  Misses         46       46

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update 3c01821...300a518. Read the comment docs.

damienlancry · 2019-06-17T10:45:42Z

It seems to be working way better now with a pool2d of size 5 (I was doing pool2d((2,2)) in my code
Edit: it s better with max_entropy but it s equally better with random acquisition, so no improvement ...

cosmic-cortex · 2019-06-18T05:01:57Z

Thanks! From the first glance, I don't know what might be wrong, so I'll take a detailed look ASAP, hopefully today! I'll also merge the PR then.

cosmic-cortex · 2019-06-19T05:22:24Z

examples/deep_bayesian_active_learning.py

+    print('Accuracy after query {n}: {acc:0.4f}'.format(n=index + 1, acc=model_accuracy))
+    perf_hist = [model_accuracy]
+
+np.save('/home/damien/Results/keras_modal_entropy.npy', perf_hist)


Hardcoded path, should be removed eventually!

oh yes sure my bad

cosmic-cortex · 2019-06-19T05:30:46Z

I have checked the code, along with the implementation of Yarin Gal. What is missing in his implementation is the actual training part, which might be crucial. Here, when you call
learner.teach(X_pool[query_idx], y_pool[query_idx], epochs=50, batch_size=128, verbose=0),
you actually append the new training instances to the old ones and run the training using all of the data. This is not a problem for classical methods such as any method from scikit-learn because every call to .fit() retrains the model from scratch, however this is not the case for NN-s in Tensorflow or PyTorch. With these, you actually continue the training from the current state, so in effect, after say the 100th query, the initial data has been shown 100*n_epoch times, while the last query has been shown only one time. This can create an inbalance. To solve this, pass the only_new=True argument for the .teach() method of the ActiveLearner. With this, the model is trained on new data only.

So, I started to experiment with this, I'll let you know the results!

(Also I have pointed out a hardcoded path in the code, that should be removed eventually.)

damienlancry · 2019-06-19T06:44:04Z

I have checked the code, along with the implementation of Yarin Gal. What is missing in his implementation is the actual training part, which might be crucial. Here, when you call
learner.teach(X_pool[query_idx], y_pool[query_idx], epochs=50, batch_size=128, verbose=0),
you actually append the new training instances to the old ones and run the training using all of the data. This is not a problem for classical methods such as any method from scikit-learn because every call to .fit() retrains the model from scratch, however this is not the case for NN-s in Tensorflow or PyTorch. With these, you actually continue the training from the current state, so in effect, after say the 100th query, the initial data has been shown 100*n_epoch times, while the last query has been shown only one time. This can create an inbalance. To solve this, pass the only_new=True argument for the .teach() method of the ActiveLearner. With this, the model is trained on new data only.

So, I started to experiment with this, I'll let you know the results!

(Also I have pointed out a hardcoded path in the code, that should be removed eventually.)

Yes there is no training in yarin gal's code, just an acquisition example. on the other hand there is a training in riashat islam's code, although I find this implementation very messy, but it works.
Ok let s try with only_new=True but I do not think that is what is recommanded in the paper. I think the paper suggests to train from scratch after every acquisition, and I thought that was what I was doing but no. So i m going to try this next.

Btw, to this end, maybe a method _fit_from_scratch could be useful, what do you think?

Also I think there might be a mistake in my query_strategy function max_entropy because I first take a random subset of the pool and then evaluate the acquisition function on this subset and then take the max indices from the subset. So they are not the right indices. So i m working on fixing that too.

damienlancry · 2019-06-19T08:14:14Z

ok i fixed the max_entropy acquisition function and it worked! I think this is ready to be merged now!

cosmic-cortex · 2019-06-19T19:51:07Z

Cool! I have merged the PR, thank you! Also, I propose to implement the acquisition functions to modAL directly as a feature, not just a custom query strategy in the example. I have just created the feature/bayesianDL branch for this purpose.

One challenge would be to write these functions in a backend-agnostic way, which may be difficult. I'll take a shot tomorrow, feel free to contribute if you are interested!

Thanks again for the PR!

damienlancry · 2019-06-20T08:35:26Z

I am totally interested in contributing!

DBAL with Image Data implementation using modAL

85445ae

damienlancry added 4 commits June 17, 2019 14:02

fixed max_entropy acquisition function

eb2df2f

removed weight decay

0d8486e

removed weight decay

d1488fc

fixed pool size

d02803f

append

f5f9753

cosmic-cortex reviewed Jun 19, 2019

View reviewed changes

damienlancry added 3 commits June 19, 2019 15:56

made the network closer to the one described in the paper

274f974

fixed max_entropy function

bdf7252

removed hard coded path

300a518

cosmic-cortex merged commit 300a518 into modAL-python:dev Jun 19, 2019

damienlancry mentioned this pull request Jun 21, 2019

bayesian DL #51

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

DBAL with Image Data implementation using modAL #48

DBAL with Image Data implementation using modAL #48

Uh oh!

damienlancry commented Jun 17, 2019 •

edited

Loading

Uh oh!

codecov-io commented Jun 17, 2019 •

edited

Loading

Uh oh!

damienlancry commented Jun 17, 2019 •

edited

Loading

Uh oh!

cosmic-cortex commented Jun 18, 2019

Uh oh!

cosmic-cortex Jun 19, 2019

Uh oh!

damienlancry Jun 19, 2019

Uh oh!

cosmic-cortex Jun 19, 2019

Uh oh!

cosmic-cortex commented Jun 19, 2019 •

edited

Loading

Uh oh!

damienlancry commented Jun 19, 2019 •

edited

Loading

Uh oh!

damienlancry commented Jun 19, 2019 •

edited

Loading

Uh oh!

cosmic-cortex commented Jun 19, 2019

Uh oh!

damienlancry commented Jun 20, 2019

Uh oh!

Uh oh!

DBAL with Image Data implementation using modAL #48

DBAL with Image Data implementation using modAL #48

Uh oh!

Conversation

damienlancry commented Jun 17, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

codecov-io commented Jun 17, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

damienlancry commented Jun 17, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

cosmic-cortex commented Jun 18, 2019

Uh oh!

cosmic-cortex Jun 19, 2019

Choose a reason for hiding this comment

Uh oh!

damienlancry Jun 19, 2019

Choose a reason for hiding this comment

Uh oh!

cosmic-cortex Jun 19, 2019

Choose a reason for hiding this comment

Uh oh!

cosmic-cortex commented Jun 19, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

damienlancry commented Jun 19, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

damienlancry commented Jun 19, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

cosmic-cortex commented Jun 19, 2019

Uh oh!

damienlancry commented Jun 20, 2019

Uh oh!

Uh oh!

damienlancry commented Jun 17, 2019 •

edited

Loading

codecov-io commented Jun 17, 2019 •

edited

Loading

damienlancry commented Jun 17, 2019 •

edited

Loading

cosmic-cortex commented Jun 19, 2019 •

edited

Loading

damienlancry commented Jun 19, 2019 •

edited

Loading

damienlancry commented Jun 19, 2019 •

edited

Loading