Add new example notebook for active learning #910

joelnkn · 2024-06-07T18:44:40Z

Adding a notebook for active learning following the checklist in #772 (and #556).

kevingreenman · 2024-06-08T03:07:15Z

@joelnkn thanks for adding this! It looks like a great start to me at first glance. A couple notes:

You mentioned on Slack a concern about the error metrics getting worse with more active learning iterations. I agree that this would normally be concerning, but I think in this case it might be caused by the very small size of the dataset; with only 100 total points in the best case, the active learning progression is very noisy, and the model is apparently getting lucky with the training points it chooses in early iterations of the model that lead to better metrics. We could consider using a larger dataset for this notebook, but we'll have to think about what makes sense because we don't want to unnecessarily bloat the github repo with large files.
If we find a good dataset to use where the metrics decrease with more active learning iterations, it would be nice to make a plot at the end of the notebook to visualize this.
Right now I think random is a good choice for the priority function (aka acquisition function). Once we add the uncertainty functionality, I think people would also like to see an example of how using uncertainty-based sampling might improve results more efficiently than sampling randomly.

kevingreenman · 2024-06-08T03:10:47Z

One other note: I suggest changing your batch_size variable name to something like al_batch_size to avoid confusion with the batch_size that's a hyperparameter in training the model without active learning.

…on, plot for active learning results

Joel Manu added 4 commits June 6, 2024 12:54

Added active learning notebook draft

19bdd84

added active learning notebook links to docs/source

fd2a4bb

active learning implemented

de94091

cleaned up outputs and code

5a4a3e8

joelnkn requested a review from kevingreenman June 7, 2024 18:44

joelnkn self-assigned this Jun 7, 2024

kevingreenman added this to the v2.1.0 milestone Jun 7, 2024

kevingreenman linked an issue Jun 7, 2024 that may be closed by this pull request

[TODO]: Add example notebooks to the docs #556

Open

kevingreenman mentioned this pull request Jun 7, 2024

V2: Add more example notebooks #772

Draft

Joel Manu and others added 3 commits June 10, 2024 11:47

Clearer variable names, active learning trains new model each iterati…

44c695f

…on, plot for active learning results

added comments and descriptions

8954e44

Merge branch 'main' into active-learning-notebook

2bbf36d

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add new example notebook for active learning #910

Add new example notebook for active learning #910

joelnkn commented Jun 7, 2024

kevingreenman commented Jun 8, 2024

kevingreenman commented Jun 8, 2024

Add new example notebook for active learning #910

Are you sure you want to change the base?

Add new example notebook for active learning #910

Conversation

joelnkn commented Jun 7, 2024

kevingreenman commented Jun 8, 2024

kevingreenman commented Jun 8, 2024