Support batch-mode queries?

Hi,

I've run into a bit of a use-case that I'm not sure is _quite_ supported by `modAL` – nor the broader libraries for active learning – but would be relatively simple to implement. After reviewing `modAL`'s internals a bit, I don't think it officially supports **active learning with batch-mode queries**. 

The sampling strategies (for example, [uncertainty sampling](https://github.com/cosmic-cortex/modAL/blob/master/modAL/uncertainty.py#L105)) do support the `n_instances` parameter, but from what I can tell, uncertainty sampling may return redundant/sub-optimal queries if we return more than one instance from the unlabeled set. This is a bit prohibitive in settings where we'd like to ask an active learner to return multiple (if not all) examples from the unlabeled set/pool, and the computational cost for re-training an active learning model goes without saying.

I found requests for batch-mode support in the popular `libact` library (issues [#57](https://github.com/ntucllab/libact/issues/57) and [#89](https://github.com/ntucllab/libact/issues/89)) but, to the best of my knowledge, I'm not sure they were addressed in any of their PRs.

In that case, does it make sense to implement something like [Ranked batch-mode active learning] by Cardoso et al.? I took a crack at it this weekend for a better personal understanding, but if it's worth integrating and supporting in `modAL` I'm happy to polish it and talk it through in a PR.

Thanks!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Support batch-mode queries? #11

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Support batch-mode queries? #11

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions