Add zero-shot classification #121

seanmor5 · 2022-12-16T02:46:58Z

I will add tests tomorrow when I get the multi-prompt case down, but right now it seems to be working fine:

Btw, is there a reason we hid documentation for TokenClassification ?

jonatanklosko · 2022-12-16T10:15:43Z

Awesome! I think ideally we should follow the same format as image/text classification, so %{predictions: [%{label: ..., score: ...}, ...]}. For now I wouldn't include prompt and label in the output, since the user has the input, and for multiple they can just zip.

Btw, is there a reason we hid documentation for TokenClassification?

We use defdelegate in the Bumblebee.Text module and there are all the docs :)

lib/bumblebee/text/zero_shot_classification.ex

jonatanklosko · 2022-12-16T10:30:33Z

lib/bumblebee/text/zero_shot_classification.ex

+      {labels, hypothesis} = Enum.unzip(labels_and_hypothesis)
+
+      all_inputs =
+        Bumblebee.apply_tokenizer(tokenizer, Enum.zip(prompts, hypothesis),


One thing to note is that here a single input becomes multiple inputs in the batch (each {prompt, hypothesis} pair). We could group them by adding an additional leading axis, however the number of hypothesis may be different for every input in the batch, so we can't really do that. Probably just documenting it is the way to go?

For Stable Diffusion if someone sets num_images_per_prompt: 2, the number of sub-inputs is fixed and we treat it as a single member of the batch (by adding a leading axis). I wonder if we should instead treat num_images_per_prompt: 2 as two inputs, so for batch size of 1 we would generate each image separately. This way if someone sets 4 images and batch size of 1 it would take more time, but don't blow up the memory, and they can set batch size to 4 to generate at once. This decoupling gives more control to the user.

@seanmor5 @josevalim thoughts?

FTR we decided to stick to the current approach, that is :batch_size always refers to the serving input. It may be inflated by options such as num_images_pre_prompt, or the number of labels in this case.

jonatanklosko · 2022-12-22T16:40:37Z

@seanmor5 I fixed the post processing to apply softmax per batch on the entailment label. I also adjusted the assertions based on this:

from transformers import pipeline

p = pipeline("zero-shot-classification", model="facebook/bart-large-mnli", candidate_labels=["cooking", "traveling", "dancing"])

p("one day I will see the world")

lib/bumblebee/text/zero_shot_classification.ex

Co-authored-by: Jonatan Kłosko <jonatanklosko@gmail.com>

jonatanklosko

jonatanklosko reviewed Dec 16, 2022

View reviewed changes

lib/bumblebee/text/zero_shot_classification.ex Outdated Show resolved Hide resolved

jonatanklosko reviewed Dec 16, 2022

View reviewed changes

seanmor5 force-pushed the sm-zero-shot-classification branch from af1d257 to ab2bf84 Compare December 20, 2022 00:00

jonatanklosko reviewed Dec 22, 2022

View reviewed changes

lib/bumblebee/text/zero_shot_classification.ex Outdated Show resolved Hide resolved

seanmor5 and others added 8 commits January 2, 2023 07:05

Add zero shot classification

1eae487

Formatting

9f26778

Fix zero-shot and add tests

2cb6333

Add test for compilation

bf00fa4

Update lib/bumblebee/text/zero_shot_classification.ex

8589c80

Co-authored-by: Jonatan Kłosko <jonatanklosko@gmail.com>

Update zero-shot

fa66055

Fix test

4f9b92f

Fix outputs

f8d44e5

seanmor5 force-pushed the sm-zero-shot-classification branch from b543247 to f8d44e5 Compare January 2, 2023 15:06

seanmor5 and others added 6 commits January 2, 2023 07:30

Update docs

aefdc16

Make batch size match for telemetry

36c4543

Fix batching

a58f174

Simplify input flattening/unflattening

8863dde

Add example to the docs

00dd053

Update typespec

5278e79

jonatanklosko approved these changes Jan 3, 2023

View reviewed changes

seanmor5 merged commit cfbd909 into main Jan 3, 2023

jonatanklosko deleted the sm-zero-shot-classification branch January 3, 2023 11:01

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add zero-shot classification #121

Add zero-shot classification #121

seanmor5 commented Dec 16, 2022

jonatanklosko commented Dec 16, 2022

jonatanklosko Dec 16, 2022

jonatanklosko Jan 3, 2023

jonatanklosko commented Dec 22, 2022

jonatanklosko left a comment

Add zero-shot classification #121

Add zero-shot classification #121

Conversation

seanmor5 commented Dec 16, 2022

jonatanklosko commented Dec 16, 2022

jonatanklosko Dec 16, 2022

Choose a reason for hiding this comment

jonatanklosko Jan 3, 2023

Choose a reason for hiding this comment

jonatanklosko commented Dec 22, 2022

jonatanklosko left a comment

Choose a reason for hiding this comment