No "batched" Inference #170

michaelfeil · 2024-07-24T17:48:58Z

I noticed a number of various things are incorrectly implemented.

classifier = pipeline("sentiment-analysis", device="cpu",
                model="distilbert/distilbert-base-uncased-finetuned-sst-2-english")

def is_positive_dialogue_ending(file) -> bool:
    dialogue_ending = file.read()[-512:] # 512 chars != 512 tokens
    return classifier(dialogue_ending)[0]["label"] == "POSITIVE" # NOT performing batched inference, performs threaded, blocking single inference -> really bad once you are on GPU.

To perform batched inference, you would need to add multiple batches, with a batch size > 1 to the sentence classification pipeline.

classifier = pipeline("sentiment-analysis", device="cpu",  
                # tokenizer from left 
                truncation="right/left", # sets truncation from the right side, I think this is not possible
                model="distilbert/distilbert-base-uncased-finetuned-sst-2-english")

def is_positive_dialogue_ending(files: list) -> list[bool]:
    dialogue_endings = [file.read()for file in files] # something ike this
    return (label  == "POSITIVE" for label in  classifier(dialogue_endings ))  # something like this

As a result, I launched https://github.com/michaelfeil/embed and https://github.com/michaelfeil/infinity for sentence classification. The backend queues and batches the requests, allowing better instructions to be used. This is useful for CPU, but crucial for e.g. CPU usage!

from embed import BatchedInference
from concurrent.futures import Future

def is_positive_dialogue_ending(file) -> bool:
    dialogue_ending = file.read()[-512:] # 512 chars != 512 tokens
    return classifier(dialogue_ending)[0]["label"] == "POSITIVE" # NOT performing batched inference, performs threaded, blocking single inference -> really bad once you are on GPU.

# Run any model
register = BatchedInference(
    model_id=[
        # classification models
        "distilbert/distilbert-base-uncased-finetuned-sst-2-english")
    ],
    # engine to `torch` or `optimum`
    engine="torch",
    # device `cuda` (Nvidia/AMD) or `cpu`
    device="cpu",
)


def is_positive_dialogue_ending(file) -> bool:
    """not multiprocessing recommended, but neither is transformers. Threading is encouraged."
    dialogue_ending = file.read()[-512:] # 512 chars != 512 tokens
    future: "Future" = register.classify(model_id="philschmid/tiny-bert-sst2-distilled", sentences=[dialogue_ending])
    # best: defer to later stage
    return future.result()[0]["label"] == "POSITIVE"

volkfox · 2024-07-24T23:19:16Z

need to fix batching ASAP for many similar HuggingFace models to work

dmpetrov · 2024-07-25T19:52:17Z

This is related to batch_map #84 I prioritized this one.

shcheklein · 2024-07-30T16:39:03Z

@dberenbaum can this be closed now?

dberenbaum · 2024-07-30T17:02:44Z

See the note from #191:

I think we should keep open #170. That request seems to be specifically about using futures to batch individual results using the existing .map without needing a separate .batch_map(). I think .batch_map() may be both simpler to implement and explain for now, but I think we could come back to the ideas in #170 in the future.

shcheklein added bug Something isn't working triage priority-p1 labels Jul 24, 2024

dmpetrov changed the title ~~Inefficency - no "batched" Inference, and some major~~ No "batched" Inference, and some major Jul 25, 2024

dmpetrov changed the title ~~No "batched" Inference, and some major~~ No "batched" Inference Jul 25, 2024

dberenbaum mentioned this issue Jul 26, 2024

Support for DataChain.batch_map() #191

Merged

shcheklein assigned volkfox Jul 31, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

No "batched" Inference #170

No "batched" Inference #170

michaelfeil commented Jul 24, 2024 •

edited

Loading

volkfox commented Jul 24, 2024

dmpetrov commented Jul 25, 2024

shcheklein commented Jul 30, 2024

dberenbaum commented Jul 30, 2024

No "batched" Inference #170

No "batched" Inference #170

Comments

michaelfeil commented Jul 24, 2024 • edited Loading

volkfox commented Jul 24, 2024

dmpetrov commented Jul 25, 2024

shcheklein commented Jul 30, 2024

dberenbaum commented Jul 30, 2024

michaelfeil commented Jul 24, 2024 •

edited

Loading