Add 🤗 hub support #13

Wauplin · 2023-01-31T17:26:21Z

This is a very preliminary work to integrate models from the Huggingface Hub (see issue #5).

I tested it only with the google/flan-t5-xl model. It seems to perform well on some classification tasks but definitely needs more evaluation. The good part is that it should be easy to compare different models hosted on huggingface simply by changing the model id.

I'm still not 100% sure about the API design + it needs refinements (docs, examples, tests?). Feedback is welcomed :)

EDIT: I added a notebook guide with explanations on how to use the HubModel class. I didn't want to add too much to the README so I placed it separately. Happy to change it if you prefer it somewhere else.

model = HubModel(model_id_or_url="google/flan-t5-xl")
prompter = Prompter(model)

output = prompter.fit(
    "binary_classification.jinja",
    label_0="positive",
    label_1="negative",
    examples=prompt_examples,
    text_input="I've read many post on here saying Israel gives training to US law enforcement .",
)

Overall I think that for now the prompts are really optimized for OpenAI models. Not really a problem since models on the Hub are very diverse anyway. In any case, there is no open-source competitors to models like ChatGPT (at least for now).

Wauplin · 2023-02-02T16:54:45Z

examples/data/binary.json

-    {
-        "text": "i have been with petronas for years i feel that petronas has performed well and made a huge profit",
-        "labels": "negative",
-        "score": "",
-        "complexity": ""
-    },


That was a duplicate from the previous example

jpoles1 · 2023-02-05T06:45:04Z

Is it possible to use this for NER or "token classification" tasks? Or is it limited to text-generation tasks?

Wauplin · 2023-02-05T16:39:51Z

@jpoles1 This PR is only meant to allow everyone to use the promptify library with models shared on the HuggingFace Hub.

As for your question, there is indeed a template to generate a prompt for an NER task (see ner.jinja). In general, the all point of promptify is to use (L)LM to perform any type of NLP task using zero-shot/few-shots learning. So the question "is it possible to perform token-based tokenization ?" with this lib' is yes, as long as the underlying language model you use is capable of it - which is not always the case.

Wauplin · 2023-02-05T16:40:46Z

Hope this answers a bit your question. For a more generic question on promptify, I would advice you to open an issue on GitHub or join the discord.

jpoles1 · 2023-02-06T03:35:20Z

Ah, yep, my question came from a fundamental misunderstanding of what's going on under the hood. Thanks for clarifying @Wauplin.

eren23 · 2023-02-06T13:42:48Z

@Wauplin Hey emergency, we are trying to use promptify for text to address extraction of Turkish earthquake tweets, Open AI works well, but we need an alternative just in case, can you join for a discord call to work our way around the HF implementation?

Wauplin · 2023-02-06T15:09:03Z

@eren23 Replied to message in mp.

For anyone wanting to test promptify with HF models, you can install the package from my branch:

pip install git+https://github.com/Wauplin/Promptify@add-hf-hub-support

But please be aware that this is still very new and not tested on real cases (i.e. it's not clear which models hosted on the Hub will perform well, on which tasks, on which languages, with which prompts,...). The PR is only enabling the possibility of using Huggingface not guaranteeing the efficiency of the method.

monk1337 · 2023-02-08T07:42:24Z

Thank you @Wauplin, for your great contribution; merging it now.

[WIP] Add HF Hub support

cff210b

Wauplin mentioned this pull request Jan 31, 2023

Support Huggingface models #5

Open

Wauplin added 7 commits February 2, 2023 16:08

Refacto HubModel + docstrings

85398d1

no need to modify style

8cde381

revert style changes

36abac5

add huggingface_hub in requirements.txt

a0fc261

Add guide notebook for HF

4a982a1

fix link

93ba84d

fix link

1f7893d

Wauplin commented Feb 2, 2023

View reviewed changes

Wauplin marked this pull request as ready for review February 2, 2023 16:54

Wauplin changed the title ~~[WIP] Add 🤗 hub support~~ Add 🤗 hub support Feb 2, 2023

monk1337 merged commit f99c503 into promptslab:main Feb 8, 2023

Wauplin deleted the add-hf-hub-support branch February 8, 2023 13:32

monk1337 added the models issue related to models label Mar 26, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add 🤗 hub support #13

Add 🤗 hub support #13

Wauplin commented Jan 31, 2023 •

edited

Loading

Wauplin Feb 2, 2023 •

edited

Loading

jpoles1 commented Feb 5, 2023

Wauplin commented Feb 5, 2023

Wauplin commented Feb 5, 2023

jpoles1 commented Feb 6, 2023

eren23 commented Feb 6, 2023

Wauplin commented Feb 6, 2023

monk1337 commented Feb 8, 2023

Add 🤗 hub support #13

Add 🤗 hub support #13

Conversation

Wauplin commented Jan 31, 2023 • edited Loading

Wauplin Feb 2, 2023 • edited Loading

Choose a reason for hiding this comment

jpoles1 commented Feb 5, 2023

Wauplin commented Feb 5, 2023

Wauplin commented Feb 5, 2023

jpoles1 commented Feb 6, 2023

eren23 commented Feb 6, 2023

Wauplin commented Feb 6, 2023

monk1337 commented Feb 8, 2023

Wauplin commented Jan 31, 2023 •

edited

Loading

Wauplin Feb 2, 2023 •

edited

Loading