feat: Add Evol instruct task #237

plaguss · 2024-01-11T12:55:43Z

Description

This PR adds a new text generation task: Evol-Instruct.

Closes #229

…evol-instruct-task

docs/technical-reference/tasks.md

src/distilabel/tasks/_templates/evol-instruct.jinja2

src/distilabel/tasks/text_generation/evol_instruct.py

davidberenstein1957 · 2024-01-15T12:34:54Z

src/distilabel/tasks/text_generation/evol_instruct.py

+        if "sorry" in output.lower() and len(output.split(" ")) < 80:
+            return
+
+        # 3) The output only contains punctuation and stop words


the stopwords comment

I don't think one of our current dependencies supports this.

What dependency do you refer to?

If we already by chance were downloading nltk we might have been able to re-use their logic. Sadly, we don't

src/distilabel/tasks/text_generation/evol_instruct.py

…evol-instruct-task

…paper

… method and redirect to the original paper

src/distilabel/tasks/text_generation/evol_instruct.py

davidberenstein1957 · 2024-01-16T11:31:52Z

docs/technical-reference/tasks.md

+
+!!! note
+    The original definition of `EvolInstruct` considers an elimination evolving step with different
+    situations to remove the responses considered as failures. Section 3.2, *Elimination Evolving* in [WizardLM paper](https://arxiv.org/abs/2304.12244) shows these steps. We have implemented steps 2-4 as part of this task, but not step one. If the user wants to tackle this, an example can be seen in the following script: [examples/pipeline-openai-wizardl-equal-prompts.py](../../examples/pipeline-evol-instruct-alpaca.py).


the wrong file is referenced here

also mention more explicitly that it shows how to implement evaluation step 1 using distilabel please

plaguss added 4 commits January 11, 2024 13:52

Add evol-instruct task

95f8f91

Include docs and example for evol instruct task

8073074

Add tests for evol instruct task

c3249b6

Merge branch 'main' of https://github.com/argilla-io/distilabel into …

0048324

…evol-instruct-task

plaguss requested review from dvsrepo and davidberenstein1957 January 11, 2024 12:56

davidberenstein1957 reviewed Jan 11, 2024

View reviewed changes

plaguss added 4 commits January 12, 2024 12:07

Align phrasing with self-instruct task

d490aa6

Correct typo

74fa6c7

Add elimination step and include more docstrings

2735e2a

Update test with typo in template

b9cad5e

davidberenstein1957 reviewed Jan 15, 2024

View reviewed changes

plaguss added 4 commits January 15, 2024 17:27

Merge branch 'main' of https://github.com/argilla-io/distilabel into …

1e5eeae

…evol-instruct-task

Add outline of not implemented step from elimination evolving in the …

b15378d

…paper

Add more information for the elimination evolving in the parse_output…

7c68176

… method and redirect to the original paper

Add english stopwords from nltk for the elimination step

5ffc29d

plaguss requested a review from davidberenstein1957 January 16, 2024 08:33

davidberenstein1957 approved these changes Jan 16, 2024

View reviewed changes

src/distilabel/tasks/text_generation/evol_instruct.py Outdated Show resolved Hide resolved

plaguss added 2 commits January 16, 2024 10:49

Add explicit reference to the paper

0907420

Add example in the docs for wizardlm check of equality of prompts

0d325e5

davidberenstein1957 reviewed Jan 16, 2024

View reviewed changes

docs: minor rephrasing

4f63e24

davidberenstein1957 merged commit 28b60c1 into main Jan 16, 2024
4 checks passed

davidberenstein1957 deleted the evol-instruct-task branch January 16, 2024 15:56

davidberenstein1957 changed the title ~~Add Evol instruct task~~ feat: Add Evol instruct task Jan 16, 2024

davidberenstein1957 mentioned this pull request Jan 16, 2024

[FEATURE] add to_argilla to EvolInstruct task #259

Closed

davidberenstein1957 added this to the 0.4.0 milestone Jan 17, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: Add Evol instruct task #237

feat: Add Evol instruct task #237

plaguss commented Jan 11, 2024

davidberenstein1957 Jan 15, 2024

davidberenstein1957 Jan 15, 2024

plaguss Jan 15, 2024

davidberenstein1957 Jan 16, 2024

davidberenstein1957 Jan 16, 2024

davidberenstein1957 Jan 16, 2024

feat: Add Evol instruct task #237

feat: Add Evol instruct task #237

Conversation

plaguss commented Jan 11, 2024

Description

davidberenstein1957 Jan 15, 2024

Choose a reason for hiding this comment

davidberenstein1957 Jan 15, 2024

Choose a reason for hiding this comment

plaguss Jan 15, 2024

Choose a reason for hiding this comment

davidberenstein1957 Jan 16, 2024

Choose a reason for hiding this comment

davidberenstein1957 Jan 16, 2024

Choose a reason for hiding this comment

davidberenstein1957 Jan 16, 2024

Choose a reason for hiding this comment