Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

feat: add to_argilla method to EvolInstructTask generated datasets #291

Merged
merged 10 commits into from
Jan 24, 2024

Conversation

plaguss
Copy link
Contributor

@plaguss plaguss commented Jan 23, 2024

Description

This PR adds pending functionality to visualise evol instruct datasets in argilla

Example of an evol instruct dataset:
image

Example to reproduce:

from datasets import load_dataset
from distilabel.tasks import EvolInstructTask
from distilabel.dataset import CustomDataset

dataset = load_dataset("argilla/distilabel-sample-evol-instruct", split="train")
dataset = CustomDataset(dataset.data.table)
dataset.task = EvolInstructTask()

# rg.init(...)
rg_dataset = dataset.to_argilla()
rg_dataset.push_to_argilla(name="evol-instruct-test", workspace="admin")

Closes #259

@plaguss plaguss added this to the 0.5.0 milestone Jan 23, 2024
@plaguss plaguss self-assigned this Jan 23, 2024
@davidberenstein1957
Copy link
Member

Also, I think I might be missing some tests.

@davidberenstein1957 davidberenstein1957 changed the title Add to_argilla method to EvolInstructTask generated datasets feat: add to_argilla method to EvolInstructTask generated datasets Jan 24, 2024
@plaguss
Copy link
Contributor Author

plaguss commented Jan 24, 2024

The new InstructTaskMixin contains the common behaviour for the task and new tests have been added for the integration

@plaguss plaguss merged commit 2ec5900 into main Jan 24, 2024
4 checks passed
@plaguss plaguss deleted the feat/evol-instruct-to-argilla branch January 24, 2024 12:11
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

[FEATURE] add to_argilla to EvolInstruct task
2 participants