feat: create responses and suggestions with spans #4623

frascuchon · 2024-03-04T17:27:45Z

Description

This PR defines a basic flow to support responses and suggestions with span values.

import spacy
from datasets import load_dataset

import argilla as rg
from argilla.client.feedback.schemas import SpanValueSchema
from argilla.client.feedback.schemas.responses import ResponseSchema, ResponseBuilder


api_url = ...
api_key = ...

rg.init(api_url, api_key)

nlp = spacy.load("en_core_web_sm")
labels = nlp.get_pipe("ner").labels

gutenberg_config = rg.FeedbackDataset(
    fields=[rg.TextField(name="text")],
    questions=[rg.SpanQuestion(name="spans", title="Name entities recognition", field="text", labels=labels)]
)

ds = gutenberg_config.push_to_argilla("gutenberg_spacy-ner", workspace="argilla")
hf_ds = load_dataset("argilla/gutenberg_spacy-ner", split="train")

user = rg.User.me()

def to_response_spans(doc) -> list[SpanValueSchema]:
    """Converts a spacy doc to a list of response spans."""
    return [
        SpanValueSchema(start=ent.start_char, end=ent.end_char,label=ent.label_)
        for ent in doc.ents
    ]

question = ds.question_by_name("spans")
records = [
    rg.FeedbackRecord(
        fields={"text": r["text"]},
        responses=[
            ResponseSchema(
                status="draft",
                user_id=user.id,
                values=[question.response(to_response_spans(nlp(r["text"])))]
            )
        ],
        suggestions=[
            question.suggestion(value=to_response_spans(nlp(r["text"])), agent="spacy", type="model")
        ]
    )
    for r in hf_ds
]
ds.add_records(records)

Closes #4618

Type of change

(Please delete options that are not relevant. Remember to title the PR according to the type of change)

New feature (non-breaking change which adds functionality)
Refactor (change restructuring the codebase without changing functionality)
Improvement (change adding some improvement to an existing functionality)

How Has This Been Tested

(Please describe the tests that you ran to verify your changes. And ideally, reference tests)

Test A
Test B

Checklist

I added relevant documentation
I followed the style guidelines of this project
I did a self-review of my code
I made corresponding changes to the documentation
My changes generate no new warnings
I have added tests that prove my fix is effective or that my feature works
I filled out the contributor form (see text above)
I have added relevant notes to the CHANGELOG.md file (See https://keepachangelog.com/)

for more information, see https://pre-commit.ci

…la-io/argilla into feat/create-span-question-from-sdk

…ecords-with-spans

for more information, see https://pre-commit.ci

…io/argilla into feat/suggest-records-with-spans

…/create-responses-with-spans

…-with-spans

…sponses-with-spans

…-with-spans

# Description This PR adds response and suggestion method helpers to simplify and unify how users may work with responses and suggestions. Instead of using response and suggestion schemas init methods, users may use `*.with_question_value` method to create responses and suggestions given a question and a response value. This way will introduce some extra data validation which is not available using init methods directly. For example, given the following dataset: ```python dataset = rg.FeedbackDataset( fields =[rg.TextField(name="text")], question=[ rg.TextQuestion(name="question-1", required=True), rg.RatingQuestion(name="question-2", values=[1, 2], required=True), rg.LabelQuestion(name="question-3", labels=["a", "b", "c"], required=True), rg.MultiLabelQuestion(name="question-4", labels=["a", "b", "c"], required=True), rg.RankingQuestion(name="question-5", values=["a", "b"], required=True), ] ) ``` users could create responses and suggestions as follows: ```python question_1 = dataset.question_by_name("question-1") question_2 = dataset.question_by_name("question-2") question_3 = dataset.question_by_name("question-3") question_4 = dataset.question_by_name("question-4") question_5 = dataset.question_by_name("question-5") record = rg.FeedbackRecord( fields={ "text": "This is a text value for field"}, responses=[ ResponseSchema(status="submitted") .with_question_value(question_1, value="answer") .with_question_value(question_2, value=0) .with_question_value(question_3, value="a") .with_question_value(question_4, value=["a", "b"]) .with_question_value(question_5, value=[{"rank": 1, "value": "a"}, {"rank": 2, "value": "b"}]) ], suggestions=[ SuggestionSchema.with_question_value(question_1, value="answer"), SuggestionSchema.with_question_value(question_2, value=0), SuggestionSchema.with_question_value(question_3, value="a"), SuggestionSchema.with_question_value(question_4, value=["a", "b"]), SuggestionSchema.with_question_value(question_5, value=[{"rank": 1, "value": "a"}, {"rank":2, "value": "b"}]) ] ) ``` **Type of change** (Please delete options that are not relevant. Remember to title the PR according to the type of change) - [ ] New feature (non-breaking change which adds functionality) - [ ] Refactor (change restructuring the codebase without changing functionality) - [ ] Improvement (change adding some improvement to an existing functionality) **How Has This Been Tested** (Please describe the tests that you ran to verify your changes. And ideally, reference `tests`) - [ ] Test A - [ ] Test B **Checklist** - [ ] I added relevant documentation - [ ] I followed the style guidelines of this project - [ ] I did a self-review of my code - [ ] I made corresponding changes to the documentation - [ ] My changes generate no new warnings - [ ] I have added tests that prove my fix is effective or that my feature works - [ ] I filled out [the contributor form](https://tally.so/r/n9XrxK) (see text above) - [ ] I have added relevant notes to the `CHANGELOG.md` file (See https://keepachangelog.com/) --------- Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>

for more information, see https://pre-commit.ci

# Description This PR integrates span questions with HF datasets, allowing exporting/importing datasets with span questions. Closes #4635 **Type of change** (Please delete options that are not relevant. Remember to title the PR according to the type of change) - [ ] New feature (non-breaking change which adds functionality) - [ ] Refactor (change restructuring the codebase without changing functionality) - [X] Improvement (change adding some improvement to an existing functionality) **How Has This Been Tested** (Please describe the tests that you ran to verify your changes. And ideally, reference `tests`) - [ ] Test A - [ ] Test B **Checklist** - [ ] I added relevant documentation - [ ] I followed the style guidelines of this project - [ ] I did a self-review of my code - [ ] I made corresponding changes to the documentation - [ ] My changes generate no new warnings - [ ] I have added tests that prove my fix is effective or that my feature works - [ ] I filled out [the contributor form](https://tally.so/r/n9XrxK) (see text above) - [ ] I have added relevant notes to the `CHANGELOG.md` file (See https://keepachangelog.com/) --------- Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>

davidberenstein1957 · 2024-03-11T08:34:19Z

src/argilla/client/feedback/integrations/huggingface/dataset.py

+                                value = [
+                                    {"start": s, "end": e, "label": l}
+                                    for s, e, l in zip(value["start"], value["end"], value["label"])
+                                ]
                            responses[user_id or "user_without_id"]["values"].update({question.name: {"value": value}})


perhaps we should add else statements here to be sure it raises errors/has behaviour when we add question types?

I'm using the current approach to support span values mappings. I wouldn't add cross-cutting solution in the current version of the SDK. This is something that we can add for new SDK implementation

davidberenstein1957 · 2024-03-11T08:34:42Z

src/argilla/client/feedback/integrations/huggingface/dataset.py

+                        value = [
+                            {"start": s, "end": e, "label": l}
+                            for s, e, l in zip(value["start"], value["end"], value["label"])
+                        ]


perhaps we should add else statements here to be sure it raises errors/has behaviour when we add question types?

I'm using the current approach to support span values mappings. I wouldn't add cross-cutting solution in the current version of the SDK. This is something that we can add for new SDK implementation

davidberenstein1957 · 2024-03-11T08:36:33Z

src/argilla/client/feedback/integrations/huggingface/dataset.py

+                    elif question.type == QuestionTypes.span:
+                        value = [
+                            {"start": s, "end": e, "label": l}
+                            for s, e, l in zip(value["start"], value["end"], value["label"])


in terms of human-readability, I also like to add the extracted text normally. Something likevalue["text"][value["start"]:value["end"]] but perhaps this is difficult top map back into the correct format when calling from_huggingface?

This change is not hard to add, if it makes sense.

davidberenstein1957 · 2024-03-11T08:36:38Z

src/argilla/client/feedback/integrations/huggingface/dataset.py

+                            elif question.type == QuestionTypes.span:
+                                value = [
+                                    {"start": s, "end": e, "label": l}
+                                    for s, e, l in zip(value["start"], value["end"], value["label"])


in terms of human-readability, I also like to add the extracted text normally. Something likevalue["text"][value["start"]:value["end"]] but perhaps this is difficult top map back into the correct format when calling from_huggingface?

This change is not hard to add, if it makes sense.

davidberenstein1957 · 2024-03-11T08:41:06Z

src/argilla/client/feedback/schemas/questions.py


    @validator("labels", always=True)
    def normalize_labels(cls, v: List[Union[str, SpanLabelOption]]) -> List[SpanLabelOption]:
        return [SpanLabelOption(value=label, text=label) if isinstance(label, str) else label for label in v]

+    @validator("labels")


wasn't there a max for the labels too which we defined on the server side? perhaps we can use it here as well?

Yes, this is not hard to implement.

Anyway, this is not the current behaviour that exists for single and multi-label settings. In both cases, there is a min validation but not a max one.

Also, if we plan to support a configurable value for this, adding a hard validation here may introduce workflow problems. So, I will let as is.

davidberenstein1957 · 2024-03-11T08:43:05Z

src/argilla/client/feedback/schemas/response_values.py

+    score: Optional[confloat(ge=0.0, le=1.0)] = None
+
+    @root_validator
+    def check_span(cls, values):


I saw we already tested for larger than 0 but do we also validate if the span is within the overall text field, i.e., they are not larger than the used text?

We're covering basic data validation from SDK, adding more complex validations is not quite simple to implement for this SDK. Since there are these kinds of validations from the server, and the rest of the suggestions/responses didn't include validations, I would keep it simple for this version.

davidberenstein1957 · 2024-03-11T08:47:12Z

src/argilla/client/feedback/schemas/responses.py

+                " issue, unless you're planning to log the response in Argilla, as"
+                " it will be automatically set to the active `user_id`.",
+            )
+        return v


perphaps we should return None explicitly here? not 1 and not True also pass the if-statement and perhaps we require an explicit None to set the user_id automatically?

This code was already here. I just moved it. I wouldn't change anything except the span feature. This can be tackled in the new SDK.

davidberenstein1957 · 2024-03-11T08:49:56Z

src/argilla/client/feedback/schemas/suggestions.py

+    def to_server_payload(self, question_name_to_id: Dict[str, UUID]) -> Dict[str, Any]:
+        """Method that will be used to create the payload that will be sent to Argilla
+        to create a `SuggestionSchema` for a `FeedbackRecord`."""
+        payload = self.dict(exclude_unset=True, include={"type", "score", "value", "agent"})


to be clear, all of "type", "score", "value", "agent" are stored but not all of them are used, correct?

'score' is used to filter records at least from UI. The rest are not used yet, even if they're available for searches.

@davidberenstein1957

This PR adds the `text` value for span suggestions and responses when publishing HF datasets. Commented by @davidberenstein1957 [here](#4623 (comment)) --------- Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>

frascuchon and others added 30 commits February 29, 2024 15:50

chore: Add new span enum type

d44be46

feat: define new SpanQuestion and SpanLabelOption classes

39c9c6e

feat: Add 'remote' version for span question

28a7ffa

refactor: Using allowed types from questions modules

febd300

chore: Expose new span question classes through feedback module

3b010bb

refactor: simplify adding questions

9759153

chore: Expose span question classes from rg

5819d87

tests: Adding unint tests

d0ffa74

tests: Adding basic integration tests

9776e2d

Using feature branch from argilla server

8cafa01

update CHANGELOG

86f568d

[pre-commit.ci] auto fixes from pre-commit.com hooks

490dec6

for more information, see https://pre-commit.ci

fix import naming

56f360b

Merge branch 'feat/create-span-question-from-sdk' of github.com:argil…

5b638e7

…la-io/argilla into feat/create-span-question-from-sdk

Adding a new test

f337160

fix: Adding label description

5317900

update tests

35f6392

update tests

ceded58

Merge branch 'feat/create-span-question-from-sdk' into feat/suggest-r…

424a5c8

…ecords-with-spans

feat: Suggestions schema module including new SpanSuggestion schema

11a0068

Remove suggestions schemas from records module

f4bea43

update suggestion schemas imports

53af61b

[pre-commit.ci] auto fixes from pre-commit.com hooks

2760348

for more information, see https://pre-commit.ci

change to_server_payload logic

4edf802

Merge branch 'feat/suggest-records-with-spans' of github.com:argilla-…

433f27b

…io/argilla into feat/suggest-records-with-spans

feat: Define responses and responses values modules

540979f

refactor: Align response and suggestion value schemas

0899176

refactor: Remove response schemas from records module

be02eb2

feat: Adding fields attribute for span question

22f6947

Review imports

be13a7d

frascuchon changed the title ~~[WIP] feat: create responses with spans~~ feat: create responses with spans Mar 5, 2024

frascuchon changed the title ~~feat: create responses with spans~~ feat: create responses and suggestions with spans Mar 5, 2024

frascuchon added 8 commits March 5, 2024 17:44

Merge branch 'bugfix/format-suggestions-for-ranking-values' into feat…

2e87977

…/create-responses-with-spans

add field to span question definition

5f14b28

Merge branch 'bugfix/format-suggestions-for-ranking-values' into feat…

ca0b649

…/create-responses-with-spans

Merge branch 'feat/span-questions-support' into feat/create-responses…

816a839

…-with-spans

Merge branch 'bugfix/hf-dataset-remove-rank-list' into feat/create-re…

6a2f636

…sponses-with-spans

Merge branch 'feat/span-questions-support' into feat/create-responses…

fd664f4

…-with-spans

Merge branch 'feat/span-questions-support' into feat/create-responses…

7f43202

…-with-spans

fix: Using schema instances instead of dict for suggestions

d0b1351

frascuchon changed the title ~~feat: create responses and suggestions with spans~~ [WIP]feat: create responses and suggestions with spans Mar 6, 2024

frascuchon and others added 2 commits March 6, 2024 16:30

using argilla-server feature branch

a3def08

dosubot bot added size:XL This PR changes 500-999 lines, ignoring generated files. and removed size:L This PR changes 100-499 lines, ignoring generated files. labels Mar 6, 2024

chore. Update CHANGELOG

5b65e11

frascuchon changed the title ~~[WIP]feat: create responses and suggestions with spans~~ feat: create responses and suggestions with spans Mar 6, 2024

frascuchon and others added 2 commits March 6, 2024 18:02

adding more tests

af035dc

[pre-commit.ci] auto fixes from pre-commit.com hooks

bb7dba4

for more information, see https://pre-commit.ci

frascuchon mentioned this pull request Mar 7, 2024

[WIP]feat: suggest records with spans #4619

Closed

13 tasks

frascuchon and others added 4 commits March 7, 2024 16:07

feat: Accept a dict for labels

b6497ae

fix: add manual validation for min_items

c86b20d

update tests

6136414

frascuchon merged commit dcb8e37 into feat/span-questions-support Mar 11, 2024
16 checks passed

frascuchon deleted the feat/create-responses-with-spans branch March 11, 2024 08:49

davidberenstein1957 reviewed Mar 11, 2024

View reviewed changes

frascuchon mentioned this pull request Mar 11, 2024

chore: apply some of suggested changes #4644

Merged

louisguitton mentioned this pull request Jun 20, 2024

[DOCS] Create a tutorial about using SpanMarker and Argilla #4086

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: create responses and suggestions with spans #4623

feat: create responses and suggestions with spans #4623

frascuchon commented Mar 4, 2024 •

edited

Loading

davidberenstein1957 Mar 11, 2024

frascuchon Mar 11, 2024

davidberenstein1957 Mar 11, 2024

frascuchon Mar 11, 2024

davidberenstein1957 Mar 11, 2024

frascuchon Mar 11, 2024

davidberenstein1957 Mar 11, 2024

frascuchon Mar 11, 2024

davidberenstein1957 Mar 11, 2024

frascuchon Mar 11, 2024 •

edited

Loading

frascuchon Mar 11, 2024

davidberenstein1957 Mar 11, 2024

frascuchon Mar 11, 2024

davidberenstein1957 Mar 11, 2024

frascuchon Mar 11, 2024

davidberenstein1957 Mar 11, 2024

frascuchon Mar 11, 2024

feat: create responses and suggestions with spans #4623

feat: create responses and suggestions with spans #4623

Conversation

frascuchon commented Mar 4, 2024 • edited Loading

Description

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

frascuchon Mar 11, 2024 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

frascuchon commented Mar 4, 2024 •

edited

Loading

frascuchon Mar 11, 2024 •

edited

Loading