Accepting several predictions/annotations for the same record #1630

frascuchon · 2022-06-27T10:40:32Z

Introduction

Currently, records annotations/predictions only support store annotation info for just one annotator agent. The idea is to support several agents, for both, annotations and predictions. This change will bring several feature enhancements such as annotations agreement flows, weak label materialization, multi-pipeline monitoring, and more.

We could give more annotation/prediction control if we combine this feature with roles and dataset settings. By defining a set of annotators (even expected predictors patterns), we can limit the number of agents that can annotate a dataset.

Design keys

The proposed design keeps the prediction/annotation fields and includes a new predictions/annotations one, a data dictionary where the key corresponds to the annotation agent, and the value includes the annotation information provided by the client.

predictions = { “agent-one” : { “labels”: [“A”], “score”: [“0.3”] } }

This new structure will be enabled for search, providing a mechanism for fine-tuning the searches based on specific annotators/predictors. We can replicate all computed fields per annotation entry, so we could do things like:
annotations.agentA.annotated_as: FALSE or predictions.agent_b.predicted_as: TRUE

Backward compatibility

The new data model must tackle current record concepts, and provide a backward compatibility method to make both modes live.

Current fields such as predicted, predicted_as, and annotated_as could change the behavior since multiple values can be assigned. The only case where we can keep the old behavior should be when only an entry is provided.

Complete list of affected fields:

predicted: computed only when one single agent is defined. It will be deprecated and removed in future versions
predicted_as: computed only when one single agent is defined. It will be deprecated and removed in future versions
annotated_as: computed only when one single agent is defined. It will be deprecated and removed in future versions
predicted_by: showing all record agents
annotated_by: showing all record agents
scores: computed only when one single agent is defined (cc: @dvsrepo). It will be deprecated and removed in future versions
prediction: this field will be deprecated and removed in future versions
annotation: this field will be use as the "final/real annotation" (annotation agreement). Maybe a better naming in future versions.
explanation: (only for text classification) computed only when one single agent is defined. It will be deprecated and removed in future versions. The explanation must be defined at the prediction level.
token classification metrics: there are some metrics defined for annotations and predictions. Maybe does not make sense to build all agent metrics, but these fields will be totally affected by the new data model.

References

See recognai/rubrix-roadmap#59

The text was updated successfully, but these errors were encountered:

frascuchon · 2022-09-30T09:08:22Z

There are some task to finish before close this issue:

Allow log records with several annotations/predictions
Handle multiple annotations from UI (view, selected, remove, change,...)
Adapt related filters (backend and UI)
Adapt the definition of prediction ok/ko when multiple values can be present.

cceyda · 2023-03-08T18:12:52Z

would this also solve the issue for token classification where searching a 'word' with 'annotated_as' returning results where that 'word' is not 'annotated_as' the 'selected tag' but all results that involve that word & tag(on a different word)

frascuchon added the type: enhancement Indicates new feature requests label Jun 27, 2022

frascuchon added Manual Labeling type: community request Indicates a feature requested by someone outside of the Argilla organization and removed type: enhancement Indicates new feature requests labels Jul 12, 2022

frascuchon transferred this issue from another repository Jul 21, 2022

frascuchon assigned dvsrepo and frascuchon Jul 21, 2022

frascuchon mentioned this issue Jul 21, 2022

[Explanation] Should we keep explanation feat. just for text classification #1883

Closed

frascuchon mentioned this issue Aug 3, 2022

feat(API): provide a dict for record annotations/predictions #1658

Merged

frascuchon linked a pull request Aug 3, 2022 that will close this issue

feat(API): provide a dict for record annotations/predictions #1658

Merged

frascuchon removed a link to a pull request Aug 4, 2022

feat(API): provide a dict for record annotations/predictions #1658

Merged

frascuchon linked a pull request Aug 23, 2022 that will close this issue

feat(API): provide a dict for record annotations/predictions #1658

Merged

frascuchon closed this as completed in #1658 Aug 25, 2022

frascuchon added this to the v0.18.0 milestone Sep 13, 2022

frascuchon reopened this Sep 30, 2022

frascuchon removed this from the v0.18.0 milestone Sep 30, 2022

frascuchon modified the milestones: 2023 Q1, 2023 Q2 Nov 11, 2022

frascuchon modified the milestones: 2023 Q2, 2023 Q1 Nov 28, 2022

frascuchon mentioned this issue Nov 28, 2022

prediction agent per prediction in the same log. #1866

Closed

cceyda mentioned this issue Feb 23, 2023

enchancement: annotation_agent should be a list for TokenClassificationRecord #2399

Closed

davidberenstein1957 mentioned this issue Feb 27, 2023

Support for multiple predictions for TokenClassificationRecord #2402

Closed

nataliaElv removed this from the 2023 Q1 milestone Jul 4, 2023

nataliaElv added type: popular request Indicates that several people outside of the Argilla organization are interested in this feature and removed Labeling labels Nov 23, 2023

nataliaElv added the type: enhancement Indicates new feature requests label Nov 23, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Accepting several predictions/annotations for the same record #1630

Accepting several predictions/annotations for the same record #1630

frascuchon commented Jun 27, 2022 •

edited

frascuchon commented Sep 30, 2022

cceyda commented Mar 8, 2023

Accepting several predictions/annotations for the same record #1630

Accepting several predictions/annotations for the same record #1630

Comments

frascuchon commented Jun 27, 2022 • edited

Introduction

Design keys

Backward compatibility

References

frascuchon commented Sep 30, 2022

cceyda commented Mar 8, 2023

frascuchon commented Jun 27, 2022 •

edited