add repr method for Rule, Dataset. #2148

Ankush-Chander · 2023-01-07T16:30:41Z

Description

Please include a summary of the changes and the related issue. Please also include relevant motivation and context. List any dependencies that are required for this change.

Closes #2046

Type of change

Please delete options that are not relevant.

New feature (non-breaking change which adds functionality)

How Has This Been Tested

Please describe the tests that you ran to verify your changes. And ideally reference tests.

import argilla as rg
from argilla.labeling.text_classification.rule import Rule

plz = Rule(query="plz OR please", label="SPAM")
print(repr(plz))
>>> Rule(query='plz OR please', label='SPAM', name='plz OR please')


records = [
        rg.TextClassificationRecord(text="example"),
        rg.TextClassificationRecord(text="another example"),
        rg.TextClassificationRecord(text="another example another example another example another example another example another example"),
    ]
dataset = rg.DatasetForTextClassification(records=records)
print(dataset)
>>>
    	text                          	annotation	prediction
0   	example                       	None      	None      
1   	another example               	None      	None      
2   	another example another exampl	None      	None      
...
3 TextClassificationRecord records

Checklist

I have merged the original branch into my forked branch
I added relevant documentation
follows the style guidelines of this project
I did a self-review of my code
I added comments to my code
I made corresponding changes to the documentation
My changes generate no new warnings
I have added tests that prove my fix is effective or that my feature works

for more information, see https://pre-commit.ci

codecov · 2023-01-07T16:45:50Z

Codecov Report

Base: 94.30% // Head: 94.34% // Increases project coverage by +0.04% 🎉

Coverage data is based on head (2b6526c) compared to base (3aa0c55).
Patch has no changes to coverable lines.

❗ Current head 2b6526c differs from pull request most recent head 2234032. Consider uploading reports for the commit 2234032 to get more accurate results

Additional details and impacted files

@@             Coverage Diff             @@
##           develop    #2148      +/-   ##
===========================================
+ Coverage    94.30%   94.34%   +0.04%     
===========================================
  Files          151      151              
  Lines         7218     7218              
===========================================
+ Hits          6807     6810       +3     
+ Misses         411      408       -3

Flag	Coverage Δ
pytest	`94.34% <ø> (+0.04%)`	⬆️

Flags with carried forward coverage won't be shown. Click here to find out more.

Impacted Files	Coverage Δ
src/argilla/client/datasets.py	`85.19% <ø> (ø)`
src/argilla/labeling/text_classification/rule.py	`97.05% <ø> (ø)`
...gilla/labeling/text_classification/label_errors.py	`90.36% <0.00%> (+3.61%)`	⬆️

Help us with your feedback. Take ten seconds to tell us how you rate us. Have a feature suggestion? Share it here.

☔ View full report at Codecov.
📢 Do you have feedback about the report comment? Let us know in this issue.

davidberenstein1957

Hi @Ankush-Chander, it looks great:) Could you perhaps also add a __repr__ for the API client, which exposes the active workspace and potentially available workspaces, URL/host and available datasets?

src/argilla/client/datasets.py

davidberenstein1957 · 2023-01-08T07:32:52Z

src/argilla/labeling/text_classification/rule.py

@@ -172,6 +172,14 @@ def __call__(
        else:
            return self._label

+    def __repr__(self):


This looks great:)

davidberenstein1957 · 2023-01-13T12:01:24Z

Also, perhaps the __repr__ of TokenClassificationRecord, TextClassificationRecord, and Text2TexClassificationRecord can be limited a bit by cutting the str of text and the tokens

davidberenstein1957 · 2023-01-30T16:08:22Z

Thanks for the great work @Ankush-Chander 👍

frascuchon

Nice work !! Thanks @Ankush-Chander

@Ankush-Chander

# Changelog All notable changes to this project will be documented in this file. See [standard-version](https://github.com/conventional-changelog/standard-version) for commit guidelines. ## [1.3.0](v1.2.1...v1.3.0) (2023-02-09) ### Features * better log error handling ([#2245](#2245)) ([66e5cce](66e5cce)), closes [#2005](#2005) * Change view mode order in sidebar ([#2215](#2215)) ([dff1ea1](dff1ea1)), closes [#2214](#2214) * **Client:** Expose keywords dataset metrics ([#2290](#2290)) ([a945c5e](a945c5e)), closes [#2135](#2135) * **Client:** relax client constraints for rules management ([#2242](#2242)) ([6e749b7](6e749b7)), closes [#2048](#2048) * Create a multiple contextual help component ([#2255](#2255)) ([a35fae2](a35fae2)), closes [#1926](#1926) * Include record event_timestamp ([#2156](#2156)) ([3992b8f](3992b8f)), closes [#1911](#1911) * updated the `prepare_for_training` methods ([#2225](#2225)) ([e53c201](e53c201)), closes [#2154](#2154) [#2132](#2132) [#2122](#2122) [#2045](#2045) [#1697](#1697) ### Bug Fixes * **Client:** formatting caused offset in prediction ([#2241](#2241)) ([d65db5a](d65db5a)) * **Client:** Log remaining data when shutdown the dataset consumer ([#2269](#2269)) ([d78963e](d78963e)), closes [#2189](#2189) * validate predictions fails on text2text ([#2271](#2271)) ([f68856e](f68856e)), closes [#2252](#2252) ### Visual enhancements * Fine tune menu record card ([#2240](#2240)) ([62148e5](62148e5)), closes [#2224](#2224) * Rely on box-shadow to provide the secondary underline ([#2283](#2283)) ([d786171](d786171)), closes [#2282](#2282) [#2282](#2282) ### Documentation * Add deploy on Spaces buttons ([#2293](#2293)) ([60164a0](60164a0)) * fix typo in documentation ([#2296](#2296)) ([ab8e85e](ab8e85e)) * Improve deployment and quickstart docs and tutorials ([#2201](#2201)) ([075bf94](075bf94)), closes [#2162](#2162) * More spaces! ([#2309](#2309)) ([f02eb60](f02eb60)) * Remove cut-off sentence in docs codeblock ([#2287](#2287)) ([7e87f20](7e87f20)) * Rephrase `to know more` into `to learn more` in Quickstart login page ([#2305](#2305)) ([6082a26](6082a26)) * Replace leftover `rubrix.apikey` with `argilla.apikey` ([#2286](#2286)) ([4871127](4871127)), closes [#2254](#2254) [#2254](#2254) * Simplify token attributions code block ([#2322](#2322)) ([4cb6ae1](4cb6ae1)) * Tutorial buttons ([#2310](#2310)) ([d6e02de](d6e02de)) * Update colab guide ([#2320](#2320)) ([e48a7cc](e48a7cc)) * Update HF Spaces creation image ([#2314](#2314)) ([e4b2a04](e4b2a04)) ## As always, thanks to our amazing contributors! - add repr method for Rule, Dataset. (#2148) by @Ankush-Chander - opensearch docker compose file doesn't run (#2228) by @kayvane1 - Docs: fix typo in documentation (#2296) by @anakin87

Ankush-Chander and others added 2 commits January 7, 2023 21:51

add repr method for Rule, Dataset.

a3f27c6

[pre-commit.ci] auto fixes from pre-commit.com hooks

2b6526c

for more information, see https://pre-commit.ci

davidberenstein1957 reviewed Jan 8, 2023

View reviewed changes

davidberenstein1957 mentioned this pull request Jan 13, 2023

add better support for python client navigation in workspaces and datasets #2195

Closed

davidberenstein1957 mentioned this pull request Jan 16, 2023

too verbose failure logging during failed rg.log() (with too large chunk_size) #2005

Closed

reuse for repr

fa1f742

Ankush-Chander requested a review from davidberenstein1957 January 30, 2023 01:11

move repr/str methods to base class.

2234032

Ankush-Chander requested review from frascuchon and removed request for davidberenstein1957 January 30, 2023 16:20

frascuchon requested a review from davidberenstein1957 January 30, 2023 17:06

frascuchon approved these changes Jan 30, 2023

View reviewed changes

davidberenstein1957 merged commit f52b49c into argilla-io:develop Jan 30, 2023

frascuchon mentioned this pull request Feb 7, 2023

chore: prepare release v1.3.0 #2304

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

add repr method for Rule, Dataset. #2148

add repr method for Rule, Dataset. #2148

Ankush-Chander commented Jan 7, 2023 •

edited by frascuchon

codecov bot commented Jan 7, 2023 •

edited

davidberenstein1957 left a comment •

edited

davidberenstein1957 Jan 8, 2023

davidberenstein1957 commented Jan 13, 2023

davidberenstein1957 commented Jan 30, 2023

frascuchon left a comment

add repr method for Rule, Dataset. #2148

add repr method for Rule, Dataset. #2148

Conversation

Ankush-Chander commented Jan 7, 2023 • edited by frascuchon

Description

codecov bot commented Jan 7, 2023 • edited

Codecov Report

davidberenstein1957 left a comment • edited

Choose a reason for hiding this comment

davidberenstein1957 Jan 8, 2023

Choose a reason for hiding this comment

davidberenstein1957 commented Jan 13, 2023

davidberenstein1957 commented Jan 30, 2023

frascuchon left a comment

Choose a reason for hiding this comment

Ankush-Chander commented Jan 7, 2023 •

edited by frascuchon

codecov bot commented Jan 7, 2023 •

edited

davidberenstein1957 left a comment •

edited