Rework DistilBERT docstrings for progressive disclosure of complexity. #881

Cyber-Machine · 2023-03-19T13:03:53Z

Partially fixes: #867

Make sure to update any "custom vocabulary" examples to match the model actual vocabulary type and special token requirements (varies per model).
Test out all docstring snippets!
Gist of all docstring snippets.
Make sure to follow our code style guidelines re indentation etc.

mattdangerw

This looks great! I see a few spots to fix, but can do that as I merge this. Thanks

mattdangerw · 2023-03-21T00:15:04Z

keras_nlp/models/distil_bert/distil_bert_masked_lm_preprocessor.py

+            replaced with a random token from the vocabulary. A selected token
+            will be left as is with probability
+            `1 - mask_token_rate - random_token_rate`.
+    Call arguments:


add newline

mattdangerw · 2023-03-21T00:15:32Z

keras_nlp/models/distil_bert/distil_bert_preprocessor.py

                    left-to-right manner and fills up the buckets until we run
                    out of budget. It supports an arbitrary number of segments.

+        Call arguments:


decrease indent

mattdangerw · 2023-03-21T00:16:16Z

keras_nlp/models/distil_bert/distil_bert_preprocessor.py

-    # Load the preprocessor from a preset.
-    preprocessor = keras_nlp.models.DistilBertPreprocessor.from_preset("distil_bert_base_en_uncased")
+    preprocessor = keras_nlp.models.DistilBertPreprocessor.from_preset(
+    "distil_bert_base_en_uncased"


mattdangerw · 2023-03-21T00:16:34Z

keras_nlp/models/distil_bert/distil_bert_masked_lm.py

    Example usage:

-    Raw string inputs and pretrained backbone.
+    Raw string data.


This still needs some updates to match the new style.

keras-team#881) * Reworked distil_bert docstrings. * Fixed Typos. * Fixed typo in DistilBERT MaskedLM Preprocessor * Updated distil_bert_classifier.py * Added DistilBertPreprocessor to docs. * Formatted using black. * A few edits * Another fix --------- Co-authored-by: Matt Watson <mattdangerw@gmail.com>

Cyber-Machine and others added 7 commits March 19, 2023 17:31

Reworked distil_bert docstrings.

9655e23

Fixed Typos.

ce18864

Fixed typo in DistilBERT MaskedLM Preprocessor

ecffef9

Updated distil_bert_classifier.py

f73f531

Added DistilBertPreprocessor to docs.

96b0999

Formatted using black.

3f73b3e

A few edits

f7ccea8

mattdangerw approved these changes Mar 21, 2023

View reviewed changes

Another fix

b1603f1

mattdangerw merged commit 620d86e into keras-team:master Mar 21, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Rework DistilBERT docstrings for progressive disclosure of complexity. #881

Rework DistilBERT docstrings for progressive disclosure of complexity. #881

Uh oh!

Cyber-Machine commented Mar 19, 2023

Uh oh!

mattdangerw left a comment

Uh oh!

mattdangerw Mar 21, 2023

Uh oh!

mattdangerw Mar 21, 2023

Uh oh!

mattdangerw Mar 21, 2023

Uh oh!

mattdangerw Mar 21, 2023

Uh oh!

Uh oh!

Rework DistilBERT docstrings for progressive disclosure of complexity. #881

Rework DistilBERT docstrings for progressive disclosure of complexity. #881

Uh oh!

Conversation

Cyber-Machine commented Mar 19, 2023

Uh oh!

mattdangerw left a comment

Choose a reason for hiding this comment

Uh oh!

mattdangerw Mar 21, 2023

Choose a reason for hiding this comment

Uh oh!

mattdangerw Mar 21, 2023

Choose a reason for hiding this comment

Uh oh!

mattdangerw Mar 21, 2023

Choose a reason for hiding this comment

Uh oh!

mattdangerw Mar 21, 2023

Choose a reason for hiding this comment

Uh oh!

Uh oh!