Skip to content

Conversation

abuelnasr0
Copy link
Contributor

Copy link
Contributor

@chenmoneygithub chenmoneygithub left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thanks for the PR!

Overall LGTM! Left some minor comments on the style.

[" afternoon sun", " night moon"]
preprocessor("The quick b rown fox jumped.")
# Tokenize a batch of single sentences .
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

nit: remove space between sentences and period.

preprocessor(["The quick brown fox jumped.", "Call me Ishmael."])
# Preprocess a batch of sentence pairs.
# When handling multiple sequences, always convert to tensors first !
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

same, remove the extra space

it should be the file path to merge rules. The merge rule file
should have one merge rule per line. Every merge rule contains
merge entities separated by a space.
vocabulary: A dictionary or a string filename path. If passing a
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Suggested edit:

A dictionary mapping tokens to integer ids, or file path to a json file containing the token to id mapping. 

vocabulary: A dictionary or a string filename path. If passing a
filename, the file should be a json file. both json and dictionary
should map a single word piece token string to an integer id.
merges: A list of merge rules or a string filename path, If passing a
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

nit: filename path => file path, and use "." to replace ","

Copy link
Contributor

@chenmoneygithub chenmoneygithub left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thanks, LGTM!

One minor thing - in the future after you fix a comment, you may click on "resolve" button so that we know it's been addressed.

@chenmoneygithub chenmoneygithub merged commit 93d85e9 into keras-team:master Mar 30, 2023
@abuelnasr0 abuelnasr0 deleted the Roberta-docstring-reworking branch December 28, 2023 17:35
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants