Skip to content

Conversation

mattdangerw
Copy link
Member

I think this will overall be a nice simplification for maintenance. Push whatever logic we can down onto the base preprocessing classes. Saves a lot of code. To assist with this, I am adding a special_tokens property to tokenizers, which I think will be useful anyway.

@mattdangerw
Copy link
Member Author

Probably still some test breakages to work though, not mailing this out quite yet.

@mattdangerw mattdangerw force-pushed the preprocessing-simplify branch 2 times, most recently from ce944da to 9250f79 Compare September 4, 2024 00:05
I think this will overall be a nice simplification for maintenance.
Push whatever logic we can down onto the base preprocessing classes.
Saves a lot of code. To assist with this, I am adding a
`special_tokens` property to tokenizers, which I think will be useful
anyway.
@mattdangerw mattdangerw force-pushed the preprocessing-simplify branch from 9250f79 to ea5e96c Compare September 4, 2024 01:29
@mattdangerw
Copy link
Member Author

Ok! Passing besides the nightly failure (which is unrelated). Mailing out.

@mattdangerw
Copy link
Member Author

The nightly breakage is unrelated btw.

Copy link
Collaborator

@divyashreepathihalli divyashreepathihalli left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM!

Copy link
Member

@SamanehSaadat SamanehSaadat left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thanks, Matt! It's really nice to move all these common logic to the base classes!
Just left some nit comments!

@mattdangerw
Copy link
Member Author

Thanks for review! Will pull this in once tests pass.

@mattdangerw mattdangerw added the kokoro:force-run Runs Tests on GPU label Sep 4, 2024
@mattdangerw mattdangerw merged commit 9707bb2 into keras-team:master Sep 5, 2024
9 of 10 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
kokoro:force-run Runs Tests on GPU
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants