Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Describe how more tokenizers can be added #11

Open
MarkusSagen opened this issue Mar 12, 2022 · 0 comments
Open

Describe how more tokenizers can be added #11

MarkusSagen opened this issue Mar 12, 2022 · 0 comments
Assignees
Labels
documentation Improvements or additions to documentation

Comments

@MarkusSagen
Copy link
Collaborator

To make it easier to understand how new tokenizers can be added, try to provide some more details on how new tokenizers can be added. Will help understand how to approach #2

@MarkusSagen MarkusSagen added the documentation Improvements or additions to documentation label Mar 12, 2022
@MarkusSagen MarkusSagen self-assigned this Mar 12, 2022
MarkusSagen added a commit that referenced this issue Mar 12, 2022
* 🐛 Save the tokenizers downloaded to script

* 🎨 Apply formating and standards

* 🐛 Support more BERT tokenizers #5 #13

* 🔥 Cleanup

* Test AST parsing

* 📝 Describe how more tokenizers can be added #11

* 🔖 Support more BERT tokenizers
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
documentation Improvements or additions to documentation
Projects
None yet
Development

No branches or pull requests

1 participant