-
Notifications
You must be signed in to change notification settings - Fork 833
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Reproducing Benchmark Results #21
Comments
It turns out the environment used for this setting change into |
Yeah, I was using the environment variable Line 16 in cf385ca
Line 24 in cf385ca
(Note that there is no environment variable that affects thread counts used in tiktoken proper / tiktoken does not use rayon) |
Please let me know if you have any further difficulty reproducing benchmark numbers! |
Hi,
I want to reproduce the result in presented in
README.md
, to the extent my hardware would allow. I am aware of thescripts/benchmark.py
file and could runtiktoken
with different number of threads. But when it comes to setting number of thread for huggingface tokenizers, I could not set it. I tried using environment variableRAYON_RS_NUM_CPUS
, but the number of threads did not change.Any help is appreciated!
The text was updated successfully, but these errors were encountered: