New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Rust native tensor conversion #171
Comments
Your transformer library looks pretty amazing, congrats! |
@guillaume-be Oh, that's really nice! I have also ported some stuff from HF transformers: https://github.com/stickeritis/sticker-transformers But I should probably rebase to your implementation since you already support more models ;). So far I have also been Python scripts to convert models to HDF5 (and load the non-finetuned models from HDF5). |
@danieldk very nice work on your transformer port, and the library it integrates into. Happy to provide help wherever I can if you'd like to re-use some portions of this port! |
The solution was to distribute the |
Hello,
I have developed an implementation of transformer-based language models (including BERT, DistilBERT, RoBERTa, GPT, GPT2) and ready-to-use NLP applications such as classification or question answering (https://github.com/guillaume-be/rust-bert). Pretrained models from Hugging Face (https://github.com/huggingface/transformers) are available. In order to use them I created a set of Python utility scripts to download and convert the data (for example https://github.com/guillaume-be/rust-bert/blob/master/utils/download-dependencies_bert.py) following your advice.
An interested user raised an issue asking if it would be possible to avoid using Python all together and perform the download and conversion in Rust (guillaume-be/rust-bert#12).
I checked quickly and believe this would involve opening the Pytorch (pickled) binary files. The Pytorch script for de-serializing is fairly complex - have you evaluated the possibility to open these files directly in Rust - for example using the
serde-pickle
crate?Thank you,
The text was updated successfully, but these errors were encountered: