Rust native tensor conversion #171

guillaume-be · 2020-03-08T21:43:42Z

Hello,

I have developed an implementation of transformer-based language models (including BERT, DistilBERT, RoBERTa, GPT, GPT2) and ready-to-use NLP applications such as classification or question answering (https://github.com/guillaume-be/rust-bert). Pretrained models from Hugging Face (https://github.com/huggingface/transformers) are available. In order to use them I created a set of Python utility scripts to download and convert the data (for example https://github.com/guillaume-be/rust-bert/blob/master/utils/download-dependencies_bert.py) following your advice.

An interested user raised an issue asking if it would be possible to avoid using Python all together and perform the download and conversion in Rust (guillaume-be/rust-bert#12).

I checked quickly and believe this would involve opening the Pytorch (pickled) binary files. The Pytorch script for de-serializing is fairly complex - have you evaluated the possibility to open these files directly in Rust - for example using the serde-pickle crate?

Thank you,

The text was updated successfully, but these errors were encountered:

LaurentMazare · 2020-03-08T21:53:27Z

Your transformer library looks pretty amazing, congrats!
As for the weight files, I see that you're converting from .npz to .ot which seems like the easy way to do it. Did you consider actually distributing the .ot files rather than user having to run the conversion ? That's what I did for the various vision models as it avoids end user needing to have a working pytorch install.
Besides this I don't have much experience with serde-pickle, you would have to check that it works properly on numpy array which may actually be a tricky bit.

danieldk · 2020-04-16T08:51:07Z

@guillaume-be Oh, that's really nice! I have also ported some stuff from HF transformers:

https://github.com/stickeritis/sticker-transformers

But I should probably rebase to your implementation since you already support more models ;). So far I have also been Python scripts to convert models to HDF5 (and load the non-finetuned models from HDF5).

guillaume-be · 2020-04-16T18:27:52Z

@danieldk very nice work on your transformer port, and the library it integrates into. Happy to provide help wherever I can if you'd like to re-use some portions of this port!

guillaume-be · 2020-05-03T12:36:21Z

The solution was to distribute the .ot file for direct use by the users. Thank you for your help!

guillaume-be mentioned this issue Mar 8, 2020

replacing the python utils with rust code? guillaume-be/rust-bert#12

Closed

guillaume-be closed this as completed May 3, 2020

failable mentioned this issue Mar 6, 2021

Failed to run cargo build guillaume-be/rust-bert#127

Closed

karelnagel mentioned this issue Mar 27, 2023

Partially convert pth to ggml rustformers/llm#83

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Rust native tensor conversion #171

Rust native tensor conversion #171

guillaume-be commented Mar 8, 2020

LaurentMazare commented Mar 8, 2020

danieldk commented Apr 16, 2020 •

edited

guillaume-be commented Apr 16, 2020

guillaume-be commented May 3, 2020

Rust native tensor conversion #171

Rust native tensor conversion #171

Comments

guillaume-be commented Mar 8, 2020

LaurentMazare commented Mar 8, 2020

danieldk commented Apr 16, 2020 • edited

guillaume-be commented Apr 16, 2020

guillaume-be commented May 3, 2020

danieldk commented Apr 16, 2020 •

edited