Enhancements/Fixes to HF Benchmark Runtime #104
Labels
bug
Something isn't working
enhancement
New feature or request
good first issue
Good for newcomers
help wanted
Extra attention is needed
HF Benchmarker is a module within SHARK that enable easy testing of HF models with ONNX, Torch, TF, and SHARK-RT of course. this work is based of SharkBenchmarker for MLIR part and Microsoft Transformer Benchmark.
EDIT: nightly ORT did not fix GPU nor did it fix TF.
Some issues/Enhancements that need fixing
1. Integrate running of TF in HF-Benchmarker.
Has some Runtime issues wrt
RuntimeError: Intra op parallelism cannot be modified after initialization.
andRuntimeError: Visible devices cannot be modified after being initialized
. See https://github.com/microsoft/onnxruntime/issues/ 11751 for more details.2. Fix up HF Benchmark Runtime with GPU
Currently the only supported device is CPU, since we will get OOM with GPU. The problem lies within importing of onnxruntime causes to load 39GB of data into the GPU, this leaves very little space for us to load our model and even run anything.
The text was updated successfully, but these errors were encountered: