[WIP] Integrating OpenVINO Runtime in Spark NLP #13947

rajatkrishna · 2023-08-28T15:31:06Z

Reopened as #14200

This PR introduces OpenVINO Runtime support in Spark NLP

Description

This PR enables Spark NLP to leverage the OpenVINO Runtime API for Java to load and run models in various formats including ONNX, PaddlePaddle, Tensorflow, Tensorflow Lite and OpenVINO IR format. OpenVINO also enables performance improvements when running on supported intel hardware, with upto 40% improvement vs Tensorflow on benchmarks with no further tuning. You can also take advantage of the full optimization and quantization capabilities offered by the OpenVINO toolkit when exporting/converting the model to the OpenVINO format using the Model Conversion API.

The following annotators have been enabled to work with OpenVINO:

BertEmbeddings: https://colab.research.google.com/drive/1J9IlT0CLrmvEOHBxuKHreEyVqsfsbidm?usp=sharing
RoBertaEmbeddings: https://colab.research.google.com/drive/1oFRqCuk2XLk29Q0X5uyGyFFQtBz23pBd?usp=sharing
XlmRoBertaEmbeddings: https://colab.research.google.com/drive/1btFhV9vunqRB-kKxTCKlBXffdE6um_Pv?usp=sharing

Note: To take advantage of this feature, see these instructions to build OpenVINO jar (Linux), and these to build Spark NLP. OpenVINO is cross-platform. Refer here for Windows build instructions, and here for other platforms.

Motivation and Context

Out-of-the-box optimizations and better performance on supported Intel hardware
Capable of reading ONNX, PaddlePaddle, TensorFlow and TensorFlow Lite formats directly
This work was completed as part of Google Summer of Code 2023

Screenshots (if appropriate):

Types of changes

Bug fix (non-breaking change which fixes an issue)
Code improvements with no or little impact
New feature (non-breaking change which adds functionality)
Breaking change (fix or feature that would cause existing functionality to change)

Checklist:

My code follows the code style of this project.
My change requires a change to the documentation.
I have updated the documentation accordingly.
I have read the CONTRIBUTING page.
I have added tests to cover my changes.
All new and existing tests passed.

Add OpenVINO model engine wrapper Add default buffer size for reading weights file Read OpenVINO IR format models Remove redundant classes and update comments

Resolve merge conflicts Typo fix

* Add param to enable OpenVINO through Python API * Formatting changes

* Enable OpenVINO backend for E5 Embeddings * Update Python APIs

rajatkrishna added 5 commits November 5, 2023 17:48

Use OpenVINO model engine for BertEmbeddings

08e27d1

Add OpenVINO model engine wrapper Add default buffer size for reading weights file Read OpenVINO IR format models Remove redundant classes and update comments

Use Long Tensors with XlmRoberta

50bd632

Add OpenVINO support for RoBerta and XlmRoBerta embeddings

b38f6ea

Fix data type and formatting

718ee1b

Add OpenVINO load model test

98b543a

Resolve merge conflicts Typo fix

rajatkrishna force-pushed the feature/ov-integration branch from 0798519 to 98b543a Compare November 6, 2023 17:44

rajatkrishna added 3 commits November 7, 2023 10:53

Update Python APIs to use OpenVINO

1a42f39

* Add param to enable OpenVINO through Python API * Formatting changes

Merge branch 'master' into feature/ov-integration

8b069c8

Add OpenVINO support for E5 Embeddings

9935856

* Enable OpenVINO backend for E5 Embeddings * Update Python APIs

rajatkrishna force-pushed the feature/ov-integration branch from 35f35c9 to 9935856 Compare December 20, 2023 20:44

rajatkrishna closed this Mar 9, 2024

rajatkrishna deleted the feature/ov-integration branch March 9, 2024 04:04

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[WIP] Integrating OpenVINO Runtime in Spark NLP #13947

[WIP] Integrating OpenVINO Runtime in Spark NLP #13947

rajatkrishna commented Aug 28, 2023 •

edited

[WIP] Integrating OpenVINO Runtime in Spark NLP #13947

[WIP] Integrating OpenVINO Runtime in Spark NLP #13947

Conversation

rajatkrishna commented Aug 28, 2023 • edited

Description

Motivation and Context

Screenshots (if appropriate):

Types of changes

Checklist:

rajatkrishna commented Aug 28, 2023 •

edited