Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

540 Release Candidate #14247

Merged
merged 37 commits into from
Jul 1, 2024
Merged

540 Release Candidate #14247

merged 37 commits into from
Jul 1, 2024

Conversation

@maziyarpanahi maziyarpanahi self-assigned this May 14, 2024
@maziyarpanahi maziyarpanahi added enhancement documentation bug-fix new-feature Introducing a new feature new model dependencies Pull requests that update a dependency file DON'T MERGE Do not merge this PR labels May 14, 2024
DevinTDHa and others added 19 commits May 14, 2024 11:04
… ONNX based annotators (#14236)

* [SPARKNLP-1011] Adding changes to transfer ONNX files on executors through Spark files feature

* [SPARKNLP-1011] Adding missing copyright comment

* [SPARKNLP-1011] Adding changes to add prefix for models with onnx_data file

* [SPARKNLP-1037] Adding changes to transfer ONNX files on executors via addFile

* [SPARKNLP-1037] Adding unique suffix to avoid duplication in spark files
* Use OpenVINO model engine for BertEmbeddings

Add OpenVINO model engine wrapper

Add default buffer size for reading weights file

Read OpenVINO IR format models

* Use Long Tensors with XlmRoberta

* Add OpenVINO support for RoBerta and XlmRoBerta embeddings

* Fix data type and formatting

* Add OpenVINO BERT test

- Resole merge conflicts

- Add test for openvino load model

* Update Python APIs to use OpenVINO

* Add param to enable OpenVINO through Python API
* Formatting changes

* Add OpenVINO support for E5 Embeddings

* Enable OpenVINO backend for E5 Embeddings
* Update Python APIs

* Resolve merge issues

* Add OpenVINO support for T5

* Read and write encoder-decoder models with OpenVINO

* OpenVINO Async Inference

* Refactor and cleanup

* Update comments

* Add config to set OpenVINO inference device

* Add OpenVINO support for BERT Sentence Embeddings

* Formatting

* Openvino synchronous inference

* Refactoring: OV Model Conversion

* BertSentenceEmbeddings Python API

* Enable OpenVINO support for Llama2

* Read/write Llama2 Transformer with OpenVINO

* Bugfix: Update saved model filename

* Replace broadcast with addFile for OpenVINO-based annotators

* Add OpenVINO Wrapper tests

* Add suffix to avoid duplication in Spark Files

* Set default OV inference device to CPU

* Use CPU by default for OpenVINO inference due to error loading device config in cluster envs

* Bugfix: Read serialized model from folder
* adding code for spp files and updating notebooks

* fixed code and notebook for xlnet embeddings

* fixing more notebooks

* adding notebook for bert sentence embeddings
…k nlp (#14238)

* Updating file version in XlmRobertaForZeroShotClassification

* updating  transformer versions
* Delete examples/demos/streamlit/CLIPForZeroShotClassification/inputs directory

* Add files via upload

* Update streamlit_app.py

* Update streamlit_app.py

* Update streamlit_app.py

* Update streamlit_app.py

* Update streamlit_app.py

* Update streamlit_app.py

* Update streamlit_app.py

* Update streamlit_app.py

* Update streamlit_app.py

* Update streamlit_app.py

* Update streamlit_app.py

* Update streamlit_app.py

* Update streamlit_app.py

* Update streamlit_app.py

* Update streamlit_app.py

* Update streamlit_app.py

* Update streamlit_app.py

* Update streamlit_app.py

* Update streamlit_app.py

* Update streamlit_app.py

* Update streamlit_app.py

* Update streamlit_app.py

* Update streamlit_app.py

* Update streamlit_app.py

* Update streamlit_app.py

* Update streamlit_app.py

* Update streamlit_app.py

* Update streamlit_app.py

* Update streamlit_app.py

* Update streamlit_app.py

* Update streamlit_app.py

* Update streamlit_app.py

* Update streamlit_app.py

* Update streamlit_app.py

* Update streamlit_app.py

* Update streamlit_app.py

* Update streamlit_app.py

* Update streamlit_app.py

* Update streamlit_app.py

* Delete examples/demos/streamlit/run_demos.ipynb

* Add files via upload
* implementing mpnet for tokenclassification

* changing default pretrained model name

---------

Co-authored-by: Maziyar Panahi <maziyar.panahi@iscpif.fr>
* added mistral

* Mistral python API
…4319)

* Phi2 scala api

* Phi2 python api

* Phi2 python and scala tests

* Phi2 python and scala tests

* added M2M100 openvino implementation

* added phi2  openvino implementation

* added openvino flag to python

---------

Co-authored-by: Maziyar Panahi <maziyar.panahi@iscpif.fr>
prabod and others added 9 commits June 10, 2024 14:49
* fixed LLAMA generation bug

* update params
)

* SPARKNLP-1036: Onnx Example notebooks (#14234)

* SPARKNLP-1036: Fix dev python kernel names

* SPARKNLP-1036: Bump transformers version

* SPARKNLP-1036: Fix Colab buttons

* SPARKNLP-1036: Pin onnx version for compatibility

* SPARKNLP-1036: Upgrade Spark version

* SPARKNLP-1036: Minor Fixes

* SPARKNLP-1036: Clean Metadata

* SPARKNLP-1036: Add/Adjust Documentation

- Note for supported Spark Version of Annotators
- added missing Documentation for BGEEmbeddings

* Fixies (#14307)

* adding fix for broken annotators

---------

Co-authored-by: Devin Ha <33089471+DevinTDHa@users.noreply.github.com>
Co-authored-by: Lev <agsfer@gmail.com>
Co-authored-by: Maziyar Panahi <maziyar.panahi@iscpif.fr>
* SPARKNLP-1036: Onnx Example notebooks (#14234)

* SPARKNLP-1036: Fix dev python kernel names

* SPARKNLP-1036: Bump transformers version

* SPARKNLP-1036: Fix Colab buttons

* SPARKNLP-1036: Pin onnx version for compatibility

* SPARKNLP-1036: Upgrade Spark version

* SPARKNLP-1036: Minor Fixes

* SPARKNLP-1036: Clean Metadata

* SPARKNLP-1036: Add/Adjust Documentation

- Note for supported Spark Version of Annotators
- added missing Documentation for BGEEmbeddings

* Fixies (#14307)

* Fix models link

---------

Co-authored-by: Devin Ha <33089471+DevinTDHa@users.noreply.github.com>
Co-authored-by: Lev <agsfer@gmail.com>
* adding onnx support and average pooling

* Delete examples/python/transformers/HuggingFace_in_Spark_NLP_RoBertaSentenceEmbeddings.ipynb.txt

* adding openvino
* SPARKNLP-1036: Onnx Example notebooks (#14234)

* SPARKNLP-1036: Fix dev python kernel names

* SPARKNLP-1036: Bump transformers version

* SPARKNLP-1036: Fix Colab buttons

* SPARKNLP-1036: Pin onnx version for compatibility

* SPARKNLP-1036: Upgrade Spark version

* SPARKNLP-1036: Minor Fixes

* SPARKNLP-1036: Clean Metadata

* SPARKNLP-1036: Add/Adjust Documentation

- Note for supported Spark Version of Annotators
- added missing Documentation for BGEEmbeddings

* Fixies (#14307)

* uploading UAEEmbeddings notebook

Adding UAEEmbeddings example notebook

---------

Co-authored-by: Devin Ha <33089471+DevinTDHa@users.noreply.github.com>
Co-authored-by: Lev <agsfer@gmail.com>
Co-authored-by: Maziyar Panahi <maziyar.panahi@iscpif.fr>
maziyarpanahi and others added 8 commits June 26, 2024 14:31
* SPARKNLP-1036: Onnx Example notebooks (#14234)

* SPARKNLP-1036: Fix dev python kernel names

* SPARKNLP-1036: Bump transformers version

* SPARKNLP-1036: Fix Colab buttons

* SPARKNLP-1036: Pin onnx version for compatibility

* SPARKNLP-1036: Upgrade Spark version

* SPARKNLP-1036: Minor Fixes

* SPARKNLP-1036: Clean Metadata

* SPARKNLP-1036: Add/Adjust Documentation

- Note for supported Spark Version of Annotators
- added missing Documentation for BGEEmbeddings

* Fixies (#14307)

* refactor OpenAIEmbeddings in Scala

* refactor OpenAIEmbeddings in Python

* add pytest.mark.slow and improve doc

---------

Co-authored-by: Devin Ha <33089471+DevinTDHa@users.noreply.github.com>
Co-authored-by: Lev <agsfer@gmail.com>
Co-authored-by: Maziyar Panahi <maziyar.panahi@iscpif.fr>
- ORT 1.18.1 has not been released yet
@maziyarpanahi maziyarpanahi merged commit 6ac2058 into master Jul 1, 2024
9 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug-fix dependencies Pull requests that update a dependency file documentation DON'T MERGE Do not merge this PR enhancement new model new-feature Introducing a new feature
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

10 participants