Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Release 0.3.0 #49

Merged
merged 38 commits into from
May 10, 2022
Merged

Release 0.3.0 #49

merged 38 commits into from
May 10, 2022

Conversation

diegofiori
Copy link
Collaborator

Closes #32, #34, #27. Implement API for quantization and half precision!

@diegofiori diegofiori linked an issue May 10, 2022 that may be closed by this pull request
@diegofiori diegofiori merged commit b2a6a5a into main May 10, 2022
@diegofiori diegofiori deleted the feat/add_half_precision branch May 10, 2022 07:54
kartikeyporwal pushed a commit to kartikeyporwal/nebullvm that referenced this pull request May 11, 2022
* add half precision and transformations logic

* fix bugs

* add support to gpu

* add support to gpu

* fix minor bug in gpu code

* fix minor bug in gpu code

* refactor quantization

* add dataset interface

* fix bug

* fix bugs with dataset api

* fix bug

* update test

* solve minor issues

* fix error in cuda

* fix error with tensorRT

* fix bug in tvm and change name of quantization_ths

* add resources

* Modify readme (nebuly-ai#48)

* Create section - integration with other libraries

* update readme (work-in-progress)

* Minor Readme update

* rename notebook

* update notebook

* rename notebook

* Rename notebook

* Rename notebook

* Rename notebook

* Rename notebook

* Rename notebook

* update version to 0.3.0

* Update readme with latest release information

* Benchmarks

* Update readme

* Update readme with benchmarks

* Update readme, minor changes

* solve api issue with tf

* fix typos in benchmarks

Co-authored-by: morgoth95 <diego.fiori@epfl.ch>
Co-authored-by: Emile Courthoud <59367323+emilecourthoud@users.noreply.github.com>
Co-authored-by: Nebuly <83510798+nebuly-ai@users.noreply.github.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

Add optimization from dataset Improve Transformers performance using ONNXRuntime optimization
2 participants