Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[community] add more data types support to ipex-llm llm integration #12635

Merged
merged 9 commits into from Apr 8, 2024

Conversation

shane-huang
Copy link
Contributor

  • Description: by default IpexLLM will load the model in int4 format. This PR adds more data types support such as sym_in5, sym_int8, etc. Data formats like NF3, NF4, FP4 and FP8 are only supported on GPU and will be added in future PR.
  • Fixes # (issue): N/A
  • New Package?: No
  • Examples: New example, example README, and updated notebook.
  • Tests: N/A

add more data types support
Copy link

Check out this pull request on  ReviewNB

See visual diffs & provide feedback on Jupyter Notebooks.


Powered by ReviewNB

@dosubot dosubot bot added the size:L This PR changes 100-499 lines, ignoring generated files. label Apr 7, 2024
Copy link
Collaborator

@logan-markewich logan-markewich left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This looks good to me -- can you bump the package version of the integration to v0.1.1. so that it gets published?

@shane-huang
Copy link
Contributor Author

This looks good to me -- can you bump the package version of the integration to v0.1.1. so that it gets published?

bumped version and add some more info in README.

@dosubot dosubot bot added the lgtm This PR has been approved by a maintainer label Apr 8, 2024
@logan-markewich logan-markewich merged commit f9ab57c into run-llama:main Apr 8, 2024
8 checks passed
chrisalexiuk-nvidia pushed a commit to chrisalexiuk-nvidia/llama_index that referenced this pull request Apr 25, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
lgtm This PR has been approved by a maintainer size:L This PR changes 100-499 lines, ignoring generated files.
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

2 participants