Skip to content

Conversation

Ankur-singh
Copy link
Contributor

Working implementation of int4/8 LLM inference using ITREX

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Hi @Ankur-singh, may I ask, who initially requested this sample? There is no such tool as 'Intel® Extension for Transformers' in AI Tools Selector https://www.intel.com/content/www/us/en/developer/tools/oneapi/ai-tools-selector.html , there is no plan to add such a tool for 2024.2

@jimmytwei jimmytwei merged commit eea608b into oneapi-src:development Jun 21, 2024
@Ankur-singh Ankur-singh deleted the quant_itrex branch March 21, 2025 19:02
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants