Skip to content

issues Search Results · repo:intel/neural-compressor language:Python

Filter by

215 results
 (91 ms)

215 results

inintel/neural-compressor (press backspace or delete to remove)

Hi, I am trying to reproduce some of the examples, but it looks like they are outdated. I am not able to load: from neural_compressor.experimental import Quantization, common which appears in multiple ...
  • evgeni-lh
  • 1
  • Opened 
    on Apr 28
  • #2193

Hi, I wonder if neural compressor supports visual language model that accept visual image and text as inputs?
  • billamiable
  • Opened 
    on Feb 27
  • #2131

As examples/3.x_api/pytorch/nlp/huggingface_models/language-modeling/quantization/mx_quant/Readme suggests, I run python run_clm_no_trainer.py --model ./Qwen2-1.5B-Instruct --quantize --accuracy --tasks ...
  • wjxh950210
  • 2
  • Opened 
    on Feb 12
  • #2120

Is it compatible flexattention from pytorch 2.6.0?
  • johnnynunez
  • 1
  • Opened 
    on Feb 10
  • #2117

When quantizing Mx fp, the quantization scales of subnormal and normal values ​​should be different. Why does L394 clip to min_exp? I understand that it should clip to 1. Looking forward to your reply ...
  • Jzz24
  • 1
  • Opened 
    on Jan 2
  • #2106

If I define a parameter with the same name as layer_scale in the pytorch nn.Module, as shown in the following code, a ValueError occurs. class ConvEncoder(nn.Module): Implementation of ConvEncoder ...
  • nalnez13
  • 1
  • Opened 
    on Dec 30, 2024
  • #2103

Hi! I tried to prune my model (mistralai/Mistral-7B-v0.1) with the following config pruning_config = WeightPruningConfig( pruning_type= snip_momentum_progressive , start_step=0, ...
  • tatiana-iazykova
  • 1
  • Opened 
    on Dec 16, 2024
  • #2093

Encountering an issue while PTQ Static on Pytorch Model. The process involves utilizing pytorch metrics for benchmarking such as [ Accuracy , F1 ]. The workflow is executed within the VS code Jupyter extension. ...
  • DGP1607
  • Opened 
    on Dec 9, 2024
  • #2087

Aims to clarify the HW deployment capabilities of Neural Compressor optimized models. - Specifically I d like to know if these models are optimized for specific architectures such as x86 or if they ...
  • DGP1607
  • Opened 
    on Dec 9, 2024
  • #2086

OS: Ubuntu Hardware: CPU Intel(R) Xeon(R) Platinum 8468V I have installed the required dependencies listed in the GitHub repository using the latest versions, as specific versions were not specified ...
  • Shivani-k16
  • Opened 
    on Nov 5, 2024
  • #2056
Issue origami icon

Learn how you can use GitHub Issues to plan and track your work.

Save views for sprints, backlogs, teams, or releases. Rank, sort, and filter issues to suit the occasion. The possibilities are endless.Learn more about GitHub Issues
ProTip! 
Press the
/
key to activate the search input again and adjust your query.
Issue origami icon

Learn how you can use GitHub Issues to plan and track your work.

Save views for sprints, backlogs, teams, or releases. Rank, sort, and filter issues to suit the occasion. The possibilities are endless.Learn more about GitHub Issues
ProTip! 
Press the
/
key to activate the search input again and adjust your query.
Issue search results · GitHub