Skip to content

support/fix more IQ OPs#6521

Merged
NeoZhangJianyu merged 1 commit intoggml-org:masterfrom
NeoZhangJianyu:support_more_IQ_ops
Apr 7, 2024
Merged

support/fix more IQ OPs#6521
NeoZhangJianyu merged 1 commit intoggml-org:masterfrom
NeoZhangJianyu:support_more_IQ_ops

Conversation

@NeoZhangJianyu
Copy link
Copy Markdown
Contributor

  1. support/fix the more OPs:
  • GGML_TYPE_IQ4_NL
  • GGML_TYPE_IQ4_XS
  • GGML_TYPE_IQ3_XXS
  • GGML_TYPE_IQ3_S
  • GGML_TYPE_IQ2_XXS
  • GGML_TYPE_IQ2_XS
  • GGML_TYPE_IQ2_S
  • GGML_TYPE_IQ1_S
  • GGML_TYPE_IQ1_M.
  1. fix the all fault Unit-test cases.
    except:
    • all MUL_MAT_ID cases, due to the interface is changed.
    • MUL_MAT(type_a=iq4_nl,type_b=f32,m=16,n=1,k=256,bs=[1,1],nr=[1,1]): [MUL_MAT] NMSE = 0.292746658 > 0.000500000 FAIL
      above fault cases will be fixed later.

…, GGML_TYPE_IQ3_S, GGML_TYPE_IQ2_XXS, GGML_TYPE_IQ2_XS, GGML_TYPE_IQ2_S, GGML_TYPE_IQ1_S, GGML_TYPE_IQ1_M
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants