[Feature] Crash: Support old MPT GGUF conversions with duplicated output tensor #2329

dlippold · 2024-05-09T17:28:35Z

Bug Report

The fine-tuned MPT model from https://huggingface.co/maddes8cht/mosaicml-mpt-7b-instruct-gguf/ in quantization Q4_1 was usabel in release 2.7.2 but not longer in 2.7.3 and later. In particular it is currently not usable.

When I try to load the model file I get the following error message:

Could not load model due to invalid model file for mosaicml-mpt-7b-instruct-Q4_1.gguf

The reason of the problem may have to do with #2006

Steps to Reproduce

Download the model file from the specified URL
Start GPT4all
Choose the downloaded model file

Expected Behavior

The model file should be loaded.

Your Environment

GPT4All version: 2.7.2, 2.7.3, 2.7.5
Operating System: Ubuntu Linux 22.04.
Chat model used (if applicable): see above

cebtenzzre · 2024-05-09T22:35:06Z

I fixed this upstream in ggerganov/llama.cpp#6139 which should make it into the next release of GPT4All (already included in #2310).

dlippold · 2024-06-29T10:07:44Z

Version 2.8.0 crashes when loading the model named above.

dlippold added bug-unconfirmed chat gpt4all-chat issues labels May 9, 2024

cebtenzzre added enhancement New feature or request and removed bug-unconfirmed labels May 9, 2024

cebtenzzre changed the title ~~Certain version of MPT GGUF model not usable anymore~~ [Feature] Support old MPT GGUF conversions with duplicated output tensor May 9, 2024

dlippold changed the title ~~[Feature] Support old MPT GGUF conversions with duplicated output tensor~~ [Feature] Crash: Support old MPT GGUF conversions with duplicated output tensor Jun 29, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Feature] Crash: Support old MPT GGUF conversions with duplicated output tensor #2329

[Feature] Crash: Support old MPT GGUF conversions with duplicated output tensor #2329

dlippold commented May 9, 2024 •

edited

Loading

cebtenzzre commented May 9, 2024

dlippold commented Jun 29, 2024

[Feature] Crash: Support old MPT GGUF conversions with duplicated output tensor #2329

[Feature] Crash: Support old MPT GGUF conversions with duplicated output tensor #2329

Comments

dlippold commented May 9, 2024 • edited Loading

Bug Report

Steps to Reproduce

Expected Behavior

Your Environment

cebtenzzre commented May 9, 2024

dlippold commented Jun 29, 2024

dlippold commented May 9, 2024 •

edited

Loading