Hello,
I am attempting to run the BitNet model on a Windows 11 machine with an Intel N6000 CPU, which does not have AVX/AVX2 support. The installation completes, but inference results in a repeating character output (e.g., "GGGGGG...").
Key Findings:
- This behavior is reproducible on my Intel N6000 machine.
- I can successfully compile and run the same model on a Raspberry Pi 4 B, which proves that AVX is not a fundamental requirement for the model's logic. This suggests the bug is specific to the Windows x86 non-AVX build.
Steps to Reproduce:
- On a Windows machine with a non-AVX CPU (e.g., Intel N6000), follow the standard installation instructions.
- During the build process, a compilation error occurs in
3rdparty/llama.cpp/common/common.cpp due to a missing header. Adding #include <chrono> fixes this initial error.
- The project then compiles successfully.
- Running inference with a command like
python run_inference.py -m models/BitNet-b1.58-2B-4T/ggml-model-i2_s.gguf -p "Once upon a time" results in a repeating character output.
What I've Tried:
- Compiling with the default settings.
- Forcing a build with
-DLLAMA_SSE4_2=ON.
- Forcing a generic build with no flags.
All of these configurations compile successfully but produce the same incorrect inference output. The system_info log confirms that AVX is disabled.
This seems to be a bug in the x86 fallback code path when compiled with the Windows toolchain.
Hello,
I am attempting to run the BitNet model on a Windows 11 machine with an Intel N6000 CPU, which does not have AVX/AVX2 support. The installation completes, but inference results in a repeating character output (e.g., "GGGGGG...").
Key Findings:
Steps to Reproduce:
3rdparty/llama.cpp/common/common.cppdue to a missing header. Adding#include <chrono>fixes this initial error.python run_inference.py -m models/BitNet-b1.58-2B-4T/ggml-model-i2_s.gguf -p "Once upon a time"results in a repeating character output.What I've Tried:
-DLLAMA_SSE4_2=ON.All of these configurations compile successfully but produce the same incorrect inference output. The
system_infolog confirms that AVX is disabled.This seems to be a bug in the x86 fallback code path when compiled with the Windows toolchain.