Jossnet-bitnet

Introduction

This project is an implementation of the bitnet-b1.58-2B-4T model from Microsoft for Apple Silicon using Metal.

First of all, this project is an opportunity for me to learn more about how an LLM works, as well as to deepen my understanding of GPU programming with Metal.

Work in progress

This project is still in progress. The model is able to return logical results, but it is currently very slow and not accurate.

Current performance

M3 chip (8 CPU - 10 GPU | 16 GB) ~ 1.6s per token

Next steps

Work on optimization
- Add cache for k and v buffers
Add a real tokenizer
Write a report on this project

Model

This project uses the bitnet-b1.58-2B-4T model. By default, Microsoft provides the model in .safetensors format, so I used Python to convert it into .bin files.

The converted model is available on my Hugging Face repository.

License & References

Hugging Face model: Dr-joss/bitnet-b1.58-2B-metal-weight

Name		Name	Last commit message	Last commit date
Latest commit History 19 Commits
jossnet-bitnet.xcodeproj		jossnet-bitnet.xcodeproj
jossnet-bitnet		jossnet-bitnet
LICENCE		LICENCE
README.MD		README.MD

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Jossnet-bitnet

Introduction

Work in progress

Current performance

Next steps

Model

License & References

About

Uh oh!

Contributors 1

Languages

Folders and files

Latest commit

History

Repository files navigation

Jossnet-bitnet

Introduction

Work in progress

Current performance

Next steps

Model

License & References

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Contributors 1

Languages