llama-70b-chat-4-shards

This repository contains a script to transform the weights of Llama v2 70B (chat) from an 8-shard configuration to a 4-shard configuration, making it more accessible for users with machines that have only 4 GPUs.

For convenience, you can directly download weights from https://huggingface.co/Jinawei/llama-v2-70b-chat-4-shards

Introduction

Meta recently released the weights for Llama v2 70B, distributed across 8 shards. However, some users may have hardware constraints, such as having only 4 GPUs on their machine, making it difficult to load and utilize the model directly. This repository provides a solution to this problem by offering a script that can transform the 8-shard weight distribution of LLAMA v2 70B into a 4-shard configuration, facilitating easier usage on machines with fewer GPUs.

Usage

python convert.py \
    --input_llama_path ~/llama-2-70b-chat \
    --input_shards 8 \
    --output_llama_path ~/llama-2-70b-chat-4-shards \
    --output_shards 4

Star as Activation

If this script proves useful in your work or projects, please consider giving it a star on GitHub. Your support helps to make the project more visible, encourages future development, and is greatly appreciated

Acknowledgements

Thanks to Meta for releasing the Llama v2 70B weights.
Any other acknowledgements or credits you'd like to give.

Contact

For any inquiries or to report issues, please open an issue on this repository.

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
README.md		README.md
convert.py		convert.py
convert.sh		convert.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

README.md

convert.py

convert.py

convert.sh

convert.sh

Repository files navigation

llama-70b-chat-4-shards

Introduction

Usage

Star as Activation

Acknowledgements

Contact

About

Releases

Packages

Languages

mexiQQ/llama-70b-chat-4-shards

Folders and files

Latest commit

History

Repository files navigation

llama-70b-chat-4-shards

Introduction

Usage

Star as Activation

Acknowledgements

Contact

About

Resources

Stars

Watchers

Forks

Languages