Add memory-efficient merging script #608

airaria · 2023-06-16T01:21:53Z

Description

This PR adds a memory-efficient merging script (merge_llama_with_chinese_lora_low_mem.py, and related notebooks).
Comparing to the original script, the new script significantly reduces the memory usage for merging LoRAs to the LLaMA model.

Estimated max memory usage when merging alpaca-lora to the LLaMA with the new script:

Model	Max Memory Usage
7B	15GB
13B	18GB
33B	22GB

…A-Alpaca into low_mem_merge

scripts/merge_llama_with_chinese_lora_low_mem.py

报错信息容易引起混淆，可删除

ymcui · 2023-06-16T02:16:36Z

GPT-4 with Code Interpreter made some suggestions.
Please make any changes that seem valid and appropriate.

Here are some suggestions on how to improve the code:

Avoid Hardcoding Values: The model parameters are hardcoded in params_of_models which may make the script less flexible. Consider loading these parameters from a configuration file or allowing the user to input them as arguments.
Modularize the Code: The code could be more modular. It's generally a good idea to separate logic into functions, which makes the code easier to read, understand, test, and maintain. For instance, you could separate the loading of models and tokenizers into its own function.
Error Handling: While the script does contain some error handling logic, more comprehensive error handling could improve robustness. For example, what if the base model doesn't exist even after trying to download it? Or what if the LoRA models cannot be loaded for some reason?
Documentation: Consider adding more comments to explain what different parts of the code are doing, especially for complex operations. This is especially important in the save_shards and merge_shards functions, which perform non-trivial manipulations on the model weights.
Naming Conventions: Some variable names are not clear or descriptive, such as tandl and ti. Choose descriptive variable names to improve the readability of your code.
Code Duplication: There are several instances where you perform the same operations multiple times, which could be avoided. For instance, the split operation in save_shards function is called multiple times with different parameters. You could create a helper function for this operation to avoid code duplication.
Use Context Managers for File Operations: When handling file operations, it's a good practice to use context managers (with statement) to ensure resources are properly managed. For example, in the last part of your code where you are loading and saving JSON files, you should use the with statement to open the file.

Overall, the code seems to be doing a complex task (merging LoRA models with a base model) and appears to be doing it correctly, but these improvements could make the code cleaner, more efficient, and easier to maintain.

airaria and others added 6 commits June 15, 2023 17:13

add low-mem-merge script

f217f55

update new conversion notebook (low-mem)

ddfe5b8

update legacy notebook

11630c4

add assertions

a5ef187

Merge branch 'low_mem_merge' of https://github.com/ymcui/Chinese-LLaM…

3771345

…A-Alpaca into low_mem_merge

remove comments

8c7322c

ymcui requested a review from iMountTai June 16, 2023 01:27

iMountTai reviewed Jun 16, 2023

View reviewed changes

scripts/merge_llama_with_chinese_lora_low_mem.py Show resolved Hide resolved

iMountTai reviewed Jun 16, 2023

View reviewed changes

scripts/merge_llama_with_chinese_lora_low_mem.py Outdated Show resolved Hide resolved

Update merge_llama_with_chinese_lora_low_mem.py

2ee6100

报错信息容易引起混淆，可删除

iMountTai approved these changes Jun 16, 2023

View reviewed changes

ymcui requested a review from iMountTai June 16, 2023 02:43

airaria added 3 commits June 16, 2023 10:58

improve naming

d208617

update help info

c43f34b

Update merge_llama_with_chinese_lora_low_mem.py

25dba18

iMountTai approved these changes Jun 16, 2023

View reviewed changes

ymcui merged commit 0c7c092 into main Jun 16, 2023

ymcui deleted the low_mem_merge branch June 20, 2023 05:36

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add memory-efficient merging script #608

Add memory-efficient merging script #608

airaria commented Jun 16, 2023 •

edited

Loading

ymcui commented Jun 16, 2023

Add memory-efficient merging script #608

Add memory-efficient merging script #608

Conversation

airaria commented Jun 16, 2023 • edited Loading

Description

ymcui commented Jun 16, 2023

airaria commented Jun 16, 2023 •

edited

Loading