[Fix] Reduce convert memory usage. #297

marvin-Yu · 2024-04-07T09:22:54Z

The current Qwen-72B model conversion process consumes approximately 282GB of memory, which far exceeds the configuration of the machines currently used by the client. This PR modify the conversion method to reduce memory usage to around 20~30GB.

[Fix] Reduce convert memory usage.

7ab42b5

marvin-Yu requested a review from pujiang2018 April 7, 2024 09:23

pujiang2018 approved these changes Apr 7, 2024

View reviewed changes

marvin-Yu merged commit c53209d into main Apr 7, 2024
1 check passed

marvin-Yu deleted the fix/reduce_convert_memory branch April 7, 2024 09:50

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Fix] Reduce convert memory usage. #297

[Fix] Reduce convert memory usage. #297

marvin-Yu commented Apr 7, 2024

[Fix] Reduce convert memory usage. #297

[Fix] Reduce convert memory usage. #297

Conversation

marvin-Yu commented Apr 7, 2024