[fix] Device and format and implementation optimization #1055

KexinFeng · 2023-09-06T19:47:50Z

Add the space char to the output
Add the test script of "huggyllama/llama-7b" for rolling_batch scheduler
Device setting is fixed. The following way will involve model copying, and cause OOM on huggyllama/llama-7b on 5.12xlarge.

        if self.device:
            self.model.to(self.device)

Efficiency optimization by avoiding list.pop(idx)

engines/python/setup/djl_python/rolling_batch/scheduler_rolling_batch.py

.gitignore

engines/python/setup/djl_python/rolling_batch/rolling_batch.py

…g_batch.py Co-authored-by: Frank Liu <frankfliu2000@gmail.com>

KexinFeng added 4 commits September 6, 2023 12:31

format

8ea860b

fix_device_add_space

98659f4

format

d5d6277

format

ea3513e

KexinFeng requested review from zachgk, frankfliu and a team as code owners September 6, 2023 19:47

KexinFeng requested a review from lanking520 September 6, 2023 19:48

KexinFeng added 2 commits September 6, 2023 12:54

efficiency_optimization

1e0c470

No module named 'lmi_dist

3280cf3

KexinFeng changed the title ~~[fix] Device and format~~ [fix] Device and format and implementation optimization Sep 6, 2023

frankfliu reviewed Sep 6, 2023

View reviewed changes

engines/python/setup/djl_python/rolling_batch/scheduler_rolling_batch.py Outdated Show resolved Hide resolved

lanking520 reviewed Sep 6, 2023

View reviewed changes

.gitignore Show resolved Hide resolved

.gitignore Show resolved Hide resolved

.gitignore Outdated Show resolved Hide resolved

engines/python/setup/djl_python/rolling_batch/rolling_batch.py Show resolved Hide resolved

KexinFeng and others added 2 commits September 6, 2023 14:04

Update engines/python/setup/djl_python/rolling_batch/scheduler_rollin…

040502d

…g_batch.py Co-authored-by: Frank Liu <frankfliu2000@gmail.com>

format

dc2f0d6

lanking520 approved these changes Sep 8, 2023

View reviewed changes

KexinFeng merged commit ed03dac into deepjavalibrary:master Sep 8, 2023
8 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[fix] Device and format and implementation optimization #1055

[fix] Device and format and implementation optimization #1055

KexinFeng commented Sep 6, 2023 •

edited

Loading

[fix] Device and format and implementation optimization #1055

[fix] Device and format and implementation optimization #1055

Conversation

KexinFeng commented Sep 6, 2023 • edited Loading

KexinFeng commented Sep 6, 2023 •

edited

Loading