FEAT: concurrent generation #417

codingl2k1 · 2023-09-01T09:49:53Z

10 concurrent generates on one torch model get ~20% performance boost. (tested on M2 macbook)

aresnow1 · 2023-09-01T13:51:44Z

We need to add tests to check if it works well for GGML models.

aresnow1 · 2023-09-04T13:59:56Z

Seems it always hang, it may not be caused by new tests, we can try to run all tests locally to find the reasons.

codingl2k1 · 2023-09-05T03:16:29Z

Seems it always hang, it may not be caused by new tests, we can try to run all tests locally to find the reasons.

Fixed. Thanks.

XprobeBot added the feature label Sep 1, 2023

XprobeBot added this to the v0.2.1 milestone Sep 1, 2023

XprobeBot modified the milestones: v0.2.1, v0.3.1 Sep 5, 2023

codingl2k1 marked this pull request as ready for review September 5, 2023 03:16

XprobeBot modified the milestones: v0.4.0, v0.4.2 Sep 12, 2023

codingl2k1 added 4 commits September 13, 2023 10:44

Improve torch inference performance

c3c95d4

Add UT

c5f2561

Fix next hang

15c59cb

Try to fix CI

efa6fea

codingl2k1 force-pushed the feat/generate_pipeline branch from 9a42889 to efa6fea Compare September 13, 2023 02:44

UranusSeven changed the title ~~FEAT: Improve torch inference performance~~ FEAT: concurrent generation Sep 13, 2023

UranusSeven approved these changes Sep 13, 2023

View reviewed changes

UranusSeven merged commit f26504e into xorbitsai:main Sep 13, 2023
8 of 10 checks passed

UranusSeven mentioned this pull request Sep 13, 2023

BUG: model.generate is not thread safe #54

Closed

Provide feedback