Skip to content

[Disco] Make Session.CallWithPacked public#15720

Merged
junrushao merged 1 commit into
apache:unityfrom
junrushao:feature/2023-09-10/llama-tp
Sep 12, 2023
Merged

[Disco] Make Session.CallWithPacked public#15720
junrushao merged 1 commit into
apache:unityfrom
junrushao:feature/2023-09-10/llama-tp

Conversation

@junrushao

Copy link
Copy Markdown
Member

This PR contains necessary tweaks to run Llama2 in Tensor Parallelism.

@junrushao junrushao force-pushed the feature/2023-09-10/llama-tp branch 5 times, most recently from eca766e to 53f3645 Compare September 11, 2023 17:56
This PR contains necessary tweaks to run Llama2 in Tensor Parallelism.
@junrushao junrushao force-pushed the feature/2023-09-10/llama-tp branch from 53f3645 to ec45117 Compare September 12, 2023 00:39
@junrushao junrushao marked this pull request as ready for review September 12, 2023 00:42
class TIRFuseMutator : public ExprMutator {
public:
static IRModule Transform(const IRModule& mod) {
Map<String, BaseFunc> tir_funcs;

Copy link
Copy Markdown
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This changes allows TIRs with global symbols to be preserved if they are not called by relax - this is useful for the shard loader.

@junrushao junrushao merged commit b82c10f into apache:unity Sep 12, 2023
MasterJH5574 pushed a commit to MasterJH5574/tvm that referenced this pull request Sep 13, 2023
This PR contains necessary tweaks to run Llama2 in Tensor Parallelism.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants