Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

make launch_transform_0213 more clear #460

Merged
merged 1 commit into from
Feb 15, 2023
Merged

make launch_transform_0213 more clear #460

merged 1 commit into from
Feb 15, 2023

Conversation

neopro12
Copy link
Collaborator

No description provided.

@neopro12 neopro12 merged commit d8549c9 into master Feb 15, 2023
@neopro12 neopro12 deleted the gpt-infer branch February 15, 2023 09:34
hexisyztem added a commit that referenced this pull request Feb 16, 2023
* Fix new arch context build check(#441)

problem :
    LinearOp::forward is getting cublashandle without checking if context is built.
    LinearOp::backward is checking if the context is built before getting cublashandle.

solution:
    Modify LinearOp::forward to check if context is built before getting cublashandle.

* fix config reference bug (#453)

* developing lsflow

* add split_head op and its test (#454)

* lsflow develop

* format note message

* lsflow tune

* add notes for context class

* add note for lsflow

* op example

* Update CODEOWNERS (#457)

* Gpt infer (#456)

* add split head for beam search

* alter checkin

* make launch_transform_0213 more clear (#459)

* fix operator compile

* make launch_transform_0213 more clear (#460)

* change max_shape to max_shape_size

* correct bias shape in split_head (#461)

* add unit test for x86 cpu kernel

---------

Co-authored-by: Kangmo Kim <kangmo.kim@gmail.com>
Co-authored-by: Ying Xiong <xiongying.taka@bytedance.com>
Co-authored-by: Xiaohui Wang <wangxiaohui.neo@bytedance.com>
hexisyztem added a commit that referenced this pull request Mar 2, 2023
* multi device develop

* change build.sh

* cuda tools move into kernels

* multi device develop

* multi device develop

* multi device develop

* solve multi device develop

* solve arm compile error

* arm print_vec

* fix regress error

* solve cuda compile error

* solve multi device print vec

* split node class wit its derived class

* lightseq x86 unit test

* add x86 unit test

* Canonical Namespace

* add pybind compile

* Lsflow develop (#463)

* Fix new arch context build check(#441)

problem :
    LinearOp::forward is getting cublashandle without checking if context is built.
    LinearOp::backward is checking if the context is built before getting cublashandle.

solution:
    Modify LinearOp::forward to check if context is built before getting cublashandle.

* fix config reference bug (#453)

* developing lsflow

* add split_head op and its test (#454)

* lsflow develop

* format note message

* lsflow tune

* add notes for context class

* add note for lsflow

* op example

* Update CODEOWNERS (#457)

* Gpt infer (#456)

* add split head for beam search

* alter checkin

* make launch_transform_0213 more clear (#459)

* fix operator compile

* make launch_transform_0213 more clear (#460)

* change max_shape to max_shape_size

* correct bias shape in split_head (#461)

* add unit test for x86 cpu kernel

---------

Co-authored-by: Kangmo Kim <kangmo.kim@gmail.com>
Co-authored-by: Ying Xiong <xiongying.taka@bytedance.com>
Co-authored-by: Xiaohui Wang <wangxiaohui.neo@bytedance.com>

* fix sys.path

* fix sys.path (#466)

* jit build support pure cpu machine

* robust builder for x86 and cuda

* fix compile error

* develop test_ls_layer

* format

* fix training compile problem

* add mkl gemm for f32 and s8 (#470)

* test for encoder layer (#471)

* test encoder layer

* fix cuda free error

* lightseq multi device develop

* fix crf op error

* avoid import training directories

* fix strided_batch_gemm config data type

* add debug message

* convert shape from vector<int> to vector<size_t>

* change debug log format

* format

* remove useless dropout

* add sdpa layer into multi head attention layer

* fix conflict parameter: is_post_ln and pre_or_po...

* fix CMakeLists.txt compile

* multi kernel develop

* lightseq transformer.cu fix

* fix linear col/row major

* fix compile error

* fix lightseq post_ln network structure

* add transformer example & print error message

* add shape message

* format

* fix concat error

* add shape message

* fix beam search bug

---------

Co-authored-by: Kangmo Kim <kangmo.kim@gmail.com>
Co-authored-by: Ying Xiong <xiongying.taka@bytedance.com>
Co-authored-by: Xiaohui Wang <wangxiaohui.neo@bytedance.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

2 participants