-
Notifications
You must be signed in to change notification settings - Fork 776
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Add oneDNN binary op #7319
Merged
Merged
Add oneDNN binary op #7319
Changes from 155 commits
Commits
Show all changes
170 commits
Select commit
Hold shift + click to select a range
c79f6e8
add
luqiang-guo 63eec78
merge master
luqiang-guo 95311b3
merge master
luqiang-guo d068bbf
Solve the thread pool problem
luqiang-guo 85c0163
merge master
luqiang-guo 4783a91
add device local logical cores
luqiang-guo 26cd0e7
fix error
luqiang-guo 624d7e9
Delete threadpool
luqiang-guo 7b6e4d2
Merge branch 'master' into dev_parallel_loop
luqiang-guo 543c726
fix include file
luqiang-guo 36a755b
Merge branch 'dev_parallel_loop' of https://github.com/Oneflow-Inc/on…
luqiang-guo 2288a11
fix clang -lopm
luqiang-guo 5fec766
fix clang error omp.h
luqiang-guo 3de09b6
fix omp cmake
luqiang-guo 9c05b6c
omp.h
luqiang-guo 17bd1bb
fix #ifdef
luqiang-guo c226521
test clang13 -lomp
luqiang-guo c4a5179
test -fopenmp
luqiang-guo 4b028a6
add fopenmp
luqiang-guo 0eb1059
Merge branch 'master' into dev_parallel_loop
luqiang-guo 784badd
rename OMP_FLAGS
luqiang-guo e500586
Merge branch 'dev_parallel_loop' of https://github.com/Oneflow-Inc/on…
luqiang-guo b689676
Merge branch 'master' into dev_parallel_loop
luqiang-guo bafea64
Merge branch 'master' into dev_parallel_loop
luqiang-guo 3d6c191
Merge branch 'master' into dev_parallel_loop
luqiang-guo 72c39e9
Merge branch 'dev_parallel_loop' of https://github.com/Oneflow-Inc/on…
luqiang-guo d00a1da
static analysis libopm-12-dev
luqiang-guo 18f363f
add tbb
luqiang-guo 71dac72
refien
jackalcooper 6eee93e
refine
jackalcooper 267900b
refine
jackalcooper fbad306
refine
jackalcooper 76c169c
revert
jackalcooper 0f43fd3
Merge branch 'master' of https://github.com/Oneflow-Inc/oneflow into …
jackalcooper 86baed9
add tbb
luqiang-guo 5006973
Merge branch 'master' into dev_test_tbb
luqiang-guo c8740bf
success add tbb
luqiang-guo 9e18b53
tbb onednn ok
luqiang-guo cdac12d
Merge branch 'master' into dev_test_tbb
luqiang-guo 85ed101
fix ninja onednn
luqiang-guo 2cdca83
component
luqiang-guo 5bf716e
install tbb include file
luqiang-guo 9eb33c0
Merge branch 'master' into dev_test_tbb
luqiang-guo 5eed042
Merge branch 'master' into dev_test_tbb
luqiang-guo e0e704a
updata tbb master zip
luqiang-guo f7ad349
Merge branch 'dev_test_tbb' of https://github.com/Oneflow-Inc/oneflow…
luqiang-guo ec8b5e0
Merge branch 'master' into dev_test_tbb
luqiang-guo d4d02c7
fix md5
luqiang-guo 12faccd
Merge branch 'dev_test_tbb' of https://github.com/Oneflow-Inc/oneflow…
luqiang-guo 1b396e5
Merge branch 'master' into dev_test_tbb
luqiang-guo a6d9f0b
refine
jackalcooper bb4141e
refjine
jackalcooper afd426a
fix
jackalcooper 22157d1
Merge branch 'master' into dev_test_tbb
luqiang-guo 4cc7858
Merge branch 'master' into dev_test_tbb
luqiang-guo 53ec863
Merge branch 'master' into dev_parallel_loop
luqiang-guo 35e6105
cmake option
luqiang-guo 134c5f0
Merge branch 'dev_test_tbb' of https://github.com/Oneflow-Inc/oneflow…
luqiang-guo 32623f7
modified clang 10 OMP
luqiang-guo e375b4c
add line
luqiang-guo 33497de
fix add OMP flags
luqiang-guo ef9c444
fix tbb
luqiang-guo 964f0a8
fix
luqiang-guo 6eb32c8
fix
luqiang-guo 689bcd7
fix'
luqiang-guo 9367609
fix
luqiang-guo a8f8bc6
Merge branch 'dev_test_tbb' into dev_parallel_loop
luqiang-guo eb38712
Merge branch 'dev_test_tbb' into dev_parallel_loop
luqiang-guo 34cc993
fix
luqiang-guo bfca0d6
fix OF_RUNTIME_TBB
luqiang-guo 91fb423
fix
luqiang-guo 934b567
Merge branch 'dev_test_tbb' into dev_parallel_loop
luqiang-guo 5b51aa0
modified binary op
luqiang-guo 47040c1
fix
luqiang-guo de6e03a
fix
luqiang-guo 0c9002b
fux error
luqiang-guo 0aad3df
Merge branch 'master' into dev_fix_tbb_error
luqiang-guo 00d5dac
fix
luqiang-guo 68fe7d4
Merge branch 'dev_fix_tbb_error' of https://github.com/Oneflow-Inc/on…
luqiang-guo 707f892
fix
luqiang-guo e535ab1
fix
luqiang-guo 5037f19
Merge branch 'master' into dev_fix_tbb_error
luqiang-guo 3836c29
refine
jackalcooper c8ea8b8
refine
jackalcooper b488247
Merge branch 'master' into dev_fix_tbb_error
luqiang-guo b4dbd08
Merge branch 'master' into dev_fix_tbb_error
oneflow-ci-bot 722d003
fix
luqiang-guo 5307906
Merge branch 'dev_fix_tbb_error' of https://github.com/Oneflow-Inc/on…
luqiang-guo 0ebb4c0
Merge branch 'master' into dev_fix_tbb_error
oneflow-ci-bot 412ba88
Merge branch 'master' into dev_fix_tbb_error
luqiang-guo c355bb2
add seq
luqiang-guo fee0ac2
Merge branch 'dev_fix_tbb_error' of https://github.com/Oneflow-Inc/on…
luqiang-guo 2638976
Merge branch 'master' into dev_fix_tbb_error
luqiang-guo eda5a8d
refine
luqiang-guo ee357df
Merge branch 'dev_parallel_loop' of https://github.com/Oneflow-Inc/on…
luqiang-guo 9c6db4a
merge master
luqiang-guo db21828
Merge branch 'master' into dev_fix_tbb_error
oneflow-ci-bot a8af104
fix
luqiang-guo 68e3cbf
Merge branch 'dev_fix_tbb_error' of https://github.com/Oneflow-Inc/on…
luqiang-guo d1995f2
Merge branch 'dev_fix_tbb_error' into dev_parallel_loop
luqiang-guo e4c760f
fix
luqiang-guo 866c1e7
Merge branch 'master' into dev_parallel_loop
luqiang-guo 1809cb6
fix
luqiang-guo da9e173
add set_num_threads
luqiang-guo 2d52493
Merge branch 'master' into dev_parallel_loop
luqiang-guo 0e8f7cc
merge master
luqiang-guo fe7d975
fix
luqiang-guo 08f76a7
Merge branch 'dev_add_onednn_binary' of https://github.com/Oneflow-In…
luqiang-guo 73a8fe6
fi
luqiang-guo c0010d1
fix error
luqiang-guo 0baa117
Merge branch 'dev_parallel_loop' of https://github.com/Oneflow-Inc/on…
luqiang-guo d3ca51e
Merge branch 'dev_parallel_loop' into dev_add_onednn_binary
luqiang-guo e01fc91
fix
luqiang-guo 8d5b327
refine
luqiang-guo 6719ea1
refine
luqiang-guo a10e6ac
fix
luqiang-guo 494957f
refine
luqiang-guo 62fcd18
fix
luqiang-guo 731c1d9
refine
luqiang-guo 84d8786
refine
luqiang-guo 209216a
refine
luqiang-guo 8d78ccf
refine
luqiang-guo 2494287
refine
luqiang-guo 7ee8afa
Merge branch 'master' into dev_parallel_loop
luqiang-guo 9284d15
fix
luqiang-guo 47b17d6
Merge branch 'master' into dev_parallel_loop
luqiang-guo 624f6d2
Merge branch 'dev_parallel_loop' into dev_add_onednn_binary
luqiang-guo 58e6ad6
Merge branch 'dev_add_onednn_binary' of https://github.com/Oneflow-In…
luqiang-guo 0a568d8
refine
luqiang-guo d37bd5c
fix
luqiang-guo 9d9df5b
Merge branch 'dev_parallel_loop' of https://github.com/Oneflow-Inc/on…
luqiang-guo 3269a43
fix
luqiang-guo c1ed873
fix
luqiang-guo 3d1f687
fix
luqiang-guo f9d3e18
fix
luqiang-guo 16dd419
refine
luqiang-guo d1c6373
refine
luqiang-guo d99dd82
refine
luqiang-guo 8fcd664
refine
luqiang-guo b8cc3f3
refine
luqiang-guo f333d1b
refine
luqiang-guo fe6ca0f
refine
luqiang-guo 1377294
Merge branch 'master' into dev_parallel_loop
luqiang-guo 7cb0cdb
fix
luqiang-guo 42850af
Merge branch 'master' into dev_parallel_loop
luqiang-guo 47cfa64
merge parallel loop
luqiang-guo 06f500a
fix
luqiang-guo 6940844
fix
luqiang-guo b55d21f
merge master
luqiang-guo 7134dd8
refine
luqiang-guo 337ef44
refine
luqiang-guo 26c59cf
Merge branch 'master' into dev_add_onednn_binary
luqiang-guo 543f934
auto format by CI
oneflow-ci-bot 1778912
Merge branch 'dev_add_onednn_binary' of https://github.com/Oneflow-In…
luqiang-guo 7f22f82
fix
luqiang-guo dbdc88d
rename mm_, dynamic_cast
luqiang-guo 38ed4d1
Merge branch 'master' into dev_add_onednn_binary
luqiang-guo 1251b0b
auto format by CI
oneflow-ci-bot af757ce
Merge branch 'master' into dev_add_onednn_binary
luqiang-guo c5a9296
fix MAKE_NEW_ONEDNN_BROADCAST_ELEMENTWISE_BINARY_COMPARASION_AND_LOGI…
luqiang-guo dcc4dd7
Merge branch 'master' into dev_add_onednn_binary
luqiang-guo 040978c
fix 0-dim tensor
luqiang-guo 5ec6d58
Merge branch 'dev_add_onednn_binary' of https://github.com/Oneflow-In…
luqiang-guo 031a81f
Merge branch 'master' into dev_add_onednn_binary
luqiang-guo d58c8bd
fix onednn format tag
luqiang-guo d470a1d
Merge branch 'dev_add_onednn_binary' of https://github.com/Oneflow-In…
luqiang-guo 5e09fd3
Merge branch 'master' into dev_add_onednn_binary
luqiang-guo 9381fb0
auto format by CI
oneflow-ci-bot 533b220
Merge branch 'master' into dev_add_onednn_binary
luqiang-guo 637df74
Merge branch 'master' into dev_add_onednn_binary
oneflow-ci-bot File filter
Filter by extension
Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Oops, something went wrong.
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
dynamic_cast
是在runtime的时候根据对象中vtpr所指向的vtbl中的rtti信息来做动态cast,效率比较低,在你这个场景中,如果已经确定这里是CpuDevice
类型的话,推荐直接用static_cast
从另一方面讲,即使这里是符合使用
dynamic_cast
的条件,它的返回值有可能是null,那么就需要对返回值做判断,否则直接后面又用返回值去调用其它函数,是有问题的There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
我去自习研究下
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
是不是可以先实现上次说的 CpuStream/CudaStream 的 device() 方法直接返回 CpuDevice/CudaDevice 的功能,这样这里就没必要 cast 了