Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Update forked PaddlePaddle #1

Merged
merged 427 commits into from
Sep 8, 2020
Merged
Show file tree
Hide file tree
Changes from all commits
Commits
Show all changes
427 commits
Select commit Hold shift + click to select a range
0e81626
add div, floor_div, remainder (#26562)
ForFishes Aug 24, 2020
bf4a463
change to use bce_loss op, add shape check for bce_loss
ZHUI Aug 24, 2020
ae4724c
fix type issue (#26500)
baiyfbupt Aug 24, 2020
efee426
support generator seed in related kernals test=develop (#26495)
yaoxuefeng6 Aug 24, 2020
90e6819
fix engilsh document for api paddle.fluid.dygraph.enabled, test=devel…
wanghuancoder Aug 24, 2020
8645591
support fp64 in huber_loss cuda kernel (#26583)
yghstill Aug 24, 2020
317f7ce
[API 2.0] Add transformer apis (#26418)
guoshengCS Aug 24, 2020
39fe0d3
find timeout unittests (#26371)
XieYunshen Aug 24, 2020
79539cf
【2.0 API】Add CUDA kernel and enhance options for grid_sample (#26576)
wanghaoshuang Aug 24, 2020
0d71cff
Add mnist test for post training quantization, test=develop (#26436)
juncaipeng Aug 24, 2020
d6e888c
fix Flatten api test=develop (#26346)
yaoxuefeng6 Aug 24, 2020
dd3df69
fix RandomSampler & BatchSampler. test=develop (#26559)
heavengate Aug 24, 2020
f8dbd7a
fix syncbn, test=develop (#26523)
ceci3 Aug 24, 2020
fa08a83
update op_version_registry, test=develop (#26592)
Shixiaowei02 Aug 24, 2020
6e823cf
add op_function_generator.exe retry in windows, test=develop (#26591)
wanghuancoder Aug 24, 2020
407de03
[2.0API] Reconstruct all API related to LR Scheduler, unify dygraph a…
zhwesky2010 Aug 24, 2020
da1efe2
Throws TypeError When Logical APIs Do Broadcast Operation (#26490)
Joejiong Aug 24, 2020
78ca8cf
Unify the metrics implementation between low-level and high-level API…
qingqing01 Aug 24, 2020
c8e1836
[API 2.0] add pool2d3d API,test=develop (#26331)
LDOUBLEV Aug 24, 2020
422a162
api2.0 paddle.nn.Bilinear and paddle.nn.functional.bilinear (#26399)
wanghuancoder Aug 24, 2020
36868e8
fix one_hot example doc test=document_fix (#26585)
yukavio Aug 24, 2020
7ff197d
Add fleet dgc amp doc, test=document_fix (#26608)
wangxicoding Aug 24, 2020
924734f
Refine typo of english document of paddle.prod (#26546)
gfwm2013 Aug 24, 2020
1ed74aa
fix msbuild log level (#26607)
zhwesky2010 Aug 24, 2020
1f82c0c
[Api2.0] add pixel shuffle (#26071)
shippingwang Aug 24, 2020
786373b
Use atomicAdd defined in paddle framework (#26628)
wanghaoshuang Aug 25, 2020
6f69fbc
fix elu grad whne alpha less then zero, test=develop (#26543)
qili93 Aug 25, 2020
a065a24
【2.0 API】Enhance affine grid operator (#26385)
wanghaoshuang Aug 25, 2020
a31dbc8
add deprecated for gather api and refine the doc of gather api (#26579)
wangchaochaohu Aug 25, 2020
c80fcf9
reduce_mean error if keepdim=True and reduce_all=True (#26614)
zhupengyang Aug 25, 2020
c1f5df5
optimized transformation form tensor to numpy (#26447)
wanghuancoder Aug 25, 2020
ef317b4
add mac tests failed exitcode (#26611)
lelelelelez Aug 25, 2020
26d6741
Enrich English declarative Doc (#26538)
zhhsplendid Aug 25, 2020
a8b5741
add a few unittests for setting timeout properity (#26630)
XieYunshen Aug 25, 2020
0679678
add norm 2.0 api, test=develop (#26465)
frankwhzhang Aug 25, 2020
02fc1fe
Fix the cmake-function named inference_download_and_uncompress on Win…
Avin0323 Aug 25, 2020
a004dfd
Use atomicAdd defined in paddle fromework (#26631)
wanghaoshuang Aug 25, 2020
0a895bc
improve unique op (#26537)
zhangting2020 Aug 25, 2020
08d736a
【paddle.fleet】add cudnn related strategies to DistributedStrategy (#2…
guru4elephant Aug 25, 2020
ed4a4ea
fix test=develop (#26632)
gongweibao Aug 25, 2020
2f5bdd8
Remove WOBOQ, gen_html() and sphinx (#26128)
randytli Aug 25, 2020
d32beea
Add Checking Type for "multiply" operation (#26508)
Joejiong Aug 25, 2020
761ed17
Update save inference model to support dygraph (#25894)
LiuChiachi Aug 25, 2020
1fcddf5
add white files(#26646)
lelelelelez Aug 25, 2020
32ceacf
update op_version_registry, test=develop (#26644)
Shixiaowei02 Aug 25, 2020
cb3f131
Set timeout properity for a few unitests
chalsliu Aug 25, 2020
32ba860
Enhance py_func error info message. (#26557)
jiweibo Aug 25, 2020
f823841
Fix atomicAdd in grid sample op and affine grid op (#26647)
wanghaoshuang Aug 25, 2020
286eca2
update the code for the topk v2
wawltor Aug 25, 2020
7af5cb9
fix english doc of all lr_scheduler (#26619)
zhwesky2010 Aug 25, 2020
2de034e
fix prelu, test=develop (#26613)
qili93 Aug 26, 2020
c70bc3b
Fix pip2 opencv-python version (#26293)
tianshuo78520a Aug 26, 2020
dbcef73
Fix the formula of SELU/selu (#26675)
hong19860320 Aug 26, 2020
98e057b
remove role maker in test_fleet_graph_execution_meta_optimizer.py (#2…
guru4elephant Aug 26, 2020
8986a82
fix adaptive gpu grad bug, add doc refine (#26660)
baiyfbupt Aug 26, 2020
33afeb3
fix the tanh (#26657)
ForFishes Aug 26, 2020
559e43e
Small change in conv2d and quantize pass (#26671)
wozna Aug 26, 2020
4ba6483
fix infer compile bug (#26641)
MRXLT Aug 26, 2020
cb00d50
../python/CMakeLists.txt (#26531)
pangyoki Aug 26, 2020
67d03be
Fix jit.to_static usage (#26682)
Aurelius84 Aug 26, 2020
1e53088
fix Distribution class problem (#26535)
pangyoki Aug 26, 2020
ada1e12
modified timeout value on windows and mac (#26690)
XieYunshen Aug 26, 2020
958d721
【paddle.fleet】Document refine lars & lamb (#26533)
JZ-LIANG Aug 26, 2020
76313dd
Genarate alias mapping (#26645)
WuHaobo Aug 27, 2020
5c14fdc
fix bug when get_glibc_ver returns None (#26693)
zhiqiu Aug 27, 2020
07973c5
disable deprecated (#26684)
zhiqiu Aug 27, 2020
1c68138
[api 2.0] add collective op for cpu using gloo and paddle.distributed…
Aug 27, 2020
2024ef6
【paddle.fleet】add comments about localsgd in DistributedStrategy (#26…
gavin1332 Aug 27, 2020
84e7be3
add TensorDataset for multiprocess DataLoader (#26332)
heavengate Aug 27, 2020
07e3b9a
api20, Bilinear add deprecated, and fix english doc (#26610)
wanghuancoder Aug 27, 2020
97cebfa
add dtype for unique (#26655)
zhangting2020 Aug 27, 2020
c2c6895
Update Paddle-Lite commit hash. (#26413)
arogowie-intel Aug 27, 2020
32ae8e8
leaky_relu, log_softmax, hardshrink formula format (#26720)
zhupengyang Aug 27, 2020
fc5acdd
Fix fleet ut timeout issue. (#26694)
gongweibao Aug 27, 2020
89d7d86
add intepolte_v2 (#26520)
tink2123 Aug 27, 2020
f056136
[Dy2stat] Support InputSpec and Return callable class instance in @de…
Aurelius84 Aug 27, 2020
dc56c89
Add the option to execute unit tests only at night (#26669)
chalsliu Aug 27, 2020
e4cc6a2
Norm op support 2-axis (#26492)
yongqiangma Aug 27, 2020
f311d3c
Fix pow api type error with python side method, merge elementwise_pow…
Joejiong Aug 27, 2020
f408301
Add unified RNN APIs (#26588)
Aug 27, 2020
8071d23
fix bug that can't print int8_t (#26712)
zhwesky2010 Aug 27, 2020
1c898b6
add bug fix enum. (#26736)
jiweibo Aug 27, 2020
f9066e6
Update the demo code and the doc of varbase.backward. (#26506)
wzzju Aug 27, 2020
f1ae017
update copyright year, test=document_fix (#26586)
Aug 27, 2020
2f75465
fix the call to core.ops.x, test=develop (#26729)
Aug 27, 2020
edf5f31
[2.0 API] add paddle.nn.functional.linear and fix paddle.nn.Linear (#…
donproc Aug 28, 2020
63eef76
Fix clip input check (#26683)
willthefrog Aug 28, 2020
c282db3
add broadcast feature for elementwise logical op
joey12300 Aug 28, 2020
1ec30cb
register cumsum Op version for compatible Op upgrades (#26734)
LutaoChu Aug 28, 2020
c03092b
fix dataloader performace decrease & unittest hanging. test=develop (…
heavengate Aug 28, 2020
52a6ca0
test=develop, improve pad assertion error (#26748)
Aug 28, 2020
5f275aa
fix sample method of Uniform and Normal class (#26713)
pangyoki Aug 28, 2020
02083bd
Add mkldnn bfloat16 option to C-API (#26676)
wozna Aug 28, 2020
eb097d6
Fix int8 performace drop cpu_quantize_placement_pass (#26715)
wozna Aug 28, 2020
4561fc3
Add check point for gather Op (#26696)
wangchaochaohu Aug 28, 2020
5f524ef
modify error report message, test=develop (#26743)
Aug 28, 2020
a7db9ac
Add the constraint for the scale of SELU/selu (#26686)
hong19860320 Aug 28, 2020
7d4e215
add parameters check in static mode for diag op
LutaoChu Aug 28, 2020
d39e789
For normalize op, remove special treatment of one-dimensional input, …
LutaoChu Aug 28, 2020
623a4c2
fix ci coverage build error (#26761)
lelelelelez Aug 28, 2020
29494d7
fix remainder, floor_div (#26732)
ForFishes Aug 28, 2020
30aab17
[2.0API]support 2.0 lr_scheduler for 2.0 optimizer (#26737)
zhwesky2010 Aug 28, 2020
e3f8e5c
trt int8 support conv2d_transpose (#26636)
cryoco Aug 28, 2020
b1f9ed6
[Dy2Stat] Add debugging and logging mechanism for dygraph to static (…
liym27 Aug 28, 2020
f34fb0a
normal: enlarge threshold to reduce random error (#26763)
zhupengyang Aug 28, 2020
2d88b9f
turn on WITH_INFERENCE_API_TEST (#26746)
zhwesky2010 Aug 28, 2020
3390c7e
fix test=develop (#26697)
gongweibao Aug 28, 2020
31f422a
Add interface to launch parallel dygraph by multiprocessing (#26044)
chenwhql Aug 28, 2020
844583c
Refine paddle.manual_seed (#26496)
zhiqiu Aug 28, 2020
7b78bfc
[2.0API]support set_default_dtype for to_tensor (#26432)
zhwesky2010 Aug 28, 2020
f32ae27
Remove `sorted_sum_gradient_` form BasicEngine and PartialGradTask. (…
wzzju Aug 28, 2020
7afb1df
Decouple weights and bias from fc primitive in MKLDNN cache (#26708)
Aug 28, 2020
25dad42
fix sample code for hapi.model.save (#26667)
LiuChiachi Aug 28, 2020
64df9b9
add shell of GPU version (#26589)
iducn Aug 28, 2020
4106e54
Fix hapi transform bug (#26738)
LielinJiang Aug 28, 2020
68e0560
refine paddle inference api (#26774)
jiweibo Aug 28, 2020
42d2915
[2.0 API] add SyncBatchNorm.convert_sync_batch_norm (#26688)
ceci3 Aug 28, 2020
2675cae
add deprecated decorator for old apis expand and expand_as (#26759)
Aug 28, 2020
8bcb1f2
Add conv+affine_channel fuse pass to MKLDNN pass strategy and fix it …
grygielski Aug 29, 2020
346689c
Register conv_transpose Op version for compatible Op upgrades (#26745)
LielinJiang Aug 29, 2020
a1b99fa
Adadelta Optimizer (#26590)
bjjwwang Aug 29, 2020
db68e08
[API2.0]Unify pooling function and add adaptive max pooling function …
shippingwang Aug 29, 2020
994217e
【paddle.fleet】fix api documents (#26777)
guru4elephant Aug 29, 2020
ac63c7c
fix a skip_layernorm bug, test=develop (#26800)
zlsh80826 Aug 30, 2020
7f2aa2d
【paddle.fleet】Support Heter Parameter Server (#25998)
MrChengmo Aug 30, 2020
0f1ad9b
leaky_relu and hardshrink add checkpoint for behavior changed (#26802)
zhupengyang Aug 31, 2020
4a57880
support default dtype for distribution APIs (#26695)
pangyoki Aug 31, 2020
9a1ea9b
Add support for tensor min/max in dygraph (#26764)
willthefrog Aug 31, 2020
f7fb4c2
Move hapi to python/paddle root dir. (#26442)
qingqing01 Aug 31, 2020
a47d92d
fleet add save with whitelist test=develop (#23376)
yaoxuefeng6 Aug 31, 2020
7ee70a4
update the doc for the some ops
wawltor Aug 31, 2020
1f6df87
fix doc, use to_tensor
ZHUI Aug 31, 2020
df0a22d
[Dy2stat]Fix Python Version compatibility of dict.iteritems (#26778)
Aurelius84 Aug 31, 2020
78a530c
[Paddle-TRT] TRT dynamic shape support PaddleSlim quant models (#26536)
cryoco Aug 31, 2020
f44420c
test=develop (#26710)
swtkiwi Aug 31, 2020
885c61f
Add use of global flag 'use_mkldnn' to layer_helper (#26497)
arlesniak Aug 31, 2020
11fb8a1
Refine cudnn softmax (#25757)
GaoWei8 Aug 31, 2020
3ce727a
rename Dropout2D/3D to Dropout2d/3d, test=develop (#26796)
huangjun12 Aug 31, 2020
b011177
enhance error message (#26808)
Aurelius84 Aug 31, 2020
9ded756
【paddle.fleet】FleetAPI 2.0 (#26772)
seiriosPlus Aug 31, 2020
28cb653
Remove backend argument of init_parallel_env (#26773)
chenwhql Aug 31, 2020
40378ed
Add the AddCheckpoint macro to softplus op (#26809)
hong19860320 Aug 31, 2020
d854101
Count the time and packet size for Windows monitor (#26678)
zhwesky2010 Aug 31, 2020
7495b28
[CI] Change RD for check_op_desc from (liym27, zhhsplendid, Aurelius8…
liym27 Aug 31, 2020
8e67497
4 scripts of enforce are moved to enforce directory (#26825)
iducn Sep 1, 2020
1f36d3c
update optimizer (#26711)
MRXLT Sep 1, 2020
ebc5f99
add embedding 2.0 (#26649)
seiriosPlus Sep 1, 2020
f866bfe
support mnist and resnet dygraph_to_static test (#25774)
sfraczek Sep 1, 2020
1769d21
fix ut (#26844)
MRXLT Sep 1, 2020
ced6e87
Revert "Add mkldnn bfloat16 option to C-API (#26676)" (#26854)
Shixiaowei02 Sep 1, 2020
ad5547d
Fix typo util->until. (#26855)
zhhsplendid Sep 1, 2020
a60bbe6
add approve check of the CI task on windows (#26832)
zhwesky2010 Sep 1, 2020
1e50b2a
fix retry error with blank (#26835)
XieYunshen Sep 1, 2020
61107aa
[Dy2Stat-debugging] Fix bug: Returns True if the verbosity level set …
liym27 Sep 1, 2020
1127d0d
fix adaptive_avg_1d outputsize type issue, test=document_fix (#26869)
baiyfbupt Sep 1, 2020
60ffc22
Refine bernoulli and unsqueeze op (#26842)
zhiqiu Sep 1, 2020
ad6e3dd
[Paddle-TRT] Stack op plugin (#25605)
zlsh80826 Sep 1, 2020
d0962ab
supplement bug fix of parameter server (#26217)
MrChengmo Sep 2, 2020
d067e66
Show more possible problems with build_and_check in file paddle_build…
Avin0323 Sep 2, 2020
6129b0e
Revert `no_grad` changes and add new implementation (#26826)
willthefrog Sep 2, 2020
0a29fc8
fix the argmin,argmax op for the paddlepaddle 2.0
wawltor Sep 2, 2020
932bbe9
fix pool trt plugin bug (#26463)
NHZlX Sep 2, 2020
f9d39b4
[Dy2Stat] Transforme api 'to_tensor' to 'assign'. (#26873)
liym27 Sep 2, 2020
5205748
fix eigen in push sparse; fix hadoop command (#26872)
Thunderbrook Sep 2, 2020
2d2c31a
Add FetchAsyncOpHandle, and use it in FastThreadedExecutor (#26643)
wanghuancoder Sep 2, 2020
435ab2a
Raise RuntimeError if run the callable object decorated by '@paddle.j…
liym27 Sep 2, 2020
030b298
fix sample codes in collective.py (#26787)
Sep 2, 2020
bf6cbbc
remove unused fc_gan unit-test demo (#26889)
luotao1 Sep 2, 2020
e6af53b
Update some used apis in Transformer apis to 2.0 apis. (#26831)
guoshengCS Sep 2, 2020
5e874cc
- Cosmetic fixes to align with PADDLE_ENFORCE guidelines (#26891)
jczaja Sep 2, 2020
0627a31
Restore "Add mkldnn bfloat16 option to C-API " (#26882)
wozna Sep 2, 2020
4e1c8f3
Remove psutil dependency. (#26841)
gongweibao Sep 2, 2020
9ee4e3d
Correct 2.0 API usage in hapi.model.load (#26829)
LiuChiachi Sep 2, 2020
c7b5ac4
fix norm bug, test=develop (#26827)
frankwhzhang Sep 2, 2020
1771d9f
fix cache judge more safe (#26910)
zhwesky2010 Sep 2, 2020
d8984a6
limit timeout value setting on linux (#26923)
XieYunshen Sep 2, 2020
2f50aa2
add deprecated information to the top of an API docstring (#26818)
Joejiong Sep 2, 2020
e480168
fix dropout bug in backward when input is 1d tensor (#26837)
huangjun12 Sep 2, 2020
f772540
add time when test failed (#26935)
zhwesky2010 Sep 2, 2020
61fc7a3
Pass version check (#26887)
shangzhizhou Sep 3, 2020
26c698e
Fix catch exit code failed caused by (#26934)
Avin0323 Sep 3, 2020
29b844a
Fix clip op attr (#26924)
willthefrog Sep 3, 2020
b036294
fix uniform CI random fail problem (#26932)
pangyoki Sep 3, 2020
05da5a3
remove matplotlib requirement files (#26948)
shippingwang Sep 3, 2020
3ba7b9b
fix regularizer timeout (#26952)
Sep 3, 2020
95e1434
Add bfloat16 data type (#25402)
wozna Sep 3, 2020
96331f7
fix conv1d padding (#26921)
LielinJiang Sep 3, 2020
1275659
refine paddle.stack (#26886)
zhiqiu Sep 3, 2020
d5f4c39
set_default_type only take effect on python floats or complex (#26939)
zhwesky2010 Sep 3, 2020
559d9f2
Add summary for hapi (#26386)
LielinJiang Sep 3, 2020
6a09b8f
erase Raises and refine doce of random functions (#26901)
zhupengyang Sep 3, 2020
bcdbac1
fix some cast error. (#26884)
wzzju Sep 3, 2020
209273e
Support load state dict form `inference model` format save result (#2…
chenwhql Sep 3, 2020
3eacced
[cuda11 support] add support for cublas load of same function name (p…
wangchaochaohu Sep 3, 2020
3900f66
[API 2.0]Fix adaptive pooling bug (#26922)
shippingwang Sep 3, 2020
9cb57f9
Update set_dict method name & add aliases (#26700)
chenwhql Sep 3, 2020
e35ad3e
【paddle.fleet】support running python train.py for fleet tasks (#26249)
danleifeng Sep 3, 2020
3a2a711
【paddle.fleet】simplify fleetrun log infos (#26888)
danleifeng Sep 3, 2020
c8cc094
add template specialization for bfloat16 for gcc 4.8 compatability (#…
Sep 3, 2020
6b4ca0d
【paddle.fleet】distributed_optimizer supports dygraph (#26541)
danleifeng Sep 3, 2020
9e4fe92
fix strategy example (#26856)
mapingshuo Sep 4, 2020
fcb2ac6
fix alpha dropout bug when p=1, test=develop (#26977)
huangjun12 Sep 4, 2020
35ae102
add shell of CPU's version info (#26937)
iducn Sep 4, 2020
c484619
fix Heter Ps multi thread (#26876)
MrChengmo Sep 4, 2020
7f3e6ca
add cuda generator (#26786)
yaoxuefeng6 Sep 4, 2020
46057dd
change uniform_random to uniform and optimize function names in unitt…
pangyoki Sep 4, 2020
72f6e56
fix sample code (#26962)
MRXLT Sep 4, 2020
352ac14
update doc of paddle.to_tensor (#26820)
zhwesky2010 Sep 4, 2020
ff3dc8a
fix the remainder (#26995)
ForFishes Sep 4, 2020
5866cde
mac tests failed (#26928)
lelelelelez Sep 4, 2020
5a48952
remove rmdir build (#26965)
zhwesky2010 Sep 4, 2020
54427b8
[Api2.0] sum: bug fix - support attr(dtype) is float32 or int32 and a…
liym27 Sep 4, 2020
695bfb0
reduce repeat time of Predictor (#27011)
Aurelius84 Sep 4, 2020
4204cea
kill op_function_generator.exe (#27005)
zhwesky2010 Sep 4, 2020
21565e8
fix batchnorm ,test=develop (#26972)
frankwhzhang Sep 4, 2020
8aebcab
Test for passing global env FLAGS_use_mkldnn (#26938)
arlesniak Sep 4, 2020
0dfe26d
add pool unittest (#26949)
LDOUBLEV Sep 4, 2020
f95e8ff
Fix conv and summary api bug (#27023)
LielinJiang Sep 4, 2020
a0c98e6
fix dtype not matching bug in log_prob and probs method of Distributi…
pangyoki Sep 4, 2020
b6fb979
Fix conv1d when data formate is NLC (#26944)
wanghaoshuang Sep 4, 2020
2660ea3
Fix cuda kernel of affine grid (#27003)
wanghaoshuang Sep 4, 2020
d843706
add shape check for fill_constant OP and remove doc of type error (#…
AshburnLee Sep 4, 2020
a727522
support heter-xpu-ps (#27018)
MrChengmo Sep 4, 2020
8fa3d36
Remove "TypeError" from some files as required (#27019)
wangchaochaohu Sep 4, 2020
f6653c7
[oneDNN] Fix to conv2d grad with groups (#27006)
jczaja Sep 4, 2020
2f97ecd
fix normal ut by set manual_seed (#27028)
zhupengyang Sep 5, 2020
c3b314c
refine the error of message test=develop (#27067)
wangchaochaohu Sep 5, 2020
9aa3958
fix cuda generator hard-coded offset step (#27027)
yaoxuefeng6 Sep 5, 2020
92530ca
Fix clip output var dtype (#27029)
willthefrog Sep 5, 2020
041f4ab
refine linspace Op for dtype setting(#27071)
wangchaochaohu Sep 5, 2020
8857e39
add the dynamic dtype check for the argmin/argma
wawltor Sep 6, 2020
9373cf5
Add TranslatedLayer.program method to get program (#26961)
chenwhql Sep 6, 2020
35f53ec
[Dy2stat] Add descriptor cache for StaticLayer (#26987)
Aurelius84 Sep 7, 2020
4ff16eb
Add padding cudnn interface (#26370)
GaoWei8 Sep 7, 2020
fab8bbf
Modify data download function and support unittests of inference APIs…
Avin0323 Sep 7, 2020
19ca6d9
add .part to speed up compile (#27044)
zhupengyang Sep 7, 2020
b150f2b
disable test_trt_dynamic_shape_ernie_ser_deser,test=document_fix (#27…
XieYunshen Sep 7, 2020
f0b2631
fix _check_values_dtype_in_probs method in Distribution class (#27046)
pangyoki Sep 7, 2020
24ec517
remove the dependence of matplotlib in dataset of uci_housing
wawltor Sep 7, 2020
eb65877
fix dimensions error for mobilenetv1_KL_quant (#26776)
sfraczek Sep 7, 2020
6321254
Refine python inference api (#26958)
jiweibo Sep 7, 2020
3a02424
Refine the error of test_dynrnn_static_input (#27084)
gfwm2013 Sep 7, 2020
9b7692b
update gpu compute archs for cuda11 (#27039)
wangchaochaohu Sep 7, 2020
39d5bb6
udpate the dtype check for the argmin, argmax
wawltor Sep 7, 2020
5af81f8
fix gpu kernel for numel Op (#27085)
wangchaochaohu Sep 7, 2020
0443b48
【paddle.fleet】add auto parallel L1 implementations (#27090)
guru4elephant Sep 7, 2020
c1a8868
Change jit.save/load configs to config & update code examples (#27056)
chenwhql Sep 7, 2020
aca450f
fix the localsgd optimizer (#27094)
ForFishes Sep 7, 2020
92bf0d4
add failed unittests retry on win system (#26823)
XieYunshen Sep 7, 2020
9fd5eae
add failed unittests retry on mac system (#26813)
XieYunshen Sep 7, 2020
File filter

Filter by extension

Filter by extension


Conversations
Failed to load comments.
Loading
Jump to
The table of contents is too big for display.
Diff view
Diff view
  •  
  •  
  •  
5 changes: 4 additions & 1 deletion CMakeLists.txt
Original file line number Diff line number Diff line change
Expand Up @@ -28,7 +28,10 @@ include(generic) # simplify cmake module
# TODO(Shibo Tao): remove find_package(CUDA) completely.
find_package(CUDA QUIET)
option(WITH_GPU "Compile PaddlePaddle with NVIDIA GPU" ${CUDA_FOUND})

option(WITH_XPU "Compile PaddlePaddle with BAIDU KUNLUN" OFF)
if (WITH_GPU AND WITH_XPU)
message(FATAL_ERROR "Error when compile GPU and XPU at the same time")
endif()
# cmake 3.12, 3.13, 3.14 will append gcc link options to nvcc, and nvcc doesn't recognize them.
if(WITH_GPU AND (${CMAKE_VERSION} VERSION_GREATER_EQUAL 3.12) AND (${CMAKE_VERSION} VERSION_LESS 3.15))
message(FATAL_ERROR "cmake ${CMAKE_VERSION} is not supported when WITH_GPU=ON because of bug https://cmake.org/pipermail/cmake/2018-September/068195.html. "
Expand Down
25 changes: 4 additions & 21 deletions Dockerfile
Original file line number Diff line number Diff line change
Expand Up @@ -11,7 +11,6 @@ RUN /bin/bash -c 'if [[ -n ${UBUNTU_MIRROR} ]]; then sed -i 's#http://archive.ub
ARG WITH_GPU
ARG WITH_AVX

ENV WOBOQ OFF
ENV WITH_GPU=${WITH_GPU:-ON}
ENV WITH_AVX=${WITH_AVX:-ON}

Expand Down Expand Up @@ -149,21 +148,11 @@ RUN localedef -i en_US -f UTF-8 en_US.UTF-8
# FIXME: due to temporary ipykernel dependency issue, specify ipykernel jupyter
# version util jupyter fixes this issue.

# specify sphinx version as 1.5.6 and remove -U option for [pip install -U
# sphinx-rtd-theme] since -U option will cause sphinx being updated to newest
# version(1.7.1 for now), which causes building documentation failed.

RUN pip3 --no-cache-dir install -U wheel py-cpuinfo==5.0.0 && \
pip3 --no-cache-dir install -U docopt PyYAML sphinx==1.5.6 && \
pip3 --no-cache-dir install sphinx-rtd-theme==0.1.9 recommonmark && \
pip3.6 --no-cache-dir install -U wheel py-cpuinfo==5.0.0 && \
pip3.6 --no-cache-dir install -U docopt PyYAML sphinx==1.5.6 && \
pip3.6 --no-cache-dir install sphinx-rtd-theme==0.1.9 recommonmark && \
pip3.7 --no-cache-dir install -U wheel py-cpuinfo==5.0.0 && \
pip3.7 --no-cache-dir install -U docopt PyYAML sphinx==1.5.6 && \
pip3.7 --no-cache-dir install sphinx-rtd-theme==0.1.9 recommonmark && \
pip --no-cache-dir install -U wheel py-cpuinfo==5.0.0 && \
pip --no-cache-dir install -U docopt PyYAML sphinx==1.5.6 && \
pip --no-cache-dir install sphinx-rtd-theme==0.1.9 recommonmark

RUN pip3 --no-cache-dir install 'pre-commit==1.10.4' 'ipython==5.3.0' && \
pip3 --no-cache-dir install 'ipykernel==4.6.0' 'jupyter==1.0.0' && \
Expand All @@ -184,9 +173,9 @@ RUN pip3.6 --no-cache-dir install pylint pytest astroid isort
RUN pip3.7 --no-cache-dir install pylint pytest astroid isort
RUN pip --no-cache-dir install pylint pytest astroid isort LinkChecker

RUN pip3 --no-cache-dir install coverage
RUN pip3.6 --no-cache-dir install coverage
RUN pip3.7 --no-cache-dir install coverage
RUN pip3 --no-cache-dir install coverage
RUN pip3.6 --no-cache-dir install coverage
RUN pip3.7 --no-cache-dir install coverage
RUN pip --no-cache-dir install coverage

COPY ./python/requirements.txt /root/
Expand All @@ -204,12 +193,6 @@ RUN pip3.7 --no-cache-dir install certifi urllib3[secure]
RUN pip --no-cache-dir install certifi urllib3[secure]


# Install woboq_codebrowser to /woboq
RUN git clone https://github.com/woboq/woboq_codebrowser /woboq && \
(cd /woboq \
cmake -DLLVM_CONFIG_EXECUTABLE=/usr/bin/llvm-config-3.8 \
-DCMAKE_BUILD_TYPE=Release . \
make)

# ar mishandles 4GB files
# https://sourceware.org/bugzilla/show_bug.cgi?id=14625
Expand Down
2 changes: 1 addition & 1 deletion README.md
Original file line number Diff line number Diff line change
Expand Up @@ -33,7 +33,7 @@ pip install paddlepaddle
# Linux GPU cuda10cudnn7
pip install paddlepaddle-gpu
# Linux GPU cuda9cudnn7
pip install paddlepaddle-gpu==1.8.3.post97
pip install paddlepaddle-gpu==1.8.4.post97

```
It is recommended to read [this doc](https://www.paddlepaddle.org.cn/documentation/docs/en/beginners_guide/install/index_en.html) on our website.
Expand Down
2 changes: 1 addition & 1 deletion README_cn.md
Original file line number Diff line number Diff line change
Expand Up @@ -30,7 +30,7 @@ pip install paddlepaddle
# Linux GPU cuda10cudnn7
pip install paddlepaddle-gpu
# Linux GPU cuda9cudnn7
pip install paddlepaddle-gpu==1.8.3.post97
pip install paddlepaddle-gpu==1.8.4.post97

```
更多安装信息详见官网 [安装说明](http://www.paddlepaddle.org.cn/documentation/docs/zh/1.8/beginners_guide/install/index_cn.html)
Expand Down
5 changes: 5 additions & 0 deletions cmake/configure.cmake
Original file line number Diff line number Diff line change
Expand Up @@ -63,6 +63,11 @@ if(WITH_BOX_PS)
add_definitions(-DPADDLE_WITH_BOX_PS)
endif()

if(WITH_XPU)
message(STATUS "Compile with XPU!")
add_definitions(-DPADDLE_WITH_XPU)
endif()

if(WITH_GPU)
add_definitions(-DPADDLE_WITH_CUDA)
add_definitions(-DEIGEN_USE_GPU)
Expand Down
9 changes: 9 additions & 0 deletions cmake/cuda.cmake
Original file line number Diff line number Diff line change
Expand Up @@ -16,6 +16,7 @@ else()
set(paddle_known_gpu_archs8 "30 35 50 52 60 61")
set(paddle_known_gpu_archs9 "30 35 50 52 60 61 70")
set(paddle_known_gpu_archs10 "30 35 50 52 60 61 70 75")
set(paddle_known_gpu_archs11 "35 50 52 60 61 70 75 80")
endif()

######################################################################################
Expand Down Expand Up @@ -61,6 +62,10 @@ function(detect_installed_gpus out_variable)
if(NOT CUDA_gpu_detect_output)
message(STATUS "Automatic GPU detection failed. Building for all known architectures.")
set(${out_variable} ${paddle_known_gpu_archs} PARENT_SCOPE)
#Todo: fix Automatic GPU detection failed on windows
if(WIN32)
set(${out_variable} "61 75" PARENT_SCOPE)
endif()
else()
set(${out_variable} ${CUDA_gpu_detect_output} PARENT_SCOPE)
endif()
Expand Down Expand Up @@ -184,6 +189,10 @@ elseif (${CMAKE_CUDA_COMPILER_VERSION} LESS 11.0) # CUDA 10.x
set(paddle_known_gpu_archs ${paddle_known_gpu_archs10})
set(CMAKE_CUDA_FLAGS "${CMAKE_CUDA_FLAGS} -D_MWAITXINTRIN_H_INCLUDED")
set(CMAKE_CUDA_FLAGS "${CMAKE_CUDA_FLAGS} -D__STRICT_ANSI__")
elseif (${CMAKE_CUDA_COMPILER_VERSION} LESS 12.0) # CUDA 11.x
set(paddle_known_gpu_archs ${paddle_known_gpu_archs11})
set(CMAKE_CUDA_FLAGS "${CMAKE_CUDA_FLAGS} -D_MWAITXINTRIN_H_INCLUDED")
set(CMAKE_CUDA_FLAGS "${CMAKE_CUDA_FLAGS} -D__STRICT_ANSI__")
endif()

add_definitions("-DPADDLE_CUDA_BINVER=\"${CUDA_VERSION_MAJOR}${CUDA_VERSION_MINOR}\"")
Expand Down
2 changes: 1 addition & 1 deletion cmake/external/cub.cmake
Original file line number Diff line number Diff line change
Expand Up @@ -17,7 +17,7 @@ include(ExternalProject)
set(CUB_PREFIX_DIR ${THIRD_PARTY_PATH}/cub)
set(CUB_SOURCE_DIR ${THIRD_PARTY_PATH}/cub/src/extern_cub)
set(CUB_REPOSITORY https://github.com/NVlabs/cub.git)
set(CUB_TAG 1.9.8)
set(CUB_TAG 1.8.0)

cache_third_party(extern_cub
REPOSITORY ${CUB_REPOSITORY}
Expand Down
2 changes: 1 addition & 1 deletion cmake/external/lite.cmake
Original file line number Diff line number Diff line change
Expand Up @@ -34,7 +34,7 @@ if (NOT LITE_SOURCE_DIR OR NOT LITE_BINARY_DIR)
set(LITE_INSTALL_DIR ${THIRD_PARTY_PATH}/install/lite)

if(NOT LITE_GIT_TAG)
set(LITE_GIT_TAG 42ab4d559f6659edfc35040fb30fdcec3dc3f8aa)
set(LITE_GIT_TAG dfdfa6440c83bf0b415f9f5a9ff84842ce0bb0fa)
endif()

if(NOT CUDA_ARCH_NAME)
Expand Down
54 changes: 54 additions & 0 deletions cmake/external/xpu.cmake
Original file line number Diff line number Diff line change
@@ -0,0 +1,54 @@
if (NOT WITH_XPU)
return()
endif()

INCLUDE(ExternalProject)
SET(XPU_PROJECT "extern_xpu")
SET(XPU_URL "https://kunlun1.su.bcebos.com/xpu.tar.gz" CACHE STRING "" FORCE)
SET(XPU_SOURCE_DIR "${THIRD_PARTY_PATH}/xpu")
SET(XPU_DOWNLOAD_DIR "${XPU_SOURCE_DIR}/src/${XPU_PROJECT}")
SET(XPU_INSTALL_DIR "${THIRD_PARTY_PATH}/install/xpu")
SET(XPU_API_INC_DIR "${THIRD_PARTY_PATH}/install/xpu/api/include")
SET(XPU_RUNTIME_INC_DIR "${THIRD_PARTY_PATH}/install/xpu/runtime/include")
SET(XPU_LIB_DIR "${THIRD_PARTY_PATH}/install/xpu/lib")

SET(XPU_API_LIB_NAME "libxpuapi.so")
SET(XPU_RT_LIB_NAME "libxpurt.so")
SET(XPU_SIM_LIB_NAME "libxpusim.so")
SET(XPU_API_LIB "${XPU_LIB_DIR}/${XPU_API_LIB_NAME}")
SET(XPU_RT_LIB "${XPU_LIB_DIR}/${XPU_RT_LIB_NAME}")
SET(XPU_SIM_LIB "${XPU_LIB_DIR}/${XPU_SIM_LIB_NAME}")

SET(CMAKE_INSTALL_RPATH "${CMAKE_INSTALL_RPATH}" "${XPU_INSTALL_DIR}/lib")

INCLUDE_DIRECTORIES(${XPU_API_INC_DIR})
INCLUDE_DIRECTORIES(${XPU_RUNTIME_INC_DIR})

FILE(WRITE ${XPU_DOWNLOAD_DIR}/CMakeLists.txt
"PROJECT(XPU)\n"
"cmake_minimum_required(VERSION 3.0)\n"
"install(DIRECTORY xpu/api xpu/runtime xpu/lib \n"
" DESTINATION ${XPU_INSTALL_DIR})\n")

ExternalProject_Add(
${XPU_PROJECT}
${EXTERNAL_PROJECT_LOG_ARGS}
PREFIX ${XPU_SOURCE_DIR}
DOWNLOAD_DIR ${XPU_DOWNLOAD_DIR}
DOWNLOAD_COMMAND wget --no-check-certificate ${XPU_URL} -c -q -O xpu.tar.gz
&& tar xvf xpu.tar.gz
DOWNLOAD_NO_PROGRESS 1
UPDATE_COMMAND ""
CMAKE_ARGS -DCMAKE_INSTALL_PREFIX=${XPU_INSTALL_ROOT}
CMAKE_CACHE_ARGS -DCMAKE_INSTALL_PREFIX:PATH=${XPU_INSTALL_ROOT}
)

ADD_LIBRARY(shared_xpuapi SHARED IMPORTED GLOBAL)
set_property(TARGET shared_xpuapi PROPERTY IMPORTED_LOCATION "${XPU_API_LIB}")

# generate a static dummy target to track xpulib dependencies
# for cc_library(xxx SRCS xxx.c DEPS xpulib)
generate_dummy_static_lib(LIB_NAME "xpulib" GENERATOR "xpu.cmake")

TARGET_LINK_LIBRARIES(xpulib ${XPU_API_LIB} ${XPU_RT_LIB} ${XPU_SIM_LIB})
ADD_DEPENDENCIES(xpulib ${XPU_PROJECT})
17 changes: 13 additions & 4 deletions cmake/generic.cmake
Original file line number Diff line number Diff line change
Expand Up @@ -384,8 +384,12 @@ function(cc_test_run TARGET_NAME)
set_property(TEST ${TARGET_NAME} PROPERTY ENVIRONMENT FLAGS_cpu_deterministic=true)
set_property(TEST ${TARGET_NAME} PROPERTY ENVIRONMENT FLAGS_init_allocated_mem=true)
set_property(TEST ${TARGET_NAME} PROPERTY ENVIRONMENT FLAGS_cudnn_deterministic=true)
# No unit test should exceed 10 minutes.
set_tests_properties(${TARGET_NAME} PROPERTIES TIMEOUT 600)
# No unit test should exceed 2 minutes.
if (APPLE OR WIN32)
set_tests_properties(${TARGET_NAME} PROPERTIES TIMEOUT 600)
else()
set_tests_properties(${TARGET_NAME} PROPERTIES TIMEOUT 120)
endif()
endif()
endfunction()

Expand Down Expand Up @@ -742,9 +746,14 @@ function(py_test TARGET_NAME)
${PYTHON_EXECUTABLE} -u ${py_test_SRCS} ${py_test_ARGS}
WORKING_DIRECTORY ${CMAKE_CURRENT_BINARY_DIR})
endif()

if (APPLE OR WIN32)
set_tests_properties(${TARGET_NAME} PROPERTIES TIMEOUT 600)
else()
# No unit test should exceed 2 minutes in Linux.
set_tests_properties(${TARGET_NAME} PROPERTIES TIMEOUT 120)
endif()

# No unit test should exceed 10 minutes.
set_tests_properties(${TARGET_NAME} PROPERTIES TIMEOUT 600)
endif()
endfunction()

Expand Down
8 changes: 5 additions & 3 deletions cmake/inference_lib.cmake
Original file line number Diff line number Diff line change
Expand Up @@ -110,10 +110,12 @@ function(copy_part_of_thrid_party TARGET DST)
SRCS ${GLOG_INCLUDE_DIR} ${GLOG_LIBRARIES}
DSTS ${dst_dir} ${dst_dir}/lib)

if (WITH_CRYPTO)
set(dst_dir "${DST}/third_party/install/cryptopp")
copy(${TARGET}
SRCS ${CRYPTOPP_INCLUDE_DIR} ${CRYPTOPP_LIBRARIES}
DSTS ${dst_dir} ${dst_dir}/lib)
SRCS ${CRYPTOPP_INCLUDE_DIR} ${CRYPTOPP_LIBRARIES}
DSTS ${dst_dir} ${dst_dir}/lib)
endif()

set(dst_dir "${DST}/third_party/install/xxhash")
copy(${TARGET}
Expand Down Expand Up @@ -187,7 +189,7 @@ copy(inference_lib_dist
SRCS ${CMAKE_BINARY_DIR}/paddle/fluid/framework/framework.pb.h
DSTS ${FLUID_INFERENCE_INSTALL_DIR}/paddle/include/internal)
copy(inference_lib_dist
SRCS ${CMAKE_BINARY_DIR}/../paddle/fluid/framework/io/crypto/cipher.h
SRCS ${PADDLE_SOURCE_DIR}/paddle/fluid/framework/io/crypto/cipher.h
DSTS ${FLUID_INFERENCE_INSTALL_DIR}/paddle/include/crypto/)
include_directories(${CMAKE_BINARY_DIR}/../paddle/fluid/framework/io)

Expand Down
18 changes: 16 additions & 2 deletions cmake/operators.cmake
Original file line number Diff line number Diff line change
Expand Up @@ -8,6 +8,7 @@ function(op_library TARGET)
set(hip_cu_srcs)
set(miopen_hip_cc_srcs)
set(cu_cc_srcs)
set(xpu_cc_srcs)
set(cudnn_cu_cc_srcs)
set(cudnn_cu_srcs)
set(CUDNN_FILE)
Expand Down Expand Up @@ -60,6 +61,12 @@ function(op_library TARGET)
list(APPEND mkldnn_cc_srcs mkldnn/${MKLDNN_FILE}.cc)
endif()
endif()
if(WITH_XPU)
string(REPLACE "_op" "_xpu_op" XPU_FILE "${TARGET}")
if (EXISTS ${CMAKE_CURRENT_SOURCE_DIR}/xpu/${XPU_FILE}.cc)
list(APPEND xpu_cc_srcs xpu/${XPU_FILE}.cc)
endif()
endif()
else()
foreach(src ${op_library_SRCS})
if (${src} MATCHES ".*\\.hip.cu$")
Expand All @@ -76,6 +83,8 @@ function(op_library TARGET)
list(APPEND mkldnn_cc_srcs ${src})
elseif(${src} MATCHES ".*\\.cu.cc$")
list(APPEND cu_cc_srcs ${src})
elseif(WITH_XPU AND ${src} MATCHES ".*_xpu_op.cc$")
list(APPEND xpu_cc_srcs ${src})
elseif(${src} MATCHES ".*\\.cc$")
list(APPEND cc_srcs ${src})
else()
Expand Down Expand Up @@ -109,7 +118,7 @@ function(op_library TARGET)
hip_library(${TARGET} SRCS ${cc_srcs} ${hip_cu_srcs} ${miopen_hip_cc_srcs} ${mkldnn_cc_srcs} DEPS ${op_library_DEPS}
${op_common_deps})
else()
cc_library(${TARGET} SRCS ${cc_srcs} ${mkldnn_cc_srcs} DEPS ${op_library_DEPS}
cc_library(${TARGET} SRCS ${cc_srcs} ${mkldnn_cc_srcs} ${xpu_cc_srcs} DEPS ${op_library_DEPS}
${op_common_deps})
endif()

Expand Down Expand Up @@ -150,10 +159,11 @@ function(op_library TARGET)
list(LENGTH cu_srcs cu_srcs_len)
list(LENGTH cu_cc_srcs cu_cc_srcs_len)
list(LENGTH mkldnn_cc_srcs mkldnn_cc_srcs_len)
list(LENGTH xpu_cc_srcs xpu_cc_srcs_len)
list(LENGTH hip_cu_srcs hip_cu_srcs_len)
list(LENGTH miopen_hip_cc_srcs miopen_hip_cc_srcs_len)
if (${pybind_flag} EQUAL 0 AND ${mkldnn_cc_srcs_len} EQUAL 0 AND ${cu_srcs_len} EQUAL 0 AND ${cu_cc_srcs_len} EQUAL 0 AND
${hip_cu_srcs_len} EQUAL 0 AND ${miopen_hip_cc_srcs_len} EQUAL 0)
${hip_cu_srcs_len} EQUAL 0 AND ${miopen_hip_cc_srcs_len} EQUAL 0 AND ${xpu_cc_srcs_len} EQUAL 0)
file(APPEND ${pybind_file} "USE_CPU_ONLY_OP(${TARGET});\n")
set(pybind_flag 1)
endif()
Expand All @@ -179,6 +189,9 @@ function(op_library TARGET)
file(APPEND ${pybind_file} "USE_OP_DEVICE_KERNEL(${TARGET}, MIOPEN);\n")
endif()

if (WITH_XPU AND ${xpu_cc_srcs_len} GREATER 0)
file(APPEND ${pybind_file} "USE_OP_DEVICE_KERNEL(${TARGET}, XPU);\n")
endif()
# pybind USE_OP_DEVICE_KERNEL for MKLDNN
if (WITH_MKLDNN AND ${mkldnn_cc_srcs_len} GREATER 0)
# Append first implemented MKLDNN activation operator
Expand Down Expand Up @@ -228,6 +241,7 @@ function(register_operators)

file(GLOB OPS RELATIVE "${CMAKE_CURRENT_SOURCE_DIR}" "*_op.cc")
string(REPLACE "_mkldnn" "" OPS "${OPS}")
string(REPLACE "_xpu" "" OPS "${OPS}")
string(REPLACE ".cc" "" OPS "${OPS}")
list(REMOVE_DUPLICATES OPS)
list(LENGTH register_operators_DEPS register_operators_DEPS_len)
Expand Down
5 changes: 5 additions & 0 deletions cmake/third_party.cmake
Original file line number Diff line number Diff line change
Expand Up @@ -250,6 +250,11 @@ if(WITH_GPU)
file_download_and_uncompress(${CUDAERROR_URL} "cudaerror") # download file cudaErrorMessage
endif(WITH_GPU)

if(WITH_XPU)
include(external/xpu) # download, build, install xpu
list(APPEND third_party_deps extern_xpu)
endif(WITH_XPU)

if(WITH_PSLIB)
include(external/pslib) # download, build, install pslib
list(APPEND third_party_deps extern_pslib)
Expand Down
7 changes: 7 additions & 0 deletions go/paddle/config.go
Original file line number Diff line number Diff line change
Expand Up @@ -154,10 +154,17 @@ func (config *AnalysisConfig) EnableMkldnnQuantizer() {
C.PD_EnableMkldnnQuantizer(config.c)
}

func (config *AnalysisConfig) EnableMkldnnBfloat16() {
C.PD_EnableMkldnnBfloat16(config.c)
}

func (config *AnalysisConfig) MkldnnQuantizerEnabled() bool {
return ConvertCBooleanToGo(C.PD_MkldnnQuantizerEnabled(config.c))
}

func (config *AnalysisConfig) MkldnnBfloat16Enabled() bool {
return ConvertCBooleanToGo(C.PD_MkldnnBfloat16Enabled(config.c))
}
// SetModelBuffer
// ModelFromMemory

Expand Down
7 changes: 6 additions & 1 deletion paddle/fluid/framework/CMakeLists.txt
Original file line number Diff line number Diff line change
Expand Up @@ -119,9 +119,13 @@ cc_test(data_layout_transform_test SRCS data_layout_transform_test.cc DEPS data_
cc_library(data_transform SRCS data_transform.cc DEPS math_function tensor
framework_proto selected_rows data_device_transform data_type_transform data_layout_transform)

cc_library(attribute SRCS attribute.cc DEPS framework_proto boost)
cc_library(attribute SRCS attribute.cc DEPS framework_proto boost enforce)
cc_test(program_desc_test SRCS program_desc_test.cc DEPS proto_desc
device_context)

cc_library(op_version_registry SRCS op_version_registry.cc DEPS framework_proto boost)
cc_test(op_version_registry_test SRCS op_version_registry_test.cc DEPS op_version_registry)

cc_library(op_proto_maker SRCS op_proto_maker.cc DEPS framework_proto attribute glog)
cc_test(op_proto_maker_test SRCS op_proto_maker_test.cc DEPS op_proto_maker)
cc_library(no_need_buffer_vars_inference SRCS no_need_buffer_vars_inference.cc DEPS attribute device_context)
Expand Down Expand Up @@ -268,6 +272,7 @@ cc_test(op_compatible_info_test SRCS op_compatible_info_test.cc DEPS op_compatib

cc_library(save_load_util SRCS save_load_util DEPS tensor scope layer)
cc_test(save_load_util_test SRCS save_load_util_test.cc DEPS save_load_util tensor scope layer)
cc_library(generator SRCS generator.cc DEPS enforce place)

# Get the current working branch
execute_process(
Expand Down
2 changes: 2 additions & 0 deletions paddle/fluid/framework/data_layout_transform.cc
Original file line number Diff line number Diff line change
Expand Up @@ -116,6 +116,8 @@ void* GetDataFromTensor(const Tensor& tensor, mkldnn::memory::data_type type) {
return platform::to_void_cast(tensor.data<unsigned char>());
case mkldnn::memory::data_type::s32:
return platform::to_void_cast(tensor.data<int32_t>());
case mkldnn::memory::data_type::bf16:
return platform::to_void_cast(tensor.data<paddle::platform::bfloat16>());
default:
PADDLE_THROW(
platform::errors::InvalidArgument("Wrong mkldnn type provided."));
Expand Down
Loading