In [1]:
# 04_Train_TransR
#
# created by LuYF-Lemon-love <luyanfeng_nlp@qq.com> on February 27, 2023
# updated by LuYF-Lemon-love <luyanfeng_nlp@qq.com> on February 27, 2023
#
# 该脚本展示了如何在 DRKG 上训练模型 (TransR), 并利用网格搜索寻找到最优参数.
#
# 需要的包:
#          torch
#          dgl, version: 0.4.3
#          dglke
#          numpy
#
# 需要的文件:
#          ./dataset
#
# 源教程链接: https://github.com/gnn4dr/DRKG/blob/master/embedding_analysis/Train_embeddings.ipynb

# Training DRKG Using TransR

这个 notebook 展示了如何在 DRKG 上训练模型 (TransR), 并利用网格搜索寻找到最优参数.

## 导入需要的库

In [2]:
import numpy as np

## 网格搜索参数

我们能使用 DGL-KE 命令训练 TransR 模型, 关于如何使用 DGL-KE 的更多信息请参考 https://github.com/awslabs/dgl-ke.

这里我们使用两个 GPU 训练模型.

大约 100000 * 2.7 / 3600 = 75.0 h

### 1

- batch_size: **4096**

- neg_sample_size: **256**

- hidden_dim: **200**, 400

- gamma: **6**, 12, 18

- lr: **0.01**, 0.05, 0.1

In [3]:
!DGLBACKEND=pytorch dglke_train --dataset DRKG --data_path ./dataset \
--data_files drkg_train.tsv drkg_valid.tsv drkg_test.tsv --format 'raw_udd_hrt' \
--model_name TransR \
--batch_size 4096 --neg_sample_size 256 --hidden_dim 200 \
--gamma 6.0 --lr 0.01 --max_step 100000 -adv --regularization_coef 1.00E-07 \
--gpu 0 1 --num_proc 2 --mix_cpu_gpu --async_update --force_sync_interval 1000 \
--valid --test \
--batch_size_eval 128 --neg_sample_size_eval 10000 \
--log_interval 1 --eval_interval 50000 --num_thread 32

Reading train triples....
Finished. Read 5286834 train triples.
Reading valid triples....
Finished. Read 293713 valid triples.
Reading test triples....
Finished. Read 293714 test triples.
|Train|: 5286834
random partition 5286834 edges into 2 parts
part 0 has 2643417 edges
part 1 has 2643417 edges
|valid|: 293713
|test|: 293714
Total initialize time 16.306 seconds
[proc 1][Train](1/100000) average pos_loss: 36.63789749145508
[proc 1][Train](1/100000) average neg_loss: 0.006224163342267275
[proc 1][Train](1/100000) average loss: 18.32206153869629
[proc 1][Train](1/100000) average regularization: 4.492153038881952e-06
[proc 1][Train] 1 steps take 5.332 seconds
[proc 1]sample: 0.235, forward: 2.697, backward: 0.097, update: 2.302
[proc 0][Train](1/100000) average pos_loss: 36.601043701171875
[proc 0][Train](1/100000) average neg_loss: 0.0046770223416388035
[proc 0][Train](1/100000) average loss: 18.302860260009766
[proc 0][Train](1/100000) average regularization: 4.475013611227041e-06
[pr

[proc 0][Train](11/100000) average pos_loss: 9.713321685791016
[proc 0][Train](11/100000) average neg_loss: 0.09249696135520935
[proc 0][Train](11/100000) average loss: 4.902909278869629
[proc 0][Train](11/100000) average regularization: 1.2937773590238066e-06
[proc 0][Train] 1 steps take 2.837 seconds
[proc 0]sample: 0.002, forward: 0.434, backward: 0.070, update: 2.331
[proc 1][Train](12/100000) average pos_loss: 9.355192184448242
[proc 1][Train](12/100000) average neg_loss: 0.08135785162448883
[proc 1][Train](12/100000) average loss: 4.71827507019043
[proc 1][Train](12/100000) average regularization: 1.2754463796227355e-06
[proc 1][Train] 1 steps take 2.964 seconds
[proc 1]sample: 0.003, forward: 0.540, backward: 0.071, update: 2.350
[proc 0][Train](12/100000) average pos_loss: 9.171796798706055
[proc 0][Train](12/100000) average neg_loss: 0.09549665451049805
[proc 0][Train](12/100000) average loss: 4.6336469650268555
[proc 0][Train](12/100000) average regularization: 1.300097324019

[proc 1][Train](22/100000) average pos_loss: 5.2835798263549805
[proc 1][Train](22/100000) average neg_loss: 0.26237356662750244
[proc 1][Train](22/100000) average loss: 2.7729766368865967
[proc 1][Train](22/100000) average regularization: 1.26339728012681e-06
[proc 1][Train] 1 steps take 2.971 seconds
[proc 1]sample: 0.002, forward: 0.494, backward: 0.070, update: 2.404
[proc 1][Train](23/100000) average pos_loss: 5.034053802490234
[proc 1][Train](23/100000) average neg_loss: 0.3313691318035126
[proc 1][Train](23/100000) average loss: 2.682711362838745
[proc 1][Train](23/100000) average regularization: 1.2029570370941656e-06
[proc 1][Train] 1 steps take 2.857 seconds
[proc 1]sample: 0.004, forward: 0.471, backward: 0.069, update: 2.313
[proc 0][Train](23/100000) average pos_loss: 4.892105579376221
[proc 0][Train](23/100000) average neg_loss: 0.3143060505390167
[proc 0][Train](23/100000) average loss: 2.603205919265747
[proc 0][Train](23/100000) average regularization: 1.21890468562924

[proc 1][Train](33/100000) average pos_loss: 3.417573928833008
[proc 1][Train](33/100000) average neg_loss: 0.4230961203575134
[proc 1][Train](33/100000) average loss: 1.920335054397583
[proc 1][Train](33/100000) average regularization: 1.5333789633587003e-06
[proc 1][Train] 1 steps take 2.895 seconds
[proc 1]sample: 0.026, forward: 0.468, backward: 0.069, update: 2.331
[proc 0][Train](34/100000) average pos_loss: 3.2738168239593506
[proc 0][Train](34/100000) average neg_loss: 0.38912004232406616
[proc 0][Train](34/100000) average loss: 1.8314684629440308
[proc 0][Train](34/100000) average regularization: 1.713571236905409e-06
[proc 0][Train] 1 steps take 2.820 seconds
[proc 0]sample: 0.017, forward: 0.486, backward: 0.069, update: 2.248
[proc 1][Train](34/100000) average pos_loss: 3.443295478820801
[proc 1][Train](34/100000) average neg_loss: 0.3749896287918091
[proc 1][Train](34/100000) average loss: 1.9091424942016602
[proc 1][Train](34/100000) average regularization: 1.784023083928

[proc 1][Train](44/100000) average pos_loss: 2.655526638031006
[proc 1][Train](44/100000) average neg_loss: 0.4615135192871094
[proc 1][Train](44/100000) average loss: 1.5585200786590576
[proc 1][Train](44/100000) average regularization: 2.0203976873744978e-06
[proc 1][Train] 1 steps take 2.803 seconds
[proc 1]sample: 0.002, forward: 0.400, backward: 0.070, update: 2.331
[proc 0][Train](45/100000) average pos_loss: 2.6914446353912354
[proc 0][Train](45/100000) average neg_loss: 0.4003305435180664
[proc 0][Train](45/100000) average loss: 1.5458875894546509
[proc 0][Train](45/100000) average regularization: 1.942932158272015e-06
[proc 0][Train] 1 steps take 2.567 seconds
[proc 0]sample: 0.002, forward: 0.420, backward: 0.071, update: 2.074
[proc 1][Train](45/100000) average pos_loss: 2.4930062294006348
[proc 1][Train](45/100000) average neg_loss: 0.38489073514938354
[proc 1][Train](45/100000) average loss: 1.4389485120773315
[proc 1][Train](45/100000) average regularization: 2.1035025383

[proc 1][Train](55/100000) average pos_loss: 2.03725004196167
[proc 1][Train](55/100000) average neg_loss: 0.38079530000686646
[proc 1][Train](55/100000) average loss: 1.2090226411819458
[proc 1][Train](55/100000) average regularization: 2.316233121746336e-06
[proc 1][Train] 1 steps take 2.814 seconds
[proc 1]sample: 0.001, forward: 0.458, backward: 0.070, update: 2.284
[proc 0][Train](56/100000) average pos_loss: 2.102231979370117
[proc 0][Train](56/100000) average neg_loss: 0.44959554076194763
[proc 0][Train](56/100000) average loss: 1.2759137153625488
[proc 0][Train](56/100000) average regularization: 2.4225089418905554e-06
[proc 0][Train] 1 steps take 2.807 seconds
[proc 0]sample: 0.001, forward: 0.435, backward: 0.070, update: 2.301
[proc 1][Train](56/100000) average pos_loss: 2.027817726135254
[proc 1][Train](56/100000) average neg_loss: 0.49231860041618347
[proc 1][Train](56/100000) average loss: 1.2600681781768799
[proc 1][Train](56/100000) average regularization: 2.45439605350

[proc 0][Train](66/100000) average pos_loss: 1.8157953023910522
[proc 0][Train](66/100000) average neg_loss: 0.5005790591239929
[proc 0][Train](66/100000) average loss: 1.1581871509552002
[proc 0][Train](66/100000) average regularization: 2.588224106148118e-06
[proc 0][Train] 1 steps take 2.735 seconds
[proc 0]sample: 0.017, forward: 0.431, backward: 0.069, update: 2.217
[proc 1][Train](67/100000) average pos_loss: 1.7446918487548828
[proc 1][Train](67/100000) average neg_loss: 0.4065309762954712
[proc 1][Train](67/100000) average loss: 1.0756113529205322
[proc 1][Train](67/100000) average regularization: 2.6292200345778838e-06
[proc 1][Train] 1 steps take 2.674 seconds
[proc 1]sample: 0.002, forward: 0.427, backward: 0.071, update: 2.174
[proc 0][Train](67/100000) average pos_loss: 1.7378671169281006
[proc 0][Train](67/100000) average neg_loss: 0.39394572377204895
[proc 0][Train](67/100000) average loss: 1.0659064054489136
[proc 0][Train](67/100000) average regularization: 2.728880417

[proc 0][Train](77/100000) average pos_loss: 1.5509196519851685
[proc 0][Train](77/100000) average neg_loss: 0.3982469439506531
[proc 0][Train](77/100000) average loss: 0.9745832681655884
[proc 0][Train](77/100000) average regularization: 2.941318143712124e-06
[proc 0][Train] 1 steps take 2.655 seconds
[proc 0]sample: 0.002, forward: 0.444, backward: 0.072, update: 2.137
[proc 1][Train](78/100000) average pos_loss: 1.5200607776641846
[proc 1][Train](78/100000) average neg_loss: 0.5272951126098633
[proc 1][Train](78/100000) average loss: 1.023677945137024
[proc 1][Train](78/100000) average regularization: 3.008342901011929e-06
[proc 1][Train] 1 steps take 2.672 seconds
[proc 1]sample: 0.001, forward: 0.432, backward: 0.070, update: 2.168
[proc 0][Train](78/100000) average pos_loss: 1.5037167072296143
[proc 0][Train](78/100000) average neg_loss: 0.4786224365234375
[proc 0][Train](78/100000) average loss: 0.9911695718765259
[proc 0][Train](78/100000) average regularization: 2.960464144052

[proc 0][Train](88/100000) average pos_loss: 1.443807601928711
[proc 0][Train](88/100000) average neg_loss: 0.5417848825454712
[proc 0][Train](88/100000) average loss: 0.9927962422370911
[proc 0][Train](88/100000) average regularization: 3.228076138839242e-06
[proc 0][Train] 1 steps take 2.578 seconds
[proc 0]sample: 0.001, forward: 0.436, backward: 0.069, update: 2.071
[proc 1][Train](89/100000) average pos_loss: 1.3767192363739014
[proc 1][Train](89/100000) average neg_loss: 0.3275604844093323
[proc 1][Train](89/100000) average loss: 0.8521398305892944
[proc 1][Train](89/100000) average regularization: 3.282636271251249e-06
[proc 1][Train] 1 steps take 2.774 seconds
[proc 1]sample: 0.002, forward: 0.426, backward: 0.070, update: 2.276
[proc 0][Train](89/100000) average pos_loss: 1.3940485715866089
[proc 0][Train](89/100000) average neg_loss: 0.3984299302101135
[proc 0][Train](89/100000) average loss: 0.8962392807006836
[proc 0][Train](89/100000) average regularization: 3.287250592620

[proc 0][Train](99/100000) average pos_loss: 1.2455143928527832
[proc 0][Train](99/100000) average neg_loss: 0.36318397521972656
[proc 0][Train](99/100000) average loss: 0.8043491840362549
[proc 0][Train](99/100000) average regularization: 3.6083683880860917e-06
[proc 0][Train] 1 steps take 2.670 seconds
[proc 0]sample: 0.002, forward: 0.404, backward: 0.069, update: 2.195
[proc 1][Train](100/100000) average pos_loss: 1.2287588119506836
[proc 1][Train](100/100000) average neg_loss: 0.5226110219955444
[proc 1][Train](100/100000) average loss: 0.875684916973114
[proc 1][Train](100/100000) average regularization: 3.691867959787487e-06
[proc 1][Train] 1 steps take 2.655 seconds
[proc 1]sample: 0.001, forward: 0.424, backward: 0.071, update: 2.158
[proc 0][Train](100/100000) average pos_loss: 1.2744104862213135
[proc 0][Train](100/100000) average neg_loss: 0.5199477076530457
[proc 0][Train](100/100000) average loss: 0.897179126739502
[proc 0][Train](100/100000) average regularization: 3.574

[proc 0][Train](110/100000) average pos_loss: 1.1528018712997437
[proc 0][Train](110/100000) average neg_loss: 0.5700418949127197
[proc 0][Train](110/100000) average loss: 0.8614218831062317
[proc 0][Train](110/100000) average regularization: 3.903267497662455e-06
[proc 0][Train] 1 steps take 2.727 seconds
[proc 0]sample: 0.001, forward: 0.492, backward: 0.070, update: 2.164
[proc 1][Train](111/100000) average pos_loss: 1.173447608947754
[proc 1][Train](111/100000) average neg_loss: 0.3729130029678345
[proc 1][Train](111/100000) average loss: 0.7731803059577942
[proc 1][Train](111/100000) average regularization: 3.871577064273879e-06
[proc 1][Train] 1 steps take 2.730 seconds
[proc 1]sample: 0.001, forward: 0.405, backward: 0.069, update: 2.254
[proc 0][Train](111/100000) average pos_loss: 1.1785063743591309
[proc 0][Train](111/100000) average neg_loss: 0.3667331039905548
[proc 0][Train](111/100000) average loss: 0.7726197242736816
[proc 0][Train](111/100000) average regularization: 3.

[proc 0][Train](121/100000) average pos_loss: 1.1143460273742676
[proc 0][Train](121/100000) average neg_loss: 0.3600374460220337
[proc 0][Train](121/100000) average loss: 0.7371917366981506
[proc 0][Train](121/100000) average regularization: 4.106740561837796e-06
[proc 0][Train] 1 steps take 2.645 seconds
[proc 0]sample: 0.001, forward: 0.443, backward: 0.069, update: 2.132
[proc 1][Train](122/100000) average pos_loss: 1.135267734527588
[proc 1][Train](122/100000) average neg_loss: 0.5426846742630005
[proc 1][Train](122/100000) average loss: 0.8389762043952942
[proc 1][Train](122/100000) average regularization: 4.016516868432518e-06
[proc 1][Train] 1 steps take 2.632 seconds
[proc 1]sample: 0.002, forward: 0.424, backward: 0.070, update: 2.137
[proc 0][Train](122/100000) average pos_loss: 1.1176319122314453
[proc 0][Train](122/100000) average neg_loss: 0.5417845845222473
[proc 0][Train](122/100000) average loss: 0.8297082185745239
[proc 0][Train](122/100000) average regularization: 4.

[proc 0][Train](132/100000) average pos_loss: 0.9551223516464233
[proc 0][Train](132/100000) average neg_loss: 0.5561748147010803
[proc 0][Train](132/100000) average loss: 0.7556486129760742
[proc 0][Train](132/100000) average regularization: 4.388291472423589e-06
[proc 0][Train] 1 steps take 2.692 seconds
[proc 0]sample: 0.001, forward: 0.429, backward: 0.070, update: 2.191
[proc 1][Train](133/100000) average pos_loss: 1.0083160400390625
[proc 1][Train](133/100000) average neg_loss: 0.3248260021209717
[proc 1][Train](133/100000) average loss: 0.6665710210800171
[proc 1][Train](133/100000) average regularization: 4.421291578182718e-06
[proc 1][Train] 1 steps take 2.680 seconds
[proc 1]sample: 0.002, forward: 0.432, backward: 0.070, update: 2.176
[proc 0][Train](133/100000) average pos_loss: 1.0957579612731934
[proc 0][Train](133/100000) average neg_loss: 0.34142327308654785
[proc 0][Train](133/100000) average loss: 0.7185906171798706
[proc 0][Train](133/100000) average regularization: 

[proc 0][Train](143/100000) average pos_loss: 0.9675946235656738
[proc 0][Train](143/100000) average neg_loss: 0.352138876914978
[proc 0][Train](143/100000) average loss: 0.6598667502403259
[proc 0][Train](143/100000) average regularization: 4.434364655026002e-06
[proc 0][Train] 1 steps take 2.659 seconds
[proc 0]sample: 0.001, forward: 0.414, backward: 0.071, update: 2.174
[proc 1][Train](144/100000) average pos_loss: 0.9976010322570801
[proc 1][Train](144/100000) average neg_loss: 0.5486695170402527
[proc 1][Train](144/100000) average loss: 0.7731353044509888
[proc 1][Train](144/100000) average regularization: 4.556133717414923e-06
[proc 1][Train] 1 steps take 2.668 seconds
[proc 1]sample: 0.001, forward: 0.427, backward: 0.069, update: 2.171
[proc 0][Train](144/100000) average pos_loss: 0.9982150793075562
[proc 0][Train](144/100000) average neg_loss: 0.5439677238464355
[proc 0][Train](144/100000) average loss: 0.7710914015769958
[proc 0][Train](144/100000) average regularization: 4.

[proc 0][Train](154/100000) average pos_loss: 0.9659552574157715
[proc 0][Train](154/100000) average neg_loss: 0.5303310751914978
[proc 0][Train](154/100000) average loss: 0.748143196105957
[proc 0][Train](154/100000) average regularization: 4.762612661579624e-06
[proc 0][Train] 1 steps take 2.741 seconds
[proc 0]sample: 0.001, forward: 0.435, backward: 0.069, update: 2.236
[proc 1][Train](155/100000) average pos_loss: 0.9691283702850342
[proc 1][Train](155/100000) average neg_loss: 0.34755581617355347
[proc 1][Train](155/100000) average loss: 0.6583421230316162
[proc 1][Train](155/100000) average regularization: 4.9209043027076405e-06
[proc 1][Train] 1 steps take 2.587 seconds
[proc 1]sample: 0.001, forward: 0.421, backward: 0.069, update: 2.095
[proc 0][Train](155/100000) average pos_loss: 0.9421359300613403
[proc 0][Train](155/100000) average neg_loss: 0.34409099817276
[proc 0][Train](155/100000) average loss: 0.6431134939193726
[proc 0][Train](155/100000) average regularization: 4.

[proc 0][Train](165/100000) average pos_loss: 0.9140968322753906
[proc 0][Train](165/100000) average neg_loss: 0.3183598518371582
[proc 0][Train](165/100000) average loss: 0.6162283420562744
[proc 0][Train](165/100000) average regularization: 5.017063358536689e-06
[proc 0][Train] 1 steps take 2.686 seconds
[proc 0]sample: 0.001, forward: 0.431, backward: 0.069, update: 2.184
[proc 1][Train](166/100000) average pos_loss: 0.9356465339660645
[proc 1][Train](166/100000) average neg_loss: 0.5482965707778931
[proc 1][Train](166/100000) average loss: 0.7419715523719788
[proc 1][Train](166/100000) average regularization: 4.974972398485988e-06
[proc 1][Train] 1 steps take 2.692 seconds
[proc 1]sample: 0.001, forward: 0.437, backward: 0.070, update: 2.184
[proc 0][Train](166/100000) average pos_loss: 0.961479663848877
[proc 0][Train](166/100000) average neg_loss: 0.5540453791618347
[proc 0][Train](166/100000) average loss: 0.7577625513076782
[proc 0][Train](166/100000) average regularization: 5.

[proc 0][Train](176/100000) average pos_loss: 0.8827727437019348
[proc 0][Train](176/100000) average neg_loss: 0.56690913438797
[proc 0][Train](176/100000) average loss: 0.7248409390449524
[proc 0][Train](176/100000) average regularization: 5.23364860782749e-06
[proc 0][Train] 1 steps take 2.625 seconds
[proc 0]sample: 0.001, forward: 0.433, backward: 0.070, update: 2.120
[proc 1][Train](177/100000) average pos_loss: 0.9066434502601624
[proc 1][Train](177/100000) average neg_loss: 0.30344367027282715
[proc 1][Train](177/100000) average loss: 0.6050435304641724
[proc 1][Train](177/100000) average regularization: 5.256356871541357e-06
[proc 1][Train] 1 steps take 2.611 seconds
[proc 1]sample: 0.017, forward: 0.426, backward: 0.069, update: 2.099
[proc 0][Train](177/100000) average pos_loss: 0.8900147676467896
[proc 0][Train](177/100000) average neg_loss: 0.30464738607406616
[proc 0][Train](177/100000) average loss: 0.5973310470581055
[proc 0][Train](177/100000) average regularization: 5.

[proc 0][Train](187/100000) average pos_loss: 0.8975009918212891
[proc 0][Train](187/100000) average neg_loss: 0.3395116329193115
[proc 0][Train](187/100000) average loss: 0.6185063123703003
[proc 0][Train](187/100000) average regularization: 5.292592049954692e-06
[proc 0][Train] 1 steps take 2.623 seconds
[proc 0]sample: 0.001, forward: 0.427, backward: 0.070, update: 2.125
[proc 1][Train](188/100000) average pos_loss: 0.9098901748657227
[proc 1][Train](188/100000) average neg_loss: 0.5590108633041382
[proc 1][Train](188/100000) average loss: 0.7344505190849304
[proc 1][Train](188/100000) average regularization: 5.41566259926185e-06
[proc 1][Train] 1 steps take 2.653 seconds
[proc 1]sample: 0.001, forward: 0.427, backward: 0.071, update: 2.154
[proc 0][Train](188/100000) average pos_loss: 0.8687083125114441
[proc 0][Train](188/100000) average neg_loss: 0.6011697053909302
[proc 0][Train](188/100000) average loss: 0.7349389791488647
[proc 0][Train](188/100000) average regularization: 5.

[proc 0][Train](198/100000) average pos_loss: 0.7827413082122803
[proc 0][Train](198/100000) average neg_loss: 0.5266755223274231
[proc 0][Train](198/100000) average loss: 0.6547083854675293
[proc 0][Train](198/100000) average regularization: 5.451427568914369e-06
[proc 0][Train] 1 steps take 2.641 seconds
[proc 0]sample: 0.001, forward: 0.431, backward: 0.071, update: 2.138
[proc 1][Train](199/100000) average pos_loss: 0.7952890396118164
[proc 1][Train](199/100000) average neg_loss: 0.36233651638031006
[proc 1][Train](199/100000) average loss: 0.5788127779960632
[proc 1][Train](199/100000) average regularization: 5.5439472816942725e-06
[proc 1][Train] 1 steps take 2.650 seconds
[proc 1]sample: 0.001, forward: 0.428, backward: 0.069, update: 2.150
[proc 0][Train](199/100000) average pos_loss: 0.8056408166885376
[proc 0][Train](199/100000) average neg_loss: 0.34826964139938354
[proc 0][Train](199/100000) average loss: 0.5769551992416382
[proc 0][Train](199/100000) average regularization

[proc 0][Train](209/100000) average pos_loss: 0.837128221988678
[proc 0][Train](209/100000) average neg_loss: 0.3421283960342407
[proc 0][Train](209/100000) average loss: 0.5896283388137817
[proc 0][Train](209/100000) average regularization: 5.6147282521124e-06
[proc 0][Train] 1 steps take 2.692 seconds
[proc 0]sample: 0.015, forward: 0.442, backward: 0.070, update: 2.165
[proc 1][Train](210/100000) average pos_loss: 0.8224406838417053
[proc 1][Train](210/100000) average neg_loss: 0.5582776069641113
[proc 1][Train](210/100000) average loss: 0.6903591156005859
[proc 1][Train](210/100000) average regularization: 5.740109827456763e-06
[proc 1][Train] 1 steps take 2.666 seconds
[proc 1]sample: 0.015, forward: 0.433, backward: 0.070, update: 2.148
[proc 0][Train](210/100000) average pos_loss: 0.8413062691688538
[proc 0][Train](210/100000) average neg_loss: 0.5268802642822266
[proc 0][Train](210/100000) average loss: 0.6840932369232178
[proc 0][Train](210/100000) average regularization: 5.66

[proc 0][Train](220/100000) average pos_loss: 0.7794954776763916
[proc 0][Train](220/100000) average neg_loss: 0.5654606223106384
[proc 0][Train](220/100000) average loss: 0.6724780797958374
[proc 0][Train](220/100000) average regularization: 5.951173534413101e-06
[proc 0][Train] 1 steps take 2.784 seconds
[proc 0]sample: 0.002, forward: 0.439, backward: 0.070, update: 2.273
[proc 1][Train](221/100000) average pos_loss: 0.8082103729248047
[proc 1][Train](221/100000) average neg_loss: 0.31316250562667847
[proc 1][Train](221/100000) average loss: 0.560686469078064
[proc 1][Train](221/100000) average regularization: 5.835905540152453e-06
[proc 1][Train] 1 steps take 2.853 seconds
[proc 1]sample: 0.002, forward: 0.476, backward: 0.069, update: 2.306
[proc 0][Train](221/100000) average pos_loss: 0.7710275650024414
[proc 0][Train](221/100000) average neg_loss: 0.3137434720993042
[proc 0][Train](221/100000) average loss: 0.5423855185508728
[proc 0][Train](221/100000) average regularization: 5

[proc 0][Train](231/100000) average pos_loss: 0.7872241735458374
[proc 0][Train](231/100000) average neg_loss: 0.3200227916240692
[proc 0][Train](231/100000) average loss: 0.5536234974861145
[proc 0][Train](231/100000) average regularization: 5.793119271402247e-06
[proc 0][Train] 1 steps take 2.635 seconds
[proc 0]sample: 0.001, forward: 0.435, backward: 0.070, update: 2.129
[proc 1][Train](232/100000) average pos_loss: 0.7785314917564392
[proc 1][Train](232/100000) average neg_loss: 0.5747613906860352
[proc 1][Train](232/100000) average loss: 0.6766464710235596
[proc 1][Train](232/100000) average regularization: 5.941182280366775e-06
[proc 1][Train] 1 steps take 2.636 seconds
[proc 1]sample: 0.001, forward: 0.423, backward: 0.070, update: 2.142
[proc 0][Train](232/100000) average pos_loss: 0.7963588237762451
[proc 0][Train](232/100000) average neg_loss: 0.6397329568862915
[proc 0][Train](232/100000) average loss: 0.7180458903312683
[proc 0][Train](232/100000) average regularization: 5

[proc 1][Train](242/100000) average pos_loss: 0.7411348819732666
[proc 1][Train](242/100000) average neg_loss: 0.5570770502090454
[proc 1][Train](242/100000) average loss: 0.649105966091156
[proc 1][Train](242/100000) average regularization: 5.980419700790662e-06
[proc 1][Train] 1 steps take 2.659 seconds
[proc 1]sample: 0.019, forward: 0.436, backward: 0.072, update: 2.133
[proc 0][Train](243/100000) average pos_loss: 0.7649781703948975
[proc 0][Train](243/100000) average neg_loss: 0.29810529947280884
[proc 0][Train](243/100000) average loss: 0.5315417051315308
[proc 0][Train](243/100000) average regularization: 6.049957846698817e-06
[proc 0][Train] 1 steps take 2.547 seconds
[proc 0]sample: 0.002, forward: 0.425, backward: 0.070, update: 2.051
[proc 1][Train](243/100000) average pos_loss: 0.8019920587539673
[proc 1][Train](243/100000) average neg_loss: 0.3242569863796234
[proc 1][Train](243/100000) average loss: 0.5631245374679565
[proc 1][Train](243/100000) average regularization: 6

[proc 1][Train](253/100000) average pos_loss: 0.6882216930389404
[proc 1][Train](253/100000) average neg_loss: 0.3222900927066803
[proc 1][Train](253/100000) average loss: 0.5052558779716492
[proc 1][Train](253/100000) average regularization: 6.329639745672466e-06
[proc 1][Train] 1 steps take 2.684 seconds
[proc 1]sample: 0.001, forward: 0.432, backward: 0.071, update: 2.180
[proc 0][Train](254/100000) average pos_loss: 0.7438844442367554
[proc 0][Train](254/100000) average neg_loss: 0.5476478934288025
[proc 0][Train](254/100000) average loss: 0.6457661390304565
[proc 0][Train](254/100000) average regularization: 6.0983634284639265e-06
[proc 0][Train] 1 steps take 2.635 seconds
[proc 0]sample: 0.001, forward: 0.433, backward: 0.070, update: 2.130
[proc 1][Train](254/100000) average pos_loss: 0.7857645750045776
[proc 1][Train](254/100000) average neg_loss: 0.5963612794876099
[proc 1][Train](254/100000) average loss: 0.6910629272460938
[proc 1][Train](254/100000) average regularization: 

[proc 1][Train](264/100000) average pos_loss: 0.7385340332984924
[proc 1][Train](264/100000) average neg_loss: 0.5141216516494751
[proc 1][Train](264/100000) average loss: 0.6263278722763062
[proc 1][Train](264/100000) average regularization: 6.313614903774578e-06
[proc 1][Train] 1 steps take 2.705 seconds
[proc 1]sample: 0.001, forward: 0.423, backward: 0.070, update: 2.211
[proc 0][Train](265/100000) average pos_loss: 0.7220503091812134
[proc 0][Train](265/100000) average neg_loss: 0.28344234824180603
[proc 0][Train](265/100000) average loss: 0.5027463436126709
[proc 0][Train](265/100000) average regularization: 6.424694220186211e-06
[proc 0][Train] 1 steps take 2.689 seconds
[proc 0]sample: 0.001, forward: 0.444, backward: 0.070, update: 2.173
[proc 1][Train](265/100000) average pos_loss: 0.6900089979171753
[proc 1][Train](265/100000) average neg_loss: 0.3384358286857605
[proc 1][Train](265/100000) average loss: 0.5142223834991455
[proc 1][Train](265/100000) average regularization: 

[proc 1][Train](275/100000) average pos_loss: 0.6881613731384277
[proc 1][Train](275/100000) average neg_loss: 0.3154081106185913
[proc 1][Train](275/100000) average loss: 0.5017847418785095
[proc 1][Train](275/100000) average regularization: 6.25842312729219e-06
[proc 1][Train] 1 steps take 2.711 seconds
[proc 1]sample: 0.002, forward: 0.426, backward: 0.070, update: 2.213
[proc 0][Train](276/100000) average pos_loss: 0.7072272896766663
[proc 0][Train](276/100000) average neg_loss: 0.533738374710083
[proc 0][Train](276/100000) average loss: 0.6204828023910522
[proc 0][Train](276/100000) average regularization: 6.695125648548128e-06
[proc 0][Train] 1 steps take 2.737 seconds
[proc 0]sample: 0.001, forward: 0.447, backward: 0.069, update: 2.219
[proc 1][Train](276/100000) average pos_loss: 0.7142603397369385
[proc 1][Train](276/100000) average neg_loss: 0.5543946623802185
[proc 1][Train](276/100000) average loss: 0.6343275308609009
[proc 1][Train](276/100000) average regularization: 6.3

[proc 1][Train](286/100000) average pos_loss: 0.719488263130188
[proc 1][Train](286/100000) average neg_loss: 0.5863946676254272
[proc 1][Train](286/100000) average loss: 0.6529414653778076
[proc 1][Train](286/100000) average regularization: 6.52147446089657e-06
[proc 1][Train] 1 steps take 2.786 seconds
[proc 1]sample: 0.001, forward: 0.430, backward: 0.070, update: 2.284
[proc 0][Train](287/100000) average pos_loss: 0.7056901454925537
[proc 0][Train](287/100000) average neg_loss: 0.28247150778770447
[proc 0][Train](287/100000) average loss: 0.4940808415412903
[proc 0][Train](287/100000) average regularization: 7.027064384601545e-06
[proc 0][Train] 1 steps take 2.797 seconds
[proc 0]sample: 0.002, forward: 0.383, backward: 0.069, update: 2.343
[proc 1][Train](287/100000) average pos_loss: 0.6964645385742188
[proc 1][Train](287/100000) average neg_loss: 0.29380637407302856
[proc 1][Train](287/100000) average loss: 0.49513545632362366
[proc 1][Train](287/100000) average regularization: 

[proc 1][Train](297/100000) average pos_loss: 0.7052925229072571
[proc 1][Train](297/100000) average neg_loss: 0.3155503273010254
[proc 1][Train](297/100000) average loss: 0.5104213953018188
[proc 1][Train](297/100000) average regularization: 6.583666163351154e-06
[proc 1][Train] 1 steps take 2.601 seconds
[proc 1]sample: 0.002, forward: 0.439, backward: 0.071, update: 2.090
[proc 0][Train](298/100000) average pos_loss: 0.7079145908355713
[proc 0][Train](298/100000) average neg_loss: 0.5688315033912659
[proc 0][Train](298/100000) average loss: 0.6383730173110962
[proc 0][Train](298/100000) average regularization: 6.648529051744845e-06
[proc 0][Train] 1 steps take 2.680 seconds
[proc 0]sample: 0.002, forward: 0.428, backward: 0.070, update: 2.180
[proc 1][Train](298/100000) average pos_loss: 0.6960655450820923
[proc 1][Train](298/100000) average neg_loss: 0.5648239850997925
[proc 1][Train](298/100000) average loss: 0.6304447650909424
[proc 1][Train](298/100000) average regularization: 6

[proc 1][Train](308/100000) average pos_loss: 0.6853450536727905
[proc 1][Train](308/100000) average neg_loss: 0.5402975082397461
[proc 1][Train](308/100000) average loss: 0.6128212809562683
[proc 1][Train](308/100000) average regularization: 6.7955093072669115e-06
[proc 1][Train] 1 steps take 2.739 seconds
[proc 1]sample: 0.002, forward: 0.437, backward: 0.070, update: 2.230
[proc 0][Train](309/100000) average pos_loss: 0.6931213736534119
[proc 0][Train](309/100000) average neg_loss: 0.2882606089115143
[proc 0][Train](309/100000) average loss: 0.49069100618362427
[proc 0][Train](309/100000) average regularization: 6.768087132513756e-06
[proc 0][Train] 1 steps take 2.816 seconds
[proc 0]sample: 0.002, forward: 0.468, backward: 0.070, update: 2.277
[proc 1][Train](309/100000) average pos_loss: 0.655471920967102
[proc 1][Train](309/100000) average neg_loss: 0.30078062415122986
[proc 1][Train](309/100000) average loss: 0.47812628746032715
[proc 1][Train](309/100000) average regularization

[proc 1][Train](319/100000) average pos_loss: 0.6891223192214966
[proc 1][Train](319/100000) average neg_loss: 0.2946910262107849
[proc 1][Train](319/100000) average loss: 0.49190667271614075
[proc 1][Train](319/100000) average regularization: 6.973408744670451e-06
[proc 1][Train] 1 steps take 2.674 seconds
[proc 1]sample: 0.002, forward: 0.431, backward: 0.070, update: 2.171
[proc 0][Train](320/100000) average pos_loss: 0.6582446694374084
[proc 0][Train](320/100000) average neg_loss: 0.5468236207962036
[proc 0][Train](320/100000) average loss: 0.6025341749191284
[proc 0][Train](320/100000) average regularization: 6.891878456372069e-06
[proc 0][Train] 1 steps take 2.706 seconds
[proc 0]sample: 0.001, forward: 0.431, backward: 0.070, update: 2.204
[proc 1][Train](320/100000) average pos_loss: 0.6810446381568909
[proc 1][Train](320/100000) average neg_loss: 0.5204712152481079
[proc 1][Train](320/100000) average loss: 0.6007579565048218
[proc 1][Train](320/100000) average regularization: 

[proc 1][Train](330/100000) average pos_loss: 0.651321530342102
[proc 1][Train](330/100000) average neg_loss: 0.5750472545623779
[proc 1][Train](330/100000) average loss: 0.61318439245224
[proc 1][Train](330/100000) average regularization: 7.134442967071664e-06
[proc 1][Train] 1 steps take 2.679 seconds
[proc 1]sample: 0.001, forward: 0.425, backward: 0.069, update: 2.183
[proc 0][Train](331/100000) average pos_loss: 0.6400346159934998
[proc 0][Train](331/100000) average neg_loss: 0.2744755744934082
[proc 0][Train](331/100000) average loss: 0.457255095243454
[proc 0][Train](331/100000) average regularization: 7.078707767504966e-06
[proc 0][Train] 1 steps take 2.686 seconds
[proc 0]sample: 0.001, forward: 0.431, backward: 0.069, update: 2.184
[proc 1][Train](331/100000) average pos_loss: 0.6649390459060669
[proc 1][Train](331/100000) average neg_loss: 0.3178738057613373
[proc 1][Train](331/100000) average loss: 0.4914064407348633
[proc 1][Train](331/100000) average regularization: 7.026

[proc 1][Train](341/100000) average pos_loss: 0.6734422445297241
[proc 1][Train](341/100000) average neg_loss: 0.28627800941467285
[proc 1][Train](341/100000) average loss: 0.4798601269721985
[proc 1][Train](341/100000) average regularization: 7.113193532859441e-06
[proc 1][Train] 1 steps take 2.689 seconds
[proc 1]sample: 0.001, forward: 0.431, backward: 0.070, update: 2.187
[proc 0][Train](342/100000) average pos_loss: 0.6727498769760132
[proc 0][Train](342/100000) average neg_loss: 0.5751352310180664
[proc 0][Train](342/100000) average loss: 0.6239425539970398
[proc 0][Train](342/100000) average regularization: 6.991429017944029e-06
[proc 0][Train] 1 steps take 2.655 seconds
[proc 0]sample: 0.001, forward: 0.443, backward: 0.070, update: 2.141
[proc 1][Train](342/100000) average pos_loss: 0.6501933336257935
[proc 1][Train](342/100000) average neg_loss: 0.5584733486175537
[proc 1][Train](342/100000) average loss: 0.6043333411216736
[proc 1][Train](342/100000) average regularization: 

[proc 1][Train](352/100000) average pos_loss: 0.6267571449279785
[proc 1][Train](352/100000) average neg_loss: 0.5724413394927979
[proc 1][Train](352/100000) average loss: 0.5995992422103882
[proc 1][Train](352/100000) average regularization: 7.0443315962620545e-06
[proc 1][Train] 1 steps take 2.865 seconds
[proc 1]sample: 0.001, forward: 0.470, backward: 0.070, update: 2.323
[proc 0][Train](353/100000) average pos_loss: 0.6505265235900879
[proc 0][Train](353/100000) average neg_loss: 0.304477721452713
[proc 0][Train](353/100000) average loss: 0.47750210762023926
[proc 0][Train](353/100000) average regularization: 6.9922784859954845e-06
[proc 0][Train] 1 steps take 2.920 seconds
[proc 0]sample: 0.016, forward: 0.456, backward: 0.069, update: 2.380
[proc 1][Train](353/100000) average pos_loss: 0.6833118200302124
[proc 1][Train](353/100000) average neg_loss: 0.2970021963119507
[proc 1][Train](353/100000) average loss: 0.49015700817108154
[proc 1][Train](353/100000) average regularization

[proc 1][Train](363/100000) average pos_loss: 0.6725837588310242
[proc 1][Train](363/100000) average neg_loss: 0.28247883915901184
[proc 1][Train](363/100000) average loss: 0.4775313138961792
[proc 1][Train](363/100000) average regularization: 7.22412187315058e-06
[proc 1][Train] 1 steps take 2.642 seconds
[proc 1]sample: 0.001, forward: 0.432, backward: 0.070, update: 2.139
[proc 0][Train](364/100000) average pos_loss: 0.6172258257865906
[proc 0][Train](364/100000) average neg_loss: 0.5630843639373779
[proc 0][Train](364/100000) average loss: 0.5901551246643066
[proc 0][Train](364/100000) average regularization: 6.816256245656405e-06
[proc 0][Train] 1 steps take 2.681 seconds
[proc 0]sample: 0.001, forward: 0.433, backward: 0.070, update: 2.176
[proc 1][Train](364/100000) average pos_loss: 0.5968049168586731
[proc 1][Train](364/100000) average neg_loss: 0.561972975730896
[proc 1][Train](364/100000) average loss: 0.5793889760971069
[proc 1][Train](364/100000) average regularization: 7.

[proc 1][Train](374/100000) average pos_loss: 0.6063521504402161
[proc 1][Train](374/100000) average neg_loss: 0.5654639005661011
[proc 1][Train](374/100000) average loss: 0.585908055305481
[proc 1][Train](374/100000) average regularization: 7.056480626488337e-06
[proc 1][Train] 1 steps take 2.723 seconds
[proc 1]sample: 0.001, forward: 0.433, backward: 0.069, update: 2.220
[proc 0][Train](375/100000) average pos_loss: 0.6789630055427551
[proc 0][Train](375/100000) average neg_loss: 0.30679166316986084
[proc 0][Train](375/100000) average loss: 0.492877334356308
[proc 0][Train](375/100000) average regularization: 7.163875125115737e-06
[proc 0][Train] 1 steps take 2.646 seconds
[proc 0]sample: 0.001, forward: 0.432, backward: 0.071, update: 2.142
[proc 1][Train](375/100000) average pos_loss: 0.6232752799987793
[proc 1][Train](375/100000) average neg_loss: 0.29547813534736633
[proc 1][Train](375/100000) average loss: 0.4593766927719116
[proc 1][Train](375/100000) average regularization: 7

[proc 1][Train](385/100000) average pos_loss: 0.6040066480636597
[proc 1][Train](385/100000) average neg_loss: 0.31398820877075195
[proc 1][Train](385/100000) average loss: 0.4589974284172058
[proc 1][Train](385/100000) average regularization: 7.438658940372989e-06
[proc 1][Train] 1 steps take 2.852 seconds
[proc 1]sample: 0.017, forward: 0.433, backward: 0.069, update: 2.332
[proc 0][Train](386/100000) average pos_loss: 0.6193631887435913
[proc 0][Train](386/100000) average neg_loss: 0.6014052629470825
[proc 0][Train](386/100000) average loss: 0.6103842258453369
[proc 0][Train](386/100000) average regularization: 7.563005965494085e-06
[proc 0][Train] 1 steps take 2.739 seconds
[proc 0]sample: 0.018, forward: 0.446, backward: 0.070, update: 2.205
[proc 1][Train](386/100000) average pos_loss: 0.6185507774353027
[proc 1][Train](386/100000) average neg_loss: 0.5772465467453003
[proc 1][Train](386/100000) average loss: 0.5978986620903015
[proc 1][Train](386/100000) average regularization: 

[proc 1][Train](396/100000) average pos_loss: 0.6240627765655518
[proc 1][Train](396/100000) average neg_loss: 0.5802360773086548
[proc 1][Train](396/100000) average loss: 0.6021494269371033
[proc 1][Train](396/100000) average regularization: 7.480491603928385e-06
[proc 1][Train] 1 steps take 2.736 seconds
[proc 1]sample: 0.001, forward: 0.435, backward: 0.070, update: 2.229
[proc 0][Train](397/100000) average pos_loss: 0.6604918837547302
[proc 0][Train](397/100000) average neg_loss: 0.2610642611980438
[proc 0][Train](397/100000) average loss: 0.46077805757522583
[proc 0][Train](397/100000) average regularization: 7.804143933753949e-06
[proc 0][Train] 1 steps take 2.717 seconds
[proc 0]sample: 0.002, forward: 0.435, backward: 0.070, update: 2.209
[proc 1][Train](397/100000) average pos_loss: 0.6018226146697998
[proc 1][Train](397/100000) average neg_loss: 0.29126453399658203
[proc 1][Train](397/100000) average loss: 0.4465435743331909
[proc 1][Train](397/100000) average regularization:

[proc 1][Train](407/100000) average pos_loss: 0.6166107654571533
[proc 1][Train](407/100000) average neg_loss: 0.29671019315719604
[proc 1][Train](407/100000) average loss: 0.4566604793071747
[proc 1][Train](407/100000) average regularization: 7.211967385956086e-06
[proc 1][Train] 1 steps take 2.668 seconds
[proc 1]sample: 0.001, forward: 0.425, backward: 0.069, update: 2.172
[proc 0][Train](408/100000) average pos_loss: 0.6410422325134277
[proc 0][Train](408/100000) average neg_loss: 0.5511144399642944
[proc 0][Train](408/100000) average loss: 0.5960783362388611
[proc 0][Train](408/100000) average regularization: 7.563546660094289e-06
[proc 0][Train] 1 steps take 2.638 seconds
[proc 0]sample: 0.002, forward: 0.440, backward: 0.070, update: 2.126
[proc 1][Train](408/100000) average pos_loss: 0.605036735534668
[proc 1][Train](408/100000) average neg_loss: 0.5816142559051514
[proc 1][Train](408/100000) average loss: 0.5933254957199097
[proc 1][Train](408/100000) average regularization: 7

[proc 1][Train](418/100000) average pos_loss: 0.5988584756851196
[proc 1][Train](418/100000) average neg_loss: 0.6242907047271729
[proc 1][Train](418/100000) average loss: 0.6115745902061462
[proc 1][Train](418/100000) average regularization: 7.324012130993651e-06
[proc 1][Train] 1 steps take 2.695 seconds
[proc 1]sample: 0.017, forward: 0.431, backward: 0.070, update: 2.178
[proc 0][Train](419/100000) average pos_loss: 0.6199166774749756
[proc 0][Train](419/100000) average neg_loss: 0.2694475054740906
[proc 0][Train](419/100000) average loss: 0.4446820914745331
[proc 0][Train](419/100000) average regularization: 7.71273971622577e-06
[proc 0][Train] 1 steps take 2.680 seconds
[proc 0]sample: 0.001, forward: 0.432, backward: 0.071, update: 2.175
[proc 1][Train](419/100000) average pos_loss: 0.6110307574272156
[proc 1][Train](419/100000) average neg_loss: 0.2961500883102417
[proc 1][Train](419/100000) average loss: 0.45359042286872864
[proc 1][Train](419/100000) average regularization: 7

### 2

- batch_size: **4096**

- neg_sample_size: **256**

- hidden_dim: **200**, 400

- gamma: **6**, 12, 18

- lr: 0.01, **0.05**, 0.1

In [None]:
!DGLBACKEND=pytorch dglke_train --dataset DRKG --data_path ./dataset \
--data_files drkg_train.tsv drkg_valid.tsv drkg_test.tsv --format 'raw_udd_hrt' \
--model_name TransR \
--batch_size 4096 --neg_sample_size 256 --hidden_dim 200 \
--gamma 6.0 --lr 0.05 --max_step 100000 -adv --regularization_coef 1.00E-07 \
--gpu 0 1 --num_proc 2 --mix_cpu_gpu --async_update --force_sync_interval 1000 \
--valid --test \
--batch_size_eval 128 --neg_sample_size_eval 10000 \
--log_interval 20000 --eval_interval 50000 --num_thread 32

Reading train triples....
Finished. Read 5286834 train triples.
Reading valid triples....
Finished. Read 293713 valid triples.
Reading test triples....
Finished. Read 293714 test triples.
|Train|: 5286834
random partition 5286834 edges into 2 parts
part 0 has 2643417 edges
part 1 has 2643417 edges
|valid|: 293713
|test|: 293714
Total initialize time 16.447 seconds


### 3

- batch_size: **4096**

- neg_sample_size: **256**

- hidden_dim: **200**, 400

- gamma: **6**, 12, 18

- lr: 0.01, 0.05, **0.1**

In [None]:
!DGLBACKEND=pytorch dglke_train --dataset DRKG --data_path ./dataset \
--data_files drkg_train.tsv drkg_valid.tsv drkg_test.tsv --format 'raw_udd_hrt' \
--model_name TransR \
--batch_size 4096 --neg_sample_size 256 --hidden_dim 200 \
--gamma 6.0 --lr 0.1 --max_step 100000 -adv --regularization_coef 1.00E-07 \
--gpu 0 1 --num_proc 2 --mix_cpu_gpu --async_update --force_sync_interval 1000 \
--valid --test \
--batch_size_eval 128 --neg_sample_size_eval 10000 \
--log_interval 20000 --eval_interval 50000 --num_thread 32

### 4

- batch_size: **4096**

- neg_sample_size: **256**

- hidden_dim: **200**, 400

- gamma: 6, **12**, 18

- lr: **0.01**, 0.05, 0.1

In [None]:
!DGLBACKEND=pytorch dglke_train --dataset DRKG --data_path ./dataset \
--data_files drkg_train.tsv drkg_valid.tsv drkg_test.tsv --format 'raw_udd_hrt' \
--model_name TransR \
--batch_size 4096 --neg_sample_size 256 --hidden_dim 200 \
--gamma 12.0 --lr 0.01 --max_step 100000 -adv --regularization_coef 1.00E-07 \
--gpu 0 1 --num_proc 2 --mix_cpu_gpu --async_update --force_sync_interval 1000 \
--valid --test \
--batch_size_eval 128 --neg_sample_size_eval 10000 \
--log_interval 20000 --eval_interval 50000 --num_thread 32

### 5

- batch_size: **4096**

- neg_sample_size: **256**

- hidden_dim: **200**, 400

- gamma: 6, **12**, 18

- lr: 0.01, **0.05**, 0.1

In [None]:
!DGLBACKEND=pytorch dglke_train --dataset DRKG --data_path ./dataset \
--data_files drkg_train.tsv drkg_valid.tsv drkg_test.tsv --format 'raw_udd_hrt' \
--model_name TransR \
--batch_size 4096 --neg_sample_size 256 --hidden_dim 200 \
--gamma 12.0 --lr 0.05 --max_step 100000 -adv --regularization_coef 1.00E-07 \
--gpu 0 1 --num_proc 2 --mix_cpu_gpu --async_update --force_sync_interval 1000 \
--valid --test \
--batch_size_eval 128 --neg_sample_size_eval 10000 \
--log_interval 20000 --eval_interval 50000 --num_thread 32

### 6

- batch_size: **4096**

- neg_sample_size: **256**

- hidden_dim: **200**, 400

- gamma: 6, **12**, 18

- lr: 0.01, 0.05, **0.1**

In [None]:
!DGLBACKEND=pytorch dglke_train --dataset DRKG --data_path ./dataset \
--data_files drkg_train.tsv drkg_valid.tsv drkg_test.tsv --format 'raw_udd_hrt' \
--model_name TransR \
--batch_size 4096 --neg_sample_size 256 --hidden_dim 200 \
--gamma 12.0 --lr 0.1 --max_step 100000 -adv --regularization_coef 1.00E-07 \
--gpu 0 1 --num_proc 2 --mix_cpu_gpu --async_update --force_sync_interval 1000 \
--valid --test \
--batch_size_eval 128 --neg_sample_size_eval 10000 \
--log_interval 20000 --eval_interval 50000 --num_thread 32

### 7

- batch_size: **4096**

- neg_sample_size: **256**

- hidden_dim: **200**, 400

- gamma: 6, 12, **18**

- lr: **0.01**, 0.05, 0.1

In [None]:
!DGLBACKEND=pytorch dglke_train --dataset DRKG --data_path ./dataset \
--data_files drkg_train.tsv drkg_valid.tsv drkg_test.tsv --format 'raw_udd_hrt' \
--model_name TransR \
--batch_size 4096 --neg_sample_size 256 --hidden_dim 200 \
--gamma 18.0 --lr 0.01 --max_step 100000 -adv --regularization_coef 1.00E-07 \
--gpu 0 1 --num_proc 2 --mix_cpu_gpu --async_update --force_sync_interval 1000 \
--valid --test \
--batch_size_eval 128 --neg_sample_size_eval 10000 \
--log_interval 20000 --eval_interval 50000 --num_thread 32

### 8

- batch_size: **4096**

- neg_sample_size: **256**

- hidden_dim: **200**, 400

- gamma: 6, 12, **18**

- lr: 0.01, **0.05**, 0.1

In [None]:
!DGLBACKEND=pytorch dglke_train --dataset DRKG --data_path ./dataset \
--data_files drkg_train.tsv drkg_valid.tsv drkg_test.tsv --format 'raw_udd_hrt' \
--model_name TransR \
--batch_size 4096 --neg_sample_size 256 --hidden_dim 200 \
--gamma 18.0 --lr 0.05 --max_step 100000 -adv --regularization_coef 1.00E-07 \
--gpu 0 1 --num_proc 2 --mix_cpu_gpu --async_update --force_sync_interval 1000 \
--valid --test \
--batch_size_eval 128 --neg_sample_size_eval 10000 \
--log_interval 20000 --eval_interval 50000 --num_thread 32

### 9

- batch_size: **4096**

- neg_sample_size: **256**

- hidden_dim: **200**, 400

- gamma: 6, 12, **18**

- lr: 0.01, 0.05, **0.1**

In [None]:
!DGLBACKEND=pytorch dglke_train --dataset DRKG --data_path ./dataset \
--data_files drkg_train.tsv drkg_valid.tsv drkg_test.tsv --format 'raw_udd_hrt' \
--model_name TransR \
--batch_size 4096 --neg_sample_size 256 --hidden_dim 200 \
--gamma 18.0 --lr 0.1 --max_step 100000 -adv --regularization_coef 1.00E-07 \
--gpu 0 1 --num_proc 2 --mix_cpu_gpu --async_update --force_sync_interval 1000 \
--valid --test \
--batch_size_eval 128 --neg_sample_size_eval 10000 \
--log_interval 20000 --eval_interval 50000 --num_thread 32

### 10

- batch_size: **4096**

- neg_sample_size: **256**

- hidden_dim: 200, **400**

- gamma: **6**, 12, 18

- lr: **0.01**, 0.05, 0.1

In [None]:
!DGLBACKEND=pytorch dglke_train --dataset DRKG --data_path ./dataset \
--data_files drkg_train.tsv drkg_valid.tsv drkg_test.tsv --format 'raw_udd_hrt' \
--model_name TransR \
--batch_size 4096 --neg_sample_size 256 --hidden_dim 400 \
--gamma 6.0 --lr 0.01 --max_step 100000 -adv --regularization_coef 1.00E-07 \
--gpu 0 1 --num_proc 2 --mix_cpu_gpu --async_update --force_sync_interval 1000 \
--valid --test \
--batch_size_eval 128 --neg_sample_size_eval 10000 \
--log_interval 20000 --eval_interval 50000 --num_thread 32

### 11

- batch_size: **4096**

- neg_sample_size: **256**

- hidden_dim: 200, **400**

- gamma: **6**, 12, 18

- lr: 0.01, **0.05**, 0.1

In [None]:
!DGLBACKEND=pytorch dglke_train --dataset DRKG --data_path ./dataset \
--data_files drkg_train.tsv drkg_valid.tsv drkg_test.tsv --format 'raw_udd_hrt' \
--model_name TransR \
--batch_size 4096 --neg_sample_size 256 --hidden_dim 400 \
--gamma 6.0 --lr 0.05 --max_step 100000 -adv --regularization_coef 1.00E-07 \
--gpu 0 1 --num_proc 2 --mix_cpu_gpu --async_update --force_sync_interval 1000 \
--valid --test \
--batch_size_eval 128 --neg_sample_size_eval 10000 \
--log_interval 20000 --eval_interval 50000 --num_thread 32

### 12

- batch_size: **4096**

- neg_sample_size: **256**

- hidden_dim: 200, **400**

- gamma: **6**, 12, 18

- lr: 0.01, 0.05, **0.1**

In [None]:
!DGLBACKEND=pytorch dglke_train --dataset DRKG --data_path ./dataset \
--data_files drkg_train.tsv drkg_valid.tsv drkg_test.tsv --format 'raw_udd_hrt' \
--model_name TransR \
--batch_size 4096 --neg_sample_size 256 --hidden_dim 400 \
--gamma 6.0 --lr 0.1 --max_step 100000 -adv --regularization_coef 1.00E-07 \
--gpu 0 1 --num_proc 2 --mix_cpu_gpu --async_update --force_sync_interval 1000 \
--valid --test \
--batch_size_eval 128 --neg_sample_size_eval 10000 \
--log_interval 20000 --eval_interval 50000 --num_thread 32

### 13

- batch_size: **4096**

- neg_sample_size: **256**

- hidden_dim: 200, **400**

- gamma: 6, **12**, 18

- lr: **0.01**, 0.05, 0.1

In [None]:
!DGLBACKEND=pytorch dglke_train --dataset DRKG --data_path ./dataset \
--data_files drkg_train.tsv drkg_valid.tsv drkg_test.tsv --format 'raw_udd_hrt' \
--model_name TransR \
--batch_size 4096 --neg_sample_size 256 --hidden_dim 400 \
--gamma 12.0 --lr 0.01 --max_step 100000 -adv --regularization_coef 1.00E-07 \
--gpu 0 1 --num_proc 2 --mix_cpu_gpu --async_update --force_sync_interval 1000 \
--valid --test \
--batch_size_eval 128 --neg_sample_size_eval 10000 \
--log_interval 20000 --eval_interval 50000 --num_thread 32

### 14

- batch_size: **4096**

- neg_sample_size: **256**

- hidden_dim: 200, **400**

- gamma: 6, **12**, 18

- lr: 0.01, **0.05**, 0.1

In [None]:
!DGLBACKEND=pytorch dglke_train --dataset DRKG --data_path ./dataset \
--data_files drkg_train.tsv drkg_valid.tsv drkg_test.tsv --format 'raw_udd_hrt' \
--model_name TransR \
--batch_size 4096 --neg_sample_size 256 --hidden_dim 400 \
--gamma 12.0 --lr 0.05 --max_step 100000 -adv --regularization_coef 1.00E-07 \
--gpu 0 1 --num_proc 2 --mix_cpu_gpu --async_update --force_sync_interval 1000 \
--valid --test \
--batch_size_eval 128 --neg_sample_size_eval 10000 \
--log_interval 20000 --eval_interval 50000 --num_thread 32

### 15

- batch_size: **4096**

- neg_sample_size: **256**

- hidden_dim: 200, **400**

- gamma: 6, **12**, 18

- lr: 0.01, 0.05, **0.1**

In [None]:
!DGLBACKEND=pytorch dglke_train --dataset DRKG --data_path ./dataset \
--data_files drkg_train.tsv drkg_valid.tsv drkg_test.tsv --format 'raw_udd_hrt' \
--model_name TransR \
--batch_size 4096 --neg_sample_size 256 --hidden_dim 400 \
--gamma 12.0 --lr 0.1 --max_step 100000 -adv --regularization_coef 1.00E-07 \
--gpu 0 1 --num_proc 2 --mix_cpu_gpu --async_update --force_sync_interval 1000 \
--valid --test \
--batch_size_eval 128 --neg_sample_size_eval 10000 \
--log_interval 20000 --eval_interval 50000 --num_thread 32

### 16

- batch_size: **4096**

- neg_sample_size: **256**

- hidden_dim: 200, **400**

- gamma: 6, 12, **18**

- lr: **0.01**, 0.05, 0.1

In [None]:
!DGLBACKEND=pytorch dglke_train --dataset DRKG --data_path ./dataset \
--data_files drkg_train.tsv drkg_valid.tsv drkg_test.tsv --format 'raw_udd_hrt' \
--model_name TransR \
--batch_size 4096 --neg_sample_size 256 --hidden_dim 400 \
--gamma 18.0 --lr 0.01 --max_step 100000 -adv --regularization_coef 1.00E-07 \
--gpu 0 1 --num_proc 2 --mix_cpu_gpu --async_update --force_sync_interval 1000 \
--valid --test \
--batch_size_eval 128 --neg_sample_size_eval 10000 \
--log_interval 20000 --eval_interval 50000 --num_thread 32

### 17

- batch_size: **4096**

- neg_sample_size: **256**

- hidden_dim: 200, **400**

- gamma: 6, 12, **18**

- lr: 0.01, **0.05**, 0.1

In [None]:
!DGLBACKEND=pytorch dglke_train --dataset DRKG --data_path ./dataset \
--data_files drkg_train.tsv drkg_valid.tsv drkg_test.tsv --format 'raw_udd_hrt' \
--model_name TransR \
--batch_size 4096 --neg_sample_size 256 --hidden_dim 400 \
--gamma 18.0 --lr 0.05 --max_step 100000 -adv --regularization_coef 1.00E-07 \
--gpu 0 1 --num_proc 2 --mix_cpu_gpu --async_update --force_sync_interval 1000 \
--valid --test \
--batch_size_eval 128 --neg_sample_size_eval 10000 \
--log_interval 20000 --eval_interval 50000 --num_thread 32

### 18

- batch_size: **4096**

- neg_sample_size: **256**

- hidden_dim: 200, **400**

- gamma: 6, 12, **18**

- lr: 0.01, 0.05, **0.1**

In [None]:
!DGLBACKEND=pytorch dglke_train --dataset DRKG --data_path ./dataset \
--data_files drkg_train.tsv drkg_valid.tsv drkg_test.tsv --format 'raw_udd_hrt' \
--model_name TransR \
--batch_size 4096 --neg_sample_size 256 --hidden_dim 400 \
--gamma 18.0 --lr 0.1 --max_step 100000 -adv --regularization_coef 1.00E-07 \
--gpu 0 1 --num_proc 2 --mix_cpu_gpu --async_update --force_sync_interval 1000 \
--valid --test \
--batch_size_eval 128 --neg_sample_size_eval 10000 \
--log_interval 20000 --eval_interval 50000 --num_thread 32