说明

感谢 ymcui/Chinese-PreTrained-XLNet 提供的中文预训练的模型。

训练起来，非常慢，非常慢！！

修改的地方

新增 component_xlnet_data_processor.py

数据预处理文件，将训练文件转化为 tf_record 格式。

ZC （Git Test）

新增 component_xlnet_multi_class_train.py

多分类组件，接入 xl-net 的模型输出，后面可以自己添加层（一般不需要）

有些日志错误懒得改过来了。

训练效果

训练的日志见：xlnet.log

收敛也比较慢，相比于 bert，估计是层数太多了，要慢慢学，-_-

附录

注意， 使用 use_bfloat16 为 true 时，出现

tensorflow.python.framework.errors_impl.NotFoundError: No registered 'Reciprocal' OpKernel for CPU devices compatible with node {{node ConstantFolding/foo/gradients/foo/Mean_1_grad/truediv_recip}} = Reciprocal[T=DT_BFLOAT16, _device="/job:localhost/replica:0/task:0/device:CPU:0"](foo/gradients/foo/Mean_1_grad/Const_1)
     (OpKernel was found, but attributes didn't match)
    .  Registered:  device='XLA_GPU'; T in [DT_FLOAT, DT_DOUBLE, DT_INT32, DT_COMPLEX64, DT_INT64, DT_BFLOAT16, DT_HALF]
  device='XLA_CPU'; T in [DT_FLOAT, DT_DOUBLE, DT_INT32, DT_COMPLEX64, DT_INT64, DT_HALF]
  device='XLA_CPU_JIT'; T in [DT_FLOAT, DT_DOUBLE, DT_INT32, DT_COMPLEX64, DT_INT64, DT_HALF]
  device='XLA_GPU_JIT'; T in [DT_FLOAT, DT_DOUBLE, DT_INT32, DT_COMPLEX64, DT_INT64, DT_BFLOAT16, DT_HALF]
  device='GPU'; T in [DT_INT64]
  device='GPU'; T in [DT_DOUBLE]
  device='GPU'; T in [DT_HALF]
  device='GPU'; T in [DT_FLOAT]
  device='CPU'; T in [DT_COMPLEX128]
  device='CPU'; T in [DT_COMPLEX64]
  device='CPU'; T in [DT_DOUBLE]
  device='CPU'; T in [DT_HALF]
  device='CPU'; T in [DT_FLOAT]
  
  stackoverflow 解释说，不支持(还以为可以使用 半精度加速，-_-)
  估计需要自己动手魔改了，转化类型
  
  [v for v in tf.global_variables() if 'adam_v' not in v.name and 'adam_m' not in v.name]

Name		Name	Last commit message	Last commit date
Latest commit History 21 Commits
misc		misc
scripts		scripts
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
README_office.md		README_office.md
classifier_utils.py		classifier_utils.py
component_xlnet_data_processor.py		component_xlnet_data_processor.py
component_xlnet_multi_class_train.py		component_xlnet_multi_class_train.py
data_utils.py		data_utils.py
function_builder.py		function_builder.py
gpu_utils.py		gpu_utils.py
model_utils.py		model_utils.py
modeling.py		modeling.py
prepro_utils.py		prepro_utils.py
run_classifier.py		run_classifier.py
run_race.py		run_race.py
run_squad.py		run_squad.py
squad_utils.py		squad_utils.py
tpu_estimator.py		tpu_estimator.py
train.py		train.py
train_gpu.py		train_gpu.py
xlnet.log		xlnet.log
xlnet.py		xlnet.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

说明

修改的地方

ZC （Git Test）

训练效果

附录

About

Releases

Packages

Contributors 7

Languages

License

YC-wind/XL-NET-Chinese-task

Folders and files

Latest commit

History

Repository files navigation

说明

修改的地方

ZC （Git Test）

训练效果

附录

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 7

Languages

Packages