Skip to content

The Structural and Semantic Pre-trained Language Model of PGPSNet

License

Notifications You must be signed in to change notification settings

mingliangzhang2018/PGPS-Pretraining

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

9 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

PGPS-Pretraining

The structural and semantic pre-training language model of PGPSNet.

Figure 1. Pipeline of structural and semantic pre-training.

Environmental Settings

They are the same as PGPSNet.

PGPS9K Dataset

You could download the dataset from Dataset Homepage.

In default, unzip the dataset file to the fold ./datasets.

Pre-training

The default parameter configurations are set in the config file ./config/config_default.py and the default training modes are displayed in ./sh_files/train.sh, for example,

CUDA_VISIBLE_DEVICES=0 python -m torch.distributed.launch \
--nproc_per_node=1 \
--master_port=$((RANDOM + 10000)) \
start.py

The training records of pre-training are saved in the folder ./log. We choose the model of last epoch as the pre-trained language model.

About

The Structural and Semantic Pre-trained Language Model of PGPSNet

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published