Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Unable to find training data #45

Open
boxiangliu opened this issue Dec 20, 2021 · 2 comments
Open

Unable to find training data #45

boxiangliu opened this issue Dec 20, 2021 · 2 comments

Comments

@boxiangliu
Copy link

Dear Max,

Thank you so much for making your code available.
I am running your stacked pointer network but I cannot find the train/dev/test datasets

CUDA_VISIBLE_DEVICES=0 OMP_NUM_THREADS=4 python -u parsing.py --mode train --config configs/parsing/stackptr.json --num_epochs 600 --batch_size 32
--opt adam --learning_rate 0.001 --lr_decay 0.999997 --beta1 0.9 --beta2 0.9 --eps 1e-4 --grad_clip 5.0
--loss_type token --warmup_steps 40 --reset 20 --weight_decay 0.0 --unk_replace 0.5 --beam 10
--word_embedding sskip --word_path "data/sskip/sskip.eng.100.gz" --char_embedding random
--punctuation '.' '``' "''" ':' ','
--train "data/PTB3.0/PTB3.0-Stanford_dep/ptb3.0-stanford.auto.cpos.train.conll"
--dev "data/PTB3.0/PTB3.0-Stanford_dep/ptb3.0-stanford.auto.cpos.dev.conll"
--test "data/PTB3.0/PTB3.0-Stanford_dep/ptb3.0-stanford.auto.cpos.test.conll"
--model_path "models/parsing/stackptr/"

Could you tell me where to find the dataset? Thank you!

@boxiangliu
Copy link
Author

@XuezheMax I should add that I have the Penn tree bank 3.0 data, but I am not sure how to convert it to the required format. Is there a straightforward way to do that?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants