Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Problem for artical #27

Closed
Wanghn95 opened this issue May 12, 2019 · 14 comments
Closed

Problem for artical #27

Wanghn95 opened this issue May 12, 2019 · 14 comments

Comments

@Wanghn95
Copy link

Hi,sorry to trouble you:
:how can I get the original article?
For example, the reference summary ref.83.txt and can.83.txt, i cannot find the original article.

And the results I get are different from yours, my pyrouge configuration is correct, what's wrong with me? can you help me? Thanks!
This is classifier log file
YN$)7E1}OEILGG~(QA@ W38

MI2 }@A05L0SLURD}%IQ4PF
KL3GUTWE9Y JX5DOG_5KUN](https://user-images.githubusercontent.com/37919100/57581779-f7cdaa00-74ee-11e9-9c3c-15637f60b679.png) ![HF 6~PE 0_(`FI78 R03YL

@nlpyang
Copy link
Owner

nlpyang commented May 12, 2019

I cannot help you with the information you provided. please always patse your commsnds.

@Wanghn95
Copy link
Author

problem1:
the commend for train classifier is:
python train.py -mode train -encoder classifier -dropout 0.1 -bert_data_path /home/test/WangHN/BertSum-master/bert_data/cnndm -model_path /home/test/WangHN/BertSum-master/models/bert_classifier -lr 2e-3 -visible_gpus 0,1,2 -gpu_ranks 0,1,2 -world_size 3 -report_every 50 -save_checkpoint_steps 1000 -batch_size 3000 -decay_method noam -train_steps 50000 -accum_count 2 -log_file /home/test/WangHN/BertSum-master/logs/bert_classifier -use_interval true -warmup_steps 10000
for test classifier is:
python train.py -mode validate -bert_data_path /home/test/WangHN/BertSum-master/bert_data/cnndm -model_path /home/test/WangHN/BertSum-master/models/bert_classifier -visible_gpus 1 -gpu_ranks 0 -batch_size 30000 -log_file /home/test/WangHN/BertSum-master/logs/Evaluation/bert_classifier -result_path /home/test/WangHN/BertSum-master/results/classifier/cnndm -test_all -block_trigram true

the log file's results are what I provide above , the rouge score is low, this is also true in another (rnn and transformer)

problem2:
for the summary results, how can I find the original article ,for example ref.83.txt and can.83.txt's original article?

@nlpyang
Copy link
Owner

nlpyang commented May 12, 2019

could you run -mode lead to see if the data is correct

@Wanghn95
Copy link
Author

how to do this

@nlpyang
Copy link
Owner

nlpyang commented May 12, 2019

python train.py -mode lead -bert_data_path ../../../bertabs/bert_data/cnndm -visible_gpus -1 -batch_size 30000 -log_file cnndm_lead -result_path ../results/cnndm_lead -block_trigram false

@Wanghn95
Copy link
Author

Wanghn95 commented May 13, 2019

`(LX8}JT~U X8NY)Z7(S7$8
dont have the mode lead.
I also think the problem may be model training, need I retraining the model?

@nlpyang
Copy link
Owner

nlpyang commented May 13, 2019

please checkout to dev branch and run lead

@Wanghn95
Copy link
Author

sorry,where is dev branch

@nlpyang
Copy link
Owner

nlpyang commented May 13, 2019

please google for "check out branches on github"

@Wanghn95
Copy link
Author

L`HVM$D0_(NN$7}V)H$}D2L
I run mode -lead ,got this results, .

@nlpyang
Copy link
Owner

nlpyang commented May 13, 2019

apparently the data is wrong. how do you process the data?

@Wanghn95
Copy link
Author

I process the data by the commend you provide,
Put the CNN story file and the daily mail story file under one file,using the Stanford CoreNLP version:3.7.0
then got the sentence splitting and token, just follow the step 3,4,5 , then got the bert_data file.

@nlpyang
Copy link
Owner

nlpyang commented May 13, 2019

please past some parts of the json file in got from format_to_lines

@nlpyang
Copy link
Owner

nlpyang commented May 13, 2019

or could you rerun the preprocessing process and calculate lead

@nlpyang nlpyang closed this as completed Jun 16, 2019
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants