Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Could you please provide the processed data for cnn/dailymail? #9

Closed
taineleau-zz opened this issue Jul 30, 2017 · 11 comments
Closed
Labels

Comments

@taineleau-zz
Copy link

Hi, thanks for your hard work! Could you provide the processed data via google drive or dropbox?

@JafferWilson
Copy link

You can have it from my repository: https://github.com/JafferWilson/Process-Data-of-CNN-DailyMail
Hope this helps you.

@taineleau-zz
Copy link
Author

@JafferWilson Thanks for your help! It's so kind of you to share the data.

Would it be more user-friendly if the author could put the link to the ready-to-use data on the README of the project? @abisee

@JafferWilson
Copy link

@taineleau Sure. If the author like to put links on the READ.ME, it will surely help others.

@abisee abisee added the question label Aug 5, 2017
@abisee
Copy link
Owner

abisee commented Aug 5, 2017

Hi @taineleau,

We would have liked to provide the processed version of the dataset, but were advised not to for legal reasons. This is why we have instead provided code to download and process it.

However, we can provide a link to @JafferWilson's repository. Thanks @JafferWilson!

Edit: The README now points to JafferWilson's repo.

@JafferWilson
Copy link

@abisee Thank you for your consideration. It will certainly help people, who do not want to mess with the code and process. :)

@abisee abisee closed this as completed Aug 5, 2017
@VikasNS
Copy link

VikasNS commented Jul 8, 2018

How to get the data in text format?

@JafferWilson
Copy link

@VikasNS Please exlpain which data you are talking about? Your query isn't clear.

@VikasNS
Copy link

VikasNS commented Jul 9, 2018

I meant, how to convert .story files to .txt files.?
But PyCharm can read .story files so no problem.

@JafferWilson
Copy link

Then I guess you must close the issue. The issue which you have already raised.

@chaine09
Copy link

chaine09 commented Feb 5, 2020

@JafferWilson

IndexError: list index (0) out of range

when running the .bin inputs.

It seems that the code used here https://github.com/HsuWanTing/unified-summarization cannot detect the abstracts and article sections.

@JafferWilson
Copy link

JafferWilson commented Feb 6, 2020

@chaine09 Please Read the READ.ME of the repository and you can try the example bin that I have created and you will see the reference of the link.
Please let me know if it helped and if not then what is the issues.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

No branches or pull requests

5 participants