You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Thanks a lot for making this amazing dataset available! :D I have one quick question and a comment.
I found this dataset includes 1.5M NYTimes articles, can you elaborate little more how you collect them?
I'd love to use this dataset for research. But the lack of details on data collection procedure (e.g., when the collection started and ended, what is the time range of collected news articles) makes it really hard to use this data for academic purposes. If you can describe how you collected this data, it would be gratefully helpful!
Thanks,
Jisun
The text was updated successfully, but these errors were encountered:
NYTimes articles were scraped directly from their website. The URLs that were scrapped were found through their developer API: https://developer.nytimes.com/
Dear Maciej,
Thanks a lot for making this amazing dataset available! :D I have one quick question and a comment.
Thanks,
Jisun
The text was updated successfully, but these errors were encountered: