Skip to content
This repository has been archived by the owner on Oct 6, 2021. It is now read-only.

Script to extract content from html pages and then merge pages for each story #13

Merged
merged 2 commits into from
Aug 4, 2018

Conversation

arnabbiswas1
Copy link
Contributor

In this script I have used pandas to extract content from html pages (Issue #1 ) and then merge multiple pages for every story (#2 ), so that in the resultant csv, each row consists of complete content of a story.

@githubssn @deepakshankar94 @dnithinraj @SahilKuchlous Please review.

@arnabbiswas1 arnabbiswas1 changed the title Script two extract content from html pages and then merge pages for each story Script to extract content from html pages and then merge pages for each story Jul 1, 2018
@arnabbiswas1 arnabbiswas1 added the bug Something isn't working label Aug 4, 2018
@arnabbiswas1 arnabbiswas1 merged commit 9517cf3 into master Aug 4, 2018
@arnabbiswas1
Copy link
Contributor Author

Thanks @ramyaragupathy !

Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
bug Something isn't working
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants