A script to scrape blog posts from weebly sites into markdown for Hugo & Jekyl
Branch: master
Clone or download
Pull request Compare This branch is 1 commit ahead of rechelon:master.
Fetching latest commit…
Cannot retrieve the latest commit at this time.
Permalink
Type Name Latest commit message Commit time
Failed to load latest commit information.
content
.gitignore
README.md
requirements.txt
weebly-scrapy.py

README.md

Weebly Blog Post Scraper

Weebly provides the option to "back up" a site, but not the actual posts or content made to it. Which is predatory bullshit designed to prey upon clients who don't have any technical skills or understanding and then lock them into their service.

This is a very simple script to scrape a weebly site's blog posts into markdown files that can be used in things like Hugo or Jekyll, or just be viewed by hand. To import markdown files to Wordpress see this link.

To use run this script with python on the command line with the first argument being the website address (the weebly.com version) and the second being the target folder:

python3 weebly-scraper.py http://example.weebly.com ./content/

Requirements: Python 3, Beautiful Soup

pip3 install -r requirements.txt