This script can extract Blogs from any Wordpress website into an XML file just by providing the link of blog(built on wordpress).
- Python's latest version
- requirements.txt (required packages with compatible versions)
Please follow these steps to run the Script locally on your machine:
git clone https://github.com/hpx07/Web-Scraping-Python-Script.git
pip install -r requirements.txt
pip install requests
pip install lxml html5lib
pip install beautifulsoup4
pip install webdriver-manager
pip install selenium
pip install playwright
//before running follow step 2
python blog-extractor.py
- Copy the URL of Blog and open blog-extractor.py file in any editor and paste it here
# ✅ List your blog URLs here
urls = [
# can also use multiple links
"https://url1.com/", ---enter your urls here
"https://url2.com/",
]
- ✅Make sure that from which site you are copying blogs may not have blocked the use of API on their website.
- ✅Check URL is correct or not.
- ✅For any other Reason use ChatGPT
GitHub: @hpx07