Skip to content

KushxKalsi/Web-Scraping-Python-Script

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

35 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

🚀Web Scraping Python Script Build HTML

This script can extract Blogs from any Wordpress website into an XML file just by providing the link of blog(built on wordpress).

Make sure before running this script in your device you should have installed:

  • Python's latest version
  • requirements.txt (required packages with compatible versions)

📦 Installation

Please follow these steps to run the Script locally on your machine:

1. Clone the Repository

git clone https://github.com/hpx07/Web-Scraping-Python-Script.git
pip install -r requirements.txt
pip install requests
pip install lxml html5lib
pip install beautifulsoup4
pip install webdriver-manager
pip install selenium
pip install playwright
//before running follow step 2
python blog-extractor.py

2. How to Use

  • Copy the URL of Blog and open blog-extractor.py file in any editor and paste it here
# ✅ List your blog URLs here
urls = [
    # can also use multiple links
    "https://url1.com/", ---enter your urls here
    "https://url2.com/", 
]

3. If Script is not working

  • ✅Make sure that from which site you are copying blogs may not have blocked the use of API on their website.
  • ✅Check URL is correct or not.
  • ✅For any other Reason use ChatGPT

🙋 Author

GitHub: @hpx07


About

Web Scraping Python Script used to extract existing Blogs from any website built on WordPress

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Python 100.0%