🌐 Website_Cloner

A powerful Python tool to recursively clone websites using Selenium with undetected-chromedriver. Designed to bypass basic bot protections like Cloudflare challenges, it allows manual CAPTCHA solving and saves pages and assets for offline browsing.

🚀 Features

🕵️‍♂️ Uses undetected-chromedriver to evade bot detection
🧠 Opens a real Chrome browser to manually solve CAPTCHAs
🔄 Recursively clones pages within the same domain
🎨 Downloads all assets (images, CSS, JS)
🔗 Rewrites internal links for seamless offline browsing

📦 Requirements

Python 3.8+
Google Chrome (latest version)
Python Packages:
- undetected-chromedriver
- requests
- beautifulsoup4
- webdriver-manager

🛠️ Installation

1. Clone the repository

git clone https://github.com/chromeheartbeat/Website_Cloner.git
cd Website_Cloner

2. (Optional) Create and activate a virtual environment
Linux/macOS:



python3 -m venv .venv
source .venv/bin/activate


3. Install dependencies
Using requirements file:

pip install -r requirements.txt

Or install individually:
pip install undetected-chromedriver requests beautifulsoup4 webdriver-manager

###  🧪 Usage

python site_cloner.py

A Chrome browser will open and load the target website.

If a CAPTCHA appears, solve it manually.

Return to the terminal and press Enter to continue.

The website will be saved to the cloned_selenium_site folder.

⚙️ Customization
Change target URL:
Edit this line in site_cloner.py:
clone_website("https://example.com")

⚠️ Disclaimer
This tool is intended for educational and ethical use only.
Do not use to clone or scrape websites without explicit permission.
The author is not responsible for any misuse or legal issues.
Always respect websites' Terms of Service and copyright laws.

📁 License
MIT License © [Solution]

🙌 Contribute
Feel free to submit issues, fork the repo, and send pull requests!
We welcome improvements, bug fixes, and new features 🚀

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
README.md		README.md
requirements.txt		requirements.txt
site_cloner.py		site_cloner.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

🌐 Website_Cloner

🚀 Features

📦 Requirements

🛠️ Installation

1. Clone the repository

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

🌐 Website_Cloner

🚀 Features

📦 Requirements

🛠️ Installation

1. Clone the repository

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages