A GUI application for scraping media files from websites with customizable parameters.
- Easy-to-use GUI with real-time log output
- Customizable file types (mp4, webm, avi, mov, etc.)
- Pagination support with configurable next-page patterns (automatically follows "Next" buttons to scrape multiple pages)
- CAPTCHA handling for sites requiring manual intervention
- Multi-threaded to keep UI responsive during scraping
- Clone the repository and navigate to the folder
- Set up virtual environment and install dependencies:
python -m venv venv source venv/bin/activate # On Windows: venv\Scripts\activate pip install -r requirements.txt
- Install Chrome browser (required for Selenium)
Run the application:
python main.py- Enter the starting URL
- Configure file types and next-page patterns (defaults provided)
- Enable CAPTCHA mode if needed
- Click "Start Scraping" and monitor progress
- Python 3.7+
- Chrome/Chromium browser
- Dependencies listed in
requirements.txt
This project is licensed under the GPL-3.0 GNU General Public License - see the LICENSE.md file for details.
