A beginner's practice repository for learning Python web scraping — small exercises, notes, and example code.
This repository contains small exercises and learning notes for Python web scraping. It is a personal practice project for learning and sharing.
- Learn the basics of Python web scraping
- Keep a record of practice code and examples
- Share learning progress and findings
- Basic web page fetching 🕷️
- Data parsing practice
- Simple data processing examples
- Improve the functionality of
webpage.py - Complete
main.py - Keep the repository English-only (Chinese content archived or removed)
(Instructions will be added as the project develops — the commands below are examples.)
Prerequisites:
- Python 3.8+ (recommended)
- Optional: create a virtual environment
Clone and install:
git clone https://github.com/yifeng-bws/python-spider.git
cd python-spider
python -m venv .venv
source .venv/bin/activate # On Windows: .venv\Scripts\activate
pip install -r requirements.txtRun (example):
python main.pyThis project is in an early stage; main.py is not yet complete.
Implementation is currently pure Python.
Feel free to open an Issue to discuss or ask questions.
Contributions and feedback are welcome. Suggested workflow:
- Fork the repository.
- Create a branch (e.g.,
feat/readme-en). - Open a pull request describing your changes.
Please include tests or examples when adding new functionality.
This project is licensed under the MIT License — see the LICENSE file for details.
Maintainer — yifeng-bws
Repo: https://github.com/yifeng-bws/python-spider