House Cat is a Python Crawler which collect house information from House for Sale Website in Taiwan.
git clone https://github.com/Yidti/house-cat.git
cd house-cat
python3.10 main.py
Introduction - Web Crawler
網路爬蟲,一種用來自動瀏覽全球資訊網的機器人,其目的一般為編纂網路索引使用。
- crawler target for real estate (
https://buy.housefun.com.tw/region
) - select city and district in Taiwan
- save url for district into txt file (
district_url.txt
) - save house information into csv file (
house_properties.csv
)