Skip to content

Scrape 99scrape.com with python using curl command and export data as CSV.

License

Notifications You must be signed in to change notification settings

arv-anshul/99acres-scrape

Repository files navigation

Scrape 99Acres

Caution

Message from Author: I (Anshul Raj Verma) am not able to scrape 99acres website using this streamlit app due to some unathorization issue.

BTW, you can use and try to scrape the website and if you are able to scrape, please raise a issue to discuss the problem. Thanks!

Important

DISCLAIMER: This project is only for education purpose.

Why this project?

I am working on a real estate project. So I need real-time data for that so I write a program to gather the data for this project and convert it into a web-app using streamlit.

Check out my Data Analysis on the scrapped data here.

Datasets

I've scrape and uploaded some cities data in my Kaggle profile you can download that and practice. It contains more than 40k+ properties details from different Indian cities.

Dataset on Kaggle: Indian Real Estate - 99acres.com

Documentation

Documentation for this project is available in 🗒️ wiki section.

In order to see the EDA process on the data fetched by this app goto 📁 campusx-project-notebooks.

Techs

  1. Python>=3.11
  2. Asynchronous Programming
  3. Streamlit
  4. Http Requests
  5. Pydantic

Setup

  1. Install all the required packages.
pip install -r requirements.txt

Usage

  1. Run the streamlit app.
streamlit run app.py
  1. Goto URL http://localhost:8501/.
  2. Fill the form: Select the city which you want to scrape and submit.
  3. After some backend processing; a download button will appear, click it to download the scrapped data.

Disclaimer

As I wanted to scrape the data from 99Acres website. I am ensuring that I am not performing any illegal activity using this data. I used this data in my project to build some ML model and perform some data analysis.