This project is designed to scrape data from the FSRA Public Register in Abu Dhabi Global Market.
The scraped data is parsed and then stored in a DataFrame, and finally exported to a CSV file.
Make sure that you have the latest version of Python installed.
git clone https://github.com/cr2007/adgm-web-scraping
cd adgm-web-scrapingOptionally, you can also download the repository as a ZIP file.
To keep dependencies isolated, it's recommended to create a virtual environment:
python3 -m venv .venv
source .venv/bin/activatepython -m venv .venv
.venv\Scripts\activateOnce the virtual environment is activated, install the required Python packages by running:
pip install -r requirements.txtCreate a .env file in the root directory and add the necessary environment variables.
Check out sample.env for the variables required.
To run the scraping script, execute the following command:
# For macOS/Linux
python3 main.py
# For Windows
python main.pyAfter you are done, deactivate the virtual environment by running:
decativate- Modify the
COMPANY_NAME_SPECIAL_CASESdictionary to handle any specific company names that need special formatting.