- Include a short paragraph about the project and the code artefacts in this project.
- Provide any relevant external links, e.g. to Notion, your S3 bucket, etc.
- Link to WhitePaper
- (Team Lead) Onidajo_anu Fikayo Wale-Olaitan
- (Data Scientist) Busayo Iman
- (Data Scientist) Nonso Azih
- (Data Scientist) Pharis Ihaki
- (Data Scientist) Karabo Eugene
- (Data Engineer) Haji Rufai
It's highly recommended to use a virtual environment for your project, there are many ways to do this, below we have provided one example of how this can be achieved. Ensure when working on your project to keep this section up-to-date so if anyone needs to run your code they know the exact steps needed to get the appropriate environment ready. A person should be able to clone your repo and get up and running with the instructions provided here.
# make sure your pip in your base Python installation is up-to-date
python3 -m pip install -U pip
# install the virtualenv package
python3 -m pip install virtualenv
# create a virtual environment in this directory called '.venv'
python3 -m venv .venv
# you should now see the folder `.venv` in your repo
# activate the virtual environment
source .venv/bin/activate
# install the requirements for this project
pip install -r requirements.txt
It's highly recommended to get in the habit of writing tests, especially once you've identified something concrete that you can refactor into a reusable bit of code. This project has been seeded with a minimal working example of a refactored function, a notebook that can import this code and a unit test to test the code behaves as expected.
You can execute tests from the terminal by running pytest
. IDEs like VSCode have built-in support for
executing and debugging tests through the "Testing" menu.
├── LICENSE <- Contains information about the legal terms and conditions under which
| the code can be used.
|
├── README.md <- The top-level README for developers using this project.
│
├── docs <- Use markdown. If/when the project becomes more complex consider migrating
| to something like pdoc or sphinx depending on the nature of the project.
| Think carefully about what documentation should sit alongside the code
| and what you can rather include in docstrings.
│
├── notebooks <- Jupyter notebooks. Naming convention is a number (for ordering), and a
│ short `-` delimited description, e.g. `1.0-initial-data-exploration`.
| Refactor the good parts. Don't write code to do the same task in multiple
| notebooks. If it's a data preprocessing task, put it in a file like
| `src/data/make_dataset.py`. If it's useful utility code, refactor it to
| `src` and import accordingly. Only commit clean notebooks (clear all cell outputs).
│
├── requirements.txt <- The requirements file for reproducing the environment.
|
├── src <- Source code. Refactor clean, re-usable code here.
│ │
│ ├── data <- Separate your code base into logical groups, e.g. data, features, models, visualisation, etc.
│ │ └── make_dataset.py
│ └── ...
|
├── tests <- Unit tests. Filename should start with "test_".
└── test_make_dataset.py