Run this project using Docker and Dev Containers or use Poetry.
Here are the combined instructions for setting up and running a devcontainer in Visual Studio Code on both Windows (with WSL2 backend) and MacOS:
-
Install Docker Desktop:
- For Windows/Mac: Download Docker Desktop from Docker official website and install it. If you're using Windows, make sure to enable WSL 2 backend.
-
Install Visual Studio Code:
- Download Visual Studio Code from the official website and install it.
-
Login into GitHub:
- Open Visual Studio Code.
- Click on the
Accountsbutton that appears in the sidebar. - Click on the
Sign in to GitHub.combutton. - Follow the prompts in the browser to complete the sign in process.
-
Install the Remote - Containers extension:
- Open Visual Studio Code.
- Click on the Extensions view icon on the Sidebar (or press
Ctrl+Shift+Xon Windows,Cmd+Shift+Xon MacOS). - Search for
Remote - Containers. - Click on the Install button.
-
Open the project in a devcontainer:
- Open the command palette (
Ctrl+Shift+Pon Windows,Cmd+Shift+Pon MacOS) and run theRemote-Containers: Reopen in Container...command. - Select the project folder and click on the
Openbutton. - A new window will open, and the devcontainer will start building. This might take a few minutes.
- Open the command palette (
Please note that starting up the container for the first time may take a while as it needs to download and install all the necessary components.
Now, you're running your project inside a devcontainer in Visual Studio Code!
Adding packages with Poetry:
- Open the terminal in Visual Studio Code.
- Run
poetry add <package-name>to add a new package to your project. Replace<package-name>with the name of the package you want to add.
Using the Poetry shell:
- Open the terminal in Visual Studio Code.
- Run
poetry shellto start a new shell session with the virtual environment activated.
You can find and create Jupyter notebooks in the src/notebooks/ directory.
To work with a notebook inside Visual Studio Code:
- Navigate to the
src/notebooks/directory in the Explorer pane. - Click on the notebook file you want to open, or create a new one by right-clicking in the Explorer pane and selecting
New File. Name your file with a.ipynbextension. - The notebook will open in a new tab, and you can start working with it directly in Visual Studio Code.
- To select the Poetry kernel, go to the 'Kernel' menu in the toolbar, click on 'Change kernel', then under 'Python Environments', select the kernel named something like
big-data-<someweird-code>.
Note: To work with Jupyter notebooks in Visual Studio Code, you may need to install the Python and Jupyter extensions. You can do this by clicking on the Extensions view icon on the Sidebar (or press Ctrl+Shift+X on Windows, Cmd+Shift+X on MacOS), then search for and install the Python and Jupyter extensions.
![]() |
![]() |
![]() |
![]() |



