ENSF 444 LO3 Lab Materials

Installation:

Fork this repo!
Install conda: https://conda.io/projects/conda/en/latest/user-guide/install/index.html
If you already have conda installed, make sure your base env is up-to-date by running: conda update -n base conda
Create conda environment: conda env create --file ./environment.yaml
Register for Kaggle is you do not already have an account: https://www.kaggle.com/
If you want to use kaggle API, download your kaggle.json by following instructions here: https://www.kaggle.com/docs/api?utm_me= and sign up to the competition on Kaggle website to accept competition rules.

Repo tour:

Labs will be placed in subdirectories name as lab-# where # is the lab number. There will be 9 labs in total.
Solutions to labs will be placed in ./solutions as well as on D2L.
Participation marks are awarded for labs. Attend the lab as scheduled and write your name and UCID on google document that will be provided.
./data is used to store datasets we will be using for the labs. Note that files contained in this directory will not be synced with remote repo.

Integration with Google Collab:

Feel free to use google collab instead of local development. You can load the notebook directly into collab and load the datasets through the GUI or with API calls, but you'll need to authenticate Kaggle on the collab instance if you want to use API.

Merging issues with notebooks:

Jupyter notebooks are stored as json formatted files. Each cell contains metadata such as execution_count, outputs, etc. Git doesn't know what is "important" to track, so re-running the notebook even without changing the source code can result in nasty merge conflicts.

The best work-around for this is to use jupytext to convert .ipynb files to .py modules. Then we can push the .py files to the remote repository and convert back to notebook format on our local systems. This way only the source code is actually tracked by git.

However, since we do not want to introduce any additional complexity to using this repo, we can use the following strategy to avoid merge conflicts:

Fork the repo
Add the upstream remote repo with the command: git remote add upstream git@github.com:mklasby/ensf-444.git.
Whenever you want to work on a jupyter notebook that originates from the upstream repo (mklasby/ensf-444), copy that notebook and rename it. For instance, add your initials as a suffix to the file name.
Now, you can pull in new updates from the repo by using the CLI command git fetch --upstream
Make sure you checkout the main branch if you're using different branches. This can be done with the command git checkout main.
Now we are finally ready to move our local commits to the tip of the upstream remote. This is known as "rebasing". Use the command git rebase upstream/master.
Finally, we push this new git history (our local commits move on top of remote updates) using the command git push -f. Note that the "force" option -f is required as your local history will differ from the history on the remote origin (your github fork).

Name		Name	Last commit message	Last commit date
Latest commit History 46 Commits
data		data
lab-1		lab-1
lab-2		lab-2
lab-3		lab-3
lab-4		lab-4
lab-5		lab-5
lab-6		lab-6
lab-7		lab-7
lab-8		lab-8
lab-9		lab-9
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
environment.yaml		environment.yaml
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

ENSF 444 LO3 Lab Materials

Installation:

Repo tour:

Integration with Google Collab:

Merging issues with notebooks:

About

Releases

Packages

Languages

License

bmsmcgee/ensf-444

Folders and files

Latest commit

History

Repository files navigation

ENSF 444 LO3 Lab Materials

Installation:

Repo tour:

Integration with Google Collab:

Merging issues with notebooks:

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages