Challenge-titanic

This repository is for training future data-scientists in "industry-like" environment.

Instructions

Export the notebook to python script and push the notebooks and python script to GitHub.
When having good results, create a pull request.
I will comment on the changes.
We reiterate with the comments until we're good to move forward to the next challenge.

The idea is to write good code which theoretically could be used for future deployments.
This project is about training, not just results.
Work with branches, not on the master in Github.
Use Python, Jupyter, and Turi
Always start by splitting the data into three parts: train, validations and test. You can use the test dataset only once! to prevent overfitting.
The example code already have issues in it - good luck!
Try to coomit every small change to github, instead of big uploads of a lot of code.

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
notebooks		notebooks
scripts		scripts
README.md		README.md