Skip to content
Tutorial notebooks for hands-on data science, following along with the course topics.
Jupyter Notebook
Branch: master
Clone or download

Files

Permalink
Type Name Latest commit message Commit time
Failed to load latest commit information.
19-Geospatial
files NLP Mar 2, 2018
img Update images Jan 24, 2018
.gitignore Fix up gitignore Apr 9, 2017
00-Introduction.ipynb updates for datahub Mar 29, 2019
01-JupyterNotebooks.ipynb updates to Jupyter Apr 5, 2019
02-DataAnalysis.ipynb update all links with alert-link, and put in quotes Sep 27, 2018
03-Python.ipynb update all links with alert-link, and put in quotes Sep 27, 2018
04-DataSciencePython.ipynb update all links with alert-link, and put in quotes Sep 27, 2018
05-DataGathering.ipynb update all links with alert-link, and put in quotes Sep 27, 2018
06-DataWrangling.ipynb update all links with alert-link, and put in quotes Sep 27, 2018
07-DataCleaning.ipynb update all links with alert-link, and put in quotes Sep 27, 2018
08-DataPrivacy&Anonymization.ipynb update all links with alert-link, and put in quotes Sep 27, 2018
09-DataVisualization.ipynb update all links with alert-link, and put in quotes Sep 27, 2018
10-Distributions.ipynb update all links with alert-link, and put in quotes Sep 27, 2018
11-TestingDistributions.ipynb Revert "update tutorial file ordering" Jul 24, 2018
13-OrdinaryLeastSquares.ipynb update all links with alert-link, and put in quotes Sep 27, 2018
14-LinearModels.ipynb
15-Clustering.ipynb update all links with alert-link, and put in quotes Sep 27, 2018
16-DimensionalityReduction.ipynb update all links with alert-link, and put in quotes Sep 27, 2018
17-Classification.ipynb update all links with alert-link, and put in quotes Sep 27, 2018
18-NaturalLanguageProcessing.ipynb update all links with alert-link, and put in quotes Sep 27, 2018
A1-PythonPackages.ipynb update all links with alert-link, and put in quotes Sep 27, 2018
A2-Git.ipynb update all links with alert-link, and put in quotes Sep 27, 2018
LICENSE.txt Create LICENSE.txt Jul 18, 2019
README.md Add website link button. Jul 18, 2019

README.md

Tutorials

Site Binder License: CC BY-NC 4.0

This repository is a set of tutorials for Hands-On Data Science. It is used to run along with the UC San Diego course COGS108 - Data Science in Practice. These tutorials presume some knowledge of the Python programming language.

Approach / Background

These tutorials are designed to be a minimal introduction to what you need to know to get working with data science - to start to be able getting and examing data, and building up to working on data-science related projects. They cover the hands-on, coding components of the material.

Conceptual and background material is covered in the Lectures. Practice with these ideas is done through the Assignments as well as materials for doing Projects.

These tutorials also try to interface with the vast world of existing tutorials, materials, and documentation. They are explicitly designed to give a quick introduction to a topic of interest, and then link out to more comprehensive resources. In that sense, they are designed to be more like a yellow pages, than an encyclopedia.

Requirements

The code and materials in this repository are created with Jupyter notebooks and require the anaconda distribution. Any other dependencies, for specific Tutorials, are specifically addressed in the notebooks.

Development

This repository is under active development, and is primarily developed and maintained by TomDonoghue, as well as by the COGS108 staff.

Contributions to this resource are welcome and encouraged! If you have suggestions for new links or materials, and/or fixes for any issues you spot, you are welcome and invited to open Issues, and/or submit a Pull Request.

You can’t perform that action at this time.