Skip to content
Intro to Python for Data Science
Jupyter Notebook Other
Branch: master
Clone or download
Fetching latest commit…
Cannot retrieve the latest commit at this time.
Permalink
Type Name Latest commit message Commit time
Failed to load latest commit information.
data Create Case Study Part 2; add related data. May 15, 2019
notebooks Vastly simply packages/functions section. Aug 5, 2019
.gitignore
LICENSE Initial commit Feb 18, 2019
README.md
environment.yml
postBuild

README.md

Introduction to Python for Data Science

Binder

Course Description

Welcome to Introduction to Python for Data Science! This short course provides a gentle, hands-on introduction to the Python programming language for data science applications. You will learn the fundamentals of Python as a language and how to work with data using the pandas library.

Objectives

The following are the primary learning objectives of students:

  1. Develop comprehensive skills in the importing/exporting, wrangling, aggregating and joining of data using Python.

  2. Establish a mental model of the Python programming language to enable future self-learning.

  3. Build awareness and basic skills in the core data science area of data visualization.

Agenda

Day 1

Topic Time
Breakfast / Social Time 8:00 - 9:00
Introductions 9:00 - 9:15
Python and Jupyter Overview 9:15 - 9:45
Fundamentals 9:45 - 10:30
Break 10:30 - 10:45
Packages, Modules, Methods, Functions 10:45 - 11:30
Importing Data 11:30 - 12:00
Lunch 12:00 - 1:00
Selecting and Filtering Data 1:00 - 2:00
Working with Columns 2:00 - 2:45
Break 2:45 - 3:00
Case Study, pt. 1 3:00 - 4:00
Q&A 4:00 - 4:30

Day 2

Topic Time
Breakfast / Social Time 8:00 - 9:00
Review 9:00 - 10:00
Summarizing Data 9:30 - 10:30
Break 10:30 - 10:45
Summarizing Grouped Data 10:45 - 11:15
Joining Data 11:15 - 12:00
Lunch 12:00 - 1:00
Exporting Data 1:00 - 1:30
Visualizing Data 1:30 - 2:30
Break 2:30 - 2:45
Case Study, pt. 2 2:45 - 4:00
Q&A 4:00 - 4:30

Course Preparation

In an effort to simplify the setup for this class, we are using Binder for all materials (slides, worksheets, etc.). In result, there is no pre-requisite installation required for the in-class material.

With that being said, it's smart to install the appropriate technologies and download the materials anyways. This will provide you a backup in case there are network issues, and it will also be required to apply your learnings outside of class.

Follow these steps to download the technologies and materials:

1. Python, Jupyter and package installation.

These easiest way to install Python, Jupyter, and the necessary packages is through Anaconda. To download and install Anaconda:

  1. Visit the Anaconda download page
  2. Select your appropriate operating system
  3. Click the "Download" button for Python 3.7 - this will begin to download the Anaconda installer
  4. Open the installer when the download completes, and then follow the prompts. If you are prompted about installing PyCharm, elect not to do so.
  5. Once installed, open the Anaconda Navigator and launch a Jupyter Notebook to ensure it works.
  6. Follow the package installation instructions to ensure pandas and seaborn packages are installed.

2. Download class materials

There are two ways to download the class materials:

  1. Clone it - If you're familiar with how to do so, you can clone this repository.
  2. Download the files as a zip - use this link

Questions

If you have any specific questions prior to the class you can reach out to us directly via GitHub or email:

You can’t perform that action at this time.