four week intro to python: basic syntax, data manipulation, data viz
Clone or download
Fetching latest commit…
Cannot retrieve the latest commit at this time.
Type Name Latest commit message Commit time
Failed to load latest commit information.

Introduction to Python


Description: This four-week course is designed to introduce attendees to Python programming and its broad applications. Each two hour session will include brief tutorials interspersed with challenge exercises, and assumes attendees have no prior computer coding experience. At the end of this course, you will be able to use Python to import, manipulate, and visualize data.

This repository is adapted from content originally appearing in Python for Ecologists, Copyright (c) Data Carpentry.

Software requirements for this course can be found on's Software page.


  • Week 1: Intro to python, jupyter notebooks, and data types
  • Week 2: Using pandas to explore data frames
  • Week 3: Extracting data from data frames
  • Week 4: Data visualization with ggplot


  • Each week's materials are described in the python script prefaced with the number of the week.
  • Data used for this lesson are identical to that used in Introduction to R; details on obtaining these data from the National Cancer Institute's Genomic Data Commons can be found in that lesson repository.
  • exercises/ includes a file for each week representing both the aggregated in-class exercises as well as additional supplemental exercises for practice
  • solutions/ includes the solutions for all files in exercises/
  • includes useful links mentioned during lessons; additional information about continued learning in Python as well as Hutch-specific resources can be found on the Data Science Wiki
  • hackmdio.txt is an archive of the interactive webpage used during lessons