Skip to content

Talk on Intro To Data Analysis Using Python and Pandas

Notifications You must be signed in to change notification settings

GusSand/itp_talk_2016

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

12 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

ITP 2016

Using Python and Pandas for data analysis

Python is a general purpose programming language that is easy to learn.

Pandas is a Python library for doing data analysis. It's really fast and lets you do exploratory work incredibly quickly.

The goal of this cookbook is to give you some concrete examples for getting started with pandas.

I'm working with these datasets right now

  • 311 calls in New York
  • Movie data

It comes with batteries (data) included, so you can try out all the examples right away.

You'll need an up-to-date version of IPython Notebook (>= 3.0) and pandas (>=0.13) for this to work properly. It's set up to work with Python 2.7.

Installation

You need to install all the packages for data analysis. You can install all these using the Anaconda distribution unless you really know what you are doing.

Installation page: Anaconda

Instructions:

  • Download the version that suits your needs at Anaconda. I use 2.7
  • Once it's downloaded double click on the package and wait until it installs.
  • Close and reopen all Terminal windows
  • To make sure it installed correctly:
    • open a terminal
    • type "python"
    • type "import pandas"
    • you should see ** no errors **
    • Done !

Once you have pandas and IPython, you can get going!

Download the Tutorial Materials

I would highly recommend using git. Once git is installed, you can clone the material in this tutorial by using the git address shown below:

git clone https://github.com/GusSand/itp_talk_2016

Using the Tutorial materials

Once you have cloned the git repo, you can change to the cookbook directory and run ipython using the following commands:

cd pandas-cookbook/cookbook

ipython notebook

A tab should open up in your browser at http://localhost:8888



Table of Contents

Contribute!

If you see something wrong, or there's something you'd like to learn that I haven't explained here, or there's something you know about that you would like to share, create an issue! Send me email! Send a pull request!

TODO

  • Add more about matplotlib

License

Creative Commons License

Some of this content adapted from: Pandas-Cookbook

This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License

About

Talk on Intro To Data Analysis Using Python and Pandas

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published