A simple guide with examples for working with Pandas. This series will expand in the future to include more examples of functions applied to data frames but for now it's a good basic guide to using Pandas for data analysis. Explanations are not real deep, it's for my reference but anyone is welcome to use it, clone it or do whatever you want with it. I will be updating as I go.
- Series Intro
- Creating a Series from a list, dictionary and csv
- Attributes
- Methods
- Parameters and arguements
- Sorting by value and index
- Extracting by label and index position
- Using apply method with custom functions
- Using the map method
-
- Selecting, Assigning new columns and using insert.
-
Data Cleaning (Basic Null value handling)
- Dealing with Nulls, NaN using isna( )
- Replacing with fillna( )
- Changing parameters
-
Operations, dropping and filling NA's & Conerting data types
-
- Setting the index, using .loc and .iloc
-
- Random Samples
- nlargest/smallest method
- using the where( ) method
- using the apply( ) method with a function for rows and columns
-
- Python date/time module
- Timestamps
- Pandas to_datetime method
- Handling errors
- Time delta Object
-
Example of grouping and using a max date value
- Time delta is also applied to create new columns based on a date
-
Merge, Join & Concatenate DataFrames
- Concatenating
- Joins - inner, outer, left
- Join paramters
- Merging
-
- Common methods - lower, upper, title and lrn
- Replacing values in a string
- Strip, lstrip, rstrip
- Spliting strings
-
- Basic Pandas charts and graphs for quick use
- Time Series, bar, scatter (Unfinished for now)
- Modifying aesthetics
-
Matplotlib Basic Visualizations
- Basic bar, line, hist, scatter plots.
- Sub plots, multiple plots
-
More Matplotlib - Modern and Most Complete Version
- Additional customization for plots - This is the most technical version with a lot of details for plots and styling.