# 10 Minutes to pandas

In [11]:
import pandas as pd
import numpy as np
import matplotlib.pyplot as plt

## Object Creation

<p>Creating a Series by passing a list of values, letting pandas create a default integer index:</p>

In [12]:
s = pd.Series([1,3,5,np.nan,6,8])
s

0    1.0
1    3.0
2    5.0
3    NaN
4    6.0
5    8.0
dtype: float64

<p>Creating a DataFrame by passing a numpy array, with a datetime index and labeled columns:</p>

In [13]:
dates = pd.date_range('20161112', periods=6)
dates

DatetimeIndex(['2016-11-12', '2016-11-13', '2016-11-14', '2016-11-15',
               '2016-11-16', '2016-11-17'],
              dtype='datetime64[ns]', freq='D')

In [14]:
df = pd.DataFrame(np.random.randn(6,4), index=dates, columns=list('ABCD'))
df

Unnamed: 0,A,B,C,D
2016-11-12,-1.515173,-0.21296,0.522402,0.327538
2016-11-13,1.265509,0.040801,0.263857,0.16828
2016-11-14,-0.897176,0.666018,-0.39542,-0.252789
2016-11-15,-2.156417,1.297175,-0.6399,-0.882191
2016-11-16,-1.245339,-1.100172,-0.421729,-0.421062
2016-11-17,-0.610857,0.307268,0.712611,-0.605391


<p>Creating a DataFrame by passing a dict of objects that can be converted to series-like:</p>

In [15]:
df2 = pd.DataFrame({ 'A' : 1.,
                     'B' : pd.Timestamp(20161112),
                     'C' : pd.Series(1,index=list(range(4)),dtype='float32'),
                     'D' : np.array([3] * 4,dtype='int32'),
                     'E' : pd.Categorical(["test","train","test","train"]),
                     'F' : 'foo'})
df2

Unnamed: 0,A,B,C,D,E,F
0,1.0,1970-01-01 00:00:00.020161112,1.0,3,test,foo
1,1.0,1970-01-01 00:00:00.020161112,1.0,3,train,foo
2,1.0,1970-01-01 00:00:00.020161112,1.0,3,test,foo
3,1.0,1970-01-01 00:00:00.020161112,1.0,3,train,foo


<p>Having specific dtypes:</p>

In [16]:
df2.dtypes

A           float64
B    datetime64[ns]
C           float32
D             int32
E          category
F            object
dtype: object

## Viewing Data