**The Pandas library is an essential tool in the Python ecosystem for data analysis and manipulation. It simplifies working with structured data and is widely used in fields like data science, machine learning, finance, and more.**

In [1]:
import pandas as pd
import numpy as np

In [3]:
s1 = pd.Series(data=[10,20,30,40,50])
s1

0    10
1    20
2    30
3    40
4    50
dtype: int64

In [5]:
s2 = pd.Series(data=[100,200,300,400,500])
s2

0    100
1    200
2    300
3    400
4    500
dtype: int64

**We can give custom indexing as well to our series**

In [7]:
s3 = pd.Series(data=[1000,2000,3000,4000,5000],index=list('vwxyz'))
s3

v    1000
w    2000
x    3000
y    4000
z    5000
dtype: int64

In [9]:
df = pd.DataFrame({'C1':s1, 'C2':s2})
df

Unnamed: 0,C1,C2
0,10,100
1,20,200
2,30,300
3,40,400
4,50,500


**The np.random.randint() function in NumPy is used to generate random integers from a specified range. It’s a versatile tool in various applications where random integer values are required, such as simulations, testing, or initializing values.**

In [11]:
df = pd.DataFrame(data=np.random.randint(1,100,size=(1000,5)), columns=list('abcde'))
df

Unnamed: 0,a,b,c,d,e
0,19,39,9,91,56
1,43,36,88,60,41
2,38,26,14,87,50
3,46,79,42,23,19
4,49,21,33,55,13
...,...,...,...,...,...
995,87,80,21,2,57
996,25,83,6,23,68
997,38,18,31,35,93
998,61,46,99,71,92


In [15]:
dataset = pd.read_csv(r'D:\data science\python\DATASETS\Car.csv')
dataset

Unnamed: 0,Age,Income,Car
0,28,37000,0
1,27,88000,0
2,28,59000,0
3,32,86000,0
4,33,149000,1
...,...,...,...
95,28,89000,0
96,34,43000,0
97,30,79000,0
98,20,36000,0


In [17]:
dataset.info()

<class 'pandas.core.frame.DataFrame'>
RangeIndex: 100 entries, 0 to 99
Data columns (total 3 columns):
 #   Column  Non-Null Count  Dtype
---  ------  --------------  -----
 0   Age     100 non-null    int64
 1   Income  100 non-null    int64
 2   Car     100 non-null    int64
dtypes: int64(3)
memory usage: 2.5 KB


In [19]:
dataset.head()

Unnamed: 0,Age,Income,Car
0,28,37000,0
1,27,88000,0
2,28,59000,0
3,32,86000,0
4,33,149000,1


In [21]:
dataset.tail()

Unnamed: 0,Age,Income,Car
95,28,89000,0
96,34,43000,0
97,30,79000,0
98,20,36000,0
99,26,80000,0


In [23]:
dataset.Car.value_counts()

Car
0    93
1     7
Name: count, dtype: int64

In [25]:
dataset.rename(columns={'Car':'Target'})

Unnamed: 0,Age,Income,Target
0,28,37000,0
1,27,88000,0
2,28,59000,0
3,32,86000,0
4,33,149000,1
...,...,...,...
95,28,89000,0
96,34,43000,0
97,30,79000,0
98,20,36000,0


In [27]:
dataset

Unnamed: 0,Age,Income,Car
0,28,37000,0
1,27,88000,0
2,28,59000,0
3,32,86000,0
4,33,149000,1
...,...,...,...
95,28,89000,0
96,34,43000,0
97,30,79000,0
98,20,36000,0


In [29]:
dataset.rename(columns={'Car':'Target'},inplace=True)
dataset

Unnamed: 0,Age,Income,Target
0,28,37000,0
1,27,88000,0
2,28,59000,0
3,32,86000,0
4,33,149000,1
...,...,...,...
95,28,89000,0
96,34,43000,0
97,30,79000,0
98,20,36000,0


In [31]:
dataset.columns

Index(['Age', 'Income', 'Target'], dtype='object')