# Creating Data frame from Storage format files


There are two different methods to create Pandas DataFrame:

1. By typing the values in Python itself to create the DataFrame
2. By importing the values from a file (such as an Excel file), and then creating the DataFrame in Python based on the values imported

Method 1: typing values in Python to create Pandas DataFrame

To start, let’s say that you have the following data about Cars, and that you want to capture that data in Python using Pandas DataFrame:

        Brand	                   Price
        
      Honda Civic	               22000
     Toyota Corolla	               25000
      Ford Focus	               27000
       Audi A4	                   35000

In [1]:
import pandas as pd

cars = {'Brand': ['Honda Civic','Toyota Corolla','Ford Focus','Audi A4'],
        'Price': [22000,25000,27000,35000]
        }

df = pd.DataFrame(cars, columns = ['Brand', 'Price'])

print (df);

            Brand  Price
0     Honda Civic  22000
1  Toyota Corolla  25000
2      Ford Focus  27000
3         Audi A4  35000


You may have noticed that each row is represented by a number (also known as the index) starting from 0. Alternatively, you may assign another value/name to represent each row.

For example, in the code below, the index=[‘Car_1′,’Car_2′,’Car_3′,’Car_4’] was added:

In [2]:
import pandas as pd

cars = {'Brand': ['Honda Civic','Toyota Corolla','Ford Focus','Audi A4'],
        'Price': [22000,25000,27000,35000]
        }

df = pd.DataFrame(cars, columns = ['Brand','Price'], index=['Car_1','Car_2','Car_3','Car_4'])

print (df);

                Brand  Price
Car_1     Honda Civic  22000
Car_2  Toyota Corolla  25000
Car_3      Ford Focus  27000
Car_4         Audi A4  35000


Method 2: importing values from an Excel file to create Pandas DataFrame

Make sure that the columns names specified in the code exactly match to the column names in the Excel file.

Let’s say that you have the following table stored in an Excel file (where the Excel file name is ‘Cars’):

        Brand	                       Price
     Honda Civic	                   22000
    Toyota Corolla	                   25000
      Ford Focus	                   27000
       Audi A4	                       35000
       
In the Python code below, you’ll need to change the path name to reflect the location where the Excel file is stored on your computer.

In my case, the Excel file is saved on my desktop, under the following path:

‘D:\Cars.xlsx’

Once you imported the data into Python, you’ll be able to assign it to the DataFrame.

In [1]:
import pandas as pd

cars = pd.read_excel(r'D:\Cars.xlsx');
df = pd.DataFrame(cars, columns = ['Brand','Price']);

print (df);

            Brand  Price
0     Honda Civic  22000
1  Toyota Corolla  25000
2      Ford Focus  27000
3         Audi A4  35000


Note: you will have to install xlrd if you get the following error when running the code:

ImportError: Install xlrd >= 1.0.0 for Excel support

You may then use the PIP install method to install xlrd as follows:

pip install xlrd

Reference: https://datatofish.com/create-pandas-dataframe