## Pandas --Check

In pandas every change we made in the data has to be checked. For this there are different functions. Let's look briefly into them.

After reading in the data, it is always good to check that everything went well. For checking any output we can use the print() function. The challenge is that large data files might not nicely print on screen using the print() function.

In [1]:
import pandas as pd
fp = r'C:\Users\Gokul G\Desktop\WORK\GISISH\GIS-ish\data\Kanpur.csv'
data = pd.read_csv(fp,sep=',',index_col= "Month", skiprows=6, na_values=[-999.0,-999])

In [2]:
print(data)

          AOD_1640nm  AOD_1020nm  AOD_870nm  AOD_865nm  AOD_779nm  AOD_675nm  \
Month                                                                          
2001-JAN         NaN    0.176252   0.224173        NaN        NaN   0.313830   
2001-FEB         NaN    0.207033   0.249738        NaN        NaN   0.327059   
2001-MAR         NaN    0.222493   0.249700        NaN        NaN   0.294628   
2001-APR         NaN    0.317698   0.338976        NaN        NaN   0.373772   
2001-MAY         NaN    0.671964   0.702189        NaN        NaN   0.752185   
...              ...         ...        ...        ...        ...        ...   
2022-AUG         NaN         NaN        NaN        NaN        NaN        NaN   
2022-SEP         NaN         NaN        NaN        NaN        NaN        NaN   
2022-OCT         NaN         NaN        NaN        NaN        NaN        NaN   
2022-NOV         NaN         NaN        NaN        NaN        NaN        NaN   
2022-DEC         NaN         NaN        

To display the data more neatly just calling the data is enough. But this still includes the entire data.

In [3]:
data

Unnamed: 0_level_0,AOD_1640nm,AOD_1020nm,AOD_870nm,AOD_865nm,AOD_779nm,AOD_675nm,AOD_667nm,AOD_620nm,AOD_560nm,AOD_555nm,...,NUM_POINTS[440-870_Angstrom_Exponent],NUM_POINTS[380-500_Angstrom_Exponent],NUM_POINTS[440-675_Angstrom_Exponent],NUM_POINTS[500-870_Angstrom_Exponent],NUM_POINTS[340-440_Angstrom_Exponent],NUM_POINTS[440-675_Angstrom_Exponent[Polar]],Data_Quality_Level,Latitude(degrees),Longitude(degrees),Elevation(meters)
Month,Unnamed: 1_level_1,Unnamed: 2_level_1,Unnamed: 3_level_1,Unnamed: 4_level_1,Unnamed: 5_level_1,Unnamed: 6_level_1,Unnamed: 7_level_1,Unnamed: 8_level_1,Unnamed: 9_level_1,Unnamed: 10_level_1,Unnamed: 11_level_1,Unnamed: 12_level_1,Unnamed: 13_level_1,Unnamed: 14_level_1,Unnamed: 15_level_1,Unnamed: 16_level_1,Unnamed: 17_level_1,Unnamed: 18_level_1,Unnamed: 19_level_1,Unnamed: 20_level_1,Unnamed: 21_level_1
2001-JAN,,0.176252,0.224173,,,0.313830,,,,,...,174,171,172,174,169,0,lev20,26.512778,80.231639,123
2001-FEB,,0.207033,0.249738,,,0.327059,,,,,...,988,986,987,988,976,0,lev20,26.512778,80.231639,123
2001-MAR,,0.222493,0.249700,,,0.294628,,,,,...,1169,1169,1169,1169,1165,0,lev20,26.512778,80.231639,123
2001-APR,,0.317698,0.338976,,,0.373772,,,,,...,1207,1207,1207,1207,1200,0,lev20,26.512778,80.231639,123
2001-MAY,,0.671964,0.702189,,,0.752185,,,,,...,691,684,691,691,633,0,lev20,26.512778,80.231639,123
...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...
2022-AUG,,,,,,,,,,,...,0,0,0,0,0,0,lev20,26.512778,80.231639,123
2022-SEP,,,,,,,,,,,...,0,0,0,0,0,0,lev20,26.512778,80.231639,123
2022-OCT,,,,,,,,,,,...,0,0,0,0,0,0,lev20,26.512778,80.231639,123
2022-NOV,,,,,,,,,,,...,0,0,0,0,0,0,lev20,26.512778,80.231639,123


This still includes the entire data, but the columns will be concatenated to show some of the columns in the display. To change this we can set the display option to display the complete columns. Similarly we can do it for rows.

In [4]:
pd.set_option('display.max_columns', 112)

- head()

Compared to previous cases it might be better to look at only the top 5–10 lines of the file rather than loading the entire thing. 

In this we can use data.head() to quickly check the contents of the dataframe. This function returns the first n rows for the object based on position. It is useful for quickly testing if your object has the right type of data in it.

In [5]:
data.head()

Unnamed: 0_level_0,AOD_1640nm,AOD_1020nm,AOD_870nm,AOD_865nm,AOD_779nm,AOD_675nm,AOD_667nm,AOD_620nm,AOD_560nm,AOD_555nm,AOD_551nm,AOD_532nm,AOD_531nm,AOD_510nm,AOD_500nm,AOD_490nm,AOD_443nm,AOD_440nm,AOD_412nm,AOD_400nm,AOD_380nm,AOD_340nm,Precipitable_Water(cm),AOD_681nm,AOD_709nm,AOD_Empty,AOD_Empty.1,AOD_Empty.2,AOD_Empty.3,AOD_Empty.4,440-870_Angstrom_Exponent,380-500_Angstrom_Exponent,440-675_Angstrom_Exponent,500-870_Angstrom_Exponent,340-440_Angstrom_Exponent,440-675_Angstrom_Exponent[Polar],NUM_DAYS[AOD_1640nm],NUM_DAYS[AOD_1020nm],NUM_DAYS[AOD_870nm],NUM_DAYS[AOD_865nm],NUM_DAYS[AOD_779nm],NUM_DAYS[AOD_675nm],NUM_DAYS[AOD_667nm],NUM_DAYS[AOD_620nm],NUM_DAYS[AOD_560nm],NUM_DAYS[AOD_555nm],NUM_DAYS[AOD_551nm],NUM_DAYS[AOD_532nm],NUM_DAYS[AOD_531nm],NUM_DAYS[AOD_510nm],NUM_DAYS[AOD_500nm],NUM_DAYS[AOD_490nm],NUM_DAYS[AOD_443nm],NUM_DAYS[AOD_440nm],NUM_DAYS[AOD_412nm],NUM_DAYS[AOD_400nm],NUM_DAYS[AOD_380nm],NUM_DAYS[AOD_340nm],NUM_DAYS[Precipitable_Water(cm)],NUM_DAYS[AOD_681nm],NUM_DAYS[AOD_709nm],NUM_DAYS[AOD_Empty],NUM_DAYS[AOD_Empty].1,NUM_DAYS[AOD_Empty].2,NUM_DAYS[AOD_Empty].3,NUM_DAYS[AOD_Empty].4,NUM_DAYS[440-870_Angstrom_Exponent],NUM_DAYS[380-500_Angstrom_Exponent],NUM_DAYS[440-675_Angstrom_Exponent],NUM_DAYS[500-870_Angstrom_Exponent],NUM_DAYS[340-440_Angstrom_Exponent],NUM_DAYS[440-675_Angstrom_Exponent[Polar]],NUM_POINTS[AOD_1640nm],NUM_POINTS[AOD_1020nm],NUM_POINTS[AOD_870nm],NUM_POINTS[AOD_865nm],NUM_POINTS[AOD_779nm],NUM_POINTS[AOD_675nm],NUM_POINTS[AOD_667nm],NUM_POINTS[AOD_620nm],NUM_POINTS[AOD_560nm],NUM_POINTS[AOD_555nm],NUM_POINTS[AOD_551nm],NUM_POINTS[AOD_532nm],NUM_POINTS[AOD_531nm],NUM_POINTS[AOD_510nm],NUM_POINTS[AOD_500nm],NUM_POINTS[AOD_490nm],NUM_POINTS[AOD_443nm],NUM_POINTS[AOD_440nm],NUM_POINTS[AOD_412nm],NUM_POINTS[AOD_400nm],NUM_POINTS[AOD_380nm],NUM_POINTS[AOD_340nm],NUM_POINTS[Precipitable_Water(cm)],NUM_POINTS[AOD_681nm],NUM_POINTS[AOD_709nm],NUM_POINTS[AOD_Empty],NUM_POINTS[AOD_Empty].1,NUM_POINTS[AOD_Empty].2,NUM_POINTS[AOD_Empty].3,NUM_POINTS[AOD_Empty].4,NUM_POINTS[440-870_Angstrom_Exponent],NUM_POINTS[380-500_Angstrom_Exponent],NUM_POINTS[440-675_Angstrom_Exponent],NUM_POINTS[500-870_Angstrom_Exponent],NUM_POINTS[340-440_Angstrom_Exponent],NUM_POINTS[440-675_Angstrom_Exponent[Polar]],Data_Quality_Level,Latitude(degrees),Longitude(degrees),Elevation(meters)
Month,Unnamed: 1_level_1,Unnamed: 2_level_1,Unnamed: 3_level_1,Unnamed: 4_level_1,Unnamed: 5_level_1,Unnamed: 6_level_1,Unnamed: 7_level_1,Unnamed: 8_level_1,Unnamed: 9_level_1,Unnamed: 10_level_1,Unnamed: 11_level_1,Unnamed: 12_level_1,Unnamed: 13_level_1,Unnamed: 14_level_1,Unnamed: 15_level_1,Unnamed: 16_level_1,Unnamed: 17_level_1,Unnamed: 18_level_1,Unnamed: 19_level_1,Unnamed: 20_level_1,Unnamed: 21_level_1,Unnamed: 22_level_1,Unnamed: 23_level_1,Unnamed: 24_level_1,Unnamed: 25_level_1,Unnamed: 26_level_1,Unnamed: 27_level_1,Unnamed: 28_level_1,Unnamed: 29_level_1,Unnamed: 30_level_1,Unnamed: 31_level_1,Unnamed: 32_level_1,Unnamed: 33_level_1,Unnamed: 34_level_1,Unnamed: 35_level_1,Unnamed: 36_level_1,Unnamed: 37_level_1,Unnamed: 38_level_1,Unnamed: 39_level_1,Unnamed: 40_level_1,Unnamed: 41_level_1,Unnamed: 42_level_1,Unnamed: 43_level_1,Unnamed: 44_level_1,Unnamed: 45_level_1,Unnamed: 46_level_1,Unnamed: 47_level_1,Unnamed: 48_level_1,Unnamed: 49_level_1,Unnamed: 50_level_1,Unnamed: 51_level_1,Unnamed: 52_level_1,Unnamed: 53_level_1,Unnamed: 54_level_1,Unnamed: 55_level_1,Unnamed: 56_level_1,Unnamed: 57_level_1,Unnamed: 58_level_1,Unnamed: 59_level_1,Unnamed: 60_level_1,Unnamed: 61_level_1,Unnamed: 62_level_1,Unnamed: 63_level_1,Unnamed: 64_level_1,Unnamed: 65_level_1,Unnamed: 66_level_1,Unnamed: 67_level_1,Unnamed: 68_level_1,Unnamed: 69_level_1,Unnamed: 70_level_1,Unnamed: 71_level_1,Unnamed: 72_level_1,Unnamed: 73_level_1,Unnamed: 74_level_1,Unnamed: 75_level_1,Unnamed: 76_level_1,Unnamed: 77_level_1,Unnamed: 78_level_1,Unnamed: 79_level_1,Unnamed: 80_level_1,Unnamed: 81_level_1,Unnamed: 82_level_1,Unnamed: 83_level_1,Unnamed: 84_level_1,Unnamed: 85_level_1,Unnamed: 86_level_1,Unnamed: 87_level_1,Unnamed: 88_level_1,Unnamed: 89_level_1,Unnamed: 90_level_1,Unnamed: 91_level_1,Unnamed: 92_level_1,Unnamed: 93_level_1,Unnamed: 94_level_1,Unnamed: 95_level_1,Unnamed: 96_level_1,Unnamed: 97_level_1,Unnamed: 98_level_1,Unnamed: 99_level_1,Unnamed: 100_level_1,Unnamed: 101_level_1,Unnamed: 102_level_1,Unnamed: 103_level_1,Unnamed: 104_level_1,Unnamed: 105_level_1,Unnamed: 106_level_1,Unnamed: 107_level_1,Unnamed: 108_level_1,Unnamed: 109_level_1,Unnamed: 110_level_1,Unnamed: 111_level_1,Unnamed: 112_level_1
2001-JAN,,0.176252,0.224173,,,0.31383,,,,,,,,,0.465554,,,0.526493,,,0.607128,0.669815,0.987951,,,,,,,,1.291169,1.022942,1.273495,1.331534,1.036138,,0,6,6,0,0,6,0,0,0,0,0,0,0,0,6,0,0,6,0,0,6,6,6,0,0,0,0,0,0,0,6,6,6,6,6,0,0,172,174,0,0,174,0,0,0,0,0,0,0,0,172,0,0,171,0,0,169,163,174,0,0,0,0,0,0,0,174,171,172,174,169,0,lev20,26.512778,80.231639,123
2001-FEB,,0.207033,0.249738,,,0.327059,,,,,,,,,0.467158,,,0.524082,,,0.608011,0.681116,1.19202,,,,,,,,1.095703,0.953232,1.112246,1.123224,1.032452,,0,27,27,0,0,27,0,0,0,0,0,0,0,0,27,0,0,27,0,0,27,27,27,0,0,0,0,0,0,0,27,27,27,27,27,0,0,986,988,0,0,988,0,0,0,0,0,0,0,0,987,0,0,986,0,0,976,942,988,0,0,0,0,0,0,0,988,986,987,988,976,0,lev20,26.512778,80.231639,123
2001-MAR,,0.222493,0.2497,,,0.294628,,,,,,,,,0.384415,,,0.417067,,,0.482666,0.547509,1.33534,,,,,,,,0.79365,0.831398,0.838956,0.810105,1.058402,,0,29,29,0,0,29,0,0,0,0,0,0,0,0,29,0,0,29,0,0,29,28,29,0,0,0,0,0,0,0,29,29,29,29,29,0,0,1169,1169,0,0,1169,0,0,0,0,0,0,0,0,1169,0,0,1168,0,0,1165,1108,1169,0,0,0,0,0,0,0,1169,1169,1169,1169,1165,0,lev20,26.512778,80.231639,123
2001-APR,,0.317698,0.338976,,,0.373772,,,,,,,,,0.446749,,,0.463579,,,0.532277,0.585415,1.748747,,,,,,,,0.537987,0.671229,0.58184,0.556626,0.93195,,0,27,27,0,0,27,0,0,0,0,0,0,0,0,27,0,0,26,0,0,27,27,27,0,0,0,0,0,0,0,27,27,27,27,27,0,0,1207,1207,0,0,1207,0,0,0,0,0,0,0,0,1207,0,0,1102,0,0,1204,1132,1206,0,0,0,0,0,0,0,1207,1207,1207,1207,1200,0,lev20,26.512778,80.231639,123
2001-MAY,,0.671964,0.702189,,,0.752185,,,,,,,,,0.853487,,,0.920185,,,0.974761,1.03932,3.218051,,,,,,,,0.387213,0.492042,0.426325,0.395823,0.63106,,0,15,15,0,0,15,0,0,0,0,0,0,0,0,15,0,0,12,0,0,15,15,15,0,0,0,0,0,0,0,15,15,15,15,15,0,0,690,691,0,0,691,0,0,0,0,0,0,0,0,691,0,0,505,0,0,644,594,652,0,0,0,0,0,0,0,691,684,691,691,633,0,lev20,26.512778,80.231639,123


-  tail()

We can also check the last rows of the data using data.tail(). It is useful for quickly verifying data, for example, after sorting or appending rows. If n is larger than the number of rows, this function returns all rows.

In [6]:
data.tail()

Unnamed: 0_level_0,AOD_1640nm,AOD_1020nm,AOD_870nm,AOD_865nm,AOD_779nm,AOD_675nm,AOD_667nm,AOD_620nm,AOD_560nm,AOD_555nm,AOD_551nm,AOD_532nm,AOD_531nm,AOD_510nm,AOD_500nm,AOD_490nm,AOD_443nm,AOD_440nm,AOD_412nm,AOD_400nm,AOD_380nm,AOD_340nm,Precipitable_Water(cm),AOD_681nm,AOD_709nm,AOD_Empty,AOD_Empty.1,AOD_Empty.2,AOD_Empty.3,AOD_Empty.4,440-870_Angstrom_Exponent,380-500_Angstrom_Exponent,440-675_Angstrom_Exponent,500-870_Angstrom_Exponent,340-440_Angstrom_Exponent,440-675_Angstrom_Exponent[Polar],NUM_DAYS[AOD_1640nm],NUM_DAYS[AOD_1020nm],NUM_DAYS[AOD_870nm],NUM_DAYS[AOD_865nm],NUM_DAYS[AOD_779nm],NUM_DAYS[AOD_675nm],NUM_DAYS[AOD_667nm],NUM_DAYS[AOD_620nm],NUM_DAYS[AOD_560nm],NUM_DAYS[AOD_555nm],NUM_DAYS[AOD_551nm],NUM_DAYS[AOD_532nm],NUM_DAYS[AOD_531nm],NUM_DAYS[AOD_510nm],NUM_DAYS[AOD_500nm],NUM_DAYS[AOD_490nm],NUM_DAYS[AOD_443nm],NUM_DAYS[AOD_440nm],NUM_DAYS[AOD_412nm],NUM_DAYS[AOD_400nm],NUM_DAYS[AOD_380nm],NUM_DAYS[AOD_340nm],NUM_DAYS[Precipitable_Water(cm)],NUM_DAYS[AOD_681nm],NUM_DAYS[AOD_709nm],NUM_DAYS[AOD_Empty],NUM_DAYS[AOD_Empty].1,NUM_DAYS[AOD_Empty].2,NUM_DAYS[AOD_Empty].3,NUM_DAYS[AOD_Empty].4,NUM_DAYS[440-870_Angstrom_Exponent],NUM_DAYS[380-500_Angstrom_Exponent],NUM_DAYS[440-675_Angstrom_Exponent],NUM_DAYS[500-870_Angstrom_Exponent],NUM_DAYS[340-440_Angstrom_Exponent],NUM_DAYS[440-675_Angstrom_Exponent[Polar]],NUM_POINTS[AOD_1640nm],NUM_POINTS[AOD_1020nm],NUM_POINTS[AOD_870nm],NUM_POINTS[AOD_865nm],NUM_POINTS[AOD_779nm],NUM_POINTS[AOD_675nm],NUM_POINTS[AOD_667nm],NUM_POINTS[AOD_620nm],NUM_POINTS[AOD_560nm],NUM_POINTS[AOD_555nm],NUM_POINTS[AOD_551nm],NUM_POINTS[AOD_532nm],NUM_POINTS[AOD_531nm],NUM_POINTS[AOD_510nm],NUM_POINTS[AOD_500nm],NUM_POINTS[AOD_490nm],NUM_POINTS[AOD_443nm],NUM_POINTS[AOD_440nm],NUM_POINTS[AOD_412nm],NUM_POINTS[AOD_400nm],NUM_POINTS[AOD_380nm],NUM_POINTS[AOD_340nm],NUM_POINTS[Precipitable_Water(cm)],NUM_POINTS[AOD_681nm],NUM_POINTS[AOD_709nm],NUM_POINTS[AOD_Empty],NUM_POINTS[AOD_Empty].1,NUM_POINTS[AOD_Empty].2,NUM_POINTS[AOD_Empty].3,NUM_POINTS[AOD_Empty].4,NUM_POINTS[440-870_Angstrom_Exponent],NUM_POINTS[380-500_Angstrom_Exponent],NUM_POINTS[440-675_Angstrom_Exponent],NUM_POINTS[500-870_Angstrom_Exponent],NUM_POINTS[340-440_Angstrom_Exponent],NUM_POINTS[440-675_Angstrom_Exponent[Polar]],Data_Quality_Level,Latitude(degrees),Longitude(degrees),Elevation(meters)
Month,Unnamed: 1_level_1,Unnamed: 2_level_1,Unnamed: 3_level_1,Unnamed: 4_level_1,Unnamed: 5_level_1,Unnamed: 6_level_1,Unnamed: 7_level_1,Unnamed: 8_level_1,Unnamed: 9_level_1,Unnamed: 10_level_1,Unnamed: 11_level_1,Unnamed: 12_level_1,Unnamed: 13_level_1,Unnamed: 14_level_1,Unnamed: 15_level_1,Unnamed: 16_level_1,Unnamed: 17_level_1,Unnamed: 18_level_1,Unnamed: 19_level_1,Unnamed: 20_level_1,Unnamed: 21_level_1,Unnamed: 22_level_1,Unnamed: 23_level_1,Unnamed: 24_level_1,Unnamed: 25_level_1,Unnamed: 26_level_1,Unnamed: 27_level_1,Unnamed: 28_level_1,Unnamed: 29_level_1,Unnamed: 30_level_1,Unnamed: 31_level_1,Unnamed: 32_level_1,Unnamed: 33_level_1,Unnamed: 34_level_1,Unnamed: 35_level_1,Unnamed: 36_level_1,Unnamed: 37_level_1,Unnamed: 38_level_1,Unnamed: 39_level_1,Unnamed: 40_level_1,Unnamed: 41_level_1,Unnamed: 42_level_1,Unnamed: 43_level_1,Unnamed: 44_level_1,Unnamed: 45_level_1,Unnamed: 46_level_1,Unnamed: 47_level_1,Unnamed: 48_level_1,Unnamed: 49_level_1,Unnamed: 50_level_1,Unnamed: 51_level_1,Unnamed: 52_level_1,Unnamed: 53_level_1,Unnamed: 54_level_1,Unnamed: 55_level_1,Unnamed: 56_level_1,Unnamed: 57_level_1,Unnamed: 58_level_1,Unnamed: 59_level_1,Unnamed: 60_level_1,Unnamed: 61_level_1,Unnamed: 62_level_1,Unnamed: 63_level_1,Unnamed: 64_level_1,Unnamed: 65_level_1,Unnamed: 66_level_1,Unnamed: 67_level_1,Unnamed: 68_level_1,Unnamed: 69_level_1,Unnamed: 70_level_1,Unnamed: 71_level_1,Unnamed: 72_level_1,Unnamed: 73_level_1,Unnamed: 74_level_1,Unnamed: 75_level_1,Unnamed: 76_level_1,Unnamed: 77_level_1,Unnamed: 78_level_1,Unnamed: 79_level_1,Unnamed: 80_level_1,Unnamed: 81_level_1,Unnamed: 82_level_1,Unnamed: 83_level_1,Unnamed: 84_level_1,Unnamed: 85_level_1,Unnamed: 86_level_1,Unnamed: 87_level_1,Unnamed: 88_level_1,Unnamed: 89_level_1,Unnamed: 90_level_1,Unnamed: 91_level_1,Unnamed: 92_level_1,Unnamed: 93_level_1,Unnamed: 94_level_1,Unnamed: 95_level_1,Unnamed: 96_level_1,Unnamed: 97_level_1,Unnamed: 98_level_1,Unnamed: 99_level_1,Unnamed: 100_level_1,Unnamed: 101_level_1,Unnamed: 102_level_1,Unnamed: 103_level_1,Unnamed: 104_level_1,Unnamed: 105_level_1,Unnamed: 106_level_1,Unnamed: 107_level_1,Unnamed: 108_level_1,Unnamed: 109_level_1,Unnamed: 110_level_1,Unnamed: 111_level_1,Unnamed: 112_level_1
2022-AUG,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,lev20,26.512778,80.231639,123
2022-SEP,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,lev20,26.512778,80.231639,123
2022-OCT,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,lev20,26.512778,80.231639,123
2022-NOV,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,lev20,26.512778,80.231639,123
2022-DEC,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,lev20,26.512778,80.231639,123


- shape

This return a tuple representing the rows and columns of the DataFrame. There is no empty parenthesis needed in the case of shape because it's a data attribute, while head() was a method attribute.

The simplest explanation I found is that, a data attribute describes an object whilst a method attribute acts on an object and changes it.

In [7]:
data.shape

(264, 112)

- info()

This method prints information about a DataFrame including the index dtype and columns, non-null values and memory usage.

In [8]:
data.info()

<class 'pandas.core.frame.DataFrame'>
Index: 264 entries, 2001-JAN to 2022-DEC
Columns: 112 entries, AOD_1640nm to Elevation(meters)
dtypes: float64(38), int64(73), object(1)
memory usage: 233.1+ KB


- size

This attribute Return the number of rows if Series. Otherwise return the number of rows times number of columns if DataFrame.

In [9]:
data.size

29568

- describe()

This method gives a descriptive statistics including those that summarize the central tendency, dispersion and shape of a dataset's distribution, excluding NaN values.

In [10]:
data.describe()

Unnamed: 0,AOD_1640nm,AOD_1020nm,AOD_870nm,AOD_865nm,AOD_779nm,AOD_675nm,AOD_667nm,AOD_620nm,AOD_560nm,AOD_555nm,AOD_551nm,AOD_532nm,AOD_531nm,AOD_510nm,AOD_500nm,AOD_490nm,AOD_443nm,AOD_440nm,AOD_412nm,AOD_400nm,AOD_380nm,AOD_340nm,Precipitable_Water(cm),AOD_681nm,AOD_709nm,AOD_Empty,AOD_Empty.1,AOD_Empty.2,AOD_Empty.3,AOD_Empty.4,440-870_Angstrom_Exponent,380-500_Angstrom_Exponent,440-675_Angstrom_Exponent,500-870_Angstrom_Exponent,340-440_Angstrom_Exponent,440-675_Angstrom_Exponent[Polar],NUM_DAYS[AOD_1640nm],NUM_DAYS[AOD_1020nm],NUM_DAYS[AOD_870nm],NUM_DAYS[AOD_865nm],NUM_DAYS[AOD_779nm],NUM_DAYS[AOD_675nm],NUM_DAYS[AOD_667nm],NUM_DAYS[AOD_620nm],NUM_DAYS[AOD_560nm],NUM_DAYS[AOD_555nm],NUM_DAYS[AOD_551nm],NUM_DAYS[AOD_532nm],NUM_DAYS[AOD_531nm],NUM_DAYS[AOD_510nm],NUM_DAYS[AOD_500nm],NUM_DAYS[AOD_490nm],NUM_DAYS[AOD_443nm],NUM_DAYS[AOD_440nm],NUM_DAYS[AOD_412nm],NUM_DAYS[AOD_400nm],NUM_DAYS[AOD_380nm],NUM_DAYS[AOD_340nm],NUM_DAYS[Precipitable_Water(cm)],NUM_DAYS[AOD_681nm],NUM_DAYS[AOD_709nm],NUM_DAYS[AOD_Empty],NUM_DAYS[AOD_Empty].1,NUM_DAYS[AOD_Empty].2,NUM_DAYS[AOD_Empty].3,NUM_DAYS[AOD_Empty].4,NUM_DAYS[440-870_Angstrom_Exponent],NUM_DAYS[380-500_Angstrom_Exponent],NUM_DAYS[440-675_Angstrom_Exponent],NUM_DAYS[500-870_Angstrom_Exponent],NUM_DAYS[340-440_Angstrom_Exponent],NUM_DAYS[440-675_Angstrom_Exponent[Polar]],NUM_POINTS[AOD_1640nm],NUM_POINTS[AOD_1020nm],NUM_POINTS[AOD_870nm],NUM_POINTS[AOD_865nm],NUM_POINTS[AOD_779nm],NUM_POINTS[AOD_675nm],NUM_POINTS[AOD_667nm],NUM_POINTS[AOD_620nm],NUM_POINTS[AOD_560nm],NUM_POINTS[AOD_555nm],NUM_POINTS[AOD_551nm],NUM_POINTS[AOD_532nm],NUM_POINTS[AOD_531nm],NUM_POINTS[AOD_510nm],NUM_POINTS[AOD_500nm],NUM_POINTS[AOD_490nm],NUM_POINTS[AOD_443nm],NUM_POINTS[AOD_440nm],NUM_POINTS[AOD_412nm],NUM_POINTS[AOD_400nm],NUM_POINTS[AOD_380nm],NUM_POINTS[AOD_340nm],NUM_POINTS[Precipitable_Water(cm)],NUM_POINTS[AOD_681nm],NUM_POINTS[AOD_709nm],NUM_POINTS[AOD_Empty],NUM_POINTS[AOD_Empty].1,NUM_POINTS[AOD_Empty].2,NUM_POINTS[AOD_Empty].3,NUM_POINTS[AOD_Empty].4,NUM_POINTS[440-870_Angstrom_Exponent],NUM_POINTS[380-500_Angstrom_Exponent],NUM_POINTS[440-675_Angstrom_Exponent],NUM_POINTS[500-870_Angstrom_Exponent],NUM_POINTS[340-440_Angstrom_Exponent],NUM_POINTS[440-675_Angstrom_Exponent[Polar]],Latitude(degrees),Longitude(degrees),Elevation(meters)
count,113.0,225.0,228.0,0.0,0.0,231.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,231.0,0.0,0.0,231.0,0.0,0.0,228.0,230.0,230.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,231.0,231.0,231.0,231.0,231.0,0.0,264.0,264.0,264.0,264.0,264.0,264.0,264.0,264.0,264.0,264.0,264.0,264.0,264.0,264.0,264.0,264.0,264.0,264.0,264.0,264.0,264.0,264.0,264.0,264.0,264.0,264.0,264.0,264.0,264.0,264.0,264.0,264.0,264.0,264.0,264.0,264.0,264.0,264.0,264.0,264.0,264.0,264.0,264.0,264.0,264.0,264.0,264.0,264.0,264.0,264.0,264.0,264.0,264.0,264.0,264.0,264.0,264.0,264.0,264.0,264.0,264.0,264.0,264.0,264.0,264.0,264.0,264.0,264.0,264.0,264.0,264.0,264.0,264.0,264.0,264.0
mean,0.214897,0.335258,0.386183,,,0.493446,,,,,,,,,0.663366,,,0.737413,,,0.82727,0.892741,2.718355,,,,,,,,1.023243,0.877037,1.004167,1.050596,0.828608,,8.67803,16.875,17.382576,0.0,0.0,17.333333,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,17.511364,0.0,0.0,17.424242,0.0,0.0,17.181818,17.079545,17.5,0.0,0.0,0.0,0.0,0.0,0.0,0.0,17.534091,17.492424,17.511364,17.522727,17.333333,0.0,464.772727,710.700758,732.503788,0.0,0.0,729.659091,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,729.287879,0.0,0.0,718.397727,0.0,0.0,691.465909,644.348485,729.227273,0.0,0.0,0.0,0.0,0.0,0.0,0.0,734.787879,721.268939,729.344697,734.465909,694.064394,0.0,26.51278,80.23164,123.0
std,0.104191,0.136913,0.134328,,,0.146812,,,,,,,,,0.180585,,,0.198788,,,0.216097,0.224253,1.555244,,,,,,,,0.331208,0.194201,0.289591,0.365931,0.173024,,11.481781,10.206029,10.099573,0.0,0.0,9.979574,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,9.965333,0.0,0.0,9.985633,0.0,0.0,10.123059,10.002534,9.991251,0.0,0.0,0.0,0.0,0.0,0.0,0.0,9.968714,9.94968,9.965333,9.966458,9.990617,0.0,761.864778,708.212567,705.607594,0.0,0.0,704.533849,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,698.437066,0.0,0.0,692.830573,0.0,0.0,674.458977,637.266053,701.450128,0.0,0.0,0.0,0.0,0.0,0.0,0.0,703.080717,692.070168,698.416628,703.125396,672.550098,0.0,5.339192e-14,1.281406e-13,0.0
min,0.069623,0.128439,0.155948,,,0.091856,,,,,,,,,0.306863,,,0.338676,,,0.404804,0.409203,0.800253,,,,,,,,0.159543,0.161845,0.158152,0.160183,0.346314,,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,26.51278,80.23164,123.0
25%,0.149434,0.244783,0.291455,,,0.38612,,,,,,,,,0.527648,,,0.587554,,,0.67069,0.728307,1.403306,,,,,,,,0.757561,0.799643,0.82228,0.751995,0.735857,,0.0,8.0,9.0,0.0,0.0,10.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,10.0,0.0,0.0,9.75,0.0,0.0,8.75,8.75,10.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,10.0,10.0,10.0,10.0,9.0,0.0,0.0,177.25,202.75,0.0,0.0,201.25,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,203.5,0.0,0.0,198.0,0.0,0.0,172.5,160.75,203.75,0.0,0.0,0.0,0.0,0.0,0.0,0.0,203.75,202.75,203.5,203.75,196.25,0.0,26.51278,80.23164,123.0
50%,0.172581,0.307655,0.370896,,,0.479536,,,,,,,,,0.632836,,,0.711554,,,0.798324,0.864513,2.062614,,,,,,,,1.148034,0.925909,1.105296,1.1696,0.821976,,0.0,18.0,18.5,0.0,0.0,18.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,19.0,0.0,0.0,19.0,0.0,0.0,18.5,18.0,19.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,19.0,19.0,19.0,19.0,19.0,0.0,0.0,549.0,608.0,0.0,0.0,579.5,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,612.0,0.0,0.0,585.0,0.0,0.0,563.5,521.5,614.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,614.5,600.0,612.0,608.0,563.5,0.0,26.51278,80.23164,123.0
75%,0.271055,0.386865,0.450169,,,0.587685,,,,,,,,,0.781024,,,0.875255,,,0.978104,1.042066,4.154134,,,,,,,,1.295319,0.995624,1.223059,1.344492,0.928319,,20.25,26.0,26.0,0.0,0.0,26.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,26.0,0.0,0.0,26.0,0.0,0.0,26.0,26.0,26.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,26.0,26.0,26.0,26.0,26.0,0.0,753.25,1033.25,1100.0,0.0,0.0,1099.25,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,1067.5,0.0,0.0,1030.0,0.0,0.0,980.75,900.25,1092.25,0.0,0.0,0.0,0.0,0.0,0.0,0.0,1099.25,1033.25,1067.5,1099.25,980.75,0.0,26.51278,80.23164,123.0
max,0.591585,0.991452,1.013257,,,1.049267,,,,,,,,,1.125929,,,1.246422,,,1.371127,1.437079,5.908376,,,,,,,,1.508481,1.243165,1.463844,1.788171,1.430991,,31.0,31.0,31.0,0.0,0.0,31.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,31.0,0.0,0.0,31.0,0.0,0.0,31.0,31.0,31.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,31.0,31.0,31.0,31.0,31.0,0.0,3419.0,3419.0,3419.0,0.0,0.0,3419.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,3419.0,0.0,0.0,3411.0,0.0,0.0,3357.0,3250.0,3413.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,3419.0,3411.0,3419.0,3419.0,3361.0,0.0,26.51278,80.23164,123.0



These are the methods you need to be familiar with for checking the data while analysis.