## Overview

This notebook will show you how to create and query a table or DataFrame that you uploaded to DBFS. [DBFS](https://docs.databricks.com/user-guide/dbfs-databricks-file-system.html) is a Databricks File System that allows you to store data for querying inside of Databricks. This notebook assumes that you have a file already inside of DBFS that you would like to read from.

This notebook is written in **Python** so the default cell type is Python. However, you can use different languages by using the `%LANGUAGE` syntax. Python, Scala, SQL, and R are all supported.

In [0]:
# File location and type
file_location = "/FileStore/tables/Book1-4.csv"
file_type = "csv"

df_pyspark=spark.read.csv(file_location,header=True,inferSchema=True)

In [0]:

df_pyspark.columns

Out[29]: ['Name', 'Age', 'salary']

In [0]:
df_pyspark.show()

+------+----+------+
|  Name| Age|salary|
+------+----+------+
|   sai|  20|  1000|
|nikhil|  24|  2000|
|juluri|  23|   158|
|sumith|  21|   965|
|kushal|  12|   124|
|  null|  15|   131|
|   hey|null|  null|
|   sai|  20| 12585|
|nikhil|  24|  2455|
+------+----+------+



In [0]:
#subset
#when their is nan value in the selected subset(feature) then whole record will be deleted
df_pyspark.na.drop(how="any",subset='Salary').show()

+------+---+------+
|  Name|Age|salary|
+------+---+------+
|   sai| 20|  1000|
|nikhil| 24|  2000|
|juluri| 23|   158|
|sumith| 21|   965|
|kushal| 12|   124|
|  null| 15|   131|
|   sai| 20| 12585|
|nikhil| 24|  2455|
+------+---+------+



In [0]:
#filling values
df_pyspark.na.fill(value=0).show() #whereever the missing values are their it will fill with value

+------+---+------+
|  Name|Age|salary|
+------+---+------+
|   sai| 20|  1000|
|nikhil| 24|  2000|
|juluri| 23|   158|
|sumith| 21|   965|
|kushal| 12|   124|
|  null| 15|   131|
|   hey|  0|     0|
|   sai| 20| 12585|
|nikhil| 24|  2455|
+------+---+------+



In [0]:
df_pyspark.na.fill("value",'Name').show() #if you want fill the nan values in the selected features

+------+----+------+
|  Name| Age|salary|
+------+----+------+
|   sai|  20|  1000|
|nikhil|  24|  2000|
|juluri|  23|   158|
|sumith|  21|   965|
|kushal|  12|   124|
| value|  15|   131|
|   hey|null|  null|
|   sai|  20| 12585|
|nikhil|  24|  2455|
+------+----+------+

