In this project, Airbnb and HR Datasets are used for the visualization with the help of python libraries : Matplotlib and Seaborn. And also using Excel for analyzing and cleaning both the Datasets. Showing 10 different visualization by using these 2 datasets after loading into 2 different dataframes. In each Visualisation, describing about the columns that are being used to visualise and the forms of representation.
Data visualization is an essential part of data analysis and communication, as it allows us to understand and interpret data more effectively through visual representations such as charts, graphs, and maps. It can be used by teachers to display student test results, by computer scientists exploring advancements in artificial intelligence (AI) or by executives looking to share information with stakeholders and also plays an important role in big data projects. With the help of Data visualization, business users gain insight into their vast amount of data and get benefits from them to recognize new patterns and errors in the data. Making sense of these patterns help the users to pay attention to the areas that indicate red flags and progress.
Before begin, ensure that we have the following prerequisite:
- Jupyter Notebook
- Excel
Airbnb dataset from kaggle https://www.kaggle.com/datasets/arianazmoudeh/airbnbopendata
Human Resources dataset from kaggle https://www.kaggle.com/datasets/rhuebner/human-resources-data-set
To get started with data visualization using Python:
-
Two extensively used Python libraries for data visualisation are Matplotlib and Seaborn.
- Matplotlib: A widely used library for creating static, animated, and interactive visualizations in Python. It provides a comprehensive set of functions for creating line plots, bar charts, scatter plots, histograms, and more.
- Seaborn: Built on top of Matplotlib, Seaborn provides a high-level interface for creating visually appealing statistical graphics. It simplifies the process of creating complex visualizations such as heatmaps, pair plots, and categorical plots.
-
In 1st Visualization: The columns using to visualize are 'neighbourhood_group' and 'price' from 'Airbnb dataset' with 'Bar chart' representation.
From this above chart we can analyze that variation in price depends on area by area.
-
In 2nd Visualization: The columns using to visualize are 'number_of_reviews' and 'price' from 'Airbnb dataset' with 'Scatter chart' representation.
From this above chart we can analyze that most number of people like to stay in less price and their reviews are higher in those areas.
-
In 3rd Visualization: Here using all the columns for visualization from 'Airbnb dataset' with 'Heatmap' representation.
From this above visualization we have seen all the correlation between different features or variables.
-
In 4th Visualization: The columns using to visualize are 'neighbourhood_group' and 'room_type' from 'Airbnb dataset' with 'countplot chart' representation.
From this above chart we can analyze that different types of rooms count grouped by neighbourhood_group.
-
In 5th Visualization: The column using to visualize is 'neighbourhood' from 'Airbnb dataset' with 'Histogram chart' representation.
From this above chart we can analyze that the frequency or the occurance of neighbourhood.
-
In 6th Visualization: The column using to visualize are 'Performance Score' and 'Salary' from 'HRDataset' with 'boxplot chart' representation.
From this above chart we can analyze that the Employees having salary according to their performance score.
-
In 7th Visualization: The column using to visualize is 'Salary' from 'HRDataset' with 'Distplot chart' representation.
From this above chart we can analyze that the maximum number of Employees have salary range between 50 thousand to 1 lakhs.
-
In 8th Visualization: The column using to visualize are 'MaritalDesc' and 'Salary' from 'HRDataset' with 'Stripplot chart' representation.
From this above chart we can analyze that the Salary distribution among the Marital Status.
-
In 9th Visualization: The column using to visualize are 'Department' and 'Absences' from 'HRDataset' with 'violinplot chart' representation.
From this above chart we can analyze that the number of Employees absent in each department.
-
In 10th Visualization: The column using to visualize is 'Department' from 'HRDataset' with 'Pie chart' representation.
From this above chart we can analyze that the number of Employees in each department.
This project is licensed under the MIT License. Feel free to modify and use it as per your requirements.
Python offers a wide range of powerful libraries and tools that enable us to create stunning visualizations, uncover insights, and tell compelling stories with our data. The provided examples have demonstrated how to create different types of visualizations, including bar chart, scatter plot, heatmap, box plot and interactive plots.
I hope this repository serves as a valuable resource for data visualization journey using Python.









