- Project Description
- Tech Stack Used
- Libraries Used
- Loading Data
- Head of Each Data
- Describe Data
- Creating New Data by Merging
- List of Columns
- Questions Solved
- Screenshots
- License
- Contributing
- Pdf file of analysis
This project focuses on analyzing data related to school projects and donations. The goal is to gain insights into various aspects such as the number of schools opening projects, the amount of donations received, the relationship between projects and donations, and the performance of different states in terms of donations.
The analysis involves loading the required datasets, merging them into a comprehensive dataset, and performing descriptive analysis to extract meaningful information. Key statistical measures like minimum, maximum, mean, median, and percentiles of donations are analyzed. Additionally, visualizations such as bar plots are created to present the findings effectively.
- Python
- Pandas: Used for data manipulation and analysis.
- NumPy: Used for numerical computations and array operations.
- Matplotlib: Used for data visualization and plotting.
- Scikit-learn: Used for machine learning and statistical modeling in Python.
- Load the required datasets into the analysis environment.
- Use Python and appropriate libraries (e.g., Pandas) to read and import the data.
- Display the first few rows of each dataset to get a quick overview of the data structure and contents.
- Utilize Pandas'
head()
function to retrieve the desired information.
- Perform a descriptive analysis of the datasets to gain insights into their statistical properties.
- Use Pandas'
describe()
function to compute statistical measures such as mean, standard deviation, quartiles, etc.
- Combine the separate datasets into a single comprehensive dataset for further analysis.
- Utilize appropriate merging techniques (e.g., Pandas'
merge()
function) based on common columns or keys.
- Generate a list of columns present in the merged dataset.
- Use Pandas'
columns
attribute orhead()
function to obtain the desired information.
- Which 10 states have the most number of schools that opened projects to gather donations? Plot the data using a bar plot.
- What are the top 10 states in which schools gathered the most amount of average donations?
- Analyze the minimum, maximum, mean, median, 25 and 75 percentiles of donations.
- In which states are there the most donations done by donors?
- Is there any relationship between the number of projects offered and the number of donations made by donors?
- Which states are performing better in terms of the number of donations per project? How many of them respond to project requests below average?
- We need to find the number of projects per state and the number of donations per state and then merge both. Fit a linear model to indicate the relationship between projects and donations.
- What are the different types of projects that exist? What is the total amount of donation for each type?
This project is licensed under the MIT License. Feel free to use, modify, and distribute the code for personal or commercial purposes.
Contributions are welcome! If you encounter any issues or have suggestions for improvement, please open an issue or submit a pull request. We appreciate your feedback and collaboration.
Please note that the tech stack and libraries mentioned above are just samples
Check the analysis here !! 👉 analysis pdf file