Skip to content

MercyMburu/Python-Data-Analysis

Folders and files

NameName
Last commit message
Last commit date

Latest commit

ย 

History

11 Commits
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 

Repository files navigation

Weekly Learning Summary: NumPy, Pandas, Data Visualization & Storytelling

This weekโ€™s class sessions were all about building a strong foundation in data analysis and storytelling using Python. I got hands-on experience with powerful libraries โ€” NumPy, Pandas, Matplotlib, and Seaborn โ€” and learned how to transform raw data into clear, meaningful insights. Below is a breakdown of what I learned in each section:

WEEK 1:NumPy โ€“ Working with Numerical Data

NumPy was the starting point of the week. I learned that itโ€™s the backbone of data science in Python, providing efficient ways to handle large numerical datasets. Some of the key takeaways and exercises included:

-Creating arrays using np.array().

-Understanding the difference between lists and arrays โ€” arrays are faster and more memory-efficient.

-Performing mathematical operations directly on arrays without loops.

-Using NumPy functions like:

*np.mean(), np.median(), np.std() for statistical calculations.

*np.reshape() and slicing to manipulate data structures.

*Exploring the concept of broadcasting, which allows operations between arrays of different shapes.

Overall, NumPy helped me understand how Python handles numerical computation at scale โ€” efficiently and cleanly.

WEEK 2:Pandas โ€“ Data Cleaning and Analysis

Next, we moved to Pandas, which is all about data manipulation and analysis. I realized that Pandas makes working with structured data (like Excel files) much easier.

Key concepts and lessons I practiced:

-Importing and reading datasets using pd.read_csv().

-Exploring data using:

*df.head(), df.tail(), and df.info().

*df.describe() for quick statistical summaries.

Cleaning data:

--Handling missing values using dropna() and fillna().

--Removing duplicates and renaming columns.

--Performing analysis:

--Grouping data with groupby().

--Sorting and filtering data based on conditions.

--Creating new calculated columns using simple operations or apply().

--Exporting clean datasets using df.to_csv().

By the end of the session, I was comfortable performing data wrangling, transformation, and summarization โ€” just like in Excel but much more efficiently.

WEEK 3:Data Visualization โ€“ Matplotlib & Seaborn

This part of the week was about making the data come alive through visuals. I learned how to use Matplotlib for basic plots and Seaborn for more advanced, aesthetically pleasing ones.

๐Ÿ”น Matplotlib

Creating simple charts like line plots, bar charts, histograms, and scatter plots using plt.plot(), plt.bar(), and plt.scatter().

Customizing visuals with:

Titles, labels, legends, and color options.

Figure size adjustments using plt.figure(figsize=(width, height)).

Adding grids and annotations to make visuals more informative.

๐Ÿ”น Seaborn

Seaborn made it easier to create clean, beautiful charts with less code. I practiced:

Plotting with sns.barplot(), sns.countplot(), sns.scatterplot(), and sns.heatmap().

Visualizing distributions using sns.histplot() and sns.boxplot().

Exploring relationships between variables through pairplots and correlation heatmaps.

These tools helped me communicate data patterns more effectively โ€” especially when comparing multiple variables.

WEEK 4:Data Storytelling โ€“ Turning Analysis into Insights

The final part of the week focused on data storytelling โ€” the art of explaining insights clearly and persuasively.

What I Learned:

Always start with a question or problem: what am I trying to find or prove?

Use visuals that support the narrative, not overwhelm it.

Focus on structure:

Problem or question

Analysis and visuals

Insights and recommendations

Keep charts clean and consistent with relevant colors, labels, and legends.

End every analysis with a clear takeaway that can guide action.

Example:

Instead of just showing a sales decline, tell the story โ€” โ€œSales dropped by 15% in Q2, mainly driven by lower performance in the Western region. Targeted marketing in this region could help recover the loss.โ€

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published