Skip to content

solomontessema/Data-Analytics-and-AI-with-Python

Folders and files

NameName
Last commit message
Last commit date

Latest commit

Β 

History

16 Commits
Β 
Β 
Β 
Β 

Repository files navigation

🧠 Data Analytics & AI with Python

Welcome to the official repository for Data Analytics and AI with Python, a hands-on course designed by Solomon Tessema to bridge foundational analytics with modern agentic AI workflows. This repo blends classical machine learning, PySpark-based data engineering, and LLM integration into reproducible, real-world pipelines.


πŸš€ Course Objectives

  • Master Python for data analysis, visualization, and automation
  • Build scalable ETL pipelines using PySpark
  • Apply classical ML techniques: regression, classification, clustering
  • Integrate LLMs and vector search for intelligent data workflows
  • Design modular, observable systems using n8n, LangChain, and custom APIs

🧰 Tech Stack

Category Tools & Libraries
Data Wrangling Pandas, PySpark, SQL
Visualization Matplotlib, Seaborn, Plotly
Machine Learning Scikit-learn, TensorFlow, XGBoost
Workflow Design n8n, FastAPI, RESTful APIs

πŸ“ Repository Structure

β”œβ”€β”€ notebooks/              # Jupyter & Colab notebooks
β”œβ”€β”€ datasets/               # Sample CSVs and Parquet files
β”œβ”€β”€ modules/                # Reusable Python scripts
β”œβ”€β”€ pipelines/              # End-to-end ETL and ML flows
β”œβ”€β”€ visualizations/         # Charts and dashboards
β”œβ”€β”€ README.md               # This file

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published