Skip to content
View paulha042's full-sized avatar

Block or report paulha042

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
paulha042/README.md

Hi, I'm Paul 👋

About Me

🤖 An Artificial Intelligence/ Deep Learning / Machine Learning Enthusiast

👨‍🎓 Bachelor of IT in Data Science (2022 - 2025)

🏫 Master of IT in Artificial Intelligence (2025 - 2026)

Deep Learning - AI Project

  • Dataset: CIFAR-10 dataset used for generative image synthesis consisting of 60,000 color images across 10 categories (airplane, automobile, bird, cat, deer, dog, frog, horse, ship, truck).

  • Model Implemented:

    • Denoising Diffusion Implicit Model (DDIM) for image generation.
    • Custom U-Net architecture with attention mechanisms.
    • Implemented both forward (noise addition) and reverse (denoising) diffusion processes.
  • Achievement:

    • Demonstrated stable convergence and interpretable attention patterns over 100 training epochs.
    • Evaluated generated samples with Fréchet Inception Distance (FID = 102.02) and Inception Score (IS = 1.21), showing trade-offs between sampling speed, image fidelity, and diversity.
  • Future Improvement:

    • Implement conditional diffusion guidance for class-controlled generation.
    • Refine noise scheduling for improved efficiency and sample quality.
    • Experiment with deeper U-Net or transformer-based architectures to enhance robustness and image realism.

Machine Learning Project

  • Dataset: A private car sales dataset from a Kaggle competition containing vehicle attributes such as year, mileage, brand, fuel type, and engine specifications to predict selling prices.

  • Model Implemented:

    • Applied supervised regression models including RandomForest, XGBoost, LightGBM, and Gradient Boosting.
    • Focused on data preprocessing, feature engineering, and hyperparameter optimization using GridSearchCV.
    • Utilized Recursive Feature Elimination (RFE) to identify the most significant predictors of price.
  • Achievement:

    • Developed a car price prediction model achieving 10% Mean Absolute Percentage Error (MAPE).
    • Compare the capabilities and the peformance of each model to provide the highest accuracy.
  • Applied K-Means++ and Agglomerative Clustering to segment customers based on purchasing behavior.
  • Enabled businesses to design personalized marketing strategies, improve customer retention, and optimize product recommendations.
  • Conducted feature scaling and elbow/silhouette analysis to determine optimal cluster numbers.
  • Compared the clusters generated from two algorithms (see how they perform differently, we don't measure accuracy here 🙂).
  • Recommend strategies to target each customer segment generated from the two algorithms.

Data Visualization

Popular repositories Loading

  1. Customer_Segmentation_Analysis Customer_Segmentation_Analysis Public

    Jupyter Notebook

  2. Car_Sales_Prediction Car_Sales_Prediction Public

    Jupyter Notebook

  3. MNIST-Image-Classification MNIST-Image-Classification Public

    Jupyter Notebook

  4. Image_Classfication_CNN Image_Classfication_CNN Public

    Jupyter Notebook

  5. Football-PowerBI-Dashboard Football-PowerBI-Dashboard Public

  6. Diffusion-Model-DDIM-Approach-on-CIFAR-10 Diffusion-Model-DDIM-Approach-on-CIFAR-10 Public

    Jupyter Notebook