Unsupervised Learning for Clustering Car and Truck Images
This GitHub repository contains the code and documentation for a project applying unsupervised learning techniques, particularly clustering, on a Kaggle dataset containing images of cars and trucks. The primary goal is to explore and implement clustering algorithms to group similar images without using labeled examples. This project serves as the author's first venture into unsupervised learning, with a focus on clustering as an introductory task.
- Clustering
- Unsupervised Learning
- AI
- Computer Vision
-
Introduction
- Brief overview of unsupervised learning and its applications.
- Related studies highlighting the importance of clustering in various domains.
-
State of the Art
- Overview of current advancements in machine learning, with a focus on agglomerative clustering.
- Detailed explanation of agglomerative clustering, including key aspects and considerations.
- Introduction to Random Forests and Isolation Forests, discussing their applications in regression, classification, and anomaly detection.
-
Experiment
- Motivation for the theme, choice of programming language, and libraries used.
- Detailed information on the dataset, data preprocessing steps, and initial data analysis.
- Implementation details for agglomerative clustering and isolation forests.
- Visualization of the clustering results and anomaly detection.
-
Conclusions
- Summary of findings.
- Challenges encountered during the experiment.
-
Future Work
- Proposed ideas for future experiments and improvements.
-
References
- Citations for relevant studies and tools used in the project.