$\Large{\color{Purple}\textbf{Index}}$

Clustering
- K-Means
- DBSCAN
- Hierarchical Cluster Analysis (HCA)
Anomaly detection and novelty detection
- One-class SVM
- Isolation Forest
Visualization and dimensionality reduction
- Principal Component Analysis (PCA)
- Kernel PCA
- Locally-Linear Embedding (LLE)
- t-distributed Stochastic Neighbor Embedding (t-SNE)
Association rule learning
- Apriori
- Eclat

⬛ $\Large{\color{Blue}\mathcal{Unsupervised\ Machine\ Learning}}$

♠️ What is Unsupervised Machine Learning?

$\Large Answer:$ Unsupervised learning, also known as unsupervised machine learning, uses machine learning algorithms to analyze and cluster unlabeled datasets. These algorithms discover hidden patterns or data groupings without the need for human intervention. Its ability to discover similarities and differences in information make it the ideal solution for exploratory data analysis, cross-selling strategies, customer segmentation, and image recognition.

♠️ How does unsupervised learning work?

$\Large Answer:$ Unsupervised learning works by analyzing unlabeled data to identify patterns and relationships. The data is not labeled with any predefined categories or outcomes, so the algorithm must find these patterns and relationships on its own. This can be a challenging task, but it can also be very rewarding, as it can reveal insights into the data that would not be apparent from a labeled dataset.

The input to the unsupervised learning models is as follows:

$\large{\color{Purple}\textrm{Unstructured data:}}$ May contain noisy(meaningless) data, missing values, or unknown data. Example: Digital photos, audio, and video files
$\large{\color{Purple}\textrm{Unlabeled data:}}$ Data only contains a value for input parameters(features), there is no targeted value(lebel or output or Y). It is easy to collect as compared to the labeled one in the Supervised approach.

Unsupervised Learning Algorithm Types

There are mainly 3 types of Algorithms which are used for Unsupervised dataset.

$\large{\color{Purple}\textrm{Clustering}}$
$\large{\color{Purple}\textrm{Association Rule Learning}}$
$\large{\color{Purple}\textrm{Dimensionality Reduction}}$
$\large{\color{Purple}\textrm{Anomaly detection and novelty detection}}$

🔲 $\large{\color{Purple}\underline{\textrm{Clustering}}}$

Clustering in unsupervised machine learning is the process of grouping unlabeled data into clusters based on their similarities. The goal of clustering is to identify patterns and relationships in the data without any prior knowledge of the data’s meaning.

Broadly this technique is applied to group data based on different patterns, such as similarities or differences, our machine model finds. These algorithms are used to process raw, unclassified data objects into groups.

Some common clustering algorithms

K-means Clustering: Partitioning Data into K Clusters
Hierarchical Clustering: Building a Hierarchical Structure of Clusters
Density-Based Clustering (DBSCAN): Identifying Clusters Based on Density
Mean-Shift Clustering: Finding Clusters Based on Mode Seeking
Spectral Clustering: Utilizing Spectral Graph Theory for Clustering

🔲 $\large{\color{Purple}\underline{\textrm{Association Rule Learning}}}$

Association rule learning is also known as association rule mining is a common technique used to discover associations in unsupervised machine learning. This technique is a rule-based ML technique that finds out some very useful relations between parameters of a large data set. This technique is basically used for market basket analysis that helps to better understand the relationship between different products. For e.g. shopping stores use algorithms based on this technique to find out the relationship between the sale of one product w.r.t to another’s sales based on customer behavior. Like if a customer buys milk, then he may also buy bread, eggs, or butter. Once trained well, such models can be used to increase their sales by planning different offers.

Apriori Algorithm: A Classic Method for Rule Induction
FP-Growth Algorithm: An Efficient Alternative to Apriori
Eclat Algorithm: Exploiting Closed Itemsets for Efficient Rule Mining
Efficient Tree-based Algorithms: Handling Large Datasets with Scalability

🔲 $\large{\color{Purple}\underline{\textrm{Dimensionality Reduction}}}$

Dimensionality reduction is the process of reducing the number of features in a dataset while preserving as much information as possible. This technique is useful for improving the performance of machine learning algorithms and for data visualization. Examples of dimensionality reduction algorithms includeDimensionality reduction is the process of reducing the number of features in a dataset while preserving as much information as possible.

Principal Component Analysis (PCA): Linear Transformation for Reduced Dimensions
Linear Discriminant Analysis (LDA): Dimensionality Reduction for Discrimination
Non-negative Matrix Factorization (NMF): Decomposing Data into Non-negative Components
Locally Linear Embedding (LLE): Preserving Local Geometry in Reduced Dimensions
Isomap: Capturing Global Relationships in Reduced Dimensions
Autoencoders: Autoencoders leverage neural networks to compress data and then recreate a new representation of the original data’s input. Looking at the image below, you can see that the hidden layer specifically acts as a bottleneck to compress the input layer prior to reconstructing within the output layer. The stage from the input layer to the hidden layer is referred to as “encoding” while the stage from the hidden layer to the output layer is known as “decoding.”

🔲 $\large{\color{Purple}\underline{\textrm{Anomaly detection and novelty detection}}}$

🔲 $\large{\color{Purple}\underline{\textrm{Applications of Unsupervised Machine Learning}}}$

Machine learning techniques have become a common method to improve a product user experience and to test systems for quality assurance. Unsupervised learning provides an exploratory path to view data, allowing businesses to identify patterns in large volumes of data more quickly when compared to manual observation. Some of the most common real-world applications of unsupervised learning are:

News Sections: Google News uses unsupervised learning to categorize articles on the same story from various online news outlets. For example, the results of a presidential election could be categorized under their label for “US” news.
Computer vision: Unsupervised learning algorithms are used for visual perception tasks, such as object recognition.
Medical imaging: Unsupervised machine learning provides essential features to medical imaging devices, such as image detection, classification and segmentation, used in radiology and pathology to diagnose patients quickly and accurately.
Anomaly detection: Unsupervised learning models can comb through large amounts of data and discover atypical data points within a dataset. These anomalies can raise awareness around faulty equipment, human error, or breaches in security.
Customer personas: Defining customer personas makes it easier to understand common traits and business clients' purchasing habits. Unsupervised learning allows businesses to build better buyer persona profiles, enabling organizations to align their product messaging more appropriately.
Recommendation Engines: Using past purchase behavior data, unsupervised learning can help to discover data trends that can be used to develop more effective cross-selling strategies. This is used to make relevant add-on recommendations to customers during the checkout process for online retailers.
Natural language processing (NLP): Unsupervised learning is used in a variety of NLP tasks, including topic modeling, document clustering, and part-of-speech tagging.

Challenges of unsupervised learning

While unsupervised learning has many benefits, some challenges can occur when it allows machine learning models to execute without any human intervention. Some of these challenges can include:

Computational complexity due to a high volume of training data
Longer training times
Higher risk of inaccurate results
Human intervention to validate output variables
Lack of transparency into the basis on which data was clustered

References:

IBM
Geeks for geeks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

README.md

$\Large{\color{Purple}\textbf{Index}}$

⬛ $\Large{\color{Blue}\mathcal{Unsupervised\ Machine\ Learning}}$

♠️ What is Unsupervised Machine Learning?

♠️ How does unsupervised learning work?

Unsupervised Learning Algorithm Types

🔲 $\large{\color{Purple}\underline{\textrm{Clustering}}}$

🔲 $\large{\color{Purple}\underline{\textrm{Association Rule Learning}}}$

🔲 $\large{\color{Purple}\underline{\textrm{Dimensionality Reduction}}}$

🔲 $\large{\color{Purple}\underline{\textrm{Anomaly detection and novelty detection}}}$

🔲 $\large{\color{Purple}\underline{\textrm{Applications of Unsupervised Machine Learning}}}$

Challenges of unsupervised learning

References:

Files

README.md

Latest commit

History

README.md

File metadata and controls

$\Large{\color{Purple}\textbf{Index}}$

⬛ $\Large{\color{Blue}\mathcal{Unsupervised\ Machine\ Learning}}$

♠️ What is Unsupervised Machine Learning?

♠️ How does unsupervised learning work?

Unsupervised Learning Algorithm Types

🔲 $\large{\color{Purple}\underline{\textrm{Clustering}}}$

🔲 $\large{\color{Purple}\underline{\textrm{Association Rule Learning}}}$

🔲 $\large{\color{Purple}\underline{\textrm{Dimensionality Reduction}}}$

🔲 $\large{\color{Purple}\underline{\textrm{Anomaly detection and novelty detection}}}$

🔲 $\large{\color{Purple}\underline{\textrm{Applications of Unsupervised Machine Learning}}}$

Challenges of unsupervised learning

References: