<center><h1 style="color:green">Gated Recurrent Unit (GRU)</center>

## 🔹 Introduction
The Gated Recurrent Unit (GRU) is a type of Recurrent Neural Network (RNN) architecture designed to address the vanishing gradient problem. It is similar to Long Short-Term Memory (LSTM) but has a **simpler structure** with fewer parameters, making it computationally efficient.

---

## 1️⃣ GRU Architecture
A GRU unit consists of:

### **🔹 Hidden State ($h_t$)**
- Acts as the memory that carries information across time steps.
- Updated using two gates: **Reset Gate** and **Update Gate**.

### **🔹 Gates in GRU**
1. **Reset Gate ($r_t$)**
   - Determines how much of the previous hidden state should be forgotten.
   - Formula:
     
     $ r_t = \sigma(W_r \cdot [h_{t-1}, x_t] + b_r) $
     
   - Uses a sigmoid activation function.

2. **Update Gate ($z_t$)**
   - Controls how much of the previous hidden state should be carried forward.
   - Formula:
     
     $ z_t = \sigma(W_z \cdot [h_{t-1}, x_t] + b_z) $
     
3. **Candidate Activation ($\tilde{h}_t$)**
   - Represents a candidate for the new hidden state.
   - Formula:
     
     $ \tilde{h}_t = \tanh(W_h \cdot [r_t \cdot h_{t-1}, x_t] + b_h) $
     
4. **Final Hidden State ($h_t$)**
   - The final hidden state is a combination of the previous state and the candidate activation.
   - Formula:
     
     $ h_t = (1 - z_t) \cdot \tilde{h}_t + z_t \cdot h_{t-1} $
     
---

## 2️⃣ Why GRU Over Traditional RNNs?
### ✅ **Simpler than LSTM**
- GRU has only **two gates** instead of three, making it computationally more efficient.

### ✅ **Solves the Vanishing Gradient Problem**
- The gating mechanism helps in **long-term dependency learning**.

### ✅ **Efficiently Captures Sequential Data Patterns**
- Works well for **speech recognition, language modeling, and time-series forecasting**.

---

## 3️⃣ Applications of GRU
- ✔ **Text and Speech Processing** (e.g., Machine Translation, Chatbots)
- ✔ **Time-Series Forecasting**
- ✔ **Stock Market Prediction**
- ✔ **Anomaly Detection**
- ✔ **Gesture and Handwriting Recognition**

---

## 🔥 Conclusion
GRUs are a simplified alternative to LSTMs, providing similar advantages in handling long-term dependencies but with fewer parameters. Their efficiency and effectiveness make them widely used in NLP, speech processing, and time-series tasks.
