# Averaged Word2Vec Embedding — How It Works

## What is Averaged Word2Vec?

Averaged Word2Vec is a simple and effective technique for creating **sentence or document embeddings** by taking the **average of the Word2Vec vectors** of the words in the sentence.

---

## Sample Sentence

Let's use the sentence:

"The cat sits on the mat"




Suppose our vocabulary includes:

`["the", "cat", "sits", "on", "mat"]`

---

## Step 1: Get Word2Vec Embeddings

Assume each word is represented as a **2-dimensional Word2Vec vector** (for simplicity):

| Word   | Word2Vec Vector         |
|--------|-------------------------|
| "the"  | $[0.1,\ -0.3]$          |
| "cat"  | $[0.7,\ 0.4]$           |
| "sits" | $[0.6,\ 0.5]$           |
| "on"   | $[0.0,\ 0.2]$           |
| "mat"  | $[-0.1,\ 0.3]$          |

---

## Step 2: Tokenize the Sentence

From `"The cat sits on the mat"`, we extract the tokens:

["the", "cat", "sits", "on", "the", "mat"]



Note that "the" appears **twice**.

---

## Step 3: Compute Average Vector

Let the sentence have $n$ words, each represented by a Word2Vec vector $\vec{v}_i$.

The **averaged sentence embedding** is calculated as:

$$
\vec{v}_{\text{sentence}} = \frac{1}{n} \sum_{i=1}^{n} \vec{v}_i
$$

For our example:

- "the" appears twice: $2 \times [0.1,\ -0.3] = [0.2,\ -0.6]$
- Add all vectors:

$$
\vec{v}_{\text{sum}} = [0.2, -0.6] + [0.7, 0.4] + [0.6, 0.5] + [0.0, 0.2] + [-0.1, 0.3]
= [1.4, 0.8]
$$

- Total number of words = 6

$$
\vec{v}_{\text{avg}} = \frac{[1.4,\ 0.8]}{6} = [0.2333,\ 0.1333]
$$

---

## 🔍 Final Sentence Embedding

The averaged Word2Vec vector for the sentence:


"The cat sits on the mat"


is:

$$
\vec{v}_{\text{sentence}} = [0.2333,\ 0.1333]
$$

---

## Why Use Averaged Word2Vec?

| Advantage                     | Description                              |
|------------------------------|------------------------------------------|
| ✅ Simple                    | Easy to implement and compute            |
| ✅ Fast                      | No training needed if Word2Vec is pretrained |
| ✅ Captures meaning          | Aggregates word semantics                |
| ❌ Loses structure            | Ignores word order and syntax            |
| ❌ Equal weighting            | All words treated equally, including stopwords |

---

## When to Use

- As a **baseline** for sentence/document representation
- For tasks like:
  - Text classification
  - Document clustering
  - Semantic similarity

> 💡 Averaged Word2Vec is a lightweight method that captures **overall meaning** without needing complex architectures.
