# Simple Word2Vec Project
This notebook demonstrates how to use pre-trained word embeddings using the Gensim library. We'll use a smaller model (`glove-wiki-gigaword-50`) to find word similarities efficiently.

## 1. Installation and Setup

In [1]:
# Install required packages
!pip install gensim numpy



## 2. Importing Libraries

In [2]:
import gensim.downloader as api
import numpy as np

## 3. Loading a Small Pre-trained Model
We'll use `glove-wiki-gigaword-50`, a smaller and faster model than Google News vectors.

In [4]:
print("Loading pre-trained GloVe model...")
wv = api.load('glove-wiki-gigaword-50')
print("Model loaded successfully!")

Loading pre-trained GloVe model...
Model loaded successfully!


## 4. Getting Word Vectors

In [6]:
# Get vector representation of a word
word = 'king'
vec = wv[word]
print(f"Vector for '{word}': {vec[:10]}")  # Display only first 10 values

Vector for 'king': [ 0.50451   0.68607  -0.59517  -0.022801  0.60046  -0.13498  -0.08813
  0.47377  -0.61798  -0.31012 ]


## 5. Finding Similar Words

In [7]:
# Find words similar to a given word
word = 'happy'
similar_words = wv.most_similar(word, topn=5)
print(f"Words similar to '{word}':")
for similar_word, score in similar_words:
    print(f"{similar_word}: {score:.4f}")

Words similar to 'happy':
'm: 0.9142
everyone: 0.8976
everybody: 0.8965
really: 0.8840
me: 0.8785


## 6. Measuring Word Similarity

In [8]:
# Compare similarity between two words
word1, word2 = 'apple', 'orange'
similarity = wv.similarity(word1, word2)
print(f"Similarity between '{word1}' and '{word2}': {similarity:.4f}")

Similarity between 'apple' and 'orange': 0.5388


## 7. Saving and Loading Vectors

In [9]:
# Save vectors for later use
output_file = "word_vectors.kv"
wv.save(output_file)
print(f"Vectors saved to {output_file}")

Vectors saved to word_vectors.kv
