# Topic 05 - Problem 06: Counting Word Occurrences in Text

---

## 1. About the Problem

In many text classification tasks, **the frequency of certain words** in the text plays a key role in analysis.

For example:
- Counting how often a word like "free" appears in customer reviews might indicate sentiment.
- Counting the occurrence of specific terms in news articles for topic classification.

In this problem, I will count how many times the word **“premium”** appears in each product description.

---




## 2. Solution Code

In [6]:
import pandas as pd

# Sample dataset
data = {
    "product_description": [
        "This is a premium quality product preMium",
        "Budget friendly option",
        "Premium design with advanced features",
        "Standard model"
    ]
}

df = pd.DataFrame(data)

# Counting occurrences of the word "premium"
df['premium_count']=df['product_description'].str.lower().str.count('premium')

print(df)


                         product_description  premium_count
0  This is a premium quality product preMium              2
1                     Budget friendly option              0
2      Premium design with advanced features              1
3                             Standard model              0


---

## 3. Explanation (What is happening)

- **str.lower()**  
  → Ensures case-insensitive counting of the word

- **str.count("premium")**  
  → Counts occurrences of the word "premium" in each text entry

This will give:
- `1` for `"premium quality product"`  
- `0` for `"Standard model"`

---

## 4. Summary / Takeaways

By solving this problem, I learned:

1. How to count word occurrences in text
2. The power of counting terms for feature engineering
3. How to transform qualitative data (text) into quantitative features
4. How these features improve text-based models like sentiment analysis

This problem demonstrates a fundamental **NLP preprocessing technique** and is great for GitHub.

---

Next, I’ll move toward:
- Tokenizing text
- Creating more complex features from words

