### Step 1: Setting Up MongoDB Connection
 First, let's set up a connection to a MongoDB instance.

In [1]:
from pymongo import MongoClient, errors

# Connect to the MongoDB server running locally
client = MongoClient('mongodb://localhost:27017/')

# Access the 'product_catalog_db' database
db = client['product_catalog_db']

# Access the 'products' collection
products_collection = db['products']

# Create a unique index on 'product_id'
products_collection.create_index("product_id", unique=True)

print("Unique index on 'product_id' created.")


Unique index on 'product_id' created.


### Step 2: CRUD Operations
#### 1. Add a New Product

In [2]:
# Define a function to add a new product
def add_product(product_id, name, description, price):
    try:
        product = {
            "product_id": product_id,
            "name": name,
            "description": description,
            "price": price
        }
        result = products_collection.insert_one(product)
        print(f"Product added with id: {result.inserted_id}")
    except errors.DuplicateKeyError:
        print(f"Error: Product with id '{product_id}' already exists!")

In [3]:
# Example usage
add_product("P001", "hp pavilion", "A powerful laptop.", 100000 )

Product added with id: 66d49564decffaded0c2a9d4


In [6]:
# Example usage
add_product("P002", "dell", "A good laptop.", 80000 )

Error: Product with id 'P002' already exists!


#### 2. Retrieve Product Details by product_id

In [4]:
# Define a function to retrieve product details by product_id
def get_product_by_id(product_id):
    product = products_collection.find_one({"product_id": product_id})
    if product:
        print(f"Product found: {product}")
    else:
        print("Product not found!")

In [5]:
# Example usage
get_product_by_id("P001")

Product found: {'_id': ObjectId('66d49564decffaded0c2a9d4'), 'product_id': 'P001', 'name': 'hp pavilion', 'description': 'A powerful laptop.', 'price': 100000}


#### 3. Update Product Description

In [7]:
# Define a function to update product description
def update_product_description(product_id, new_description):
    result = products_collection.update_one(
        {"product_id": product_id},
        {"$set": {"description": new_description}}
    )
    if result.matched_count > 0:
        print("Product description updated successfully!")
    else:
        print("Product not found!")

In [8]:
# Example usage
update_product_description("P001", "An ultra-powerful laptop with the latest features.")


Product description updated successfully!


#### 4. Remove a Product by product_id

In [9]:
# Define a function to remove a product by product_id
def remove_product_by_id(product_id):
    result = products_collection.delete_one({"product_id": product_id})
    if result.deleted_count > 0:
        print("Product removed successfully!")
    else:
        print("Product not found!")

In [10]:
# Example usage
remove_product_by_id("P001")

Product removed successfully!


### Step 3: Justification for Choosing MongoDB


- **Flexible Schema**: MongoDB is a document-oriented database, which means it stores data in a flexible, JSON-like format (BSON). This flexibility allows for easy updates to the data model without requiring extensive migrations.
- **Scalability**: MongoDB is designed to scale horizontally by distributing data across multiple nodes in a cluster. It can handle large volumes of unstructured or semi-structured data, making it ideal for a product catalog where each product might have different attributes.
- **Rich Query Language**: MongoDB supports a powerful and expressive query language that makes it easy to retrieve and manipulate data.


### Step 4: Scaling and Distributing Data Across Multiple Nodes
- **Sharding**: MongoDB supports sharding, which is the process of distributing data across multiple servers or clusters. This is particularly useful for handling large datasets that exceed the capacity of a single server. MongoDB automatically balances the data and queries across the shards.
- **Replica Sets**: MongoDB uses replica sets to ensure high availability. A replica set is a group of MongoDB servers that maintain the same data, providing redundancy and failover.
- **Indexing**: To improve query performance, MongoDB allows you to create indexes on fields that are frequently queried. This is especially important for a product catalog, where you might need to filter or sort products by different attributes.


### Step 5: Testing the Implementation

In [13]:
# Define test cases to validate the CRUD operations
def test_crud_operations():
    # Add a new product
    add_product("P003", "Samsung", "A high-end smartphone.", 699.99)
    
    # Retrieve the product
    get_product_by_id("P003")
    
    # Update the product description
    update_product_description("P003", "A high-end smartphone with advanced features.")
    
    # Retrieve the updated product
    get_product_by_id("P003")
    
    # Remove the product
    remove_product_by_id("P003")
    
    # Try to retrieve the removed product
    get_product_by_id("P003")

# Run the test cases
test_crud_operations()

Product added with id: 66d49955decffaded0c2a9d8
Product found: {'_id': ObjectId('66d49955decffaded0c2a9d8'), 'product_id': 'P003', 'name': 'Samsung', 'description': 'A high-end smartphone.', 'price': 699.99}
Product description updated successfully!
Product found: {'_id': ObjectId('66d49955decffaded0c2a9d8'), 'product_id': 'P003', 'name': 'Samsung', 'description': 'A high-end smartphone with advanced features.', 'price': 699.99}
Product removed successfully!
Product not found!


### Conclusion
"""
This notebook demonstrates the implementation of CRUD operations for a product catalog using MongoDB. The operations include adding, retrieving, updating, and removing products based on a unique product ID. Additionally, it discusses the justification for choosing MongoDB as the document database for this use case, as well as strategies for scaling and distributing data across multiple nodes.
"""