<a href="https://colab.research.google.com/github/yogeshsinghgit/Pwskills_Assignment/blob/main/MongoDB_Assignment.ipynb" target="_parent"><img src="https://colab.research.google.com/assets/colab-badge.svg" alt="Open In Colab"/></a>

# MongoDB Assignment

[Assignment Link](https://drive.google.com/file/d/1LzTUwURsuTLcKHK1Y58vji2Uk-1klRdI/view)


# Q1. What is MongoDB? Explain non-relational databases in short. In which scenarios it is preferred to use MongoDB over SQL databases?

**MongoDB:**
MongoDB is a popular open-source NoSQL (non-relational) database management system that is designed to handle unstructured or semi-structured data. It falls under the category of document-oriented databases, where data is stored in flexible, JSON-like documents instead of traditional rows and columns. MongoDB is known for its scalability, flexibility, and the ability to handle large volumes of data.

**Non-Relational Databases:**
Non-relational databases, also known as NoSQL databases, are a type of database management system that provides a mechanism for storage and retrieval of data that is modeled in ways other than the traditional relational databases. Unlike relational databases, non-relational databases do not require a fixed schema, and they can handle a variety of data types and structures, making them well-suited for handling unstructured or semi-structured data.

**Scenarios to Prefer MongoDB over SQL Databases:**
MongoDB is often preferred over traditional SQL databases in specific scenarios:

1. **Flexible Schema:**
   - MongoDB's document-oriented structure allows for a flexible schema. Each document in a collection can have a different structure, which is particularly useful when dealing with evolving or dynamic data models.

2. **Unstructured or Semi-Structured Data:**
   - MongoDB excels at handling unstructured or semi-structured data, such as JSON-like documents. It is well-suited for scenarios where the data does not fit neatly into tables with fixed columns.

3. **Horizontal Scalability:**
   - MongoDB is designed to scale horizontally, allowing you to distribute data across multiple servers or nodes. This makes it suitable for applications that require high scalability and can benefit from distributed databases.

4. **Agile Development and Rapid Prototyping:**
   - MongoDB's flexible schema and dynamic nature make it conducive to agile development practices and rapid prototyping. Developers can quickly adapt to changing requirements without the need for extensive schema modifications.

5. **Complex Data Structures:**
   - MongoDB supports complex data structures, including nested arrays and documents. This is beneficial for scenarios where data relationships are hierarchical or nested.

6. **Real-time Applications:**
   - MongoDB is often used in real-time applications, such as IoT (Internet of Things) platforms and certain types of analytics, where quick and flexible access to data is crucial.

7. **Geospatial Data:**
   - MongoDB has built-in support for geospatial indexing and queries, making it well-suited for applications that involve location-based data.

It's important to note that the choice between MongoDB and SQL databases depends on the specific requirements of the application. While MongoDB is a good fit for certain use cases, traditional SQL databases may be more suitable for applications with complex relationships, transactions, and well-defined schemas.

# Q2. State and Explain the features of MongoDB.

MongoDB is a NoSQL, document-oriented database management system that provides a flexible and scalable approach to handling data. Here are some key features of MongoDB:

1. **Document-Oriented:**
   - MongoDB stores data in BSON (Binary JSON)-like documents, which are flexible, JSON-like structures. Each document can have a different structure, allowing for a dynamic and schema-less data model.

2. **Dynamic Schema:**
   - MongoDB does not enforce a fixed schema for documents in a collection. This flexibility is advantageous when dealing with evolving data models, as new fields can be added to documents without requiring a predefined schema.

3. **Scalability:**
   - MongoDB is designed to scale horizontally, allowing for the distribution of data across multiple servers or nodes. This horizontal scaling is achieved through sharding, enabling MongoDB to handle large volumes of data and high traffic.

4. **High Performance:**
   - MongoDB's architecture and indexing capabilities contribute to high performance in read and write operations. It includes features like in-memory storage and support for indexing on any attribute, allowing for efficient query execution.

5. **Rich Query Language:**
   - MongoDB provides a powerful and expressive query language that supports a wide range of queries, including field, range, and regular expression queries. The query language allows for complex data retrieval and manipulation.

6. **Indexing:**
   - MongoDB supports various types of indexes, including single-field, compound, and geospatial indexes. Indexing enhances query performance by allowing the database to quickly locate and retrieve specific documents.

7. **Aggregation Framework:**
   - MongoDB's aggregation framework provides a powerful set of tools for data transformation and analysis. It supports operations such as filtering, grouping, sorting, and projecting, allowing for complex data manipulations within the database.

8. **Geospatial Capabilities:**
   - MongoDB includes built-in support for geospatial indexing and queries. This feature is useful for applications that involve location-based data, such as mapping and location-based services.

9. **Automatic Sharding:**
   - MongoDB can automatically partition data across multiple nodes through sharding, enabling horizontal scaling and improved performance. Sharding helps distribute the load and allows for the storage of large datasets.

10. **Agile Development:**
    - MongoDB's dynamic schema and flexible data model make it well-suited for agile development practices. Developers can quickly adapt to changing requirements without the need for extensive schema modifications.

11. **Document Validation:**
    - MongoDB allows the definition of document validation rules to ensure that data adheres to specific criteria. This helps maintain data integrity and consistency within the database.

12. **Transactions (as of MongoDB 4.0):**
    - MongoDB introduced multi-document transactions in version 4.0, allowing for the execution of multiple operations on multiple documents as a single atomic operation. This feature ensures data consistency in complex transactional scenarios.

MongoDB's features make it a popular choice for a variety of applications, including those with dynamic and evolving data requirements, as well as those that demand high scalability and performance.

# Q3. Write a code to connect MongoDB to Python. Also, create a database and a collection in MongoDB.


To connect MongoDB to Python, you can use the `pymongo` library, which is the official MongoDB driver for Python. If you haven't installed it yet, you can do so with:

```bash
pip install pymongo
```

Here's a simple example code to connect to MongoDB, create a database, and a collection:

```python
import pymongo

# Replace these with your MongoDB server details
host = "your_host"
port = 27017  # default MongoDB port
username = "your_username"
password = "your_password"

# Connect to MongoDB
client = pymongo.MongoClient(host=host, port=port, username=username, password=password)

# Create or access a database
database_name = "your_database"
db = client[database_name]

# Create or access a collection
collection_name = "your_collection"
collection = db[collection_name]

# Insert a document into the collection
document = {"name": "John Doe", "age": 30, "city": "Example City"}
result = collection.insert_one(document)

# Print the inserted document's ID
print("Inserted document ID:", result.inserted_id)

# Close the connection
client.close()
```

Explanation of the code:

1. **Connect to MongoDB:**
   - Use `pymongo.MongoClient` to connect to your MongoDB server. Replace `"your_host"`, `"your_username"`, and `"your_password"` with your MongoDB server details.

2. **Create or Access a Database:**
   - Use `client[database_name]` to create or access a database. Replace `"your_database"` with the desired database name.

3. **Create or Access a Collection:**
   - Use `db[collection_name]` to create or access a collection within the database. Replace `"your_collection"` with the desired collection name.

4. **Insert a Document:**
   - Use `collection.insert_one(document)` to insert a document into the collection. In this example, a document with "name," "age," and "city" fields is inserted.

5. **Print the Inserted Document's ID:**
   - Print the ID of the inserted document, which is automatically generated by MongoDB.

6. **Close the Connection:**
   - Close the connection to MongoDB when you're done.

Remember to replace the placeholder values with your actual MongoDB server details, database name, and collection name.

Note: Make sure your MongoDB server is running and accessible from your Python environment. Adjust the connection details based on your MongoDB server configuration.

# Q4. Using the database and the collection created in question number 3, write a code to insert one record,and insert many records. Use the find() and find_one() methods to print the inserted record.

Certainly! Below is an example code that extends the previous one to insert one record, insert many records, and use the `find()` and `find_one()` methods to retrieve and print the inserted records:

```python
import pymongo

# Replace these with your MongoDB server details
host = "your_host"
port = 27017  # default MongoDB port
username = "your_username"
password = "your_password"

# Connect to MongoDB
client = pymongo.MongoClient(host=host, port=port, username=username, password=password)

# Create or access a database
database_name = "your_database"
db = client[database_name]

# Create or access a collection
collection_name = "your_collection"
collection = db[collection_name]

# Insert one record
record_one = {"name": "Jane Doe", "age": 25, "city": "Example City"}
result_one = collection.insert_one(record_one)

# Insert many records
records_many = [
    {"name": "Alice", "age": 28, "city": "Example City"},
    {"name": "Bob", "age": 35, "city": "Another City"},
    {"name": "Charlie", "age": 40, "city": "Yet Another City"}
]
result_many = collection.insert_many(records_many)

# Print the IDs of the inserted records
print("Inserted record ID (one):", result_one.inserted_id)
print("Inserted records IDs (many):", result_many.inserted_ids)

# Find and print one record
found_one = collection.find_one({"name": "Jane Doe"})
print("Found one record:", found_one)

# Find and print all records
found_all = collection.find()
print("\nFound all records:")
for record in found_all:
    print(record)

# Close the connection
client.close()
```

Explanation of the extended code:

1. **Insert One Record:**
   - Use `collection.insert_one(record_one)` to insert a single record.

2. **Insert Many Records:**
   - Use `collection.insert_many(records_many)` to insert multiple records.

3. **Print Inserted Record IDs:**
   - Print the IDs of the inserted records using `result_one.inserted_id` and `result_many.inserted_ids`.

4. **Find and Print One Record:**
   - Use `collection.find_one({"name": "Jane Doe"})` to find and print a single record.

5. **Find and Print All Records:**
   - Use `collection.find()` to find all records and print each record in a loop.

Remember to replace placeholder values and adjust the connection details based on your MongoDB server configuration.

# Q5. Explain how you can use the find() method to query the MongoDB database. Write a simple code to demonstrate this.

The `find()` method in MongoDB is used to query the database and retrieve documents that match a specified criteria. It returns a cursor, which is an iterable object that can be looped over to access the matching documents.

Here's a simple code example demonstrating the use of the `find()` method:

```python
import pymongo

# Replace these with your MongoDB server details
host = "your_host"
port = 27017  # default MongoDB port
username = "your_username"
password = "your_password"

# Connect to MongoDB
client = pymongo.MongoClient(host=host, port=port, username=username, password=password)

# Create or access a database
database_name = "your_database"
db = client[database_name]

# Create or access a collection
collection_name = "your_collection"
collection = db[collection_name]

# Insert some sample data (if not already inserted)
sample_data = [
    {"name": "Alice", "age": 28, "city": "Example City"},
    {"name": "Bob", "age": 35, "city": "Another City"},
    {"name": "Charlie", "age": 40, "city": "Yet Another City"}
]
collection.insert_many(sample_data)

# Use find() to query the database
query = {"age": {"$gte": 30}}  # Find documents where age is greater than or equal to 30
result = collection.find(query)

# Print the matching documents
print("Matching Documents:")
for document in result:
    print(document)

# Close the connection
client.close()
```

Explanation of the code:

1. **Connect to MongoDB:**
   - Establish a connection to the MongoDB server.

2. **Access Database and Collection:**
   - Create or access a specific database and collection within MongoDB.

3. **Insert Sample Data:**
   - Insert some sample data into the collection (if it's not already inserted).

4. **Use find() to Query the Database:**
   - Use the `find()` method with a query to retrieve documents that match specific criteria. In this example, the query looks for documents where the "age" field is greater than or equal to 30.

5. **Print Matching Documents:**
   - Loop through the cursor returned by `find()` and print the matching documents.

Remember to replace placeholder values and adjust the connection details based on your MongoDB server configuration. The query in the example is a simple example, but MongoDB queries can be much more complex, supporting a wide range of conditions and operators.

# Q6. Explain the sort() method. Give an example to demonstrate sorting in MongoDB.

The `sort()` method in MongoDB is used to sort the results of a query. It allows you to specify one or more fields by which the documents should be sorted and the order (ascending or descending). The `sort()` method is commonly applied to a `find()` query to retrieve documents in a specific order.

Here's an example to demonstrate sorting in MongoDB using the `sort()` method:

```python
import pymongo

# Replace these with your MongoDB server details
host = "your_host"
port = 27017  # default MongoDB port
username = "your_username"
password = "your_password"

# Connect to MongoDB
client = pymongo.MongoClient(host=host, port=port, username=username, password=password)

# Create or access a database
database_name = "your_database"
db = client[database_name]

# Create or access a collection
collection_name = "your_collection"
collection = db[collection_name]

# Insert some sample data (if not already inserted)
sample_data = [
    {"name": "Alice", "age": 28, "city": "Example City"},
    {"name": "Bob", "age": 35, "city": "Another City"},
    {"name": "Charlie", "age": 40, "city": "Yet Another City"}
]
collection.insert_many(sample_data)

# Use find() with sort() to query and sort the database
query = {"age": {"$gte": 30}}  # Find documents where age is greater than or equal to 30
sort_order = [("age", pymongo.ASCENDING)]  # Sort in ascending order by the "age" field
result = collection.find(query).sort(sort_order)

# Print the sorted documents
print("Sorted Documents:")
for document in result:
    print(document)

# Close the connection
client.close()
```

Explanation of the code:

1. **Connect to MongoDB:**
   - Establish a connection to the MongoDB server.

2. **Access Database and Collection:**
   - Create or access a specific database and collection within MongoDB.

3. **Insert Sample Data:**
   - Insert some sample data into the collection (if it's not already inserted).

4. **Use find() with sort() to Query and Sort:**
   - Use the `find()` method with a query to retrieve documents that match specific criteria (`age >= 30` in this example).
   - Apply the `sort()` method to specify the sorting order. In this example, it sorts in ascending order based on the "age" field.

5. **Print Sorted Documents:**
   - Loop through the cursor returned by `find()` and print the sorted documents.

The `sort()` method takes a list of tuples where each tuple contains a field name and a sorting order (`pymongo.ASCENDING` for ascending or `pymongo.DESCENDING` for descending). In the example, it sorts in ascending order based on the "age" field.

# Q7. Explain why delete_one(), delete_many(), and drop() is used.

In MongoDB, the `delete_one()`, `delete_many()`, and `drop()` methods are used for different purposes related to the removal of documents or collections:

1. **`delete_one()` Method:**
   - **Purpose:** The `delete_one()` method is used to delete a single document that matches a specified filter criteria.
   - **Syntax:**
     ```python
     collection.delete_one(filter)
     ```
   - **Example:**
     ```python
     result = collection.delete_one({"name": "Alice"})
     ```
   - This example deletes the first document that matches the filter criteria, where the "name" field is equal to "Alice."

2. **`delete_many()` Method:**
   - **Purpose:** The `delete_many()` method is used to delete multiple documents that match a specified filter criteria.
   - **Syntax:**
     ```python
     collection.delete_many(filter)
     ```
   - **Example:**
     ```python
     result = collection.delete_many({"age": {"$gte": 30}})
     ```
   - This example deletes all documents where the "age" field is greater than or equal to 30.

3. **`drop()` Method:**
   - **Purpose:** The `drop()` method is used to remove an entire collection from the database.
   - **Syntax:**
     ```python
     collection.drop()
     ```
   - **Example:**
     ```python
     collection.drop()
     ```
   - This example removes the entire collection, including all its documents and indexes. Be cautious when using `drop()` as it is irreversible.

**Use Cases:**
- **`delete_one()` and `delete_many()`:** These methods are useful when you want to selectively remove specific documents from a collection based on certain criteria. For example, removing a single user's record or deleting all documents that meet a certain condition.
  
- **`drop()`:** This method is typically used when you want to completely remove a collection, perhaps because it is no longer needed or you want to start fresh with an empty collection. It is a more drastic action compared to `delete_one()` and `delete_many()` as it removes the entire collection.

**Important Note:**
   - While these methods provide powerful tools for managing data in MongoDB, it's crucial to exercise caution, especially when using `delete_many()` and `drop()`, as they can result in the permanent loss of data. Always double-check your filter criteria to avoid unintentional data loss.