In [None]:
Q1. What is MongoDB? Explain non-relational databases in short. In which scenarios it is preferred to use
MongoDB over SQL databases?

**MongoDB** is a popular open-source, document-oriented NoSQL database that provides high performance, scalability, and flexibility. It stores data in a flexible, JSON-like format called BSON (Binary JSON), which allows for nested data structures and dynamic schemas. MongoDB is designed to handle large volumes of data and supports various features such as high availability, automatic sharding, and horizontal scaling.

### Non-Relational Databases:

Non-relational databases, also known as NoSQL databases, are database management systems that do not follow the traditional tabular structure of relational databases. Instead, they use flexible data models that can accommodate diverse data types, including structured, semi-structured, and unstructured data. NoSQL databases offer advantages such as horizontal scalability, flexible schemas, and high performance, making them suitable for handling large volumes of data and accommodating dynamic and evolving data requirements.

### Scenarios where MongoDB is preferred over SQL databases:

1. **Schema Flexibility**: MongoDB's schema-less design allows for flexible and dynamic schemas, making it well-suited for scenarios where data structures are subject to frequent changes or where the schema is not fully known in advance.

2. **Unstructured Data**: MongoDB excels at storing unstructured or semi-structured data, such as documents, JSON data, logs, and sensor data. It provides efficient storage and retrieval of such data without the need for complex joins or normalization.

3. **Scalability**: MongoDB is designed for horizontal scalability, allowing it to distribute data across multiple servers and handle large-scale deployments with ease. It supports automatic sharding, which enables seamless scaling as data volumes grow.

4. **High Performance**: MongoDB offers high-performance read and write operations, especially for workloads involving complex queries, aggregation, and real-time analytics. Its native support for indexing and efficient storage mechanisms contribute to its performance advantages.

5. **Agile Development**: MongoDB's agile development approach aligns well with modern development practices, such as Agile and DevOps. It allows for rapid prototyping, iterative development, and faster time-to-market by eliminating the need for rigid schemas and complex data migrations.

6. **Use Cases Requiring Geospatial Indexing**: MongoDB provides built-in support for geospatial indexing and queries, making it suitable for location-based applications, mapping, and geospatial analytics.

In [None]:
Q2. State and Explain the features of MongoDB.

MongoDB is a widely used NoSQL database that offers various features to meet the needs of modern applications. Here are some of the key features of MongoDB:

1. **Flexible Schema Design**:
   - MongoDB employs a flexible schema design, allowing documents within a collection to have varying structures. This schema-less approach enables easy handling of polymorphic data and facilitates agile development.

2. **Document-Oriented Storage**:
   - MongoDB stores data in a document-oriented format called BSON (Binary JSON), which is a binary representation of JSON documents. Each document can contain nested data structures and arrays, providing a natural way to represent complex relationships.

3. **High Availability**:
   - MongoDB supports replica sets, which are self-healing clusters of MongoDB nodes. Replica sets provide automatic failover and data redundancy, ensuring high availability and fault tolerance in the event of node failures.

4. **Horizontal Scalability**:
   - MongoDB's architecture allows for horizontal scaling across multiple servers through sharding. Sharding distributes data across shards (partitions) based on a shard key, enabling linear scalability and efficient distribution of workload.

5. **Indexing**:
   - MongoDB supports various types of indexes, including single-field, compound, multi-key, text, and geospatial indexes. Indexes improve query performance by allowing fast retrieval of data based on indexed fields.

6. **Aggregation Framework**:
   - MongoDB provides a powerful aggregation framework for performing data aggregation operations such as grouping, filtering, sorting, and computing aggregate functions (e.g., sum, average, count) on documents within a collection.

7. **Ad Hoc Queries**:
   - MongoDB supports ad hoc queries, allowing developers to perform complex queries on documents using a rich query language similar to SQL. Queries can include conditions, projections, sorting, and various operators for querying nested data.

8. **Geospatial Indexing and Queries**:
   - MongoDB offers built-in support for geospatial indexing and queries, enabling storage and retrieval of location-based data. Geospatial queries allow for proximity searches, polygon queries, and other spatial operations.

9. **Text Search**:
   - MongoDB includes full-text search capabilities through text indexes, enabling efficient searching of text data within documents. Text indexes support language-specific stemming, stop words, and case-insensitive search.

10. **Security**:
    - MongoDB provides robust security features such as authentication, authorization, encryption (at rest and in transit), auditing, and role-based access control (RBAC). These features help protect sensitive data and ensure compliance with security standards.

11. **Transactions**:
    - Starting from MongoDB 4.0, MongoDB supports multi-document transactions, allowing developers to perform atomic operations across multiple documents within a single transaction. Transactions ensure data consistency and integrity in complex operations involving multiple writes.

12. **Scalable Storage Engine**:
    - MongoDB's storage engine architecture allows for pluggable storage engines, such as WiredTiger and In-Memory, to accommodate diverse workload requirements. Each storage engine provides different trade-offs in terms of performance, durability, and compression.

These features make MongoDB a versatile and powerful NoSQL database solution suitable for a wide range of use cases, including web applications, content management systems, IoT (Internet of Things), real-time analytics, and more.

In [None]:
Q3. Write a code to connect MongoDB to Python. Also, create a database and a collection in MongoDB.

To connect MongoDB to Python, you can use the `pymongo` library, which provides a Python interface for interacting with MongoDB. Below is a Python code example demonstrating how to connect to MongoDB, create a database, and a collection within that database:

import pymongo

# Establishing a connection to MongoDB
try:
    client = pymongo.MongoClient("mongodb://localhost:27017/")
    print("Connected to MongoDB")

    # Creating a database
    mydb = client["mydatabase"]
    print("Database 'mydatabase' created")

    # Creating a collection within the database
    mycol = mydb["customers"]
    print("Collection 'customers' created")

except pymongo.errors.ConnectionFailure as e:
    print("Error connecting to MongoDB:", e)

finally:
    # Closing the MongoDB connection
    if 'client' in locals():
        client.close()
        print("MongoDB connection is closed")

Explanation:

1. **Connecting to MongoDB**: We use `pymongo.MongoClient()` to connect to the MongoDB server running on the default port `27017` on the local machine. If the connection is successful, a `MongoClient` object is created.

2. **Creating a Database**: We create a database named `mydatabase` using the `client["mydatabase"]` syntax. If the database does not already exist, MongoDB will create it when data is first stored in it.

3. **Creating a Collection**: Within the `mydatabase` database, we create a collection named `customers` using the `mydb["customers"]` syntax. If the collection does not already exist, MongoDB will create it when data is first inserted into it.

4. **Error Handling**: We handle potential connection errors using a try-except block and print an error message if an error occurs.

5. **Closing the Connection**: Finally, we close the MongoDB connection using the `client.close()` method to release resources.

This code demonstrates how to establish a connection to MongoDB using `pymongo`, create a database, and create a collection within that database. You can further interact with the created collection by inserting, querying, updating, or deleting documents as needed.

In [None]:
Q4. Using the database and the collection created in question number 3, write a code to insert one record,
and insert many records. Use the find() and find_one() methods to print the inserted record.

Sure, below is a Python code example that demonstrates how to insert one record and insert multiple records into the `customers` collection created in question number 3. It also shows how to use the `find()` and `find_one()` methods to print the inserted records:

import pymongo

# Establishing a connection to MongoDB
try:
    client = pymongo.MongoClient("mongodb://localhost:27017/")
    print("Connected to MongoDB")

    # Accessing the 'mydatabase' database
    mydb = client["mydatabase"]

    # Accessing the 'customers' collection
    mycol = mydb["customers"]

    # Inserting one record
    record_one = {"name": "John", "address": "Highway 37"}
    mycol.insert_one(record_one)
    print("One record inserted successfully")

    # Inserting multiple records
    records_many = [
        {"name": "Peter", "address": "Lowstreet 27"},
        {"name": "Amy", "address": "Apple st 652"},
        {"name": "Hannah", "address": "Mountain 21"},
        {"name": "Michael", "address": "Valley 345"},
        {"name": "Sandy", "address": "Ocean blvd 2"}
    ]
    mycol.insert_many(records_many)
    print("Multiple records inserted successfully")

    # Finding and printing the inserted records using find() method
    print("Inserted records using find():")
    for record in mycol.find():
        print(record)

    # Finding and printing one inserted record using find_one() method
    print("\nInserted record using find_one():")
    print(mycol.find_one({"name": "John"}))

except pymongo.errors.ConnectionFailure as e:
    print("Error connecting to MongoDB:", e)

finally:
    # Closing the MongoDB connection
    if 'client' in locals():
        client.close()
        print("MongoDB connection is closed")

Explanation:

1. **Inserting One Record**: We use the `insert_one()` method to insert one record into the `customers` collection. The record is represented as a dictionary.

2. **Inserting Multiple Records**: We use the `insert_many()` method to insert multiple records into the `customers` collection. The records are represented as a list of dictionaries.

3. **Finding and Printing Records**: We use the `find()` method to retrieve all records from the `customers` collection and print them using a loop. We also use the `find_one()` method to retrieve and print one specific record from the collection.

This code demonstrates how to insert one record and multiple records into a MongoDB collection and use the `find()` and `find_one()` methods to retrieve and print the inserted records.

In [None]:
Q5. Explain how you can use the find() method to query the MongoDB database. Write a simple code to
demonstrate this.

In MongoDB, the `find()` method is used to query documents from a collection based on specified criteria. It allows you to retrieve documents that match a given filter condition. The `find()` method returns a cursor object that can be iterated over to access the matched documents.

### Using the `find()` Method:

#### Syntax:
cursor = collection.find(filter, projection)
```

- `collection`: The collection from which documents will be retrieved.
- `filter` (optional): Specifies the criteria used to select documents. It is a dictionary representing the query filter.
- `projection` (optional): Specifies which fields to include or exclude in the returned documents. It is also a dictionary where 1 indicates inclusion and 0 indicates exclusion.

#### Example:
Consider a collection named `customers` with documents representing customer data. Here's how you can use the `find()` method to query the collection:

import pymongo

# Establishing a connection to MongoDB
client = pymongo.MongoClient("mongodb://localhost:27017/")

# Accessing the database and collection
mydb = client["mydatabase"]
mycol = mydb["customers"]

# Querying documents using the find() method
query = {"address": "Apple st 652"}
cursor = mycol.find(query)

# Iterating over the cursor to access matched documents
for document in cursor:
    print(document)

In this code:

1. We establish a connection to MongoDB and access the `mydatabase` database and `customers` collection.
2. We define a filter query specifying the criteria for selecting documents. In this example, we want to find documents where the value of the `address` field is `"Apple st 652"`.
3. We use the `find()` method to retrieve documents from the `customers` collection that match the specified query filter.
4. We iterate over the cursor returned by the `find()` method to access each matched document and print it.

This demonstrates how to use the `find()` method to query a MongoDB database and retrieve documents based on specified criteria. The `find()` method is versatile and allows for complex queries using various query operators to match documents.

In [None]:
Q6. Explain the sort() method. Give an example to demonstrate sorting in MongoDB.

In MongoDB, the `sort()` method is used to sort the results of a query based on one or more fields in ascending or descending order. It allows you to control the order in which documents are returned by the query. The `sort()` method modifies the cursor to return documents in the specified order.

### Using the `sort()` Method:

#### Syntax:
cursor = collection.find(filter).sort(sort_keys, direction)

- `collection`: The collection from which documents will be retrieved.
- `filter`: Specifies the criteria used to select documents. It is optional and can be omitted if no filtering is required.
- `sort_keys`: Specifies the field(s) by which to sort the documents. It is a list of field names or tuples (field name, direction). By default, sorting is performed in ascending order.
- `direction`: Specifies the sorting direction. It can be `pymongo.ASCENDING` (1) for ascending order or `pymongo.DESCENDING` (-1) for descending order.

#### Example:
Consider a collection named `customers` with documents representing customer data. Here's how you can use the `sort()` method to sort the retrieved documents based on the `name` field in ascending order:

import pymongo

# Establishing a connection to MongoDB
client = pymongo.MongoClient("mongodb://localhost:27017/")

# Accessing the database and collection
mydb = client["mydatabase"]
mycol = mydb["customers"]

# Sorting documents based on the 'name' field in ascending order
cursor = mycol.find().sort("name", pymongo.ASCENDING)

# Iterating over the cursor to access sorted documents
for document in cursor:
    print(document)

In [None]:
Q7. Explain why delete_one(), delete_many(), and drop() is used.

In MongoDB, the `delete_one()`, `delete_many()`, and `drop()` methods are used to remove documents or collections from the database. Each of these methods serves a specific purpose and provides flexibility in deleting data based on different criteria:

1. **delete_one() Method**:
   - The `delete_one()` method is used to delete a single document that matches a specified filter condition from a collection.
   - It removes the first document that matches the filter criteria and stops after deleting the first matching document.
   - This method is useful when you want to delete a specific document based on certain criteria.

2. **delete_many() Method**:
   - The `delete_many()` method is used to delete multiple documents that match a specified filter condition from a collection.
   - It removes all documents that match the filter criteria and continues until all matching documents are deleted.
   - This method is suitable for bulk deletion of documents based on a common condition.

3. **drop() Method**:
   - The `drop()` method is used to drop (delete) an entire collection from the database.
   - It removes the entire collection along with all its documents and indexes.
   - This method is typically used when you want to permanently remove a collection and its data from the database.

### Example Scenarios:

- **delete_one() Example**:
  result = mycol.delete_one({"name": "John"})
  print(result.deleted_count, "document deleted")
  This code deletes the first document in the `mycol` collection where the `name` field is "John".

- **delete_many() Example**:
  result = mycol.delete_many({"status": "inactive"})
  print(result.deleted_count, "documents deleted")
  This code deletes all documents in the `mycol` collection where the `status` field is "inactive".

- **drop() Example**:
  mycol.drop()
  This code deletes the entire `mycol` collection, including all its documents and indexes.