Q1. What is MongoDB? Explain non-relational databases in short. In which scenarios it is preferred to use
MongoDB over SQL databases?

#Answer

MongoDB is a popular open-source document-oriented NoSQL database system. It is designed to store and manage large volumes of unstructured or semi-structured data. MongoDB uses a flexible document model called BSON (Binary JSON) to represent data, which allows for dynamic schemas and nested data structures.

Non-relational databases, also known as NoSQL (which stands for "not only SQL"), are database systems that do not use the traditional tabular relational model found in SQL databases. They are designed to handle large-scale data storage and processing requirements, focusing on scalability, flexibility, and performance. Non-relational databases can store data in various formats like key-value pairs, documents, columnar, and graphs.

MongoDB is preferred over SQL databases in several scenarios:

1. Flexible and evolving schemas: MongoDB allows for dynamic and flexible schemas, meaning that each document in a collection can have a different structure. This makes it easier to handle evolving data models, where the structure of data can change over time without requiring alterations to the database schema.

2. Scalability: MongoDB is designed to scale horizontally, allowing you to distribute data across multiple servers or clusters easily. This enables handling large amounts of data and high traffic loads.

3. High-performance for certain use cases: MongoDB's document-oriented model can provide high performance for certain types of queries, especially when retrieving entire documents or working with nested data structures.

4. Agile development: MongoDB's flexibility and ease of use make it well-suited for agile development methodologies. It allows developers to quickly iterate and adapt their data models during the development process.

5. Use cases with unstructured or semi-structured data: MongoDB is particularly useful when dealing with unstructured or semi-structured data, such as social media data, sensor data, logs, and content management systems.

It's important to note that SQL databases, such as MySQL or PostgreSQL, are still a preferred choice for many applications that require complex transactions, strong data consistency, and strict schema enforcement. The choice between MongoDB and SQL databases depends on the specific requirements and nature of the data in your application.

                      -------------------------------------------------------------------

Q2. State and Explain the features of MongoDB.

#Answer

MongoDB offers several key features that make it a popular choice for developers and businesses:

1. Document-Oriented: MongoDB is a document-oriented database, which means it stores data in flexible, self-contained documents using a format called BSON (Binary JSON). Each document can have a different structure, allowing for dynamic schemas and easy representation of complex data.

2. Scalability: MongoDB is built to scale horizontally, allowing you to distribute data across multiple servers or clusters. It supports sharding, which divides data across shards for improved performance and increased storage capacity. Additionally, MongoDB's auto-sharding feature simplifies the process of adding new nodes to the cluster.

3. High Performance: MongoDB provides high-performance data access through various features. It supports indexing to optimize query performance and has a built-in caching mechanism called the WiredTiger storage engine. Additionally, MongoDB uses memory-mapped files, which can leverage the operating system's virtual memory for faster data access.

4. Flexible Data Model: MongoDB's flexible data model allows for easy handling of evolving schemas. You can store documents with different fields and structures within the same collection, without requiring a predefined schema. This flexibility enables developers to quickly adapt to changing data requirements.

5. Rich Query Language: MongoDB's query language supports a wide range of operations, including CRUD (create, read, update, delete) operations, aggregation, geospatial queries, and text search. The query language is expressive and allows for complex queries on documents.

6. Replication and High Availability: MongoDB offers replication to ensure data redundancy and high availability. By configuring replica sets, you can create multiple copies of your data across different servers. If the primary node fails, one of the secondary nodes can take over as the primary, minimizing downtime.

7. Automatic Sharding: MongoDB's automatic sharding feature allows for horizontal scaling without manual intervention. It splits data across multiple shards based on a chosen shard key. As the data grows, MongoDB automatically redistributes and balances the data across the shards.

8. Security: MongoDB provides robust security features, including authentication, role-based access control (RBAC), and transport layer encryption (TLS/SSL). These features help protect data from unauthorized access and ensure secure communication between clients and servers.

9. Flexibility with Storage: MongoDB supports various storage engines, including the default WiredTiger engine and the legacy MMAPv1 engine. This gives developers the flexibility to choose the most suitable storage engine based on their specific requirements.

Overall, MongoDB's features make it well-suited for applications that require flexibility, scalability, high performance, and the ability to handle diverse and evolving data structures.

                      -------------------------------------------------------------------

Q3. Write a code to connect MongoDB to Python. Also, create a database and a collection in MongoDB.

#Answer

To connect to MongoDB using Python, we can make use of the pymongo library, which is the official MongoDB driver for Python. Here's an example code that demonstrates how to connect to MongoDB, create a database, and a collection:

In [None]:
import pymongo

# Establish connection
client = pymongo.MongoClient("mongodb://localhost:27017")

# Create/connect to a database
database = client["mydatabase"]

# Create/connect to a collection
collection = database["mycollection"]

# Insert a document into the collection
document = {"name": "John", "age": 30}
inserted_document = collection.insert_one(document)
print("Inserted document ID:", inserted_document.inserted_id)

# Close the connection
client.close()



Import the pymongo library, which provides the necessary classes and methods for interacting with MongoDB.

Use the pymongo.MongoClient() function to establish a connection to the MongoDB server. Here, we assume that MongoDB is running on the default host and port (localhost:27017). If your MongoDB server is running on a different host or port, modify the connection string accordingly.

Create a new database using the client["mydatabase"] syntax. Replace "mydatabase" with the desired name for your database. If the specified database already exists, the connection will be established with that database.

Create a new collection within the database using the database["mycollection"] syntax. Replace "mycollection" with the desired name for your collection. If the specified collection already exists, the connection will be established with that collection.

Insert a document into the collection using the collection.insert_one() method. Here, we're inserting a dictionary as the document. The method returns an InsertOneResult object, and we can access the inserted document's ID using inserted_document.inserted_id.

Finally, close the connection to the MongoDB server using client.close().

                      -------------------------------------------------------------------

Q4. Using the database and the collection created in question number 3, write a code to insert one record,
and insert many records. Use the find() and find_one() methods to print the inserted record.

#Answer

Here's an example code that demonstrates inserting one record and inserting multiple records into a collection in MongoDB. It also shows how to use the find() and find_one() methods to print the inserted records:

In [None]:
import pymongo

# Establish connection
client = pymongo.MongoClient("mongodb://localhost:27017")

# Access the database and collection
database = client["mydatabase"]
collection = database["mycollection"]

# Insert one record
document = {"name": "Alice", "age": 25}
inserted_document = collection.insert_one(document)
print("Inserted document ID:", inserted_document.inserted_id)

# Insert many records
documents = [
    {"name": "Bob", "age": 30},
    {"name": "Charlie", "age": 35},
    {"name": "David", "age": 40}
]
inserted_documents = collection.insert_many(documents)
print("Inserted document IDs:", inserted_documents.inserted_ids)

# Find and print the inserted records
print("\nInserted Records:")
print("One Record:")
print(collection.find_one({"_id": inserted_document.inserted_id}))

print("\nMany Records:")
for document in collection.find({"_id": {"$in": inserted_documents.inserted_ids}}):
    print(document)

# Close the connection
client.close()


                      -------------------------------------------------------------------

Q5. Explain how you can use the find() method to query the MongoDB database. Write a simple code to
demonstrate this.

#Answer

The find() method in MongoDB is used to query the database and retrieve documents that match certain criteria. It allows you to perform complex queries by specifying filters, projections, sorting, and limiting the number of results. Here's an explanation of how you can use the find() method and a simple code example to demonstrate it:

To use the find() method in MongoDB, you provide a query document that specifies the conditions for matching the documents. The query document is a dictionary (or a JSON-like object) where the keys represent the field names, and the values represent the matching criteria.

In [4]:
#The basic syntax for the find() method is as follows:
    
collection.find(query, projection)


query (optional): Specifies the query document to filter the documents. It defines the conditions that the documents must meet to be included in the result. If no query is specified, all documents in the collection will be returned.

projection (optional): Specifies which fields to include or exclude from the result. It helps to shape the output by selecting specific fields. By default, all fields are included in the result.

Here's a simple code example that demonstrates the use of the find() method:

In [None]:
import pymongo

# Establish connection
client = pymongo.MongoClient("mongodb://localhost:27017")

# Access the database and collection
database = client["mydatabase"]
collection = database["mycollection"]

# Insert some sample documents
documents = [
    {"name": "Alice", "age": 25, "city": "New York"},
    {"name": "Bob", "age": 30, "city": "London"},
    {"name": "Charlie", "age": 35, "city": "Paris"},
    {"name": "David", "age": 40, "city": "Tokyo"}
]
collection.insert_many(documents)

# Perform a find query
query = {"age": {"$gte": 30}}  # Find documents where age is greater than or equal to 30
projection = {"_id": 0, "name": 1, "age": 1}  # Include only name and age fields, exclude _id field
results = collection.find(query, projection)

# Print the matching documents
for document in results:
    print(document)

# Close the connection
client.close()


                       -------------------------------------------------------------------

Q6. Explain the sort() method. Give an example to demonstrate sorting in MongoDB.

#Answer

The sort() method in MongoDB is used to sort the documents in a collection based on one or more fields. It allows you to specify the sorting order as ascending or descending for each field. The sort() method modifies the order in which the documents are retrieved from the collection.

The basic syntax for the sort() method is as follows:

In [None]:
collection.find(query).sort(sort_key, sort_order)


query (optional): Specifies the query document to filter the documents. If no query is provided, all documents in the collection will be sorted.

sort_key: Specifies the field(s) to sort by. It can be a single field name or a list of field names.

sort_order (optional): Specifies the sorting order for the field(s). It can be either pymongo.ASCENDING (1) for ascending order or pymongo.DESCENDING (-1) for descending order. The default is ascending order.

Here's an example code that demonstrates sorting in MongoDB:

In [None]:
import pymongo

# Establish connection
client = pymongo.MongoClient("mongodb://localhost:27017")

# Access the database and collection
database = client["mydatabase"]
collection = database["mycollection"]

# Insert some sample documents
documents = [
    {"name": "Alice", "age": 25},
    {"name": "Bob", "age": 30},
    {"name": "Charlie", "age": 35},
    {"name": "David", "age": 40}
]
collection.insert_many(documents)

# Perform a sort query
sort_key = "age"  # Sort by the "age" field
sort_order = pymongo.DESCENDING  # Sort in descending order
results = collection.find().sort(sort_key, sort_order)

# Print the sorted documents
for document in results:
    print(document)

# Close the connection
client.close()


                        -------------------------------------------------------------------

Q7. Explain why delete_one(), delete_many(), and drop() is used.

#Answer


In MongoDB, the `delete_one()`, `delete_many()`, and `drop()` methods are used to remove documents or entire collections from a database. Each method serves a specific purpose:

1. `delete_one()` method:
   The `delete_one()` method is used to delete a single document that matches a specified filter. It removes the first document that satisfies the given criteria from the collection. If multiple documents match the filter, only the first one encountered will be deleted.

   Example:
   ```python
   collection.delete_one({"name": "John"})
   ```
   This code will delete the first document in the collection where the "name" field is "John".

2. `delete_many()` method:
   The `delete_many()` method allows deleting multiple documents that match a given filter. It removes all the documents that satisfy the specified criteria from the collection.

   Example:
   ```python
   collection.delete_many({"age": {"$gt": 30}})
   ```
   This code will delete all the documents in the collection where the "age" field is greater than 30.

   The `delete_many()` method is useful when you want to remove multiple documents that meet certain conditions.

3. `drop()` method:
   The `drop()` method is used to completely remove a collection from the database. It deletes all the documents within the collection and the collection itself. The operation is irreversible, and the collection cannot be recovered once dropped.

   Example:
   ```python
   collection.drop()
   ```
   This code will drop the entire collection.

   The `drop()` method is typically used when you want to delete an entire collection rather than individual documents.

It's important to exercise caution when using these methods, especially `drop()`, as data loss can occur. Always double-check the filters and ensure you have appropriate backups before performing irreversible operations like `drop()`.

                        -------------------------------------------------------------------