Q1. What is MongoDB? Explain non-relational databases in short. In which scenarios it is preferred to use
MongoDB over SQL databases?

MongoDB is a NoSQL database that stores data in a flexible, JSON-like format called BSON (Binary JSON). It is designed for scalability and high performance. Unlike traditional relational databases, MongoDB uses collections and documents rather than tables and rows.

Non-relational databases, or NoSQL databases, are designed to handle a variety of data models, including document-based, key-value pairs, column-family, and graph formats. They are often used for their flexibility, scalability, and ability to handle unstructured or semi-structured data.

Scenarios where MongoDB is preferred over SQL databases:

1.Unstructured or Semi-structured Data: MongoDB is ideal for applications dealing with complex, hierarchical, or semi-structured data, such as content management systems or user profiles.

2.Scalability: MongoDB excels in scenarios requiring horizontal scaling across multiple servers. Its built-in sharding allows for distributing data across multiple nodes easily.

3.Flexible Schema: When the data model is likely to evolve, MongoDB’s schema-less design allows for changes without impacting existing data.

4.Rapid Development: Its flexible data model can speed up development cycles, especially when the application requires frequent changes to the data structure.

5.Real-time Analytics: MongoDB can be used for applications needing real-time analytics or handling large volumes of data with varying structures, such as logging systems or big data applications.

In contrast, SQL databases are often preferred for applications requiring strong consistency, complex transactions, and fixed schemas.

Q2. State and Explain the features of MongoDB.

MongoDB offers several key features that make it a popular choice for modern applications:

1. Document-Oriented Storage: MongoDB stores data in flexible, JSON-like documents (BSON format), which can have varying structures. This allows for rich and hierarchical data representation.

2. Schema Flexibility: Unlike traditional relational databases with fixed schemas, MongoDB's schema-less design means that documents in a collection can have different structures, making it easy to evolve your data model as needed.

3. High Availability: MongoDB provides high availability through replica sets. A replica set is a group of MongoDB servers that maintain the same data set. If the primary server fails, a secondary server can automatically take over.

4. Scalability: MongoDB supports horizontal scaling through sharding. Sharding involves distributing data across multiple servers (shards), allowing for the management of large datasets and high throughput operations.

5. Indexing: MongoDB supports a variety of indexing options to improve query performance. This includes single-field indexes, compound indexes, geospatial indexes, and text indexes.

6. Aggregation Framework: MongoDB includes a powerful aggregation framework for performing complex queries and data processing tasks. It allows for filtering, grouping, and transforming data within the database.

7. Rich Query Language: MongoDB's query language supports a wide range of queries, including CRUD operations, as well as complex queries involving nested documents and arrays.

8. Built-in Replication and Sharding: The database supports automatic replication and sharding. Replication provides data redundancy and high availability, while sharding helps manage large datasets and distribute load.

9. GridFS: MongoDB includes GridFS for storing and retrieving large files, such as images, videos, and other binary data. GridFS divides files into chunks and stores them across multiple documents.

10. Aggregation Pipelines: MongoDB's aggregation pipelines allow for processing data in stages, enabling powerful data transformations and analytics. Each stage of the pipeline performs an operation on the data.

11. ACID Transactions: MongoDB supports multi-document ACID (Atomicity, Consistency, Isolation, Durability) transactions, ensuring reliable and consistent operations across multiple documents and collections.

12. Geospatial Support: MongoDB has built-in support for geospatial data and queries, making it suitable for location-based services and applications.

13. Flexible Data Model: The ability to store complex data structures, including arrays and nested documents, allows for more natural data modeling for certain use cases.

These features make MongoDB a versatile database choice for applications that require flexible data modeling, scalability, and high performance.

Q3. Write a code to connect MongoDB to Python. Also, create a database and a collection in MongoDB.

In [1]:
from pymongo import MongoClient

# Step 1: Connect to MongoDB server
client = MongoClient('mongodb://localhost:27017/')  # Adjust the URI if needed

# Step 2: Create a database
db = client['mydatabase']  # Replace 'mydatabase' with your desired database name

# Step 3: Create a collection
collection = db['mycollection']  # Replace 'mycollection' with your desired collection name

# Optional: Print the names of the databases and collections to verify
print("Databases:", client.list_database_names())
print("Collections in 'mydatabase':", db.list_collection_names())


ModuleNotFoundError: No module named 'pymongo'

Q4. Using the database and the collection created in question number 3, write a code to insert one record,
and insert many records. Use the find() and find_one() methods to print the inserted record.

In [None]:
#Code to insert one record:
from pymongo import MongoClient

# Connect to MongoDB server
client = MongoClient('mongodb://localhost:27017/')

# Access the database and collection
db = client['mydatabase']
collection = db['mycollection']

# Insert one record
single_record = {"name": "Ajay", "age": 24, "city": "Patna"}
result = collection.insert_one(single_record)

# Print the inserted record ID
print("Inserted record ID:", result.inserted_id)

# Retrieve and print the inserted record
retrieved_record = collection.find_one({"name": "Ajay"})
print("Retrieved record:", retrieved_record)

#Code to insert many records:
# Insert many records
multiple_records = [
    {"name": "Saurav", "age": 23, "city": "Noida"},
    {"name": "Aditya", "age": 24, "city": "Mumbai"},
    {"name": "Jayant", "age": 24, "city": "Indore"}
]
result = collection.insert_many(multiple_records)

# Print the IDs of the inserted records
print("Inserted record IDs:", result.inserted_ids)

# Retrieve and print all records using find()
all_records = collection.find()
print("All records in 'mycollection':")
for record in all_records:
    print(record)


Q5. Explain how you can use the find() method to query the MongoDB database. Write a simple code to
demonstrate this.

The find() method in MongoDB is used to query a collection and retrieve documents that match a specified query filter. It returns a cursor that allows you to iterate over the resulting documents.

How to Use find():
1. Basic Query: Retrieves all documents in a collection if no query filter is provided.
2. Query with Conditions: Filters documents based on specific criteria (e.g., fields and values).
3. Projection: Limits the fields returned in the result set.
4. Sorting: Orders the results based on specified fields.
5. Limit and Skip: Controls the number of documents returned and skips over a specified number.

In [1]:
#Example
from pymongo import MongoClient

# Connect to MongoDB server
client = MongoClient('mongodb://localhost:27017/')

# Access the database and collection
db = client['mydatabase']
collection = db['mycollection']

# Insert some example records (for demonstration purposes)
collection.insert_many([
    {"name": "Ajay", "age": 24, "city": "Patna"},
    {"name": "Saurav", "age": 23, "city": "Noida"},
    {"name": "Sajan", "age": 25, "city": "New Delhi"},
    {"name": "Ankit", "age": 26, "city": "Indore"}
])

# Query: Find all documents where age is greater than 30
query = {"age": {"$gt": 30}}
results = collection.find(query)

# Print the results
print("Documents where age is greater than 30:")
for document in results:
    print(document)

# Query: Find documents and project only 'name' and 'city' fields
projection = {"_id": 0, "name": 1, "city": 1}
results = collection.find(query, projection)

# Print the results with projection
print("\nDocuments where age is greater than 30 with projected fields:")
for document in results:
    print(document)


ModuleNotFoundError: No module named 'pymongo'

Q6. Explain the sort() method. Give an example to demonstrate sorting in MongoDB.

sort() Method in MongoDB
In MongoDB, the sort() method is used to arrange the documents in a collection in a specific order. This method can be applied to any field in the document and can sort the results either in ascending order (smallest to largest) or descending order (largest to smallest).

In [None]:
#Syntax
db.collection.find().sort({field_name: direction})
#Example in Python using PyMongo:
from pymongo import MongoClient

# Connect to the MongoDB server
client = MongoClient("mongodb://localhost:27017/")

# Select the database
db = client["school"]

# Select the collection
collection = db["students"]

# Sort the documents by 'score' in descending order
sorted_students = collection.find().sort("score", -1)

# Print the sorted documents
for student in sorted_students:
    print(student)


Q7. Explain why delete_one(), delete_many(), and drop() is used.

In MongoDB, the methods delete_one(), delete_many(), and drop() are used to remove documents or entire collections from a database. Each method serves a specific purpose:

1. delete_one()
Purpose: Removes a single document that matches a specified filter.
Usage: This method is used when you want to delete only the first document that matches the query criteria.


In [None]:
from pymongo import MongoClient

# Connect to the MongoDB server
client = MongoClient("mongodb://localhost:27017/")

# Select the database
db = client["mydatabase"]

# Select the collection
collection = db["mycollection"]

# Delete one document where the 'name' field is 'Ajay'
result = collection.delete_one({"name": "Ajay"})

# Output the number of documents deleted
print(result.deleted_count)


2. delete_many()
Purpose: Removes all documents that match a specified filter.
Usage: This method is used when you want to delete multiple documents based on the query criteria.

In [None]:
from pymongo import MongoClient

# Connect to the MongoDB server
client = MongoClient("mongodb://localhost:27017/")

# Select the database
db = client["mydatabase"]

# Select the collection
collection = db["mycollection"]

# Delete all documents where the 'status' field is 'inactive'
result = collection.delete_many({"status": "inactive"})

# Output the number of documents deleted
print(result.deleted_count)


3. drop()
Purpose: Removes an entire collection from the database.
Usage: This method is used when you want to completely delete a collection and all its documents.

In [None]:
from pymongo import MongoClient

# Connect to the MongoDB server
client = MongoClient("mongodb://localhost:27017/")

# Select the database
db = client["mydatabase"]

# Select the collection
collection = db["mycollection"]

# Drop the entire collection
collection.drop()
