# Mongo DB Assignment

**Q1. What is MongoDB? Explain non-relational databases in short. In which scenarios it is preferred to use
MongoDB over SQL databases?**

Solution:

MongoDB is a popular document-oriented NoSQL (non-relational) database management system. It is designed to store and manage unstructured and semi-structured data in a flexible and scalable manner. Here's a brief explanation of non-relational databases and the scenarios where MongoDB is preferred over SQL databases:

Non-relational Databases:
Non-relational databases, also known as NoSQL databases, are a category of databases that differ from traditional SQL databases in their data model and storage approach. They are designed to handle large volumes of unstructured or semi-structured data and provide flexibility and scalability. Unlike SQL databases, which use structured tables and relationships, non-relational databases use various data models such as key-value, document, columnar, or graph. They offer horizontal scalability, high performance, and easier data modeling for certain use cases.

MongoDB and its Advantages:
MongoDB is a widely used NoSQL database that follows the document data model. Here are some scenarios where MongoDB is preferred over SQL databases:

Flexible and Dynamic Schema: MongoDB does not enforce a rigid schema like SQL databases. It allows you to store documents with varying structures within the same collection, providing flexibility in handling evolving data models.

Scalability: MongoDB is designed to scale horizontally by distributing data across multiple servers. It can handle large volumes of data and high traffic loads efficiently, making it suitable for applications with rapid data growth or high scalability requirements.

High Performance: MongoDB's document-oriented model allows for efficient retrieval of related data, as it can store related information in a single document. It also supports indexing and provides features like sharding and replication to enhance performance and availability.

Rapid Development: MongoDB's flexible data model and dynamic schema make it suitable for agile development environments. It allows developers to quickly iterate and adapt to changing application requirements without the need for frequent database schema modifications.

Use Cases: MongoDB is commonly used in scenarios such as content management systems, real-time analytics, social media applications, e-commerce platforms, and applications that handle large amounts of unstructured or semi-structured data.

It's important to note that while MongoDB offers advantages for certain use cases, SQL databases excel in scenarios that require complex transactions, strict data integrity constraints, and extensive relational querying capabilities.

In summary, MongoDB is a NoSQL database that provides flexibility, scalability, and performance advantages for applications dealing with unstructured or semi-structured data, rapid development cycles, and high scalability requirements.

**Q2. State and Explain the features of MongoDB.**

Solution:

MongoDB offers several key features that make it a popular choice for data storage and management. Here are some of the prominent features of MongoDB:

Document-Oriented:
MongoDB follows a document-oriented data model, where data is stored in flexible, self-describing documents in the BSON (Binary JSON) format. Documents are similar to JSON objects and can have varying structures within the same collection. This allows for easy and efficient representation of complex data structures.

No Schema Constraints:
MongoDB does not enforce rigid schema constraints, providing a flexible and dynamic data model. It allows you to store documents with different structures within the same collection. This flexibility is particularly useful in scenarios where data schemas evolve over time or when dealing with semi-structured or unstructured data.

Scalability and High Performance:
MongoDB is designed to scale horizontally, allowing for easy distribution of data across multiple servers or clusters. It offers built-in sharding, which enables data partitioning and distribution across multiple machines, leading to improved performance and increased capacity to handle large data volumes and high traffic loads.

Replication and High Availability:
MongoDB provides support for replica sets, which are self-healing, highly available clusters of MongoDB instances. Replica sets ensure data redundancy and automatic failover, allowing for high availability and data durability.

Indexing and Querying:
MongoDB supports various types of indexes, including single-field, compound, geospatial, and text indexes. Indexes help improve query performance by allowing efficient data retrieval. MongoDB also provides a rich query language with support for a wide range of querying operators and expressions.

Aggregation Framework:
MongoDB's Aggregation Framework allows for advanced data processing and analysis. It offers a set of operators and stages to perform data transformations, filtering, grouping, joining, and computing aggregations. The Aggregation Framework is powerful for performing complex analytics and generating meaningful insights from data.

Geospatial Capabilities:
MongoDB includes built-in geospatial indexing and querying features, making it well-suited for applications that work with geolocation data. It supports complex geospatial queries like proximity searches, spatial joins, and bounding box queries.

Full-text Search:
MongoDB provides powerful full-text search capabilities through its text indexes. It supports language-aware text searches, stemming, and relevance-based ranking of search results. This feature is valuable for applications requiring efficient text search functionality.

Security:
MongoDB offers robust security features such as authentication, role-based access control, SSL/TLS encryption, and auditing capabilities. These features help protect sensitive data and ensure compliance with security standards.

Rich Ecosystem and Community:
MongoDB has a vibrant and active community, which contributes to its ecosystem. It provides drivers and libraries for various programming languages, along with numerous third-party tools and frameworks that integrate well with MongoDB.

These features make MongoDB a powerful and flexible NoSQL database for a wide range of use cases, including web applications, content management systems, real-time analytics, IoT applications, and more.

**Q3. Write a code to connect MongoDB to Python. Also, create a database and a collection in MongoDB.**

Solution:

In [None]:
import pymongo

# Establish connection
client = pymongo.MongoClient("mongodb://localhost:27017")

# Create a database
database = client["your_database_name"]

# Create a collection
collection = database["your_collection_name"]

# Insert a document
document = {"name": "John Doe", "age": 30}
collection.insert_one(document)

# Close the connection
client.close()


To connect MongoDB to Python, you can use the pymongo library, which provides a Python interface for working with MongoDB

Make sure you have MongoDB installed and running on your local machine before executing the code.

Explanation of the code:

pymongo module is imported to interact with MongoDB.

The pymongo.MongoClient() method is used to establish a connection to the MongoDB server running on the default host and port (localhost:27017). If you're using a different host or port, adjust the connection URL accordingly.

The client object represents the MongoDB connection.

To create a database, you can access it using client["your_database_name"]. Replace "your_database_name" with the desired name for your database.

To create a collection within the database, you can access it using database["your_collection_name"]. Replace "your_collection_name" with the desired name for your collection.

You can insert a document into the collection using the insert_one() method. In the example, a document with name and age fields is inserted.

Finally, the client.close() method is used to close the connection to the MongoDB server.

Remember to replace "your_database_name" and "your_collection_name" with the actual names you want to use for your database and collection, respectively.



**Q4. Using the database and the collection created in question number 3, write a code to insert one record,
and insert many records. Use the find() and find_one() methods to print the inserted record.**

Solution:

In [None]:
import pymongo

# Establish connection
client = pymongo.MongoClient("mongodb://localhost:27017")

# Access the database and collection
database = client["your_database_name"]
collection = database["your_collection_name"]

# Insert one record
document_one = {"name": "Alice", "age": 25}
inserted_one = collection.insert_one(document_one)
print("Inserted record ID:", inserted_one.inserted_id)

# Insert many records
documents_many = [
    {"name": "Bob", "age": 30},
    {"name": "Charlie", "age": 35},
    {"name": "David", "age": 40}
]
inserted_many = collection.insert_many(documents_many)
print("Inserted record IDs:", inserted_many.inserted_ids)

# Find and print the inserted record
found_one = collection.find_one({"name": "Alice"})
print("Found record:", found_one)

# Find and print all inserted records
found_many = collection.find()
print("Found records:")
for record in found_many:
    print(record)

# Close the connection
client.close()


In the above code:

After establishing the connection and accessing the desired database and collection, we proceed with inserting records.

The insert_one() method is used to insert a single record, and the inserted record's ID is printed.

The insert_many() method is used to insert multiple records, and the IDs of the inserted records are printed.

The find_one() method is used to retrieve and print the inserted record with the name "Alice".

The find() method is used to retrieve all records from the collection, and each record is printed in a loop.

Finally, the connection is closed using client.close().

By executing this code, you will insert one record and multiple records into the collection, and then retrieve and print the inserted record(s) using the find() and find_one() methods.






**Q5. Explain how you can use the find() method to query the MongoDB database. Write a simple code to
demonstrate this.**

Solution:

In MongoDB, the find() method is used to query the database and retrieve documents that match specified criteria. It allows you to specify filtering conditions, projection of fields, sorting, and more. Here's a simple code example to demonstrate the usage of the find() method:

In [None]:
import pymongo

# Establish connection
client = pymongo.MongoClient("mongodb://localhost:27017")

# Access the database and collection
database = client["your_database_name"]
collection = database["your_collection_name"]

# Perform a find query
query = {"age": {"$gt": 25}}  # Retrieve documents where age is greater than 25
results = collection.find(query)

# Print the retrieved documents
for document in results:
    print(document)

# Close the connection
client.close()


In the code example:

After establishing the connection and accessing the desired database and collection, we define a query using a dictionary. In this case, we want to retrieve documents where the "age" field is greater than 25.

The find() method is called on the collection, passing the query as an argument. This executes the query and returns a cursor object containing the matching documents.

We iterate over the cursor using a for loop to print each retrieved document.

Finally, the connection is closed using client.close().

By executing this code, you will perform a find query in MongoDB, retrieve and print the documents that match the specified criteria (age greater than 25 in this case). You can customize the query by modifying the filter conditions to suit your specific requirements.

**Q6. Explain the sort() method. Give an example to demonstrate sorting in MongoDB.**

Solution:

In MongoDB, the sort() method is used to specify the sorting order for the retrieved documents in a query result. It allows you to sort the documents based on one or more fields in ascending or descending order. Here's an explanation of the sort() method and an example to demonstrate sorting in MongoDB:

Syntax of the sort() method:

In [None]:
collection.find(query).sort(sort_key, sort_direction)


query: The query that defines the filtering conditions for the documents.

sort_key: The field based on which the sorting should be performed.

sort_direction: The sorting order. Use pymongo.ASCENDING for ascending order and pymongo.DESCENDING for descending order.

Example:

In [None]:
import pymongo

# Establish connection
client = pymongo.MongoClient("mongodb://localhost:27017")

# Access the database and collection
database = client["your_database_name"]
collection = database["your_collection_name"]

# Perform a find query with sorting
query = {"age": {"$gt": 25}}  # Retrieve documents where age is greater than 25
sort_key = "age"  # Sort based on the "age" field
sort_direction = pymongo.ASCENDING  # Sort in ascending order
results = collection.find(query).sort(sort_key, sort_direction)

# Print the retrieved documents
for document in results:
    print(document)

# Close the connection
client.close()


In the example code:

After establishing the connection and accessing the desired database and collection, we define a query to retrieve documents where the "age" field is greater than 25.

We specify the field "age" as the sorting key.

The pymongo.ASCENDING constant is used to specify the sorting order as ascending.

The sort() method is called on the find() query, passing the sorting parameters as arguments.

We iterate over the cursor and print each retrieved document, which will be sorted in ascending order based on the "age" field.

By executing this code, you will perform a find query in MongoDB, retrieve and print the documents that match the specified criteria, sorted in the specified order. You can modify the sorting key and direction to suit your specific requirements.

**Q7. Explain why delete_one(), delete_many(), and drop() is used.**

Solution:

In MongoDB, the delete_one(), delete_many(), and drop() methods are used for deleting data from a collection or dropping an entire collection. 

Here's an explanation of when and how each method is used:

**delete_one()**:

Use delete_one() when you want to delete a single document that matches a specific condition.
It takes a filter as a parameter to identify the document(s) to delete. If multiple documents match the filter, only the first one encountered will be deleted.
Example usage: Deleting a specific user from a user collection based on their unique identifier.

**delete_many()**:

Use delete_many() when you want to delete multiple documents that match a specific condition.
It takes a filter as a parameter to identify the documents to delete. All documents that match the filter will be deleted.
Example usage: Deleting all documents from a log collection that have a timestamp older than a certain date.

**drop()**:

Use drop() when you want to completely remove a collection from the database.
It takes no parameters and drops the entire collection, including all its documents and indexes.
Exercise caution when using this method, as it irreversibly deletes all data in the collection.
Example usage: Dropping a temporary collection that is no longer needed.


These methods provide different levels of granularity and flexibility in deleting data from MongoDB:

delete_one() is used when you need to delete a specific document or the first matching document.

delete_many() is used when you need to delete multiple documents that meet certain criteria.

drop() is used when you want to delete an entire collection, including all its documents.

It's important to note that deleting data from a database should be done with caution, as it permanently removes the data. Always double-check your delete operations and ensure that you have appropriate backups or safeguards in place to prevent accidental data loss.


# --------------------------------------------------END-------------------------------------------------------