Pandas is a popular open-source library in Python that provides powerful data manipulation and analysis tools. 
It's a versatile library that can be extremely useful for data manipulation and analysis tasks in Python.
Python program that uses the Pandas framework to load, manipulate, and analyze a dataset. In this example, we'll load a CSV file containing information about movies and perform some basic operations:

1.Loading Data
2.Data Exploration
3.Filtering and Selecting Data
4.Grouping and Aggregation

In [1]:
pip install pandas


Note: you may need to restart the kernel to use updated packages.


Now, let's assume you have a CSV file named "movies.csv" with the following contents:
 the data is defined as a list of dictionaries, and then a DataFrame is created using the Pandas library. This should run without errors and display the DataFrame containing the movie information.

In [11]:
import pandas as pd

data = [
    {'Title': 'Movie1', 'Genre': 'Action', 'Year': 2000, 'Rating': 7.5},
    {'Title': 'Movie2', 'Genre': 'Comedy', 'Year': 2005, 'Rating': 8.2},
    {'Title': 'Movie3', 'Genre': 'Drama', 'Year': 2010, 'Rating': 6.9},
    {'Title': 'Movie4', 'Genre': 'Action', 'Year': 2002, 'Rating': 7.1},
    {'Title': 'Movie5', 'Genre': 'Comedy', 'Year': 2015, 'Rating': 8.5}
]

df = pd.DataFrame(data)
print(df)


    Title   Genre  Year  Rating
0  Movie1  Action  2000     7.5
1  Movie2  Comedy  2005     8.2
2  Movie3   Drama  2010     6.9
3  Movie4  Action  2002     7.1
4  Movie5  Comedy  2015     8.5


Here's the Python program:

In [7]:
import os
print("Current Working Directory:", os.getcwd())


Current Working Directory: /workspaces/Pandas


In [8]:
import os

files_in_directory = os.listdir()
print(files_in_directory)


['.git', 'README.md', 'Pandas.ipynb']


In [12]:
import pandas as pd

# Sample DataFrame
data = {'Title': ['Movie1', 'Movie2', 'Movie3'],
        'Genre': ['Action', 'Comedy', 'Drama'],
        'Year': [2000, 2005, 2010],
        'Rating': [7.5, 8.2, 6.9]}

df = pd.DataFrame(data)

# Specify the filename and write the DataFrame to CSV
df.to_csv('movies.csv', index=False)  # index=False prevents writing the row index as a column


In [15]:
import pandas as pd

# Load the CSV file into a DataFrame
df = pd.read_csv('movies.csv')

# Display the first few rows of the DataFrame
print("Data loaded:")
print(df)
print()

# Basic data exploration
print("Basic Stats:")
print("Number of rows:", df.shape[0])
print("Number of columns:", df.shape[1])
print("Column names:", df.columns)
print()

# Filtering and selecting data
action_movies = df[df['Genre'] == 'Action']
print("Action Movies:")
print(action_movies)
print()

# Grouping and aggregation
genre_grouped = df.groupby('Genre').agg({'Rating': 'mean', 'Year': 'max'})
print("Genre-wise Aggregation:")
print(genre_grouped)


Data loaded:
    Title   Genre  Year  Rating
0  Movie1  Action  2000     7.5
1  Movie2  Comedy  2005     8.2
2  Movie3   Drama  2010     6.9

Basic Stats:
Number of rows: 3
Number of columns: 4
Column names: Index(['Title', 'Genre', 'Year', 'Rating'], dtype='object')

Action Movies:
    Title   Genre  Year  Rating
0  Movie1  Action  2000     7.5

Genre-wise Aggregation:
        Rating  Year
Genre               
Action     7.5  2000
Comedy     8.2  2005
Drama      6.9  2010
