### Introduction to Pandas

Welcome to the module on Pandas, a powerful data manipulation and analysis library for Python. In this session, we will explore the basics of Pandas and how it can be used to efficiently handle and analyze data.

#### What is Pandas?

Pandas is an open-source data analysis and manipulation library built on top of the Python programming language. It provides data structures and functions needed to work with structured data seamlessly. The name "Pandas" is derived from the term "panel data," which refers to multidimensional data.

#### Why Use Pandas?

Pandas is highly favored for its ability to:
- Handle large amounts of data efficiently.
- Perform data cleaning and preprocessing.
- Provide powerful data aggregation and transformation tools.
- Integrate seamlessly with other Python libraries, such as NumPy, Matplotlib, and Scikit-Learn.

#### Installing Pandas

To get started with Pandas, you need to have it installed on your system. You can install it using pip, Python’s package installer, with the following command:

In [None]:
!pip install pandas



Once installed, you can import it into your Python environment:

In [None]:
import pandas as pd

#### Exploring Pandas Data Structures

Pandas primarily uses two data structures: Series and DataFrame. Let's take a quick look at each.

**Series:** A one-dimensional labeled array capable of holding any data type. It is similar to a column in an Excel spreadsheet.

In [None]:
# Creating a Series
serie = pd.Series([1, 2, 3, 4, 5])
print(serie)

0    1
1    2
2    3
3    4
4    5
dtype: int64


**DataFrame:** A two-dimensional labeled data structure with columns of potentially different types. It can be thought of as a table, similar to an Excel spreadsheet or a SQL table.

In [None]:
# Creating a DataFrame
data = {
    'Name': ['Ana', 'Brais', 'Carlos', 'Diana'],
    'Age': [23, 24, 22, 25],
    'City': ['Santiago', 'Vigo', 'Ourense', 'Lugo']
}
dataframe = pd.DataFrame(data)
print(dataframe)

     Name  Age      City
0     Ana   23  Santiago
1   Brais   24      Vigo
2  Carlos   22   Ourense
3   Diana   25      Lugo


#### Course Outline

Throughout this course, we will cover the following key areas:

1. **Basic Operations with Pandas:**
   - Reading and writing data to files (CSV, Excel).
   - Selecting and indexing data.
   - Filtering and modifying data.

2. **Data Analysis and Manipulation:**
   - Grouping and aggregating data.
   - Merging and joining DataFrames.
   - Performing basic statistical operations.

3. **Data Visualization:**
   - Creating basic plots using Pandas.
   - Visualizing data trends and distributions.

By the end of this module, you will have a solid understanding of how to use Pandas to manage and analyze data effectively.

Let's dive in and start exploring the capabilities of Pandas!