 Five Functions of the Pandas Library with Execution
Here are five useful functions from the pandas library along with example code:

read_csv(): Reads a CSV file into a DataFrame.
head(): Returns the first n rows of a DataFrame.
describe(): Provides a summary of statistics for numeric columns.
groupby(): Groups DataFrame using a mapper or by series of columns.
merge(): Merges DataFrame objects by performing a database-style join.
Let's demonstrate these functions with code:

python
Copy code
import pandas as pd

# Sample data
data = {'A': [1, 2, 3, 4, 5], 'B': [5, 4, 3, 2, 1], 'C': [2, 3, 4, 5, 6]}
df = pd.DataFrame(data)

# 1. head()
print("Head of DataFrame:\n", df.head())

# 2. describe()
print("Description of DataFrame:\n", df.describe())

# 3. groupby()
grouped = df.groupby('A').sum()
print("Grouped DataFrame:\n", grouped)

# 4. merge()
data2 = {'A': [1, 2, 6], 'D': [7, 8, 9]}
df2 = pd.DataFrame(data2)
merged_df = pd.merge(df, df2, on='A', how='outer')
print("Merged DataFrame:\n", merged_df)

# 5. read_csv() - Example (commented out as there is no actual file to read here)
# df_from_csv = pd.read_csv('file.csv')
Q2: Re-indexing DataFrame with a New Index
python
Copy code
def reindex_dataframe(df):
    df.index = range(1, 2 * len(df) + 1, 2)
    return df

# Sample DataFrame
df = pd.DataFrame({'A': [1, 2, 3], 'B': [4, 5, 6], 'C': [7, 8, 9]})
new_df = reindex_dataframe(df)
print(new_df)
Q3: Calculate the Sum of the First Three Values in a Column
python
Copy code
def sum_first_three(df):
    total = df['Values'].iloc[:3].sum()
    print("Sum of the first three values:", total)

# Sample DataFrame
df = pd.DataFrame({'Values': [10, 20, 30, 40, 50]})
sum_first_three(df)
Q4: Create a New Column 'Word_Count'
python
Copy code
def add_word_count(df):
    df['Word_Count'] = df['Text'].apply(lambda x: len(str(x).split()))
    return df

# Sample DataFrame
df = pd.DataFrame({'Text': ['Hello world', 'Pandas is great', 'I love programming']})
df = add_word_count(df)
print(df)
Q5: Difference between DataFrame.size and DataFrame.shape
DataFrame.size: Returns the total number of elements in the DataFrame (rows * columns).
DataFrame.shape: Returns a tuple representing the dimensionality of the DataFrame (number of rows, number of columns).
Q6: Function to Read an Excel File in Pandas
To read an Excel file, use the pd.read_excel() function.

python
Copy code
df = pd.read_excel('file.xlsx')
Q7: Extract Usernames from Email Addresses
python
Copy code
def extract_usernames(df):
    df['Username'] = df['Email'].apply(lambda x: x.split('@')[0])
    return df

# Sample DataFrame
df = pd.DataFrame({'Email': ['john.doe@example.com', 'jane.doe@example.com']})
df = extract_usernames(df)
print(df)
Q8: Select Rows Based on Conditions
python
Copy code
def filter_dataframe(df):
    filtered_df = df[(df['A'] > 5) & (df['B'] < 10)]
    return filtered_df

# Sample DataFrame
df = pd.DataFrame({'A': [0, 1, 2, 3, 4, 5, 18, 26, 32, 49], 'B': [3, 5, 1, 7, 9, 4, 2, 9, 3, 12], 'C': [5, 1, 4, 3, 2, 1, 7, 4, 3, 5]})
filtered_df = filter_dataframe(df)
print(filtered_df)
Q9: Calculate Mean, Median, and Standard Deviation
python
Copy code
def calculate_statistics(df):
    mean_value = df['Values'].mean()
    median_value = df['Values'].median()
    std_value = df['Values'].std()
    print(f"Mean: {mean_value}, Median: {median_value}, Standard Deviation: {std_value}")

# Sample DataFrame
df = pd.DataFrame({'Values': [10, 20, 30, 40, 50]})
calculate_statistics(df)
Q10: Create a 'MovingAverage' Column for the Past 7 Days
python
Copy code
def add_moving_average(df):
    df['MovingAverage'] = df['Sales'].rolling(window=7).mean()
    return df

# Sample DataFrame
df = pd.DataFrame({'Date': pd.date_range(start='2023-01-01', periods=10), 'Sales': [10, 20, 30, 40, 50, 60, 70, 80, 90, 100]})
df = add_moving_average(df)
print(df)
Q11: Add 'Weekday' Column Based on 'Date' Column
python
Copy code
def add_weekday(df):
    df['Weekday'] = df['Date'].dt.day_name()
    return df

# Sample DataFrame
df = pd.DataFrame({'Date': pd.date_range(start='2023-01-01', periods=5)})
df = add_weekday(df)
print(df)
Q12: Select Rows Based on Date Range
python
Copy code
def filter_by_date(df):
    df['Date'] = pd.to_datetime(df['Date'])
    filtered_df = df[(df['Date'] >= '2023-01-01') & (df['Date'] <= '2023-01-31')]
    return filtered_df

# Sample DataFrame
df = pd.DataFrame({'Date': pd.date_range(start='2023-01-01', periods=40)})
filtered_df = filter_by_date(df)
print(filtered_df)
Q13: Essential Library for Using Pandas
To use the basic functions of pandas, the first and foremost necessary library to import is pandas itself.

python
Copy code
import pandas as pd





