#Answer 1:

Here are five common functions in the pandas library with examples of their execution:

read_csv(): Reads a CSV file into a DataFrame.

head(): Returns the first few rows of a DataFrame.

describe(): Generates descriptive statistics.

groupby(): Groups DataFrame using a mapper or by a Series of columns.

merge(): Merges DataFrame or named Series objects.


In [3]:
#Answer 2 :
import pandas as pd
def reindex_dataframe(df):
    df.index = range(1, 2 * len(df) + 1, 2)
    return df

df = pd.DataFrame({'A': [1, 2, 3], 'B': [4, 5, 6], 'C': [7, 8, 9]})
print("Reindexed DataFrame:\n", reindex_dataframe(df))



Reindexed DataFrame:
    A  B  C
1  1  4  7
3  2  5  8
5  3  6  9


In [4]:
#Answer 3 :


def sum_first_three_values(df):
    sum_values = df['Values'][:3].sum()
    print("Sum of first three values:", sum_values)

# Example usage
df = pd.DataFrame({'Values': [10, 20, 30, 40, 50]})
sum_first_three_values(df)


Sum of first three values: 60


In [5]:
#Answer 4:

def add_word_count_column(df):
    df['Word_Count'] = df['Text'].apply(lambda x: len(str(x).split()))
    return df

# Example usage
df = pd.DataFrame({'Text': ['Hello world', 'Pandas is great', 'This is a test']})
print("DataFrame with Word_Count:\n", add_word_count_column(df))



DataFrame with Word_Count:
               Text  Word_Count
0      Hello world           2
1  Pandas is great           3
2   This is a test           4


#Answer 5:

DataFrame.size: Returns the number of elements in the DataFrame (rows * columns).

DataFrame.shape: Returns a tuple representing the dimensionality of the DataFrame (number of rows, number of columns).


#Answer 6:
The function to read an Excel file in pandas is pd.read_excel().



In [7]:
#Answer 7:

def add_username_column(df):
    df['Username'] = df['Email'].apply(lambda x: x.split('@')[0])
    return df

# Example usage
df = pd.DataFrame({'Email': ['john.doe@example.com', 'jane.smith@sample.org']})
print("DataFrame with Username:\n", add_username_column(df))



DataFrame with Username:
                    Email    Username
0   john.doe@example.com    john.doe
1  jane.smith@sample.org  jane.smith


In [9]:
#Answer 8 :

def select_rows(df):
    new_df = df[(df['A'] > 5) & (df['B'] < 10)]
    return new_df


df = pd.DataFrame({'A': [3, 8, 6, 2, 9], 'B': [5, 2, 9, 3, 1], 'C': [1, 7, 4, 5, 2]})
print("Selected rows:\n", select_rows(df))



Selected rows:
    A  B  C
1  8  2  7
2  6  9  4
4  9  1  2


In [10]:
#Answer 9:

def calculate_statistics(df):
    mean = df['Values'].mean()
    median = df['Values'].median()
    std_dev = df['Values'].std()
    return mean, median, std_dev


df = pd.DataFrame({'Values': [10, 20, 30, 40, 50]})
mean, median, std_dev = calculate_statistics(df)
print(f"Mean: {mean}, Median: {median}, Standard Deviation: {std_dev}")



Mean: 30.0, Median: 30.0, Standard Deviation: 15.811388300841896


In [12]:
#Answer 10:


def add_moving_average(df):
    df['MovingAverage'] = df['Sales'].rolling(window=7).mean()
    return df


df = pd.DataFrame({'Date': pd.date_range(start='2023-01-01', periods=10), 'Sales': [1, 2, 3, 4, 5, 6, 7, 8, 9, 10]})
print("DataFrame with MovingAverage:\n", add_moving_average(df))


DataFrame with MovingAverage:
         Date  Sales  MovingAverage
0 2023-01-01      1            NaN
1 2023-01-02      2            NaN
2 2023-01-03      3            NaN
3 2023-01-04      4            NaN
4 2023-01-05      5            NaN
5 2023-01-06      6            NaN
6 2023-01-07      7            4.0
7 2023-01-08      8            5.0
8 2023-01-09      9            6.0
9 2023-01-10     10            7.0


In [14]:
#Answer 11:


def add_weekday_column(df):
    df['Weekday'] = pd.to_datetime(df['Date']).dt.day_name()
    return df


df = pd.DataFrame({'Date': ['2023-01-01', '2023-01-02', '2023-01-03', '2023-01-04', '2023-01-05']})
print("DataFrame with Weekday:\n", add_weekday_column(df))


DataFrame with Weekday:
          Date    Weekday
0  2023-01-01     Sunday
1  2023-01-02     Monday
2  2023-01-03    Tuesday
3  2023-01-04  Wednesday
4  2023-01-05   Thursday


In [16]:
#Answer 12:


def select_date_range(df):
    mask = (df['Date'] >= '2023-01-01') & (df['Date'] <= '2023-01-31')
    return df.loc[mask]


df = pd.DataFrame({'Date': pd.date_range(start='2023-01-01', periods=40), 'Value': range(40)})
print("Rows with date between 2023-01-01 and 2023-01-31:\n", select_date_range(df))


Rows with date between 2023-01-01 and 2023-01-31:
          Date  Value
0  2023-01-01      0
1  2023-01-02      1
2  2023-01-03      2
3  2023-01-04      3
4  2023-01-05      4
5  2023-01-06      5
6  2023-01-07      6
7  2023-01-08      7
8  2023-01-09      8
9  2023-01-10      9
10 2023-01-11     10
11 2023-01-12     11
12 2023-01-13     12
13 2023-01-14     13
14 2023-01-15     14
15 2023-01-16     15
16 2023-01-17     16
17 2023-01-18     17
18 2023-01-19     18
19 2023-01-20     19
20 2023-01-21     20
21 2023-01-22     21
22 2023-01-23     22
23 2023-01-24     23
24 2023-01-25     24
25 2023-01-26     25
26 2023-01-27     26
27 2023-01-28     27
28 2023-01-29     28
29 2023-01-30     29
30 2023-01-31     30


#Answer 13:

o use the basic functions of pandas, the first and foremost necessary library that needs to be imported is pandas. You can import it using:

    
    import pandas as pd
