In [1]:
import warnings

warnings.simplefilter(action="ignore", category=FutureWarning)

![](../img/change_values.png)

## Change Values

### pandas.DataFrame.pipe: Increase the Readability of your Code when Applying Multiple Functions to a DataFrame

In [None]:
!pip install textblob

If you want to increase the readability of your code when applying multiple functions to a DataFrame, use `pands.DataFrame.pipe` method.

In [1]:
from textblob import TextBlob
import pandas as pd 

def remove_white_space(df: pd.DataFrame):
    df['text'] = df['text'].apply(lambda row: row.strip())
    return df

def get_sentiment(df: pd.DataFrame):
    df['sentiment'] = df['text'].apply(lambda row:
                                    TextBlob(row).sentiment[0])
    return df

df = pd.DataFrame({'text': ["It is a beautiful day today  ",
                        "  This movie is terrible"]})

df = (df.pipe(remove_white_space)
    .pipe(get_sentiment)
)

df

Unnamed: 0,text,sentiment
0,It is a beautiful day today,0.85
1,This movie is terrible,-1.0


### Apply a Function to a Column of a DataFrame

If you want to apply only one function to a column of a DataFrame, use `apply`.

In [3]:
import pandas as pd 

df = pd.DataFrame({"col1": [1, 2], "col2": [3, 4]})
df 

Unnamed: 0,col1,col2
0,1,3
1,2,4


<IPython.core.display.Javascript object>

In [4]:
df["col1"] = df["col1"].apply(lambda row: row * 2)
df

Unnamed: 0,col1,col2
0,2,3
1,4,4


<IPython.core.display.Javascript object>

### Apply a Function to a DataFrame Elementwise

The `apply` method applies a function along an axis of a DataFrame. If you want to apply a function to a DataFrame elementwise, use `applymap`.

In [8]:
import pandas as pd 

df = pd.DataFrame({"col1": [2, 9], "col2": [3, 7]})
print(df )

   col1  col2
0     2     3
1     9     7


In [9]:
print(df.applymap(lambda val: 'failed' if val < 5 else 'passed'))

     col1    col2
0  failed  failed
1  passed  passed


### Assign Values to Multiple New Columns

If you want to assign values to multiple new columns, instead of assigning them separately, you can do everything in one line of code with `df.assign`.

In the code below, I first created `col3` then use `col3` to create `col4`. Everything is in one line of code.

In [3]:
import pandas as pd 

df = pd.DataFrame({"col1": [1, 2], "col2": [3, 4]})

df = df.assign(col3=lambda x: x.col1 * 100 + x.col2).assign(
    col4=lambda x: x.col2 * x.col3
)
df

Unnamed: 0,col1,col2,col3,col4
0,1,3,103,309
1,2,4,204,816


### pandas.Series.map: Change Values of a Pandas Series Using a Dictionary	

If you want to change values of a pandas Series using a dictionary, use `pd.Series.map`.

In [2]:
import pandas as pd 

s = pd.Series(["a", "b", "c"])

s.map({"a": 1, "b": 2, "c": 3})

0    1
1    2
2    3
dtype: int64

`map` also allows you to insert a string to a pandas Series using `format`. 

In [3]:
import pandas as pd  

s = pd.Series(["berries", "apples", "cherries"])
s.map("Today I got some {} from my garden.".format)

0     Today I got some berries from my garden.
1      Today I got some apples from my garden.
2    Today I got some cherries from my garden.
dtype: object

### pandas.DataFrame.explode: Transform Each Element in an Iterable to a Row

When working with `pandas DataFrame`, if you want to transform each element in an iterable to a row, use `explode`.

In [4]:
import pandas as pd 

df = pd.DataFrame({"a": [[1, 2], [4, 5]], "b": [11, 13]})
df

Unnamed: 0,a,b
0,"[1, 2]",11
1,"[4, 5]",13


In [46]:
df.explode("a")

Unnamed: 0,a,b
0,1,11
0,2,11
1,4,13
1,5,13


<IPython.core.display.Javascript object>

### Split a String into Multiple Rows

Sometimes, you might have a column whose values are strings representing different items such as `"1, 2"`. 

In [33]:
import pandas as pd

df = pd.DataFrame({"a": ["1,2", "4,5"], "b": [11, 13]})
df


Unnamed: 0,a,b
0,12,11
1,45,13


To turn each string into a list, use `Series.str.split()`:

In [34]:
# Split by comma
df.a = df.a.str.split(",")
df


Unnamed: 0,a,b
0,"[1, 2]",11
1,"[4, 5]",13


Now you can split elements in the list into multiple rows using `explode`.

In [35]:
df.explode('a')

Unnamed: 0,a,b
0,1,11
0,2,11
1,4,13
1,5,13


### Forward Fill in pandas: Use the Previous Value to Fill the Current Missing Value

If you want to use the previous value in a column or a row to fill the current missing value in a pandas DataFrame, use `df.fillna(method=’ffill’)`. `ffill` stands for forward fill.

In [6]:
import numpy as np
import pandas as pd 

df = pd.DataFrame({"a": [1, np.nan, 3], "b": [4, 5, np.nan], "c": [1, 2, 3]})
df

Unnamed: 0,a,b,c
0,1.0,4.0,1
1,,5.0,2
2,3.0,,3


In [49]:
df = df.fillna(method="ffill")
df

Unnamed: 0,a,b,c
0,1.0,4.0,1
1,1.0,5.0,2
2,3.0,5.0,3


<IPython.core.display.Javascript object>