# Credit Repayment Behaviour Analysis

The goal to find out whether the factors such as the marital status and the number of children of the borrowers affect the repayment of the loan to the bank on time. The Bank X has provided the data on clients' creditworthiness.

The report will be considered when building a credit scoring of a potential client of the bank. A credit scoring is used to evaluate the risks of the potential borrowers defaulting on their debt obligations.

# Data Description

***Note:*** the data provided by the Bank X is about clients in Russia, thus some contents of the dataset might be in Russian language.

children : the number of children in the family
<br>days_employed: how long the customer has worked
<br>dob_years: the customer‚Äôs age
<br>education: the customer‚Äôs education level
<br>education_id: identifier for the customer‚Äôs education
<br>family_status: the customer‚Äôs marital status
<br>family_status_id: identifier for the customer‚Äôs marital status
<br>gender: the customer‚Äôs gender
<br>income_type: the customer‚Äôs income type
<br>debt: whether the client has ever defaulted on a loan
<br>total_income: monthly income
<br>purpose: reason for taking out a loan

## Step 1. Data quality assessment

In [None]:
# importing Pandas library
import pandas as pd 

In [None]:
# opening the data file
df = pd.read_csv('clients_data.csv')

In [None]:
# getting the first 5 table strings.
df.head()

In [None]:
# looking at the general information
df.info()
# checking numeric values
df.describe()

In [None]:
# checking for at the NaN values
df.isnull().sum()

**Summary:** We have a table with the total of 21525 rows and 12 columns of clients' data. Data types include float64, int64, and object. 
Findings:

1. 'total_income', 'days_employed' each have 2174 NaN values. 
2. 'days_employed': has negative and very big values, such as '-18388.949901' and '401755.400475'. The data in this column doesn't make sense as it is meant to represent the number of days of the employment. It is necessary to request more information about this data.
3. 'children': the number of children '-1' and '20'
4. 'dob_years': some of the client's data indicate their age as '0' years
5. 'education', 'education_id': there are 15 unique values in 'education' and 5 in 'education_id'
6. 'gender': 3 unique values: female, male, and N/A

Possible reasons for missing values & errors in the data above: In this case, it might be the concealment of the personal information by the client (about the employment and income level). Another possibility is the data entry errors.

## Step 2. Data preprocessing

### Processing the missing values

In [None]:
#–°–æ–∑–¥–∞–Ω–∏–µ "—Å–ª–æ–≤–∞—Ä—è" —Å –º–µ–¥–∏–∞–Ω–æ–π 'total_income', —Å–≥—Ä—É–ø. –ø–æ –∫–∞—Ç–µ–≥–æ—Ä–∏—è–º –∏–∑ 'income_type'
income_dict = dict(df.groupby('income_type')['total_income'].median())

#–ó–∞–º–µ–Ω—è–µ–º –ø—Ä–æ–ø—É—â–µ–Ω–Ω—ã–µ –∑–Ω–∞—á–µ–Ω–∏—è –≤ —Å—Ç–æ–ª–±—Ü–µ 'total_income' –∑–Ω–∞—á–µ–Ω–∏—è–º–∏ –∏–∑ —Å–ª–æ–≤–∞—Ä—è 'income_dict'
df['total_income'] = df['total_income'].fillna(df['income_type'].apply(lambda x: income_dict.get(x)))
     #Check:
#print(df.head(15))
#df.info()

#–î–µ–ª–∞–µ–º —Ç–æ –∂–µ —Å–∞–º–æ–µ –¥–ª—è —Å—Ç–æ–ª–±—Ü–∞ days_employed, –Ω–æ –±–µ–∑ —Å–ª–æ–≤–∞—Ä—è, —Ç.–∫. –∫–æ–ª-–≤–æ –¥–Ω–µ–π –∑–∞–Ω—è—Ç–æ—Å—Ç–∏ –Ω–µ –∑–∞–≤–∏—Å–∏—Ç –æ—Ç –¥—Ä—É–≥–∏—Ö –ø–∞—Ä–∞–º–µ—Ç—Ä–æ–≤ –≤ —Ç–∞–±–ª–∏—Ü–µ
df['days_employed'] = df['days_employed'].fillna(df['days_employed'].median())
     #Check:
#print(df.head(15))
#df.info()

**–í—ã–≤–æ–¥** : –ü—Ä–æ–ø—É—Å–∫–∏ –≤ –¥–∞—Ç–∞—Å–µ—Ç–µ –∑–∞–ø–æ–ª–Ω–µ–Ω—ã –º–µ–¥–∏–∞–Ω–æ–π, —Ç–∞–∫ –∫–∞–∫ –º—ã –Ω–µ –∑–Ω–∞–µ–º —Ç–æ—á–Ω—ã–µ –¥–∞–Ω–Ω—ã–µ –∏ –Ω–µ –º–æ–∂–µ–º –ø—Ä–æ–≤–µ—Ä–∏—Ç—å –ø—Ä–∏—á–∏–Ω—É –∏—Ö –æ—Ç—Å—É—Ç—Å—Ç–≤–∏—è, –∞ —É–¥–∞–ª—è—Ç—å –∏—Ö –∏–∑ —Ç–∞–±–ª–∏—Ü—ã - –ø–æ–¥–≤–µ—Ä–≥–∞—Ç—å —Å–µ–±—è –ø–æ—Ç–µ—Ä–µ —Ü–µ–Ω–Ω–æ–π –∏–Ω—Ñ–æ—Ä–º–∞—Ü–∏–∏

1. –ü—Ä–∏–Ω—Ü–∏–ø –∑–∞–ø–æ–ª–Ω–µ–Ω–∏—è –ø—Ä–æ–ø—É—Å–∫–æ–≤: –≤ –ø—Ä–æ–ø—É—Å–∫–∞—Ö —Ç–∏–ø–∞ –∫–æ–ª–∏—á–µ—Å—Ç–≤–µ–Ω–Ω—ã—Ö –ø–µ—Ä–µ–º–µ–Ω–Ω—ã—Ö –±—ã–≤–∞–µ—Ç –±–æ–ª—å—à–∞—è —Ä–∞–∑–Ω–∏—Ü–∞ –≤ –∑–Ω–∞—á–µ–Ω–∏—è—Ö. –í —Ç–∞–∫–æ–º —Å–ª—É—á–∞–µ, —Å—Ä–µ–¥–Ω–µ–µ –∞—Ä–∏—Ñ–º–µ—Ç–∏—á–µ—Å–∫–æ–µ –º–æ–∂–µ—Ç –Ω–µ–∫–æ—Ä—Ä–µ–∫—Ç–Ω–æ —Ö–∞—Ä–∞–∫—Ç–µ—Ä–∏–∑–æ–≤–∞—Ç—å –¥–∞–Ω–Ω—ã–µ, –ø–æ—ç—Ç–æ–º—É –Ω–∞–∏–±–æ–ª—å–Ω—ã–π —ç—Ñ—Ñ–µ–∫—Ç–∏–≤–Ω—ã–π —Å–ø–æ—Å–æ–± - —ç—Ç–æ –∑–∞–ø–æ–ª–Ω–∏—Ç—å –ø—Ä–æ–ø—É—Å–∫–∏ –º–µ–¥–∏–∞–Ω–æ–π, —Å–ø–µ—Ä–≤–∞ –æ—Ç—Å–æ—Ä—Ç–∏—Ä–æ–≤–∞–≤ —á–∏—Å–ª–∞ –ø–æ-–ø–æ—Ä—è–¥–∫—É –∏ —Ä–∞–∑–¥–µ–ª–∏–≤ –∏—Ö –ø–æ –∫–∞—Ç–µ–≥–æ—Ä–∏—è–º –∏–∑ —Å—Ç–æ–ª–±—Ü–∞ income_type (—Ç–∏–ø –∑–∞–Ω—è—Ç–æ—Å—Ç–∏), —Ç.–∫. —É—Ä –∑–ø –º–æ–∂–µ—Ç —Å–∏–ª—å–Ω–æ –æ—Ç–ª–∏—á–∞—Ç—å—Å—è —É —Ä–∞–∑–Ω—ã—Ö –¥–æ–ª–∂–Ω–æ—Å—Ç–µ–π.
2. –¢–∞–∫ –∫–∞–∫ days_employed –Ω–µ —è–≤–ª. –æ–±—è–∑–∞—Ç–µ–ª—å–Ω—ã–º –ø–∞—Ä–∞–º–µ—Ç—Ä–æ–º –¥–ª—è –¥–æ—Å—Ç–∏–∂–µ–Ω–∏—è —Ü–µ–ª–µ–π –ø–æ—Å—Ç–∞–≤–ª–µ–Ω–Ω–æ–π –∑–∞–¥–∞—á–∏ –±–∞–Ω–∫–æ–º, –∏ —Ç–∞–∫ –∫–∞–∫ –∏—Ö –Ω–µ –∏–º–µ–µ—Ç —Å–º—ã—Å–ª–∞ —Å–≥—Ä—É–ø–ø–∏—Ä–æ–≤—ã–≤–∞—Ç—å —Å –¥—Ä—É–≥–∏–º–∏ —Å—Ç–æ–ª–±—Ü–∞–º–∏, —è –∑–∞–ø–æ–ª–Ω–∏–ª–∞ —ç—Ç–æ—Ç —Å—Ç–æ–ª–±–µ—Ü —Å—Ä–µ–¥–Ω–∏–º —Å—Ç–∞—Ç–∏—Å—Ç–∏—á–µ—Å–∫–∏–º –∑–Ω–∞—á–µ–Ω–∏–µ–º –ø–æ —á–∏—Å–ª–∞–º –≤ –¥–∞–Ω–Ω–æ–º —Å—Ç–æ–ª–±—Ü–µ, —á—Ç–æ–±—ã –≤—ã—Ä–æ–≤–Ω—è—Ç—å —Ç–∞–±–ª–∏—Ü—É. 

### –ó–∞–º–µ–Ω–∞ —Ç–∏–ø–∞ –¥–∞–Ω–Ω—ã—Ö

In [None]:
#–ó–∞–º–µ–Ω—è–µ–º –≤–µ—â–µ—Å—Ç–≤–µ–Ω–Ω—ã–π —Ç–∏–ø –¥–∞–Ω–Ω—ã—Ö –Ω–∞ —Ü–µ–ª–æ—á–∏—Å–ª–µ–Ω–Ω—ã–π
df['days_employed'] = df['days_employed'].astype('int')
df['total_income'] = df['total_income'].astype('int')
#Check
#df.info()

<div class="alert alert-success">
<h2> –ö–æ–º–º–µ–Ω—Ç–∞—Ä–∏–π —Ä–µ–≤—å—é–µ—Ä–∞ <a class="tocSkip"> </h2>
    
–ê –º–æ–∂–Ω–æ –±—ã–ª–æ —Ç–∞–∫

```python
df[['days_employed', 'total_income']] = df[['days_employed', 'total_income']].astype('int')
```

**–í—ã–≤–æ–¥** : –ó–∞–º–µ–Ω–∞ –≤–µ—â–µ—Å—Ç–≤–µ–Ω–Ω–æ–≥–æ —Ç–∏–ø–∞ –¥–∞–Ω–Ω—ã—Ö –Ω–∞ —Ü–µ–ª–æ—á–∏—Å–ª–µ–Ω–Ω—ã–π –Ω–µ –æ–±—è–∑–∞—Ç–µ–ª—å–Ω–∞, –Ω–æ –∂–µ–ª–∞—Ç–µ–ª—å–Ω–∞ –≤ –≤–∏–¥—É —É–¥–æ–±—Å—Ç–≤–∞ –∏ –Ω–∞–≥–ª—è–¥–Ω–æ—Å—Ç–∏

1. –í–µ—â–µ—Å—Ç–≤–µ–Ω—ã–Ω–π —Ç–∏–ø –¥–∞–Ω–Ω—ã—Ö - —Ç–∏–ø float. –í—Å–µ–≥–æ –≤ –¥–∞–Ω–Ω—ã—Ö 2 —Å—Ç–æ–ª–±—Ü–∞, –∫–æ—Ç–æ—Ä—ã–µ —Å–æ–¥–µ—Ä–∂–∞—Ç entries –¥–∞–Ω–Ω–æ–≥–æ —Ç–∏–ø–∞:  'days_employed' (—Ç—Ä—É–¥–æ–≤–æ–π —Å—Ç–∞–∂ –≤ –¥–Ω—è—Ö) –∏ 'total_income' (–¥–æ—Ö–æ–¥ –≤ –º–µ—Å—è—Ü). –ó–∞–º–µ–Ω—è–µ–º –∑–Ω–∞—á–µ–Ω–∏—è –Ω–∞ —Ü–µ–ª–æ—á–∏—Å–ª–µ–Ω–Ω—ã–µ –¥–ª—è –∫—Ä–∞—Ç–∫–æ—Å—Ç–∏ –∏ –Ω–∞–≥–ª—è–¥–Ω–æ—Å—Ç–∏.
2. –î–ª—è –∏–∑–º–µ–Ω–µ–Ω–∏—è —Ç–∏–ø–∞ –¥–∞–Ω–Ω—ã—Ö –±—ã–ª –∏—Å–ø–æ–ª—å–∑–æ–≤–∞–Ω –º–µ—Ç–æ–¥ astype –∏–∑-–∑–∞ –µ–≥–æ –∫—Ä–∞—Ç–∫–æ—Å—Ç–∏ –∏ –ø—Ä–æ—Å—Ç–æ—Ç—ã –∏—Å–ø–æ–ª—å–∑–æ–≤–∞–Ω–∏—è

### –û–±—Ä–∞–±–æ—Ç–∫–∞ –¥—É–±–ª–∏–∫–∞—Ç–æ–≤

In [None]:
#–°–Ω–∞—á–∞–ª–∞ –ø—Ä–∏–≤–µ–¥–µ–º —Å—Ç–æ–ª–±—Ü—ã —Å –¥–∞–Ω–Ω—ã–º–∏ —Ç–∏–ø–∞ str –≤ –Ω–∏–∂–Ω–∏–π —Ä–µ–≥–∏—Å—Ç—Ä
df['education'] = df['education'].str.lower()
#–ü—Ä–æ–≤–µ—Ä–∫–∞ –Ω–∞ –Ω–∞–ª–∏—á–∏–µ —Ä–∞–∑–Ω–∏—Ü—ã –≤ —Ä–µ–≥–∏—Å—Ç—Ä–∞—Ö –≤ –¥—Ä—É–≥–∏—Ö —Å—Ç–æ–ª–±—Ü–∞—Ö (–Ω–µ –Ω–∞–π–¥–µ–Ω–æ - –ø—Ä–æ–¥–æ–ª–∂–∞–µ–º)
#print(df['insert_column'].unique())

#–ü–æ–¥—Å—á–µ—Ç –∫–æ–ª-–≤–∞ –¥—É–±–ª–∏–∫–∞—Ç–æ–≤ –≤ –¥–∞—Ç–∞—Ñ—Ä–µ–π–º–µ(71)
#df.duplicated().sum()

#–£–¥–∞–ª–µ–Ω–∏–µ –¥—É–±–ª–∏–∫–∞—Ç–æ–≤ + –ø—Ä–æ–≤–µ—Ä–∫–∞
df = df.drop_duplicates().reset_index(drop= True)
#df.duplicated().sum()

<div class="alert alert-warning">
<h2> –ö–æ–º–º–µ–Ω—Ç–∞—Ä–∏–π —Ä–µ–≤—å—é–µ—Ä–∞ <a class="tocSkip"> </h2>

–°—Ç–∞—Ä–∞–π—Å—è –ø—Ä–æ–º–µ–∂—É—Ç–æ—á–Ω—ã–µ –≤—ã–≤–æ–¥—ã/–∫–æ–º–º–µ–Ω—Ç–∞—Ä–∏–∏ –æ—Ñ–æ—Ä–º–ª—è—Ç—å —Ç–∞–∫–∂–µ –≤ —è—á–µ–π–∫–∞—Ö `Markdown`, —ç—Ç–æ –ø–æ–≤—ã—à–∞–µ—Ç —á–∏—Ç–∞–µ–º–æ—Å—Ç—å –∫–æ–¥–∞. 

–ò –æ—Å—Ç–∞–≤–ª—è–π —Ä–µ—à–µ–Ω–∏–µ —Ç–∏–ø–∞ `df.duplicated().sum()`, —á—Ç–æ–±—ã –∑–∞–∫–∞–∑—á–∏–∫—É —Ç–æ–∂–µ –≤—Å–µ —ç—Ç–∏ –¥–∞–Ω–Ω—ã–µ –±—ã–ª–∏ –≤–∏–¥–Ω—ã

**–í—ã–≤–æ–¥** : –ù–∞–π–¥–µ–Ω—ã –∏ —É—Å—Ç—Ä–∞–Ω–µ–Ω—ã –¥—É–±–ª–∏–∫–∞—Ç—ã –≤ –¥–∞—Ç–∞—Å–µ—Ç–µ

1. –î–ª—è –ø–æ–∏—Å–∫–∞ –∏ —É–¥–∞–ª–µ–Ω–∏—è –¥—É–±–ª–∏–∫–∞—Ç–æ–≤ —è –∏—Å–ø–æ–ª—å–∑–æ–≤–∞–ª–∞ —á–µ—Ç—ã—Ä–µ –º–µ—Ç–æ–¥–∞: –ø–µ—Ä–≤—ã–π - –Ω–∞–π—Ç–∏ —É–Ω–∏–∫–∞–ª—å–Ω—ã–µ –∑–Ω–∞—á–µ–Ω–∏—è –≤ —Å—Ç–æ–ª–±—Ü–∞—Ö str, –±–ª–∞–≥–æ–¥–∞—Ä—è —á–µ–º—É –æ–±–Ω–∞—Ä—É–∂–∏–ª–æ—Å—å, —á—Ç–æ –µ—Å—Ç—å —Å–ª–æ–≤–∞ —Å –æ–¥–Ω–∏–º –∑–Ω–∞—á–µ–Ω–∏–µ–º, –Ω–æ —Ä–∞–∑–Ω–æ–≥–æ —Ä–µ–≥–∏—Å—Ç—Ä–∞, –∫–æ—Ç–æ—Ä—ã–µ —Å—á–∏—Ç–∞—é—Ç—Å—è "—É–Ω–∏–∫–∞–ª—å–Ω—ã–º–∏"(—ç—Ç–æ –º–µ—à–∞–µ—Ç –ø–æ–∏—Å–∫—É –¥—É–±–ª–∏–∫–∞—Ç–æ–≤), –≤—Ç–æ—Ä–æ–π - –∏—Å–ø. str.lower() –º–µ—Ç–æ–¥ –¥–ª—è Pandas, —á—Ç–æ–±—ã –ø—Ä–∏–≤–µ—Å—Ç–∏ —Å—Ç—Ä–æ–∫–∏ –≤ –µ–¥–∏–Ω—ã–π –Ω–∏–∂–Ω–∏–π —Ä–µ–≥–∏—Å—Ç—Ä, —Ç—Ä–µ—Ç–∏–π - –Ω–∞–π—Ç–∏ –∫–æ–ª-–≤–æ –¥—É–±–ª–∏–∫–∞—Ç–æ–≤ –¥–≤–æ–π–Ω—ã–º –º–µ—Ç–æ–¥–æ–º duplicated().sum(), —á–µ—Ç–≤–µ—Ä—Ç—ã–π - —É–¥–∞–ª–∏—Ç—å –¥—É–±–ª–∏–∫–∞—Ç—ã –º–µ—Ç–æ–¥–æ–º drop_duplicates.
2. –ü—Ä–∏—á–∏–Ω—ã –ø–æ—è–≤–ª–µ–Ω–∏—è –¥—É–±–ª–∏–∫–∞—Ç–æ–≤. –ò—Ö –º–æ–∂–µ—Ç –±—ã—Ç—å –º–Ω–æ–≥–æ, –Ω–∞–ø—Ä–∏–º–µ—Ä —á–µ–ª–æ–≤–µ—á–µ—Å–∫–∏–π —Ñ–∞–∫—Ç–æ—Ä (–Ω–µ–∫–æ—Ä—Ä–µ–∫—Ç–Ω–æ–µ –≤–≤–µ–¥–µ–Ω–∏–µ, –æ—à–∏–±–∫–∏ –ø—Ä–∏ –≤–≤–æ–¥–µ –¥–∞–Ω–Ω—ã—Ö, —Å–æ–∫—Ä—ã—Ç–∏–µ –∏–Ω—Ñ–æ—Ä–º–∞—Ü–∏–∏, –∏ —Ç.–¥.) –∏–ª–∏ —Ç–µ—Ö–Ω–∏—á–µ—Å–∫–∏–µ –æ—à–∏–±–∫–∏, –Ω–∞–ø—Ä–∏–º–µ—Ä –Ω–µ–ø—Ä–∞–≤–∏–ª—å–Ω—ã–π —Å–≤–æ–¥ –¥–∞–Ω–Ω—ã—Ö –∏—Ö —Ä–∞–∑–Ω—ã—Ö –∏—Å—Ç–æ—á–Ω–∏–∫–æ–≤ –∏–ª–∏ –±–∞–≥

### –õ–µ–º–º–∞—Ç–∏–∑–∞—Ü–∏—è

In [None]:
from pymystem3 import Mystem
m = Mystem()

#–ù–∞–π–¥–µ–º —É–Ω–∏–∫–∞–ª—å–Ω—ã–µ –ª–µ–º–º–∞—Ç–∏–∑–∏—Ä–æ–≤–∞–Ω–Ω—ã–µ –∑–Ω–∞—á–µ–Ω–∏—è –≤ —Å—Ç–æ–ª–±—Ü–µ 'purpose', —á—Ç–æ–±—ã –∏—Å–ø–æ–ª—å–∑–æ–≤–∞—Ç—å –µ–≥–æ –≤ —Ñ—É–Ω–∫—Ü–∏–∏
lemmas = []
for words in df['purpose'].unique():
        lemmas_word = m.lemmatize(words)
        lemmas += lemmas_word
myset = list(set(lemmas))

#–¢–∞–∫ –∫–∞–∫ —Å–ø–∏—Å–æ–∫ –º–∞–ª–µ–Ω—å–∫–∏–π, –æ—á–∏—Å—Ç–∏–º –µ–≥–æ –æ—Ç –ø—Ä–æ–±–µ–ª–æ–≤, –ø—Ä–µ–¥–ª–æ–≥–æ–≤, –ø—Ä–æ—á–µ–≥–æ –º—É—Å–æ—Ä–∞ –≤—Ä—É—á–Ω—É—é, –æ—Å—Ç–≤–∏–≤ —Ç–æ–ª—å–∫–æ –æ–ø–∏—Å–∞—Ç–µ–ª—å–Ω—ã–µ —Å—É—â–µ—Å—Ç–≤–∏—Ç–µ–ª—å–Ω—ã–µ
myset_clean = ['—Å–µ–º—å—è', '–æ–±—Ä–∞–∑–æ–≤–∞–Ω–∏–µ', '—Ä–µ–º–æ–Ω—Ç', '–ø–æ–∫—É–ø–∫–∞', '—Å—Ç—Ä–æ–∏—Ç–µ–ª—å—Å—Ç–≤–æ', '–∞–≤—Ç–æ–º–æ–±–∏–ª—å', '–∂–∏–ª—å–µ', '–æ–ø–µ—Ä–∞—Ü–∏—è', '–Ω–µ–¥–≤–∏–∂–∏–º–æ—Å—Ç—å', '—Å–≤–∞–¥—å–±–∞']

#–§—É–Ω–∫—Ü–∏—è –¥–ª—è –ª–µ–º–º–∞—Ç–µ–∑–∞—Ü–∏–∏ —Ç–µ–∫—Å—Ç–∞ –≤ —Å—Ç–æ–ª–±—Ü–µ
def lemmatize_col(row):
    for word in m.lemmatize(row):
        if word in myset_clean:
            return word  
        
#–ü—Ä–∏–º–µ–Ω–µ–Ω–∏–µ —Ñ—É–Ω–∫—Ü–∏–∏ –∫ —Å—Ç–æ–ª–±—Ü—É –≤ –¥–∞—Ç–∞—Ñ—Ä–µ–π–º–µ
df['group_purpose'] = df['purpose'].apply(lemmatize_col)

#–ü—Ä–æ–≤–µ—Ä–∫–∞ –Ω–∞ –Ω–∞–ª–∏—á–∏–µ None
#df.isna().sum()

<div class="alert alert-success"> 
<h2> –ö–æ–º–º–µ–Ω—Ç–∞—Ä–∏–π —Ä–µ–≤—å—é–µ—Ä–∞ <a class="tocSkip"> </h2>

–†–µ–∞–ª–∏–∑–∞—Ü–∏—è –ø—Ä–∞–≤–∏–ª—å–Ω–∞—è, –ø—Ä–∏ —ç—Ç–æ–º –ø—Ä–µ–¥–ª–∞–≥–∞—é –±–æ–ª–µ–µ –∫–æ—Ä–æ—Ç–∫–æ–µ –∏ –±—ã—Å—Ç—Ä–æ–µ —Ä–µ—à–µ–Ω–∏–µ —Å –∏—Å–ø–æ–ª—å–∑–æ–≤–∞–Ω–∏–µ–º [join()](https://stackoverflow.com/questions/5618878/how-to-convert-list-to-string)

¬†¬†¬† lemmas = m.lemmatize(' '.join(df['purpose']))
¬†¬†¬† Counter(lemmas)

</div>

**–í—ã–≤–æ–¥** : –õ–µ–º–º–∞—Ç–µ–∑–∞—Ü–∏—è –Ω–µ–æ–±—Ö–æ–¥–∏–º–∞, —á—Ç–æ–±—ã –±—ã–ª–æ —É–¥–æ–±–Ω–æ –≥—Ä—É–ø–ø–∏—Ä–æ–≤–∞—Ç—å –∏ –∏—Å–∫–∞—Ç—å –Ω—É–∂–Ω—ã–µ –∑–Ω–∞—á–µ–Ω–∏—è, —Ç–∞–∫ –∫–∞–∫ —Ä–µ—Å–ø–æ–Ω–¥–µ–Ω—Ç—ã –º–æ–≥–ª–∏ –≤–≤–µ—Å—Ç–∏ –æ–¥–Ω–æ –∏ —Ç–æ –∂–µ –∑–Ω–∞—á–µ–Ω–∏–µ —Ä–∞–∑–Ω—ã–º–∏ —Ñ–æ—Ä–º—É–ª–∏—Ä–æ–≤–∫–∞–º–∏ (–ø—Ä–∏–º. –ø–æ–∫—É–ø–∫–∞ –∞–≤—Ç–æ–º–æ–±–∏–ª—è –∏ –ø—Ä–∏–æ–±—Ä–µ—Ç–µ–Ω–∏–µ –∞–≤—Ç–æ–º–æ–±–∏–ª—è)

<div class="alert alert-success">
<h2> –ö–æ–º–º–µ–Ω—Ç–∞—Ä–∏–π —Ä–µ–≤—å—é–µ—Ä–∞ <a class="tocSkip"> </h2>

–ê –≤–æ—Ç –º–æ—è —Ä–µ–∞–ª–∏–∑–∞—Ü–∏—è, —Ç–∞–∫, –¥–ª—è –æ–±—â–µ–≥–æ —Ä–∞–∑–≤–∏—Ç–∏—è. –ö—Å—Ç–∞—Ç–∏, –æ–ø–µ—Ä–∞—Ü–∏—è –∑–¥–µ—Å—å –∏–¥–µ—Ç –Ω–µ –∫–∞–∫ –º–µ–¥–∏—Ü–∏–Ω—Å–∫–∞—è —É—Å–ª—É–≥–∞, –∞ –∫–∞–∫ –¥–µ–π—Å—Ç–≤–∏–µ —Å –Ω–µ–¥–≤–∏–∂–∏–º–æ—Å—Ç—å—é –∏–ª–∏ –∂–∏–ª—å–µ–º

In [None]:
purposes_keys = {'–∂–∏–ª—å–µ', '–Ω–µ–¥–≤–∏–∂–∏–º–æ—Å—Ç—å', '–∞–≤—Ç–æ–º–æ–±–∏–ª—å', '–æ–±—Ä–∞–∑–æ–≤–∞–Ω–∏–µ', '—Å–≤–∞–¥—å–±–∞'}

def get_purpose(data):
    
    """–ü—Ä–∏—Å–≤–∞–∏–≤–∞–µ—Ç —Å—Ç—Ä–æ–∫–µ –∫–∞—Ç–µ–≥–æ—Ä–∏—é —Ü–µ–ª–∏"""
    
    intersection = list(purposes_keys & set(m.lemmatize(data['purpose'])))
    
    if not intersection:
        return '–∫–∞—Ç–µ–≥–æ—Ä–∏—è –Ω–µ –æ–ø—Ä–µ–¥–µ–ª–µ–Ω–∞'
    return intersection[0]

df_example = pd.read_csv('/datasets/data.csv').head()

df_example.apply(get_purpose, axis=1)

1. –ü—Ä–æ—Ü–µ—Å—Å –ª–µ–º–º–∞—Ç–∏–∑–∞—Ü–∏–∏ –ø–æ–¥—Ä–æ–±–Ω–æ –æ–ø–∏—Å–∞–Ω –≤ –∑–∞–∫–æ–º–º–µ–Ω—Ç–∏—Ä–æ–≤–∞–Ω–Ω—ã—Ö (#) —Å—Ç—Ä–æ–∫–∞—Ö –≤ –∫–æ–¥–µ –≤—ã—à–µ. Note, —á—Ç–æ –≤ —Ä—É—á–Ω—É—é —Ç–∞–∫–∞—è –æ–±—Ä–∞–±–æ—Ç–∫–∞ myset –≤–æ–∑–º–æ–∂–Ω–∞ —Ç–æ–ª—å–∫–æ –µ—Å–ª–∏ —Å–ø–∏—Å–æ–∫ –ø–æ–ª—É—á–∞–µ—Ç—Å—è –º–∞–ª–µ–Ω—å–∫–∏–º. –ï—Å–ª–∏ –∂–µ –Ω–µ—Ç, —è –±—ã –≤–æ—Å–ø–æ–ª—å–∑–æ–≤–∞–ª–∞—Å—å NLTK –∏ —Å–¥–µ–ª–∞–ª–∞ –µ—â–µ –æ–¥–∏–Ω loop –ø–æ 'myset', –≥–¥–µ –æ–Ω –≤—ã–±–∏—Ä–µ—Ç –∏–∑ —Å–ø–∏—Å–∫–∞ —Å–ª–æ–≤–∞ –∫–æ—Ç–æ—Ä—ã–µ == ‚ÄòNN‚ÄôÔºànoun).
2. —Å–ª–æ–≤–∞—Ä—å –¥–ª—è –ª–µ–º–º–∞—Ç–µ–∑–∞—Ü–∏–∏ –±—ã–ª —Å–æ–±—Ä–∞–Ω –∏–∑ —É–Ω–∏–∫–∞–ª—å–Ω—ã—Ö –∑–Ω–∞—á–µ–Ω–∏–µ —Å–∞–º–æ–≥–æ –ª–µ–º–º–∞—Ç–∏–∑–∏—Ä—É–µ–º–æ–≥–æ —Å—Ç–æ–ª–±—Ü–∞, —á—Ç–æ–±—ã —É–≤–µ–ª–∏—á–∏—Ç—å —Ç–æ—á–Ω–æ—Å—Ç—å –ø—Ä–∏ –∞–Ω–∞–ª–∏–∑–µ –¥–∞–Ω–Ω—ã—Ö

### –ö–∞—Ç–µ–≥–æ—Ä–∏–∑–∞—Ü–∏—è –¥–∞–Ω–Ω—ã—Ö

In [None]:
# –ü—Ä–æ–≤–µ—Ä–∫–∞ –Ω–∞ –∫–æ–ª-–≤–æ —Ä–µ—Å–ø–æ–Ω–¥–µ–Ω—Ç–æ–≤ –ø–æ –∫–æ–ª-–≤—É –¥–µ—Ç–µ–π –≤ —Å–µ–º—å–µ
#print(df['children'].value_counts())

#–ó–∞–º–µ–Ω–∞ –∑–Ω–∞—á–µ–Ω–∏–π 20 –∏ -1 –Ω–∞ —Ä–µ–ª–µ–≤–∞–Ω—Ç–Ω—ã–µ –∑–Ω–∞—á–µ–Ω–∏—è –≤ —Ç–∞–±–ª–∏—Ü–µ (–ø–æ –ª–æ–≥–∏–∫–µ: 20 - –∏–º–µ–ª–æ—Å—å –≤ –≤–∏–¥—É 2, (-1) - –∏–º–µ–ª–æ—Å—å –≤ –≤–∏–¥—É 0)
df['children'] = df['children'].mask(df['children']==20, 2) 
df['children'] = df['children'].mask(df['children']==(-1), 0) 

#print(df['family_status'].unique())

#–°–±–æ—Ä–∫–∞ –≤—Å–µ—Ö –¥–∞–Ω–Ω—ã—Ö –≤ –ø–∏–≤–æ—Ç —Ç–∞–±–ª–∏—Ü—É –∏ —Å—É–º–º–∏—Ä–æ–≤–∞–Ω–∏–µ –∑–Ω–∞—á–µ–Ω–∏–µ –≤ 'TOTAL' –¥–ª—è –Ω–∞–≥–ª—è–¥–Ω–æ—Å—Ç–∏
data_pivot = df.pivot_table(index=['family_status'], columns='children', values='debt', aggfunc='sum',fill_value = 'N/A', margins = True, margins_name='TOTAL')
print(data_pivot)

<div class="alert alert-success">
<h2> –ö–æ–º–º–µ–Ω—Ç–∞—Ä–∏–π —Ä–µ–≤—å—é–µ—Ä–∞ <a class="tocSkip"> </h2>

–î–ª—è –∑–∞–º–µ–Ω—ã –∑–Ω–∞—á–µ–Ω–∏–π –º–æ–∂–Ω–æ –≤–æ—Å–ø–æ–ª—å–∑–æ–≤–∞—Ç—å—Å—è –º–µ—Ç–æ–¥–æ–º [replace](https://pandas.pydata.org/pandas-docs/stable/reference/api/pandas.DataFrame.replace.html)

</div>

**–í—ã–≤–æ–¥** : –ü—Ä–∏ –ø—Ä–æ—Å–º–æ—Ç—Ä–µ –¥–∞–Ω–Ω—ã—Ö –æ –∫–æ–ª-–≤–µ –¥–µ—Ç–µ–π —É —Ä–µ—Å–ø–æ–Ω–¥–µ–Ω—Ç–∞ –≤—ã—è—Å–Ω–∏–ª–æ—Å—å, —á—Ç–æ –≤—Å–µ–≥–æ –≤ —Ç–∞–±–ª–∏—Ü–µ –ø—Ä–∏—à–ª–æ—Å—å 76 –∑–Ω–∞—á–µ–Ω–∏–π –Ω–∞ 20 –¥–µ—Ç–µ–π –∏ 47 –∑–Ω–∞—á–µ–Ω–∏–π –Ω–∞ -1, –∞ –≤—Å–µ –æ—Å—Ç–∞–ª—å–Ω—ã–µ —Ä–∞—Å–ø—Ä–µ–¥–µ–ª–µ–Ω—ã –º–µ–∂–¥—É 0-5 –¥–µ—Ç–µ–π. –ò–∑ —ç—Ç–æ–≥–æ —è —Å–¥–µ–ª–∞–ª–∞ –≤—ã–≤–æ–¥, —á—Ç–æ –ø—Ä–∏ –≤–≤–æ–¥–µ –¥–∞–Ω–Ω—ã—Ö –ø—Ä–æ–∏–∑–æ—à–ª–∞ –æ—à–∏–±–∫–∞, –∏ —Ç–µ, –∫—Ç–æ –Ω–∞–ø–∏—Å–∞–ª–∏ 20 - –¥–æ–ø–∏—Å–∞–ª–∏ –ª–∏—à–Ω–∏–π 0, –∞ –ø—Ä–∏ -1 –ø–æ–¥—Ä–∞–∑—É–º–µ–≤–∞–ª–æ—Å—å 0. –°–ø—Ä–æ—Å–∏—Ç—å –æ –¥–æ—Å—Ç–æ–≤–µ—Ä–Ω–æ—Å—Ç–∏ –¥–∞–Ω–Ω–æ–≥–æ –≤—ã–≤–æ–¥–∞ –Ω–µ—Ç –≤–æ–∑–º–æ–∂–Ω–æ—Å—Ç–∏, –ø–æ—ç—Ç–æ–º—É –≤–º–µ—Å—Ç–æ —É–¥–∞–ª–µ–Ω–∏—è –¥–∞–Ω–Ω—ã—Ö —Å—Ç—Ä–æ–∫, —è –≤–æ—Å–ø–æ–ª—å–∑–æ–≤–∞–ª–∞—Å—å –¥–∞–Ω–Ω–æ–π –ª–æ–≥–∏–∫–æ–π –ø—Ä–∏ –∑–∞–º–µ–Ω–µ. –í –∏—Ç–æ–≥–µ, —è —Å–æ–±—Ä–∞–ª–∞ –∏–Ω—Ç–µ—Ä–µ—Å—É—é—â–∏–µ –Ω–∞—Å —á–∏—Å–ª–∞ –≤ PivotTable.

1. –û—Å–Ω–æ–≤–Ω–∞—è –∑–∞–¥–∞—á–∞ - —Ä–∞–∑–æ–±—Ä–∞—Ç—å—Å—è, –≤–ª–∏—è–µ—Ç –ª–∏ —Å–µ–º–µ–π–Ω–æ–µ –ø–æ–ª–æ–∂–µ–Ω–∏–µ ('family_status') –∏ –∫–æ–ª–∏—á–µ—Å—Ç–≤–æ –¥–µ—Ç–µ–π ('children') –∫–ª–∏–µ–Ω—Ç–∞ –Ω–∞ —Ñ–∞–∫—Ç –≤–æ–∑–≤—Ä–∞—Ç–∞ –∫—Ä–µ–¥–∏—Ç–∞ –≤ —Å—Ä–æ–∫ ('debt'). –ü—Ä–æ—Ü–µ—Å –∫–∞—Ç–µ–≥–æ—Ä–∏–∑–∞—Ü—ã—ã –æ–ø–∏—Å–∞–Ω –≤ —Å–∞–º–æ–º –∑–∞–¥–∞–Ω–∏–∏ –≤ –∑–∞–∫–æ–º–º–µ–Ω—Ç–∏—Ä–æ–≤–∞–Ω–Ω—ã—Ö (#) —Å—Ç—Ä–æ–∫–∞—Ö –≤ –∫–æ–¥–µ.

## –®–∞–≥ 3. –û—Ç–≤–µ—Ç—å—Ç–µ –Ω–∞ –≤–æ–ø—Ä–æ—Å—ã

- –ï—Å—Ç—å –ª–∏ –∑–∞–≤–∏—Å–∏–º–æ—Å—Ç—å –º–µ–∂–¥—É –Ω–∞–ª–∏—á–∏–µ–º –¥–µ—Ç–µ–π –∏ –≤–æ–∑–≤—Ä–∞—Ç–æ–º –∫—Ä–µ–¥–∏—Ç–∞ –≤ —Å—Ä–æ–∫?

In [None]:
#–°–º. Markdown
#–°–æ–∑–¥–∞–¥–∏–º –Ω–æ–≤—É—é Pivot Table, –∫–æ—Ç–æ—Ä–∞—è –æ—Ç—Ä–∞–∂–∞–µ—Ç –∫–æ–ª–∏—á–µ—Å—Ç–≤–æ –¥–æ–ª–∂–Ω–∏–∫–æ–≤ —Å —Ä–∞–∑–Ω—ã–º –∫–æ–ª-–≤–æ–º –¥–µ—Ç–µ–π –≤ —á–∏—Å–ª–∞—Ö –∏ –ø—Ä–æ—Ü–µ–Ω—Ç–∞—Ö

data_pivot_new = df.pivot_table(index = 'children',  
                                values = 'debt', aggfunc = ['count', 'sum', 'mean'])
data_pivot_new.columns=['Total entries', 'Total debtors', '% of debtors']
data_pivot_new.style.format({'% of debtors': '{:.1%}'})

**–í—ã–≤–æ–¥**

–ù–µ –æ–±—è–∑–∞—Ç–µ–ª—å–Ω–æ. –ò–∑ —Ç–∞–±–ª–∏—Ü—ã –ø–æ–Ω—è—Ç–Ω–æ, —á—Ç–æ —É –±–æ–ª—å—à–∏–Ω—Å—Ç–≤–∞ –æ–ø—Ä–æ—à–µ–Ω–Ω—ã—Ö (14 138 –ª—é–¥–µ–π) –¥–µ—Ç–µ–π –Ω–µ—Ç, –∞ –¥–æ–ª–∂–Ω–∏–∫–æ–≤ —Å—Ä–µ–¥–∏ –Ω–∏—Ö —Ç–æ–ª—å–∫–æ 1064. –ù–∞–∏–±–æ–ª—å—à–∏–π –ø—Ä–æ—Ü–µ–Ω—Ç –¥–æ–ª–∂–Ω–∏–∫–æ–≤ —É –∫–∞—Ç–µ–≥–æ—Ä–∏–∏ –ª—é–¥–µ–π —Å 4–º—è –¥–µ—Ç—å–º–∏(9.8% –∏–ª–∏ 4 —á–µ–ª–æ–≤–µ–∫–∞), –Ω–æ —Å—Ä–µ–¥–∏ –Ω–∏—Ö –±—ã–ª —Ç–æ–ª—å–∫–æ 41 —Ä–µ—Å–ø–æ–¥–µ–Ω—Ç. –ò–Ω—Ç–µ—Ä–µ—Å–Ω–æ, —á—Ç–æ –ø—Ä–∏ —ç—Ç–æ–º —É –∫–∞—Ç–µ–≥–æ—Ä–∏–∏ –ª—é–¥–µ–π —Å 5 –¥–µ—Ç—å–º–∏ (–≤—Å–µ–≥–æ 9 —á–µ–ª–æ–≤–µ–∫) –Ω–µ—Ç –¥–æ–ª–≥–æ–≤. –ò–∑ –¥–∞–Ω–Ω—ã—Ö —Ä–µ–∑—É–ª—å—Ç–∞—Ç–æ–≤ —è –±—ã —Å–¥–µ–ª–∞–ª–∞ –≤—ã–≤–æ–¥, —á—Ç–æ –Ω–∞ % –¥–æ–ª–∂–Ω–∏–∫–æ–≤ –≤–ª–∏—è—é—Ç –Ω–µ –¥–µ—Ç–∏ –∏–ª–∏ –Ω–µ —Ç–æ–ª—å–∫–æ –¥–µ—Ç–∏, –∏ –≤–∞–∂–Ω–æ —Ä–∞—Å—Å–º–æ—Ç—Ä–µ—Ç—å —Ç–∞–∫–∂–µ –¥—Ä—É–≥–∏–µ —Ñ–∞–∫—Ç–æ—Ä—ã. –¢–∏–ø –∑–∞–π–º–∞ –≤ –±–∞–Ω–∫–µ —Ç–æ–∂–µ –∏–º–µ–µ—Ç –∑–Ω–∞—á–µ–Ω–∏–µ.

<div class="alert alert-danger">
<h2> –ö–æ–º–º–µ–Ω—Ç–∞—Ä–∏–π —Ä–µ–≤—å—é–µ—Ä–∞ <a class="tocSkip"> </h2>

–ó–¥–µ—Å—å —Ç—ã –ø–æ—Å—á–∏—Ç–∞–ª–∞ –∞–±—Å–æ–ª—é—Ç–Ω—ã–µ –∑–Ω–∞—á–µ–Ω–∏—è, –æ–¥–Ω–∞–∫–æ –æ–Ω–∏ –Ω–µ –≤ –ø–æ–ª–Ω–æ–π –º–µ—Ä–µ –æ—Ç—Ä–∞–∂–∞—é—Ç —Ö–∞—Ä–∞–∫—Ç–µ—Ä–∏—Å—Ç–∏–∫–∏ —ç—Ç–∏—Ö –≥—Ä—É–ø–ø. –í–æ–∑–º–æ–∂–Ω–æ, —á—Ç–æ –æ–±—â–µ–µ –∫–æ–ª-–≤–æ –ª—é–¥–µ–π —Å –¥–µ—Ç—å–º–∏ –Ω–∞–º–Ω–æ–≥–æ –±–æ–ª—å—à–µ –ª—é–¥–µ–π –±–µ–∑ –¥–µ—Ç–µ–π, –ø–æ—ç—Ç–æ–º—É –∏ —á–∏—Å–ª–æ –∑–∞–¥–æ–ª–∂–Ω–∏–∫–æ–≤ –±–æ–ª—å—à–µ. –°—Ä–∞–∑—É –∏ –Ω–µ —Å–∫–∞–∂–µ—à—å. –¢–µ–±–µ –Ω–µ–æ–±—Ö–æ–¥–∏–º–æ –ø–æ—Å—á–∏—Ç–∞—Ç—å –æ—Ç–Ω–æ—Å–∏—Ç–µ–ª—å–Ω—ã–µ –ø–æ–∫–∞–∑–∞—Ç–µ–ª–∏, —Ç.–µ –∫–∞–∫–æ–π –ø—Ä–æ—Ü–µ–Ω—Ç –ª—é–¥–µ–π —Å –¥–µ—Ç—å–º–∏ –Ω–µ–≤–æ–∑—Ä–∞—â–∞—é—Ç –∫—Ä–µ–¥–∏—Ç –æ—Ç–Ω–æ—Å–∏—Ç–µ–ª—å–Ω–æ –æ–±—â–µ–≥–æ –∫–æ–ª-–≤–∞ –ª—é–¥–µ–π —ç—Ç–æ–π –≥—Ä—É–ø–ø—ã (–∫–æ–º–º–µ–Ω—Ç–∞—Ä–∏–π –æ–±—â–∏–π, –æ—Ç–Ω–æ—Å–∏—Ç—Å—è –∫–æ –≤—Å–µ–º—É 3 —Ä–∞–∑–¥–µ–ª—É). –ü–æ—ç—Ç–æ–º—É —è –ø—Ä–µ–¥–ª–∞–≥–∞—é –≤—ã–ø–æ–ª–Ω–∏—Ç—å —Å–ª–µ–¥—É—é—â–µ–µ:
    
- –ø–æ—Å—Ç—Ä–æ–π —Å–≤–æ–¥–Ω—ã–µ —Ç–∞–±–ª–∏—Ü—ã –ø–æ —Å–æ–∑–¥–∞–Ω–Ω—ã–º —Å—Ç–æ–ª–±—Ü–∞–º –∏ –ø–æ—Å—á–∏—Ç–∞–π –≤ –Ω–∏—Ö —Å—Ä–µ–¥–Ω–∏–µ –∑–Ω–∞—á–µ–Ω–∏—è –ø–æ —Å—Ç–æ–ª–±—Ü—É `debt`. –≠—Ç–æ —Ç–æ–∂–µ —Å–∞–º–æ–µ —á—Ç–æ –∏ –ø–æ–¥–µ–ª–∏—Ç—å –∫–æ–ª-–≤–æ –¥–æ–ª–∂–Ω–∏–∫–æ–≤ (–æ–ø–µ—Ä–∞—Ü–∏—è sum) –Ω–∞ –æ–±—â–µ–µ –∫–æ–ª-–≤–æ –∫–ª–∏–µ–Ω—Ç–æ–≤ (count). –≠—Ç–æ—Ç –ø–æ–∫–∞–∑–∞—Ç–µ–ª—å –∫–∞–∫ —Ä–∞–∑ –∏ –±—É–¥–µ—Ç –¥–µ–º–æ–Ω—Å—Ç—Ä–∏—Ä–æ–≤–∞—Ç—å –ø—Ä–æ—Ü–µ–Ω—Ç –¥–æ–ª–∂–Ω–∏–∫–æ–≤ —ç—Ç–∏—Ö –≥—Ä—É–ø–ø. 
</div>

<div class="alert alert-success">
<h2> –ö–æ–º–º–µ–Ω—Ç–∞—Ä–∏–π —Ä–µ–≤—å—é–µ—Ä–∞ v2 <a class="tocSkip"> </h2>

–¢–∞–º —Ç—ã —Å—á–∏—Ç–∞–µ—à—å –ø—Ä–æ—Ü–µ–Ω—Ç –ø–æ–ª—å–∑–æ–≤–∞—Ç–µ–ª–µ–π –æ–ø—Ä–µ–¥–µ–ª–µ–Ω–Ω–æ–π –≥—Ä—É–ø–ø—ã –æ—Ç –æ–±—â–µ–≥–æ –∫–æ–ª-–≤–∞ –ø–æ–ª—å–∑–æ–≤–∞—Ç–µ–ª–µ–π. –≠—Ç–æ –Ω–µ–º–Ω–æ–≥–æ –Ω–µ —Ç–æ, —á—Ç–æ–±—ã –ø–æ—Å—á–∏—Ç–∞—Ç—å –∫–æ–ª-–≤–æ –¥–æ–ª–∂–Ω–∏–∫–æ–≤, –º–æ–∂–Ω–æ –≤—ã–ø–æ–ª–Ω–∏—Ç—å —Å–ª–µ–¥—É—é—â–µ–µ (–û–±—Ä–∞—Ç–∏ –≤–Ω–∏–º–∞–Ω–∏–µ –Ω–∞ [style](https://pandas.pydata.org/pandas-docs/stable/user_guide/style.html) –±–∏–±–ª–∏–æ—Ç–µ–∫–∏ `pandas`. –¢–∞–º –º–Ω–æ–≥–æ –ø—Ä–∏–∫–æ–ª—å–Ω—ã—Ö —à—Ç—É–∫ –¥–ª—è —Ç–æ–≥–æ, —á—Ç–æ–±—ã —Å–¥–µ–ª–∞—Ç—å –¥–∞—Ç–∞—Ñ—Ä–µ–π–º –∫—Ä–∞—Å–∏–≤–µ–µ)

In [None]:
df_example = df.pivot_table(index = 'children', values = 'debt', 
                            aggfunc = ['count', 'sum', 'mean', lambda x: 1 - x.mean()])
df_example.columns = ['–ö–æ–ª-–≤–æ –ø–æ–ª—å–∑–æ–≤–∞—Ç–µ–ª–µ–π', '–ö–æ–ª-–≤–æ –¥–æ–ª–∂–Ω–∏–∫–æ–≤', '% –¥–æ–ª–∂–Ω–∏–∫–æ–≤', '% –ù–ï–¥–æ–ª–∂–Ω–∏–∫–æ–≤']
df_example.style.format({'% –¥–æ–ª–∂–Ω–∏–∫–æ–≤': '{:.2%}', '% –ù–ï–¥–æ–ª–∂–Ω–∏–∫–æ–≤': '{:.2%}'})

- –ï—Å—Ç—å –ª–∏ –∑–∞–≤–∏—Å–∏–º–æ—Å—Ç—å –º–µ–∂–¥—É —Å–µ–º–µ–π–Ω—ã–º –ø–æ–ª–æ–∂–µ–Ω–∏–µ–º –∏ –≤–æ–∑–≤—Ä–∞—Ç–æ–º –∫—Ä–µ–¥–∏—Ç–∞ –≤ —Å—Ä–æ–∫?

In [None]:
#–°–º. Markdown

**–í—ã–≤–æ–¥**

–î–∞. –ò—Å—Ö–æ–¥—è –∏–∑ PivotTable –≤ —à–∞–≥–µ 2.5, –∏ —Ä–µ–∑—É–ª—å—Ç–∞—Ç–∞–º TOTAL –ø–æ —Å—Ç—Ä–æ–∫–∞–º (—Ç–æ –µ—Å—Ç—å, –Ω–µ –≤–∞–∂–Ω–æ, —Å–∫–æ–ª—å–∫–æ –¥–µ—Ç–µ–π —É —á–µ–ª–æ–≤–µ–∫–∞, —Å–º–æ—Ç—Ä–∏–º —Ç–æ–ª—å–∫–æ –Ω–∞ —Å–µ–º–µ–π–Ω–æ–µ –ø–æ–ª–æ–∂–µ–Ω–∏–µ), –º–æ–∂–Ω–æ —Å–¥–µ–ª–∞—Ç—å –≤—ã–≤–æ–¥, —á—Ç–æ –∫–æ–ª-–≤–æ –∂–µ–Ω–∞—Ç—ã—Ö / –∑–∞–º—É–∂–Ω–∏—Ö —Å –¥–æ–ª–≥–∞–º–∏ –±–æ–ª—å—à–µ, –∑–Ω–∞—á–∏—Ç–µ–ª—å–Ω–æ –±–æ–ª—å—à–µ, —á–µ–º —É –≤—Å–µ—Ö –æ—Å—Ç–∞–ª—å–Ω—ã—Ö, –æ–¥–Ω–∞–∫–æ —É –Ω–µ –∂–µ–Ω–∞—Ç—ã—Ö / –Ω–µ –∑–∞–º—É–∂–Ω–∏—Ö –¥–æ–ª–≥–æ–≤ –±–æ–ª—å—à–µ, —á–µ–º —É —Ç–µ—Ö, –∫—Ç–æ –≤ —Ä–∞–∑–≤–æ–¥–µ, –≤–¥–æ–≤–∞/–≤–¥–æ–≤–µ—Ü –∏–ª–∏ –≤ –≥—Ä–∞–∂–¥–∞–Ω—Å–∫–æ–º –±—Ä–∞–∫–µ. 

|family_status|TOTAL |  
|---|---|
|–ù–µ –∂–µ–Ω–∞—Ç / –Ω–µ –∑–∞–º—É–∂–µ–º |  **274**|
|–≤ —Ä–∞–∑–≤–æ–¥–µ     |         **85**|
|–≤–¥–æ–≤–µ—Ü / –≤–¥–æ–≤–∞     |     **63**|
|–≥—Ä–∞–∂–¥–∞–Ω—Å–∫–∏–π –±—Ä–∞–∫ |       **388**|
|–∂–µ–Ω–∞—Ç / –∑–∞–º—É–∂–µ–º    |     **931**|

<div style="border:solid purple 5px; padding: 20px"> 
<h2 align="center"> –†—É–±—Ä–∏–∫–∞ ¬´–ü–∏—Ç–æ–Ω—è—á–∏–π –ª–∞–π—Ñ—Ö–∞–∫–µ—Ä¬ª <a class="tocSkip"> </h2>

<h3> –§—É–Ω–∫—Ü–∏—è zip <a class="tocSkip"> </h3>

–§—É–Ω–∫—Ü–∏—è zip —Å–æ–∑–¥–∞—ë—Ç –∏—Ç–µ—Ä–∞—Ç–æ—Ä, –∫–æ—Ç–æ—Ä—ã–π –∫–æ–º–±–∏–Ω–∏—Ä—É–µ—Ç —ç–ª–µ–º–µ–Ω—Ç—ã –Ω–µ—Å–∫–æ–ª—å–∫–∏—Ö —Å–ø–∏—Å–∫–æ–≤. –≠—Ç–æ –ø–æ–∑–≤–æ–ª—è–µ—Ç –æ—Å—É—â–µ—Å—Ç–≤–ª—è—Ç—å –ø–∞—Ä–∞–ª–ª–µ–ª—å–Ω—ã–π –æ–±—Ö–æ–¥ —Å–ø–∏—Å–∫–æ–≤ –≤ —Ü–∏–∫–ª–∞—Ö for –∏–ª–∏, –Ω–∞–ø—Ä–∏–º–µ—Ä, –≤—ã–ø–æ–ª–Ω—è—Ç—å –ø–∞—Ä–∞–ª–ª–µ–ª—å–Ω—É—é —Å–æ—Ä—Ç–∏—Ä–æ–≤–∫—É.

![](https://i.ibb.co/MPPZ6TL/image.png)

- –ï—Å—Ç—å –ª–∏ –∑–∞–≤–∏—Å–∏–º–æ—Å—Ç—å –º–µ–∂–¥—É —É—Ä–æ–≤–Ω–µ–º –¥–æ—Ö–æ–¥–∞ –∏ –≤–æ–∑–≤—Ä–∞—Ç–æ–º –∫—Ä–µ–¥–∏—Ç–∞ –≤ —Å—Ä–æ–∫?

In [None]:
#–ü–æ –æ—Ñ–∏—Ü–∏–∞–ª—å–Ω—ã–º –¥–∞–Ω–Ω—ã–º –æ—Ç –†–æ—Å—Å—Ç–∞—Ç–∞, —Å—Ä–µ–¥–Ω–µ–º–µ—Å—è—á–Ω–∞—è –ó–ü –Ω–∞—Å–µ–ª–µ–Ω–∏—è –†–æ—Å—Å–∏–∏ –≤ 2020 51083. 
#–î–ª—è —Ç–æ–≥–æ —á—Ç–æ–±—ã —Å—á–∏—Ç–∞—Ç—å—Å—è —Å—Ä–µ–¥–Ω–∏–º –∫–ª–∞—Å—Å–æ–º, –≤ –ú–æ—Å–∫–≤–µ –Ω–µ–æ–±—Ö–æ–¥–∏–º–æ –∑–∞—Ä–∞–±–∞—Ç—ã–≤–∞—Ç—å –æ—Ç 121 —Ç—ã—Å. —Ä—É–±

#–ü–æ—Å—á–∏—Ç–∞–µ–º, —á—Ç–æ —Å—Ä–µ–¥–Ω–∏–π —É—Ä. –ó–ü —ç—Ç–æ –ø—Ä–∏–±–ª–∏–∑–∏—Ç–µ–ª—å–Ω–æ 51000-121000—Ç—ã—Å —Ä—É–±. –ù–∏–∂–µ 51–∫ - –Ω–∏–∂–µ —Å—Ä., –í—ã—à–µ 120–∫ - –≤—ã—à–µ —Å—Ä.
def total_income_category(value):
    if value < 51000:
        return 0
    elif 51000 <= value <= 121000:
        return 1
    return 2 
df['income_level_index'] = df['total_income'].apply(total_income_category)

#–°–¥–µ–ª–∞–µ–º –µ—â–µ 1 PivoTable, —Ä–∞–∑–¥–µ–ª–∏–≤ –¥–æ–ª–∂–Ω–∏–∫–æ–≤ –ø–æ –≥–µ–Ω–¥–µ—Ä–Ω–æ–º—É –ø—Ä–∏–∑–Ω–∞–∫—É:
data_pivot = df.pivot_table(index=['income_level_index'], columns=['gender'], values='debt', aggfunc=['sum'],fill_value = 'N/A', margins = True, margins_name='TOTAL')
print(data_pivot)



<div class="alert alert-success">
<h2> –ö–æ–º–º–µ–Ω—Ç–∞—Ä–∏–π —Ä–µ–≤—å—é–µ—Ä–∞ <a class="tocSkip"> </h2>

–ü–æ–¥–∫—Ä–µ–ø–ª–µ–Ω–∏–µ –≤—ã–≤–æ–¥–æ–≤ –≤–Ω–µ—à–Ω–∏–º–∏ —Ñ–∞–∫—Ç–∞–º–∏ üëç. –£—Å–∏–ª–∏–≤–∞–µ—Ç –æ—â—É—â–µ–Ω–∏–µ —Ö–æ—Ä–æ—à–æ —Å–¥–µ–ª–∞–Ω–Ω–æ–π —Ä–∞–±–æ—Ç—ã

</div>

<div class="alert alert-warning">
<h2> –ö–æ–º–º–µ–Ω—Ç–∞—Ä–∏–π —Ä–µ–≤—å—é–µ—Ä–∞ <a class="tocSkip"> </h2>

–¢–µ–±—è –Ω–µ —Å–º—É—Ç–∏–ª–æ –∑–Ω–∞—á–µ–Ω–∏–µ `XNA` –≤ –¥–∞–Ω–Ω—ã—Ö?

<div class="alert alert-success">
<h2> –ö–æ–º–º–µ–Ω—Ç–∞—Ä–∏–π —Ä–µ–≤—å—é–µ—Ä–∞ <a class="tocSkip"> </h2>

–ó–¥–µ—Å—å, –∫—Å—Ç–∞—Ç–∏ –¥–ª—è –¥–µ–ª–µ–Ω–∏—è –ø–æ –¥–æ—Ö–æ–¥—É –º–æ–∂–Ω–æ –±—ã–ª–æ –ø—Ä–∏–º–µ–Ω–∏—Ç—å –º–µ—Ç–æ–¥ [cut](https://pandas.pydata.org/pandas-docs/stable/reference/api/pandas.cut.html) –∏–ª–∏ [qcut](https://pandas.pydata.org/pandas-docs/stable/reference/api/pandas.qcut.html)(–≤ —ç—Ç–æ–º —Å–ª—É—á–∞–µ –¥–µ–ª–µ–Ω–∏–µ –±—É–¥–µ—Ç –ø—Ä–æ–≤–æ–¥–∏—Ç—å—Å—è –ø–æ –ø—Ä–æ—Ü–µ–Ω–∏–ª—è–º). –û–±—Ä–∞—Ç–∏ –≤–Ω–∏–º–∞–Ω–∏–µ, —É –¥–∞–Ω–Ω–æ–≥–æ –º–µ—Ç–æ–¥–∞ –µ—Å—Ç—å –ø–∞—Ä–∞–º–µ—Ç—Ä `labels`, —Ç–∞–∫ –º–æ–∂–Ω–æ –æ—Ñ–æ—Ä–º–∏—Ç—å –≥—Ä—É–ø–ø—ã –∫—Ä–∞—Å–∏–≤–µ–µ üòä

</div>

**–í—ã–≤–æ–¥**

–ò–∑ Pivot Table –≤—ã—à–µ –º–æ–∂–Ω–æ —Å–¥–µ–ª–∞—Ç—å —Å–ª–µ–¥—É—é—â–∏–µ –≤—ã–≤–æ–¥—ã: 

1. –ö–æ–ª-–≤–æ –ª—é–¥–µ–π —Å —É—Ä. –¥–æ—Ö–æ–¥–∞ –≤—ã—à–µ —Å—Ä–µ–¥–Ω–µ–≥–æ (–∏–Ω–¥–µ–∫—Å 2) –≥–æ—Ä–∞–∑–¥–æ –±–æ–ª—å—à–µ —Ç–µ—Ö, —É –∫–æ–≥–æ —É—Ä. –¥–æ—Ö–æ–¥–∞ –Ω–∏–∂–µ (1 –∏ 0). –í—Å–µ–≥–æ –µ—Å—Ç—å 6 –¥–æ–ª–∂–Ω–∏–∫–æ–≤ —Å —É—Ä–æ–≤–Ω–µ–º –¥–æ—Ö–æ–¥–∞ –Ω–∏–∂–µ —Å—Ä–µ–¥–Ω–µ–≥–æ (–∏–Ω–¥–µ–∫—Å 0)
                 
|income_level_index|TOTAL|                       
|---|---|
|0|**6**|
|1|**572**|
|2|**1143**|

2. –î–æ–ø–æ–ª–Ω–∏—Ç–µ–ª—å–Ω–æ, —Ç–∞–∫ –∫–∞–∫ —è –æ—Ç—Å–æ—Ä—Ç–∏—Ä–æ–≤–∞–ª–∞ –¥–æ–ª–∂–Ω–∏–∫–æ–≤ –ø–æ –≥–µ–Ω–¥–µ—Ä–Ω–æ–º—É –ø—Ä–∏–∑–Ω–∞–∫—É, –º–æ–∂–Ω–æ —Å–¥–µ–ª–∞—Ç—å –≤—ã–≤–æ–¥, —á—Ç–æ –∫–æ–ª-–≤–æ –¥–æ–ª–∂–Ω–∏–∫–æ–≤ –∂–µ–Ω—â–∏–Ω —Å –¥–æ—Ö–æ–¥–æ–º –≤—ã—à–µ —Å—Ä–µ–¥–Ω–µ–≥–æ, —á–µ–º —É –º—É–∂—á–∏–Ω —Å –¥–æ—Ö–æ–¥–æ–º –≤—ã—à–µ —Å—Ä–µ–¥–Ω–µ–≥–æ. –¢–æ –∂–µ —Å–∞–º–æ–µ –æ—Ç–Ω–æ—Å–∏—Ç—Å—è –∫ —Å —Ä–µ–¥–Ω–µ–º—É –∏ –Ω–∏–∂–µ —Å—Ä–µ–¥–Ω–µ–≥–æ —É—Ä–æ–≤–Ω—è–º –¥–æ—Ö–æ–¥–∞. 

|income_level_index|Female|Male|XNA  
|---|---|---|---|                         
|0|18.0|8.0|N/A  
|1|386.0|186.0|N/A   
|2|590.0|553.0|0   
|**TOTAL**|**994.0**|**747.0**|**0**   


<div class="alert alert-success">
<h2> –ö–æ–º–º–µ–Ω—Ç–∞—Ä–∏–π —Ä–µ–≤—å—é–µ—Ä–∞ <a class="tocSkip"> </h2>

–í Markdown —Ä–µ–¥–∫–æ —Ç–∞–±–ª–∏—Ü—ã –≤–∏–∑—É–∞–ª–∏–∑–∏—Ä—É—é—Ç, –≤–∏–¥–∏—à—å, –≤ —Ç–≤–æ–µ–π —Ä–µ–∞–ª–∏–∑–∞—Ü–∏–∏ –æ–Ω–∏ –ø–æ–ª—É—á–∞—é—Ç—Å—è —Å–æ–≤—Å–µ–º –Ω–µ—á–∏—Ç–∞–±–µ–ª—å–Ω—ã–º–∏. –ü–æ—á–∏—Ç–∞–π –≤–æ—Ç [–∑–¥–µ—Å—å](https://medium.com/analytics-vidhya/the-ultimate-markdown-guide-for-jupyter-notebook-d5e5abf728fd), —Ç–∞–º –µ—Å—Ç—å –ø—Ä–∏–º–µ—Ä—ã –∫–∞–∫ —Å–¥–µ–ª–∞—Ç—å —Ç–∞–±–ª–∏—Ü—ã –∏ –≤ —Ü–µ–ª–æ–º –æ—á–µ–Ω—å –∫—Ä—É—Ç–æ —Ä–∞—Å–ø–∏—Å–∞–Ω—ã Markdown, –∏ –µ—â–µ –≤–æ—Ç —Ç–∞–∫—É—é [—à–ø–∞—Ä–≥–∞–ª–∫—É](https://paulradzkov.com/2014/markdown_cheatsheet/) –æ—Å—Ç–∞–≤–ª—é, –≤–¥—Ä—É–≥ –ø–æ–Ω–∞–¥–æ–±–∏—Ç—Å—è üôÇ
    
</div>

<font color='blue'>***–í–µ—Ä–æ–Ω–∏–∫–∞***: –°–ø–∞—Å–∏–±–æ!</font> 

- –ö–∞–∫ —Ä–∞–∑–Ω—ã–µ —Ü–µ–ª–∏ –∫—Ä–µ–¥–∏—Ç–∞ –≤–ª–∏—è—é—Ç –Ω–∞ –µ–≥–æ –≤–æ–∑–≤—Ä–∞—Ç –≤ —Å—Ä–æ–∫?

In [None]:
#–°–æ–∑–¥–∞–¥–∏–º –µ—â–µ 1 PivotTable
data_pivot = df.pivot_table(index=['group_purpose'], columns='gender', values='debt', aggfunc='sum',fill_value = 'N/A', margins = True, margins_name='TOTAL')
print(data_pivot[['F','M', 'TOTAL']])

**–í—ã–≤–æ–¥**

–ò–∑ Pivot Table –≤—ã—à–µ –º–æ–∂–Ω–æ —Å–¥–µ–ª–∞—Ç—å —Å–ª–µ–¥—É—é—â–∏–µ –≤—ã–≤–æ–¥—ã: 
1. –ë–æ–ª—å—à–µ –≤—Å–µ–≥–æ –¥–æ–ª–∂–Ω–∏–∫–æ–≤ —Å—Ä–µ–¥–∏ –¥–≤—É—Ö –∏–º–µ—é—â–∏—Ö—Å—è –≥–µ–Ω–¥–µ—Ä–æ–≤ - —Ç–µ, –∫—Ç–æ –±–µ—Ä—É—Ç –∫—Ä–µ–¥–∏—Ç —Å —Ü–µ–ª—å—é –ø–æ–∫—É–ø–∫–∏ —á–µ–≥–æ-–ª–∏–±–æ (—Å–ø–µ—Ü–µ—Ñ–∏—á–µ—Å–∫–∏ –∂–µ–Ω—â–∏–Ω –¥–æ–ª–∂–Ω–∏–∫–æ–≤ –±–æ–ª—å—à–µ, —á–µ–º –º—É–∂—á–∏–Ω). –ù–∞ 2 –º–µ—Å—Ç–µ —Å—Ä–µ–¥–∏ —Ü–µ–ª–µ–π –¥–ª—è –∫—Ä–µ–¥–∏—Ç–∞ - –æ–±—Ä–∞–∑–æ–≤–∞–Ω–∏–µ, –Ω–∞ 3 - –∞–≤—Ç–æ–º–æ–±–∏–ª—å. –í–æ –≤—Å–µ—Ö —Å–ª—É—á–∞—è—Ö –∫–æ–ª-–≤–æ –¥–æ–ª–∂–Ω–∏–∫–æ–≤ –∂–µ–Ω—â–∏–Ω –±–æ–ª—å—à–µ, —á–µ–º –º—É–∂—á–∏–Ω.
group_purpose  TOTAL                        
–∞–≤—Ç–æ–º–æ–±–∏–ª—å     277
–∂–∏–ª—å–µ          46
–Ω–µ–¥–≤–∏–∂–∏–º–æ—Å—Ç—å   42
–æ–±—Ä–∞–∑–æ–≤–∞–Ω–∏–µ    370
–æ–ø–µ—Ä–∞—Ü–∏—è       205
–ø–æ–∫—É–ø–∫–∞        436
—Ä–µ–º–æ–Ω—Ç         35
—Å–≤–∞–¥—å–±–∞        186
—Å—Ç—Ä–æ–∏—Ç–µ–ª—å—Å—Ç–≤–æ  144


## –®–∞–≥ 4. –û–±—â–∏–π –≤—ã–≤–æ–¥

–†–∞–∑–±–µ—Ä–µ–º –∏—Ç–æ–≥–∏ –Ω–∞ –Ω–µ—Å–∫–æ–ª—å–∫–æ —ç—Ç–∞–ø–æ–≤:

1)–ü–æ—Å–ª–µ –æ–±—â–µ–π –æ—Ü–µ–Ω–∫–∏ —Å–æ—Å—Ç–æ—è–Ω–∏—è –¥–∞—Ç–∞—Å–µ—Ç—è, –±—ã–ª–∏ –≤—ã—è–≤–ª–µ–Ω—ã –Ω–µ—Å–∫–æ–ª—å–∫–æ –æ—á–µ–≤–∏–¥–Ω—ã—Ö –ø—Ä–æ–±–ª–µ–º: —à–∏–±–∫–∏ –≤ –¥–∞–Ω–Ω—ã—Ö, —Ä–∞–∑–Ω—ã–µ —Ä–µ–≥–∏—Å—Ç—Ä—ã –∏—Ç–¥. 2. –í–æ–∑–º–æ–∂–Ω—ã–µ –ø—Ä–∏—á–∏–Ω—ã –ø–æ—è–≤–ª–µ–Ω–∏—è –ø—Ä–æ–ø—É—Å–∫–æ–≤: —á–∞—â–µ –≤—Å–µ–≥–æ –ø—Ä–∏—á–∏–Ω–æ–π —è–≤–ª—è–µ—Ç—Å—è —á–µ–ª–æ–≤–µ—á–µ—Å–∫–∏–π —Ñ–∞–∫—Ç–æ—Ä. –í –¥–∞–Ω–Ω–æ–º —Å–ª—É—á–∞–µ, —ç—Ç–æ –º–æ–∂–µ—Ç –±—ã—Ç—å —Å–æ–∫—Ä—ã—Ç–∏–µ –ª–∏—á–Ω–æ–π –∏–Ω—Ñ–æ—Ä–º–∞—Ü–∏–∏ (–æ –∑–∞–Ω—è—Ç–æ—Å—Ç–∏ –∏ —É—Ä–æ–≤–Ω–µ –¥–æ—Ö–æ–¥–∞). –î—Ä—É–≥–æ–π –≤–æ–∑–º–æ–¥–Ω–æ–π –ø—Ä–∏—á–∏–Ω–æ–π –º–æ–≥—É—Ç –±—ã—Ç—å –æ—à–∏–±–∫–∏ –≤–≤–æ–¥–∞ –¥–∞–Ω–Ω—ã—Ö.

2)–ü—Ä–æ–ø—É—Å–∫–∏ –≤ —Å—Ç–æ–ª–±—Ü–µ 'total_income' –±—ã–ª–∏ –∑–∞–ø–æ–ª–Ω–µ–Ω—ã –º–µ–¥–∏–∞–Ω–æ–π –ø–æ—Å–ª–µ —Ä–∞–∑–¥–µ–ª–µ–Ω–∏—è –ø–æ –≥—Ä—É–ø–ø–∞–º –ø–æ –¥–æ—Ö–æ–¥—É('income_type'), —Ç.–∫. –µ—Å—Ç—å —Å—Ä–µ–¥–∏ –¥–∞–Ω–Ω—ã—Ö –µ—Å—Ç—å –Ω–µ—Ä–∞–≤–Ω–æ–º–µ—Ä–Ω–æ–µ —Ä–∞—Å–ø—Ä–µ–¥–µ–ª–µ–Ω–∏–µ —Å—Ä–µ–¥–∏ –Ω–µ–±–æ–ª—å—à–∏—Ö –∏ –∫—Ä—É–ø–Ω—ã—Ö –∑–Ω–∞—á–µ–Ω–∏–π, —á—Ç–æ –º–æ–∂–µ—Ç –ø–æ–≤–ª—è—Ç—å –Ω–∞ —Ç–æ—á–Ω–æ—Å—Ç—å –∞–Ω–∞–ª–∏–∑–∞. –°—Ç–æ–ª–±–µ—Ü 'days_employed' –±—ã–ª –∑–∞–ø–æ–ª–Ω–µ–Ω —ç—Ç–æ—Ç —Å—Ç–æ–ª–±–µ—Ü —Å—Ä–µ–¥–Ω–∏–º —Å—Ç–∞—Ç–∏—Å—Ç–∏—á–µ—Å–∫–∏–º –∑–Ω–∞—á–µ–Ω–∏–µ–º –≤ –ø—Ä–æ–ø—É—Å–∫–∞—Ö, —á—Ç–æ–±—ã –≤—ã—Ä–æ–≤–Ω—è—Ç—å –¥–∞—Ç–∞—Å–µ—Ç. –ò–∑-–∑–∞ –æ—à–∏–±–∫–∏ —Ñ–æ—Ä–º–∞—Ç–µ/–≤–≤–æ–¥–µ –¥–∞–Ω–Ω—ã—Ö –≤ –¥–∞–Ω–Ω–æ–º —Å—Ç–æ–ª–±—Ü–µ, –æ–Ω –Ω–µ –∏—Å–ø–æ–ª—å–∑–æ–≤–∞–ª—Å—è –ø—Ä–∏ –¥–∞–ª—å–Ω–µ–π–µ–º –∞–Ω–∞–ª–∏–∑–µ.

3) –ë—ã–ª –∑–∞–º–µ–Ω–µ–Ω –≤–µ—â–µ—Å—Ç–≤–µ–Ω—ã–Ω–π —Ç–∏–ø –¥–∞–Ω–Ω—ã—Ö - —Ç–∏–ø float - –Ω–∞ —Ü–µ–ª–æ—á–∏—Å–ª–µ–Ω–Ω—ã–π. –í—Å–µ–≥–æ –≤ –¥–∞–Ω–Ω—ã—Ö 2 —Å—Ç–æ–ª–±—Ü–∞, –∫–æ—Ç–æ—Ä—ã–µ —Å–æ–¥–µ—Ä–∂–∞—Ç entries –¥–∞–Ω–Ω–æ–≥–æ —Ç–∏–ø–∞: 'days_employed' (—Ç—Ä—É–¥–æ–≤–æ–π —Å—Ç–∞–∂ –≤ –¥–Ω—è—Ö) –∏ 'total_income' (–¥–æ—Ö–æ–¥ –≤ –º–µ—Å—è—Ü). –ó–∞–º–µ–Ω—è–µ–º –∑–Ω–∞—á–µ–Ω–∏—è –Ω–∞ —Ü–µ–ª–æ—á–∏—Å–ª–µ–Ω–Ω—ã–µ –¥–ª—è –∫—Ä–∞—Ç–∫–æ—Å—Ç–∏ –∏ –Ω–∞–≥–ª—è–¥–Ω–æ—Å—Ç–∏.

4)–î–ª—è –ø–æ–∏—Å–∫–∞ –∏ —É–¥–∞–ª–µ–Ω–∏—è –¥—É–±–ª–∏–∫–∞—Ç–æ–≤ –±—ã–ª–æ –∏—Å–ø–æ–ª—å–∑–æ–≤–∞–Ω–æ —á–µ—Ç—ã—Ä–µ –º–µ—Ç–æ–¥–∞: –ø–µ—Ä–≤—ã–π - –Ω–∞–π—Ç–∏ —É–Ω–∏–∫–∞–ª—å–Ω—ã–µ –∑–Ω–∞—á–µ–Ω–∏—è –≤ —Å—Ç–æ–ª–±—Ü–∞—Ö str, –±–ª–∞–≥–æ–¥–∞—Ä—è —á–µ–º—É –æ–±–Ω–∞—Ä—É–∂–∏–ª–æ—Å—å, —á—Ç–æ –µ—Å—Ç—å —Å–ª–æ–≤–∞ —Å –æ–¥–Ω–∏–º –∑–Ω–∞—á–µ–Ω–∏–µ–º, –Ω–æ —Ä–∞–∑–Ω–æ–≥–æ —Ä–µ–≥–∏—Å—Ç—Ä–∞, –∫–æ—Ç–æ—Ä—ã–µ —Å—á–∏—Ç–∞—é—Ç—Å—è "—É–Ω–∏–∫–∞–ª—å–Ω—ã–º–∏"(—ç—Ç–æ –º–µ—à–∞–µ—Ç –ø–æ–∏—Å–∫—É –¥—É–±–ª–∏–∫–∞—Ç–æ–≤), –≤—Ç–æ—Ä–æ–π - –∏—Å–ø. str.lower() –º–µ—Ç–æ–¥ –¥–ª—è Pandas, —á—Ç–æ–±—ã –ø—Ä–∏–≤–µ—Å—Ç–∏ —Å—Ç—Ä–æ–∫–∏ –≤ –µ–¥–∏–Ω—ã–π –Ω–∏–∂–Ω–∏–π —Ä–µ–≥–∏—Å—Ç—Ä, —Ç—Ä–µ—Ç–∏–π - –Ω–∞–π—Ç–∏ –∫–æ–ª-–≤–æ –¥—É–±–ª–∏–∫–∞—Ç–æ–≤ –¥–≤–æ–π–Ω—ã–º –º–µ—Ç–æ–¥–æ–º duplicated().sum(), —á–µ—Ç–≤–µ—Ä—Ç—ã–π - —É–¥–∞–ª–∏—Ç—å –¥—É–±–ª–∏–∫–∞—Ç—ã –º–µ—Ç–æ–¥–æ–º drop_duplicates.

–ü—Ä–∏—á–∏–Ω –ø–æ—è–≤–ª–µ–Ω–∏—è –¥—É–±–ª–∏–∫–∞—Ç–æ–≤ –º–æ–∂–µ—Ç –±—ã—Ç—å –º–Ω–æ–≥–æ, –Ω–∞–ø—Ä–∏–º–µ—Ä: —á–µ–ª–æ–≤–µ—á–µ—Å–∫–∏–π —Ñ–∞–∫—Ç–æ—Ä (–Ω–µ–∫–æ—Ä—Ä–µ–∫—Ç–Ω–æ–µ –≤–≤–µ–¥–µ–Ω–∏–µ, –æ—à–∏–±–∫–∏ –ø—Ä–∏ –≤–≤–æ–¥–µ –¥–∞–Ω–Ω—ã—Ö, —Å–æ–∫—Ä—ã—Ç–∏–µ –∏–Ω—Ñ–æ—Ä–º–∞—Ü–∏–∏, –∏ —Ç.–¥.) –∏–ª–∏ —Ç–µ—Ö–Ω–∏—á–µ—Å–∫–∏–µ –æ—à–∏–±–∫–∏, –Ω–∞–ø—Ä–∏–º–µ—Ä –Ω–µ–ø—Ä–∞–≤–∏–ª—å–Ω—ã–π —Å–≤–æ–¥ –¥–∞–Ω–Ω—ã—Ö –∏—Ö —Ä–∞–∑–Ω—ã—Ö –∏—Å—Ç–æ—á–Ω–∏–∫–æ–≤ –∏–ª–∏ –±–∞–≥

5)–ü—Ä–∏ –ø—Ä–æ—Å–º–æ—Ç—Ä–µ –¥–∞–Ω–Ω—ã—Ö –æ –∫–æ–ª-–≤–µ –¥–µ—Ç–µ–π —É –∫–∞–¥–æ–≥–æ —Ä–µ—Å–ø–æ–Ω–¥–µ–Ω—Ç–∞ –≤—ã—è—Å–Ω–∏–ª–æ—Å—å, —á—Ç–æ –≤—Å–µ–≥–æ –≤ —Ç–∞–±–ª–∏—Ü–µ –ø—Ä–∏—à–ª–æ—Å—å 76 –∑–Ω–∞—á–µ–Ω–∏–π –Ω–∞ 20 –¥–µ—Ç–µ–π –∏ 47 –∑–Ω–∞—á–µ–Ω–∏–π –Ω–∞ -1, –∞ –≤—Å–µ –æ—Å—Ç–∞–ª—å–Ω—ã–µ —Ä–∞—Å–ø—Ä–µ–¥–µ–ª–µ–Ω—ã –º–µ–∂–¥—É 0-5 –¥–µ—Ç–µ–π. –ò–∑ —ç—Ç–æ–≥–æ —è —Å–¥–µ–ª–∞–ª–∞ –≤—ã–≤–æ–¥, —á—Ç–æ –ø—Ä–∏ –≤–≤–æ–¥–µ –¥–∞–Ω–Ω—ã—Ö –ø—Ä–æ–∏–∑–æ—à–ª–∞ –æ—à–∏–±–∫–∞, –∏ —Ç–µ, –∫—Ç–æ –Ω–∞–ø–∏—Å–∞–ª–∏ 20 - –¥–æ–ø–∏—Å–∞–ª–∏ –ª–∏—à–Ω–∏–π 0, –∞ –ø—Ä–∏ -1 –ø–æ–¥—Ä–∞–∑—É–º–µ–≤–∞–ª–æ—Å—å 0. –°–ø—Ä–æ—Å–∏—Ç—å –æ –¥–æ—Å—Ç–æ–≤–µ—Ä–Ω–æ—Å—Ç–∏ –¥–∞–Ω–Ω–æ–≥–æ –≤—ã–≤–æ–¥–∞ –Ω–µ—Ç –≤–æ–∑–º–æ–∂–Ω–æ—Å—Ç–∏, –ø–æ—ç—Ç–æ–º—É –≤–º–µ—Å—Ç–æ —É–¥–∞–ª–µ–Ω–∏—è –¥–∞–Ω–Ω—ã—Ö —Å—Ç—Ä–æ–∫, —è –≤–æ—Å–ø–æ–ª—å–∑–æ–≤–∞–ª–∞—Å—å –¥–∞–Ω–Ω–æ–π –ª–æ–≥–∏–∫–æ–π –ø—Ä–∏ –∑–∞–º–µ–Ω–µ. –í –∏—Ç–æ–≥–µ, —è —Å–æ–±—Ä–∞–ª–∞ –∏–Ω—Ç–µ—Ä–µ—Å—É—é—â–∏–µ –Ω–∞—Å —á–∏—Å–ª–∞ –≤ PivotTable

***–ò–¢–û–ì***: –ò–∑–Ω–∞—á–∞–ª—å–Ω–æ–π –ø–æ—Å—Ç–∞–≤–ª–µ–Ω–Ω–æ–π –∑–∞–¥–∞—á–µ–π –±—ã–ª–æ –≤—ã—è—Å–Ω–∏—Ç—å –≤–ª–∏—è–µ—Ç –ª–∏ —Å–µ–º–µ–π–Ω–æ–µ –ø–æ–ª–æ–∂–µ–Ω–∏–µ –∏ –∫–æ–ª–∏—á–µ—Å—Ç–≤–æ –¥–µ—Ç–µ–π –∫–ª–∏–µ–Ω—Ç–∞ –Ω–∞ —Ñ–∞–∫—Ç –≤–æ–∑–≤—Ä–∞—Ç–∞ –∫—Ä–µ–¥–∏—Ç–∞ –≤ —Å—Ä–æ–∫. –ù–∞ –∞–Ω–∞–ª–∏–∑–µ –∏–º–µ—é—â–∏—Ö—Å—è –¥–∞–Ω–Ω—ã—Ö –º–æ–∂–Ω–æ —Å–¥–µ–ª–∞—Ç—å –≤—ã–≤–æ–¥, —á—Ç–æ –Ω–∞–ª–∏—á–∏–µ –¥–µ—Ç–µ–π –Ω–µ –æ–±—è–∑–∞—Ç–µ–ª—å–Ω–æ —è–≤–ª—è–µ—Ç—Å—è —Ñ–∞–∫—Ç–æ—Ä–æ–º –≤–ª–∏—è–Ω–∏—è –Ω–∞ –ø—Ä–æ—Å—Ä–æ—á–∫–∏ –ø–ª–∞—Ç–µ–∂–µ–π –≤ –±–∞–Ω–∫–µ,% –¥–æ–ª–∂–Ω–∏–∫–æ–≤ –≤ –∫–∞–∂–¥–æ–π –≥—Ä—É–ø–ø–µ —Ä–∞–∑–¥–µ–ª–µ–Ω–Ω–æ–π –ø–æ –∫–æ–ª-–≤—É –¥–µ—Ç–µ–π –ø—Ä–∏–º–µ—Ä–Ω–æ –æ–∫–æ–ª–æ 9% (–∑–∞ –∏—Å–∫–ª—é—á–µ–Ω–∏–∏ –≥—Ä—É–ø–ø—ã —Å 5—é –¥–µ—Ç—å–º–∏). –ü—Ä–∏ —ç—Ç–æ–º, —É –∂–µ–Ω–∞—Ç—ã—Ö/ –∑–∞–º—É–∂–Ω–∏—Ö –±–µ–∑ –¥–µ—Ç–µ–π –¥–æ–ª–∂–Ω–∏–∫–æ–≤ –±–æ–ª–µ–µ —á–µ–º –≤ 2 —Ä–∞–∑–∞ –±–æ–ª—å—à–µ, —á–µ–º —É –ª—é–±—ã—Ö –¥—Ä—É–≥–∏—Ö –∫–∞—Ç–µ–≥–æ—Ä–∏–π. 

|family_status/children|0|1|2|3|4|5|  
|---|---|---|---|---|---|---|
|–ù–µ –∂–µ–Ω–∞—Ç / –Ω–µ –∑–∞–º—É–∂–µ–º|210.0|52.0|10.0 |1.0|1.0|N/A   |
|–≤ —Ä–∞–∑–≤–æ–¥–µ | 55.0 |21.0  |  8.0|   1.0 | 0.0|  N/A    |
|–≤–¥–æ–≤–µ—Ü / –≤–¥–æ–≤–∞   |       53.0   | 7.0  |  3.0 |  0.0 | 0.0|  N/A|     
|–≥—Ä–∞–∂–¥–∞–Ω—Å–∫–∏–π –±—Ä–∞–∫   |     229.0 | 118.0  | 33.0|  8.0 | 0.0 |   0 |   
|–∂–µ–Ω–∞—Ç / –∑–∞–º—É–∂–µ–º   |      517.0  |246.0 | 148.0  |17.0|  3.0  |  0   |

–û–¥–Ω–∞–∫–æ –¥–µ–ª–∞—Ç—å –≤—ã–≤–æ–¥, —á—Ç–æ –Ω–µ —Å—Ç–æ–∏—Ç –≤—ã–¥–∞–≤–∞—Ç—å –∫—Ä–µ–¥–∏—Ç –∑–∞–º—É–∂–Ω–∏–º / –∂–µ–Ω–∞—Ç—ã–º –∂–µ–Ω—â–∏–Ω–∞–º –∏–ª–∏ –º—É–∂—á–∏–Ω–∞–º –µ—Å–ª–∏ —É –Ω–∏—Ö –Ω–µ—Ç –¥–µ—Ç–µ–π - –Ω–µ–ø—Ä–∞–≤–∏–ª—å–Ω–æ. –ù—É–∂–µ–Ω –≥–æ—Ä–∞–∑–¥–æ –±–æ–ª–µ–µ –≥–ª—É–±–æ–∫–∏–π –∞–Ω–∞–ª–∏–∑ –¥–∞–Ω–Ω—ã—Ö —Å —É—á–µ—Ç–æ–º –º–Ω–æ–∂–µ—Å—Ç–≤–∞ –¥—Ä—É–≥–∏—Ö —Ñ–∞–∫—Ç–æ—Ä–æ–≤, –∫–æ—Ç–æ—Ä—ã–µ –º–æ–≥—É—Ç –ø–æ–≤–ª–∏—è—Ç—å –Ω–∞ —Ç–∞–∫–∏–µ —Ä–µ–∑—É–ª—å—Ç–∞—Ç—ã. –ù–∞–ø—Ä–∏–º–µ—Ä, —É –ø–∞—Ä—ã —Å—É–ø—Ä—É–≥–æ–≤ –Ω–µ—Ç –¥–µ—Ç–µ–π, –∏–∑-–∑–∞ –ø—Ä–æ–±–ª–µ–º —Å–æ –∑–¥–æ—Ä–≤—å–µ–º. 

**–†–µ–∫–æ–º–º–µ–Ω–¥–∞—Ü–∏–∏**: 1) –£—Ç–æ—á–Ω–∏—Ç—å —Ñ–æ—Ä–º–∞—Ç –¥–∞–Ω–Ω—ã—Ö –≤ —Å—Ç–æ–ª–±—Ü–µ 'days_employed' –∏ –ø–µ—Ä–µ–≤–µ—Å—Ç–∏ —á–∏—Å–ª–∞ –≤ –∫–æ—Ä—Ä–µ–∫—Ç–Ω—É—é —Ñ–æ—Ä–º—É. 
2) –Ω–µ–æ–±—Ö–æ–¥–∏–º –±–æ–ª–µ–µ –ø–æ–≤—Ç–æ—Ä–Ω—ã–π –∞–Ω–∞–ª–∏–∑ –¥–∞–Ω–Ω—ã—Ö —Å —É—á–µ—Ç–æ–º –º–Ω–æ–∂–µ—Å—Ç–≤–∞ –¥—Ä—É–≥–∏—Ö —Ñ–∞–∫—Ç–æ—Ä–æ–≤, –∫–æ—Ç–æ—Ä—ã–µ –ø–æ–º–æ–≥—É—Ç –æ—Ç–≤–µ—Ç–∏—Ç—å –Ω–∞ –≤–æ–ø—Ä–æ—Å –≤–ª–∏—è–µ—Ç –ª–∏ –∫–æ–ª-–≤–æ –¥–µ—Ç–µ–π –Ω–∞ –Ω–∞–ª–∏—á–∏–µ –ø—Ä–æ—Å—Ä–æ—á–µ–∫ –≤—ã–ø–ª–∞—Ç—ã –≤ –±–∞–Ω–∫. –ö–æ–Ω–∫—Ä–µ—Ç–Ω–æ, –¥–æ–±–∞–≤–∏—Ç—å –≤ –æ–ø—Ä–æ—Å –Ω–µ—Å–∫–æ–ª—å–∫–æ –±–æ–ª–µ–µ –ø–æ–¥—Ä–æ–±–Ω—ã—Ö –≤–æ–ø—Ä–æ—Å–æ–≤, –ø—Ä–∏–º.: 1) –ï—Å—Ç—å –ª–∏ –≤ —Å–µ–º—å–µ –ª—é–¥–∏ —Å –∏–Ω–≤–∞–ª–∏–¥–Ω–æ—Å—Ç—å—é/—Ç—è–∂–µ–ª–æ–π –±–æ–ª–µ–∑–Ω—å—é? 2) –ò–º–µ—é—Ç—Å—è –ª–∏ –¥–æ–ª–≥–∏ –≤ –¥—Ä—É–≥–∏—Ö –±–∞–Ω–∫–∞—Ö/–∏—Å—Ç–æ—á–Ω–∏–∫–∞—Ö? 3) –ö–∞–∫–∏–µ –≤ —Å—Ä–µ–¥–Ω–µ–º —Ä–∞—Å—Ö–æ–¥—ã –Ω–∞ —Ä–µ–±–µ–Ω–∫–∞ –≤ –º–µ—Å—è—Ü? –∏—Ç–¥

<div style="border:solid purple 2px; padding: 20px"> 

–£ —Ç–µ–±—è –ø–æ–ª—É—á–∏–ª–∞—Å—å –æ—á–µ–Ω—å —Ö–æ—Ä–æ—à–∞—è —Ä–∞–±–æ—Ç–∞! –ú–Ω–æ–≥–∏–µ –º–æ–º–µ–Ω—Ç—ã —Å–¥–µ–ª–∞–Ω—ã –ø—Ä–æ—Å—Ç–æ –ø—Ä–µ–∫—Ä–∞—Å–Ω–æ. –¢—ã –ø–æ–∫–∞–∑—ã–≤–∞–µ—à—å –æ—Ç–ª–∏—á–Ω–æ–µ –≤–ª–∞–¥–µ–Ω–∏–µ –∏–∑—É—á–∞–µ–º—ã–º –º–∞—Ç–µ—Ä–∏–∞–ª–æ–º: —É–≤–µ—Ä–µ–Ω–Ω–æ –ø–æ–ª—å–∑—É–µ—à—å—Å—è pandas, —É–º–µ–µ—à—å –ø–æ–¥–≥–æ—Ç–∞–≤–ª–∏–≤–∞—Ç—å, –æ—á–∏—â–∞—Ç—å, –æ–±–æ–≥–æ—â–∞—Ç—å –¥–∞–Ω–Ω—ã–µ. –û—Å—Ç–∞–ª–∏—Å—å –Ω–µ–±–æ–ª—å—à–∏–µ –¥–æ—Ä–∞–±–æ—Ç–∫–∏:

- –ü–æ –≤—Å–µ–º—É –ø—Ä–æ–µ–∫—Ç—É –Ω–µ–æ–±—Ö–æ–¥–∏–º–æ –æ—Å—Ç–∞–≤–∏—Ç—å –∏ —Ä–∞—Å–∫–æ–º–º–µ–Ω—Ç–∏—Ä–æ–≤–∞—Ç—å –ø—Ä–æ–º–µ–∂—É—Ç–æ—á–Ω—ã–µ —Ä–µ—à–µ–Ω–∏—è
- –í —à–∞–≥–µ 3 –Ω–µ–æ–±—Ö–æ–¥–∏–º–æ –ø–æ–ª—É—á–∏—Ç—å —Ä–µ–∑—É–ª—å—Ç–∞—Ç—ã —Å –ø–æ–º–æ—â—å—é —Å–≤–æ–¥–Ω—ã—Ö —Ç–∞–±–ª–∏—Ü —Å —Ä–∞—Å—á–µ—Ç–æ–º –æ—Ç–Ω–æ—Å–∏—Ç–µ–ª—å–Ω—ã—Ö –ø–æ–∫–∞–∑–∞—Ç–µ–ª–π –ø–æ –≥—Ä—É–ø–ø–∞–º

–ñ–¥—É —Ç–≤–æ–∏—Ö –∏—Å–ø—Ä–∞–≤–ª–µ–Ω–∏–π, —É —Ç–µ–±—è –≤—Å–µ –ø–æ–ª—É—á–∏—Ç—Å—è üòä

</div>

## –ß–µ–∫-–ª–∏—Å—Ç –≥–æ—Ç–æ–≤–Ω–æ—Å—Ç–∏ –ø—Ä–æ–µ–∫—Ç–∞

–ü–æ—Å—Ç–∞–≤—å—Ç–µ 'x' –≤ –≤—ã–ø–æ–ª–Ω–µ–Ω–Ω—ã—Ö –ø—É–Ω–∫—Ç–∞—Ö. –î–∞–ª–µ–µ –Ω–∞–∂–º–∏—Ç–µ Shift+Enter.

- [x]  –æ—Ç–∫—Ä—ã—Ç —Ñ–∞–π–ª;
- [x]  —Ñ–∞–π–ª –∏–∑—É—á–µ–Ω;
- [x]  –æ–ø—Ä–µ–¥–µ–ª–µ–Ω—ã –ø—Ä–æ–ø—É—â–µ–Ω–Ω—ã–µ –∑–Ω–∞—á–µ–Ω–∏—è;
- [x]  –∑–∞–ø–æ–ª–Ω–µ–Ω—ã –ø—Ä–æ–ø—É—â–µ–Ω–Ω—ã–µ –∑–Ω–∞—á–µ–Ω–∏—è;
- [x]  –µ—Å—Ç—å –ø–æ—è—Å–Ω–µ–Ω–∏–µ, –∫–∞–∫–∏–µ –ø—Ä–æ–ø—É—â–µ–Ω–Ω—ã–µ –∑–Ω–∞—á–µ–Ω–∏—è –æ–±–Ω–∞—Ä—É–∂–µ–Ω—ã;
- [x]  –æ–ø–∏—Å–∞–Ω—ã –≤–æ–∑–º–æ–∂–Ω—ã–µ –ø—Ä–∏—á–∏–Ω—ã –ø–æ—è–≤–ª–µ–Ω–∏—è –ø—Ä–æ–ø—É—Å–∫–æ–≤ –≤ –¥–∞–Ω–Ω—ã—Ö;
- [x]  –æ–±—ä—è—Å–Ω–µ–Ω–æ, –ø–æ –∫–∞–∫–æ–º—É –ø—Ä–∏–Ω—Ü–∏–ø—É –∑–∞–ø–æ–ª–Ω–µ–Ω—ã –ø—Ä–æ–ø—É—Å–∫–∏;
- [x]  –∑–∞–º–µ–Ω–µ–Ω –≤–µ—â–µ—Å—Ç–≤–µ–Ω–Ω—ã–π —Ç–∏–ø –¥–∞–Ω–Ω—ã—Ö –Ω–∞ —Ü–µ–ª–æ—á–∏—Å–ª–µ–Ω–Ω—ã–π;
- [x]  –µ—Å—Ç—å –ø–æ—è—Å–Ω–µ–Ω–∏–µ, –∫–∞–∫–æ–π –º–µ—Ç–æ–¥ –∏—Å–ø–æ–ª—å–∑—É–µ—Ç—Å—è –¥–ª—è –∏–∑–º–µ–Ω–µ–Ω–∏—è —Ç–∏–ø–∞ –¥–∞–Ω–Ω—ã—Ö –∏ –ø–æ—á–µ–º—É;
- [x]  —É–¥–∞–ª–µ–Ω—ã –¥—É–±–ª–∏–∫–∞—Ç—ã;
- [x]  –µ—Å—Ç—å –ø–æ—è—Å–Ω–µ–Ω–∏–µ, –∫–∞–∫–æ–π –º–µ—Ç–æ–¥ –∏—Å–ø–æ–ª—å–∑—É–µ—Ç—Å—è –¥–ª—è –ø–æ–∏—Å–∫–∞ –∏ —É–¥–∞–ª–µ–Ω–∏—è –¥—É–±–ª–∏–∫–∞—Ç–æ–≤;
- [x]  –æ–ø–∏—Å–∞–Ω—ã –≤–æ–∑–º–æ–∂–Ω—ã–µ –ø—Ä–∏—á–∏–Ω—ã –ø–æ—è–≤–ª–µ–Ω–∏—è –¥—É–±–ª–∏–∫–∞—Ç–æ–≤ –≤ –¥–∞–Ω–Ω—ã—Ö;
- [x]  –≤—ã–¥–µ–ª–µ–Ω—ã –ª–µ–º–º—ã –≤ –∑–Ω–∞—á–µ–Ω–∏—è—Ö —Å—Ç–æ–ª–±—Ü–∞ —Å —Ü–µ–ª—è–º–∏ –ø–æ–ª—É—á–µ–Ω–∏—è –∫—Ä–µ–¥–∏—Ç–∞;
- [x]  –æ–ø–∏—Å–∞–Ω –ø—Ä–æ—Ü–µ—Å—Å –ª–µ–º–º–∞—Ç–∏–∑–∞—Ü–∏–∏;
- [x]  –¥–∞–Ω–Ω—ã–µ –∫–∞—Ç–µ–≥–æ—Ä–∏–∑–∏—Ä–æ–≤–∞–Ω—ã;
- [x]  –µ—Å—Ç—å –æ–±—ä—è—Å–Ω–µ–Ω–∏–µ –ø—Ä–∏–Ω—Ü–∏–ø–∞ –∫–∞—Ç–µ–≥–æ—Ä–∏–∑–∞—Ü–∏–∏ –¥–∞–Ω–Ω—ã—Ö;
- [x]  –µ—Å—Ç—å –æ—Ç–≤–µ—Ç –Ω–∞ –≤–æ–ø—Ä–æ—Å: "–ï—Å—Ç—å –ª–∏ –∑–∞–≤–∏—Å–∏–º–æ—Å—Ç—å –º–µ–∂–¥—É –Ω–∞–ª–∏—á–∏–µ–º –¥–µ—Ç–µ–π –∏ –≤–æ–∑–≤—Ä–∞—Ç–æ–º –∫—Ä–µ–¥–∏—Ç–∞ –≤ —Å—Ä–æ–∫?";
- [x]  –µ—Å—Ç—å –æ—Ç–≤–µ—Ç –Ω–∞ –≤–æ–ø—Ä–æ—Å: "–ï—Å—Ç—å –ª–∏ –∑–∞–≤–∏—Å–∏–º–æ—Å—Ç—å –º–µ–∂–¥—É —Å–µ–º–µ–π–Ω—ã–º –ø–æ–ª–æ–∂–µ–Ω–∏–µ–º –∏ –≤–æ–∑–≤—Ä–∞—Ç–æ–º –∫—Ä–µ–¥–∏—Ç–∞ –≤ —Å—Ä–æ–∫?";
- [x]  –µ—Å—Ç—å –æ—Ç–≤–µ—Ç –Ω–∞ –≤–æ–ø—Ä–æ—Å: "–ï—Å—Ç—å –ª–∏ –∑–∞–≤–∏—Å–∏–º–æ—Å—Ç—å –º–µ–∂–¥—É —É—Ä–æ–≤–Ω–µ–º –¥–æ—Ö–æ–¥–∞ –∏ –≤–æ–∑–≤—Ä–∞—Ç–æ–º –∫—Ä–µ–¥–∏—Ç–∞ –≤ —Å—Ä–æ–∫?";
- [x]  –µ—Å—Ç—å –æ—Ç–≤–µ—Ç –Ω–∞ –≤–æ–ø—Ä–æ—Å: "–ö–∞–∫ —Ä–∞–∑–Ω—ã–µ —Ü–µ–ª–∏ –∫—Ä–µ–¥–∏—Ç–∞ –≤–ª–∏—è—é—Ç –Ω–∞ –µ–≥–æ –≤–æ–∑–≤—Ä–∞—Ç –≤ —Å—Ä–æ–∫?";
- [x]  –≤ –∫–∞–∂–¥–æ–º —ç—Ç–∞–ø–µ –µ—Å—Ç—å –≤—ã–≤–æ–¥—ã;
- [x]  –µ—Å—Ç—å –æ–±—â–∏–π –≤—ã–≤–æ–¥.