# Tutorial 2: Organizing data</font> <a id='home'></a>

Welcome to this tutorial, where you will learn how to master essential Excel skills for managing data.

The ability to effectively manipulate data is crucial whether you're analyzing trends in the media industry, tracking revenue for a music streaming service, or studying patterns in digital advertising. This notebook will guide you through practical examples, ensuring you gain hands-on experience with Excel's core functionalities. 

1. [Manage rows and columns](#managerowcolumns) ([quizzes](#quiz1-4) #1 #2 #3 #4)
2. [Tab manipulation](#tabmanip) ([quizzes](#quiz5-7) #5 #6 #7)
3. [Format your file](#format) ([quizzes](#quiz8-10) #8 #9 #10)
4. [Use conditional formatting](#conditional) ([quizzes](#quiz11-13) #11 #12 #13)
4. [Use filter and sort functions](#filter) ([quizzes](#quiz14-16) #14 #15 #16)
5. [Practice formatting an Excel file](#test)


This tutorial lets you interact by:
- modifying elements directly in the cells, 
- take quizzes,
- and testing examples directly in your own spreadsheet on your computer (Excel, LibreOffice, etc.).

## Manage rows and columns <a id="managerowcolumns"></a> ([top](#home))

Here, we aim at understanding better Excel functionalities. Keep your Excel document open to follow along with each step as you explore how to manipulate rows and columns effectively. Now that you're familiar with working with individual cells, let's see how to work with rows and columns directly.

### Inserting and deleting rows and columns
To create or delete rows and columns, follow these steps:
1. Select the row or column header where you want to insert the new row/column.
2. Right-click to open the context menu.
3. Choose "Insert" to add the new row/column.

Let's take a more practical example. You are managing digital advertising data for a streaming platform and want to add a "Second Campaign" column between "Campaign Name" (column C) and "Clicks" (column D).
- Select column D (the one you want to shift to make space), and right-click to open the context menu.
- Select "Insert," and a new column will be created to the left of column D.

Try this out with the data below by downloading the following Excel file by clicking on this link: [Tutorial2xls1-subscriptions.xlsx](https://github.com/matgithub-picardy/ExcelPythonCourse/raw/refs/heads/main/Tutorial_Excel_2/Tutorial2xls1-subscriptions.xlsx). You can also copy and paste the data in your own Excel document.

| ID    | First name | Last name | Birthdate   | Account type   | Sign-up date  | Monthly <br> revenue ($) | Subscription tier | Active |
|-------|------------|-----------|-------------|----------------|---------------|---------------------|-------------------|--------|
| 1001  | Alice      | Johnson   | 1985-07-12  | Free           | 2021-01-15    | 0                   | Basic             | Yes    |
| 1002  | Bob        | Smith     | 1990-04-23  | Premium        | 2020-11-03    | 45                  | Premium           | Yes    |
| 1003  | Carol      | Williams  | 1978-02-19  | Premium        | 2019-05-22    | 30                  | Premium           | No     |
| 1004  | David      | Brown     | 2001-08-05  | Free           | 2023-06-12    | 0                   | Basic             | Yes    |
| 1005  | Eve        | Davis     | 1992-10-29  | Premium        | 2020-09-14    | 25                  | Premium           | Yes    |
| 1006  | Frank      | White     | 1988-11-11  | Free           | 2022-04-01    | 0                   | Basic             | No     |
| 1007  | Grace      | Green     | 1995-06-30  | Premium        | 2018-12-05    | 60                  | Premium Plus      | Yes    |
| 1008  | Harry      | Thompson  | 1983-03-21  | Free           | 2021-08-19    | 0                   | Basic             | Yes    |
| 1009  | Irene      | Miller    | 1999-12-15  | Premium        | 2019-10-17    | 50                  | Premium Plus      | Yes    |
| 1010  | Jack       | Wilson    | 1986-01-03  | Free           | 2023-02-10    | 0                   | Basic             | No     |
| 1011  | Karen      | Harris    | 1975-05-14  | Premium        | 2020-03-25    | 35                  | Premium           | No     |
| 1012  | Liam       | Clark     | 1991-09-27  | Premium        | 2021-07-30    | 40                  | Premium Plus      | Yes    |
| 1013  | Megan      | Lewis     | 1989-07-04  | Free           | 2022-11-11    | 0                   | Basic             | Yes    |
| 1014  | Noah       | Walker    | 1994-05-19  | Premium        | 2020-06-28    | 55                  | Premium Plus      | Yes    |
| 1015  | Olivia     | Martinez  | 1987-03-16  | Free           | 2023-05-01    | 0                   | Basic             | Yes    |
| 1016  | Patrick    | Young     | 1996-11-22  | Premium        | 2021-09-14    | 25                  | Premium           | No     |
| 1017  | Quincy     | Adams     | 1998-02-09  | Premium        | 2019-04-06    | 50                  | Premium           | Yes    |
| 1018  | Rachel     | Robinson  | 1980-12-25  | Free           | 2022-07-30    | 0                   | Basic             | No     |
| 1019  | Sam        | Parker    | 1984-09-07  | Premium        | 2020-05-23    | 65                  | Premium Plus      | Yes    |
| 1020  | Tina       | Hall      | 1993-01-12  | Free           | 2023-08-16    | 0                   | Basic             | No     |
| 1021  | Ursula     | King      | 1979-10-10  | Premium        | 2018-02-05    | 70                  | Premium Plus      | Yes    |
| 1022  | Victor     | Scott     | 1997-07-20  | Premium        | 2021-05-22    | 45                  | Premium           | No     |
| 1023  | Wendy      | Turner    | 1982-04-18  | Free           | 2022-09-29    | 0                   | Basic             | Yes    |
| 1024  | Xavier     | Garcia    | 1990-12-11  | Premium        | 2020-03-03    | 55                  | Premium Plus      | Yes    |
| 1025  | Yolanda    | Johnson   | 1981-08-22  | Free           | 2023-01-10    | 0                   | Basic             | Yes    |

Let's say you want to insert a column called "Middle Name" between the "First Name" and "Last Name" columns. As a first step, you can adjust the width of your table's columns if the cell contents are not legible, as in the screenshot below for the “Date of birth” column: 
- right click on one or more column headers
- select "Column Width" or "Optimal Width" from the context menu, as you prefer.
- choose the width you want and click "Ok".

Columns will be automatically resized to the ideal width based on the longest entry in each column, making your table easier to read and navigate.

<img src="Tutorial_Excel_2/adjust_excel_1.jpeg" alt="Chart Image" width="900">


To insert a new column in Excel, we proceed in the same way: we select column C header (“Last name”), right click, and Choose "Insert Column Before".

<img src="Tutorial_Excel_2/adjust_excel_2.jpeg" alt="Chart Image" width="500">

We simulate this process using the Python code below (click on the cell and press “Ctrl+Enter” (Windows) / “Shift+Enter” (Mac). Try changing the names in the Python code to match the three first names above (you'll have to invent the Middle names!):

In [None]:
import pandas as pd
from IPython.display import display

# Example dataset
data = {
    'First Name': ['John', 'Alice', 'Bob'],
    'Last Name': ['Doe', 'Smith', 'Brown']
}

# Creating a DataFrame
df = pd.DataFrame(data)

# Display the original DataFrame
print("Before adding 'Middle Name' column:")
display(df)

# Insert a new column 'Middle Name'
df.insert(1, 'Middle Name', ['Edward', 'Marie', 'Lee'])  # Insert at index 1

# Display the updated DataFrame
print("After adding 'Middle Name' column:")
display(df)

### Hiding rows and columns

Sometimes you might want to hide certain columns to simplify your view. For example, you may want to hide the “Monthly revenue (\$)” column, whose contents are sensitive and should not be accessible to anyone, while preserving the data.

In Excel, to hide a column or a row:
1. Place your mouse cursor over the column /row header.
2. Right-click and select “Hide”.

To “unmask” or show the column / row again :
1. Select the two column / row headers that surround the hidden column / row (here the “Sign-up date” and “Subscription tier” columns).
2. Right-click and select “Show Columns”.

You will of course have noticed that the principle is the same as above when we have adjusted column or row widths and added columns or rows. You can now hide the column in the Excel file: 

<img src="Tutorial_Excel_2/adjust_excel_3.jpeg" alt="Chart Image" width="800">


You can continue to explore Excel by yourself by looking for the functionalities we've just seen by searching for them in the drop-down menus. For example, can you find the "Hide column" functionality?


### Grouping rows and columns
Grouping allows you to organize multiple rows or columns into a collapsible group, making large datasets more manageable.

To group columns / rows:
1. Select the columns / rows.
2. Go to the "Data" tab and click "Group."

You can collapse or expand the group as needed. Let’s simulate this by selecting multiple columns and displaying only a subset.


Example:
In the Excel spreadsheet, you may want to group all "identity" columns: "ID", First name", "Middle name", "Last name", and "Birthdate". Select the columns, then click Data > Group. What happens? Can you ungroup these columns? 

### Summary
- **Adjusting column / row width:** Select the column / row header and click "Column Width" in the context menu, or double-click the separator between column / row headers to auto-resize.
- **Inserting a column / row :** Select the column / row header and click "Insert" in the context menu.
- **Hiding a column / row:** Select the column / row header and click "Hide" in the context menu.
- **Grouping columns / rows:** Select the column / row headers, and click on the Data tab > Group.

### Quiz #1<a id="quiz1-4"></a> ([top](#home))
**What is the correct way to insert a new column in Excel?**
- Select the desired column and click "Delete".
- Select the desired column, right-click, and choose "Insert".
- Select the desired column, click "Cut", and paste it elsewhere.
- Hiding Rows and Columns

<details>
<summary>Click here to see the answer</summary>

**The correct answer is:** Select the desired column, right-click, and choose "Insert".

</details>

### Quiz #2
**How would you hide multiple columns in Excel?**

1. Select the columns, right-click, and choose "Delete".
2. Select the columns, right-click, and choose "Hide".
3. Select the columns, right-click, and choose "Copy".


<details>
<summary>Click here to see the answer</summary>

**The correct answer is:** 2. Select the columns, right-click, and choose "Hide".

</details>

### Quiz #3
**What is the purpose of grouping rows or columns in Excel?**

1. To delete rows or columns
2. To collapse and expand sections of your worksheet for better visibility
3. To apply filters on the data
4. To merge cells

<details>
<summary>Click here to see the answer</summary>

**The correct answer is:** 2. To collapse and expand sections of your worksheet for better visibility

</details>

### Quiz #4
**Which menu option allows you to group rows or columns in Excel?**

1. "Home" tab > Group
2. "Data" tab > Group
3. "Insert" tab > Group
4. "Review" tab > Group

<details>
<summary>Click here to see the answer</summary>

**The correct answer is:** 2. "Data" tab > Group

</details>

## Tab manipulation<a id="tabmanip"></a> ([top](#home))
Here, we will manipulate Excel tabs or "sheets".<a name="cite_ref-1"></a>[<sup>[1]</sup>](#hehe) You will explore how to create, delete, rename, move, color, hide, and duplicate Excel tabs. These operations are essential when working with multiple datasets.




**What is a Tab?**

In Excel, "tabs" or "sheets" ("feuilles de calcul" in French) refer to the individual pages within a workbook. Each tab in Excel is like a separate sheet in a physical file. Excel workbooks can have multiple tabs, each representing different sections of your data. For a manager, tabs can help organize different datasets or variables, such as customer data, market trends, or pricing structures.

<a name="cite_note-1"></a>[1](#cite_ref-1) A sheet is translated “feuille” in French.

### Creating / deleting tabs
- To create a tab: Click on the "+" icon next to the last tab in the workbook.
- To delete a tab: Right-click on the tab and choose Delete from the context menu. Remember that this action is permanent!

<img src="Tutorial_Excel_2/adjust_excel_5.jpeg" alt="Chart Image" width="500">

How would creating additional tabs help you organize different datasets in your digital economics research? Think about separating different variables like price trends, consumer behavior, etc.

**Example:** Imagine a workbook with two tabs: "Sheet1" and "Sheet2".
- Sheet1 contains data on online sales.
- Sheet2 holds information on customer demographics.

### Renaming tabs
Excel assigns generic names like Sheet1, Sheet2, etc. It's a good practice to rename tabs with meaningful names to make navigation easier. Please note that Excel does not allow worksheet names longer than 31 characters.

To rename a tab:
- Double-click the tab name, or
- Right-click and select "Rename" from the context menu.

Choose concise names that indicate the content of the sheet (e.g., "Online Sales Q1").


### Moving and organizing tabs
When dealing with many datasets, it’s important to keep your tabs organized:
- To move a tab: Click and hold the tab name, then drag it to your preferred position.
- Logical order: Start with data input, followed by analysis, and conclude with results.

**Question:** How would you organize tabs for the following datasets?
- Raw data on online sales
- Marketing campaign results
- Sales summary report

### Coloring tabs
To differentiate important tabs, color-coding can be very helpful:
- To color a tab: Right-click on the tab name, then choose "Tab Color".
- For example yellow for summary reports and green for important datasets

<img src="Tutorial_Excel_2/adjust_excel_6.jpeg" alt="Chart Image" width="600">


### Hiding tabs
Sometimes you might want to hide specific data while keeping it accessible:
- To hide a tab: Right-click the tab and select Hide Sheet.
- To unhide a tab: Right-click any tab and choose Unhide, then select the tab to reveal.


### Duplicating tabs
Duplicating tabs is useful when you need a template or want to experiment with data without altering the original.

To copy a tab:
- Right-click the tab name, then choose "Move or Copy".
- Select Create a copy and specify where to place the duplicate.

<img src="Tutorial_Excel_2/adjust_excel_7.jpeg" alt="Chart Image" width="500">

### Quiz #5<a id="quiz5-7"></a> ([top](#home))
**You can organize multiple tabs in Excel by dragging them in the correct order.**

1. True
2. False

<details>
<summary>Click here to see the answer</summary>

**The correct answer is:** 1. True
</details>

### Quiz #6
**What is the limit for the number of characters allowed in a tab name in Excel?**

1. 255
2. 31
3. 64
4. Unlimited

<details>
<summary>Click here to see the answer</summary>

**The correct answer is:** 2. 31

</details>

### Quiz #7

**Once a tab is deleted in Excel, it can be recovered using the undo button.**

1. True
2. False

<details>
<summary>Click here to see the answer</summary>

**The correct answer is:** 2. False

</details>


## Format your file<a id="format"></a> ([top](#home))
Here, you will learn how to manage Excel layout options and configure print settings to enhance readability and presentation.

In this section, you will analyze data from an online marketplace where independent sellers list products for sale. Each seller has reported their sales for a given period, which includes the quantity sold, the price per unit, and the resulting revenue. The goal is to understand how each seller performs in terms of sales volume and revenue generation. We will use Excel formatting tools to adjust borders, hide gridlines, and apply consistent formatting if needed to make the data more readable:

Seller name         | Quantity sold | Price (USD) | Revenue (USD)
--------------------|---------------|-------------|---------------
BlueTech Goods      | 10            | 25          | 250
EcoTrend            | 20            | 30          | 600
GadgetPro           | 15            | 22          | 330
HomeEssentials      | 25            | 27          | 675

To access the data, download the Excel file using this link: [Tutorial2xls1-platform_sales.xlsx](https://github.com/matgithub-picardy/ExcelPythonCourse/raw/refs/heads/main/Tutorial_Excel_2/Tutorial2xls2-platformsales.xlsx).



### Hiding gridlines
Gridlines provide cell structure visibility but don’t appear in print. You can toggle them on and off for easier on-screen viewing:
- Go to the **View** tab.
- In the **Show** group, uncheck **Gridlines** (Windows) / **View Grid Lines** (Mac) to hide them. To make them appear again, check **Gridlines** / **View Grid Lines** again.
<img src="Tutorial_Excel_2/adjust_excel_8.jpeg" alt="Chart Image" width="400">


### Adding borders
Borders help delineate data tables clearly. This is especially helpful for economic data tables with multiple columns and rows.

**Activity:**
1. Select cells B3 to E7.
2. Right-click on the selection and choose "Format Cells".
3. In the Borders tab, select the Outline preset or any border style you prefer to apply borders to your table. 
<div style="display: flex; justify-content: space-around;">
    <img src="Tutorial_Excel_2/adjust_excel_9.jpeg" alt="Chart Image" width="450">
    <img src="Tutorial_Excel_2/adjust_excel_10.jpeg" alt="Chart Image" width="450">
</div>

### Freezing panes
Freezing panes keeps headers visible while scrolling through long datasets:
- Go to the **View** tab.
- Select **Freeze Panes** (Windows) or **Freeze Cells** (Mac) and choose **Freeze First Column.

<img src="Tutorial_Excel_2/adjust_excel_11.jpeg" alt="Chart Image" width="450">

Now, even when you scroll down, the column header remains visible, enhancing navigation for data analysis. But you should have noticed that the 1st row and column of our file are empty, and that the headers are on line 3 and column B. No problem there! We just have to freeze the first 3 rows and the first 2 columns:
- first make sure that no rows or column rows are still frozen
- select cell C4
- select the View menu
- select "Freeze Rows and Columns"

Now the columns and rows at the “top and left” of cell C4 are frozen! You can now scroll the table. For the rest of the tutorial, you can unfreeze the columns and rows.

### Printing

When managing large datasets, you may want to print headers on each page, limit the print area, or add page numbers for easier navigation. Here’s how to do this in Excel and LibreOffice Calc.

#### Printing headers on each page
To repeat headers on each page:
- Excel:
    - Go to the Page Layout tab.
    - Select Print Titles in the Page Setup group.
    - In the "Rows to repeat at top" field, enter the row(s) containing your headers.
- LibreOffice:
    - Go to Format > Print Ranges > Edit.
In the Rows to repeat section, specify the row(s) with headers that you want to appear on each page.

This video shows how to print column headers on all pages in Excel (source: YouTube channel "Ready Steady Excel"):
<video controls src="Tutorial_Excel_2/Print_headers_on_all_pages_in_Excel.mp4" width="400" height="250" />
<!-- https://www.youtube.com/watch?v=hOHpBh_y54E&ab_channel=ReadySteadyExcel -->


#### Defining a print area
To print only a specific range of cells:
- Excel: Select the range of data you want to print, go to Page Layout > Print Area > Set Print Area.
- LibreOffice Calc: Select your data range, then go to Format > Print Ranges > Define.

This video shows how to set the print area in Excel (source: YouTube channel "Excel Tutorials by EasyClick Academy"):
<video controls src="Tutorial_Excel_2/Set_the_print_area_in_Excel.mp4" width="500" height="300" />
<!-- https://www.youtube.com/watch?v=Mrt4v0ysA8w -->


**Task:** Practice setting a print area by selecting cell B1 and defining it as your print area.



#### Adding page numbers to printed pages
To add page numbers:
- Excel:
    - Go to Page Layout > Page Setup.
    - Choose the Header/Footer tab and, in the Footer section, select an option with page numbers (e.g., “Page 1”).
- LibreOffice:
    - Go to Format > Page > Header/Footer tab.
    - Enable Footer and select Edit. In the footer, add the page number by inserting Page from the footer options.
With these settings, page numbers will appear on printed pages, helping organize multi-page datasets.


### Conclusion
Mastering Excel layout and print settings is essential for presenting and sharing data effectively. These skills allow for cleaner, more readable tables and data overviews, important in the field of digital economics.

### Quiz #8<a id="quiz8-10"></a> ([top](#home))
**Why might hiding gridlines be useful when preparing an Excel file for presentation?**

1. It enhances readability by reducing visual clutter.
2. It prevents changes to data.
3. It speeds up Excel calculations.

<details>
<summary>Click here to see the answer</summary>

**The correct answer is:** 1. It enhances readability by reducing visual clutter.

</details>

### Quiz #9
**How does freezing panes improve usability in data analysis?**

1. It speeds up the application.
2. It filters data automatically.
3. It keeps header information visible.

<details>
<summary>Click here to see the answer</summary>

**The correct answer is:** 3. It keeps header information visible.

</details>

### Quiz #10
**Where in Excel can you add page numbers for print layout?**

1. View Tab
2. Page Layout > Page Setup
3. Data Tab

<details>
<summary>Click here to see the answer</summary>

**The correct answer is:** 2. Page Layout > Page Setup

</details>

## Use conditional formatting<a id="conditional"></a> ([top](#home))

### Quiz #11<a id="quiz11-13"></a> ([top](#home))
** **

1. 

<details>
<summary>Click here to see the answer</summary>

**The correct answer is:** 2. Page Layout > Page Setup

</details>

### Quiz #12
****

1. 

<details>
<summary>Click here to see the answer</summary>

**The correct answer is:**

</details>

### Quiz #13
****

1. 

<details>
<summary>Click here to see the answer</summary>

**The correct answer is:** 

</details>

## Use filter and sort functions<a id="filter"></a> ([top](#home))

### Quiz #14<a id="quiz14-16"></a> ([top](#home))
****

1. 

<details>
<summary>Click here to see the answer</summary>

**The correct answer is:**

</details>

### Quiz #15
****

1. 

<details>
<summary>Click here to see the answer</summary>

**The correct answer is:**

</details>

### Quiz #16
****

1. 

<details>
<summary>Click here to see the answer</summary>

**The correct answer is:** 

</details>

## Practice formatting an Excel file<a id="test"></a> ([top](#home))

### Quiz #17<a id="quiz17-19"></a> ([top](#home))
****

1. 

<details>
<summary>Click here to see the answer</summary>

**The correct answer is:**

</details>

### Quiz #18
****

1. 

<details>
<summary>Click here to see the answer</summary>

**The correct answer is:**

</details>

### Quiz #19
****

1. 

<details>
<summary>Click here to see the answer</summary>

**The correct answer is:** 

</details>