# One-to-Many and Many-to-Many Joins - Lab

## Introduction

In this lab, you'll practice your knowledge on one-to-many and many-to-many relationships!

## Objectives

You will be able to:
- Query data using one-to-many and many-to-many joins
- Predict the resulting size of one-to-many and many-to-many joins

## One-to-Many and Many-to-Many Joins
<img src='images/Database-Schema.png' width="600">

## Connect to the Database

In [5]:
import sqlite3
import pandas as pd
conn = sqlite3.connect('data.sqlite', detect_types=sqlite3.PARSE_COLNAMES)
c = conn.cursor()


In [6]:
c.execute('''SELECT * FROM employees''')
df = pd.DataFrame(c.fetchall()) 
df.columns = [i[0] for i in c.description]
df.head()

Unnamed: 0,employeeNumber,lastName,firstName,extension,email,officeCode,reportsTo,jobTitle
0,1002,Murphy,Diane,x5800,dmurphy@classicmodelcars.com,1,,President
1,1056,Patterson,Mary,x4611,mpatterso@classicmodelcars.com,1,1002.0,VP Sales
2,1076,Firrelli,Jeff,x9273,jfirrelli@classicmodelcars.com,1,1002.0,VP Marketing
3,1088,Patterson,William,x4871,wpatterson@classicmodelcars.com,6,1056.0,Sales Manager (APAC)
4,1102,Bondur,Gerard,x5408,gbondur@classicmodelcars.com,4,1056.0,Sale Manager (EMEA)


In [7]:
c.execute('''SELECT * FROM offices''')
df = pd.DataFrame(c.fetchall()) 
df.columns = [i[0] for i in c.description]
df.head()

Unnamed: 0,officeCode,city,phone,addressLine1,addressLine2,state,country,postalCode,territory
0,1,San Francisco,+1 650 219 4782,100 Market Street,Suite 300,CA,USA,94080,
1,2,Boston,+1 215 837 0825,1550 Court Place,Suite 102,MA,USA,02107,
2,3,NYC,+1 212 555 3000,523 East 53rd Street,apt. 5A,NY,USA,10022,
3,4,Paris,+33 14 723 4404,43 Rue Jouffroy D'abbans,,,France,75017,EMEA
4,5,Tokyo,+81 33 224 5000,4-1 Kioicho,,Chiyoda-Ku,Japan,102-8578,Japan


## Employees and their Office (a One-to-One join)

Return a list of all of the employees with their first name, last name and the city and state of the office that they work out of (if they have one). Include all employees and order them by their first name, then their last name.

In [8]:
#Your code here
c.execute('''SELECT firstName, lastName, city, state FROM employees JOIN offices using(officeCode) ORDER BY firstName ASC, lastName ASC''')
df = pd.DataFrame(c.fetchall()) 
df.columns = [i[0] for i in c.description]
df.head()


Unnamed: 0,firstName,lastName,city,state
0,Andy,Fixter,Sydney,
1,Anthony,Bow,San Francisco,CA
2,Barry,Jones,London,
3,Diane,Murphy,San Francisco,CA
4,Foon Yue,Tseng,NYC,NY


## Customers and their Orders (a One-to-Many join)

Return a list of all the customers first and last names along with a record for each of their order numbers, order dates and statuses.

In [9]:
c.execute('''SELECT contactFirstName, contactLastName, orderNumber, orderDate, status FROM customers JOIN orders using(customerNumber)''')
df = pd.DataFrame(c.fetchall()) 
df.columns = [i[0] for i in c.description]
df.head()

Unnamed: 0,contactFirstName,contactLastName,orderNumber,orderDate,status
0,Carine,Schmitt,10123,2003-05-20,Shipped
1,Carine,Schmitt,10298,2004-09-27,Shipped
2,Carine,Schmitt,10345,2004-11-25,Shipped
3,Jean,King,10124,2003-05-21,Shipped
4,Jean,King,10278,2004-08-06,Shipped


## Customers and their Payments (another One-to-Many join)

Return a list of customers first and last names along with details about their payments including the amount and date of payments. Sort these results in descending order by the payment amount.

In [15]:
# Your code here
c.execute('''SELECT contactFirstName, contactLastName, amount, paymentDate 
            FROM customers 
            JOIN payments 
            using(customerNumber) 
            ORDER BY amount DESC ''')
df = pd.DataFrame(c.fetchall()) 
df.columns = [i[0] for i in c.description]
df.head()

Unnamed: 0,contactFirstName,contactLastName,amount,paymentDate
0,Violeta,Benitez,9977.85,2003-11-08
1,Ben,Calaghan,9821.32,2003-10-17
2,Leslie,Taylor,9658.74,2004-12-06
3,Sean,Clenahan,9415.13,2004-07-28
4,Roland,Mendel,8807.12,2005-05-03


## Orders, Order details and Product Details (a Many-to-Many Join)

Return a list of customer first and last names, product names, quantities, and date ordered for each of the customers and each of their orders. Sort these in descending order by the order date.

Note: This will require joining 4 tables! This can be tricky! Give it a shot, and if you're still stuck, turn to the next section where you'll see how to write subqueries which can make complex queries such as this much simpler!

In [17]:
# Your code here
c.execute('''SELECT contactFirstName, contactLastName, productName, quantityOrdered, orderDate 
            FROM customers c 
            JOIN orders o 
            using(customerNumber) 
            JOIN orderdetails od 
            using(orderNumber) 
            JOIN products using(productCode) ORDER BY orderDate DESC''')
df = pd.DataFrame(c.fetchall()) 
df.columns = [i[0] for i in c.description]
df.head()

Unnamed: 0,contactFirstName,contactLastName,productName,quantityOrdered,orderDate
0,Janine,Labrune,1962 LanciaA Delta 16V,38,2005-05-31
1,Janine,Labrune,1957 Chevy Pickup,33,2005-05-31
2,Janine,Labrune,1998 Chrysler Plymouth Prowler,28,2005-05-31
3,Janine,Labrune,1964 Mercedes Tour Bus,38,2005-05-31
4,Janine,Labrune,1926 Ford Fire Engine,19,2005-05-31


## Summary

In this lab, you practiced your knowledge on one-to-many and many-to-many relationships!