# One-to-Many and Many-to-Many Joins - Lab

## Introduction

In this lab, you'll practice your knowledge of one-to-many and many-to-many relationships!

## Objectives

You will be able to:
* Explain one-to-many and many-to-many joins as well as implications for the size of query results
* Query data using one-to-many and many-to-many joins

## One-to-Many and Many-to-Many Joins
<img src='images/Database-Schema.png' width="600">

## Connect to the Database

In [1]:
# Your code here
import sqlite3
conn = sqlite3.connect('data.sqlite')
cur = conn.cursor()

## Employees and their Office (a One-to-One join)

Return a DataFrame with all of the employees including their first name and last name along with the city and state of the office that they work out of (if they have one). Include all employees and order them by their first name, then their last name.

In [5]:
# Your code here
cur.execute("""SELECT 
                    FIRSTNAME, 
                    LASTNAME, 
                    CITY, 
                    STATE, 
                    EMPLOYEES.OFFICECODE 
                FROM EMPLOYEES 
                JOIN OFFICES 
                ON EMPLOYEES.OFFICECODE = OFFICES.OFFICECODE
                ORDER BY FIRSTNAME, LASTNAME""").fetchall()

[('Andy', 'Fixter', 'Sydney', '', 6),
 ('Anthony', 'Bow', 'San Francisco', 'CA', 1),
 ('Barry', 'Jones', 'London', '', 7),
 ('Diane', 'Murphy', 'San Francisco', 'CA', 1),
 ('Foon Yue', 'Tseng', 'NYC', 'NY', 3),
 ('George', 'Vanauf', 'NYC', 'NY', 3),
 ('Gerard', 'Bondur', 'Paris', '', 4),
 ('Gerard', 'Hernandez', 'Paris', '', 4),
 ('Jeff', 'Firrelli', 'San Francisco', 'CA', 1),
 ('Julie', 'Firrelli', 'Boston', 'MA', 2),
 ('Larry', 'Bott', 'London', '', 7),
 ('Leslie', 'Jennings', 'San Francisco', 'CA', 1),
 ('Leslie', 'Thompson', 'San Francisco', 'CA', 1),
 ('Loui', 'Bondur', 'Paris', '', 4),
 ('Mami', 'Nishi', 'Tokyo', 'Chiyoda-Ku', 5),
 ('Martin', 'Gerard', 'Paris', '', 4),
 ('Mary', 'Patterson', 'San Francisco', 'CA', 1),
 ('Pamela', 'Castillo', 'Paris', '', 4),
 ('Peter', 'Marsh', 'Sydney', '', 6),
 ('Steve', 'Patterson', 'Boston', 'MA', 2),
 ('Tom', 'King', 'Sydney', '', 6),
 ('William', 'Patterson', 'Sydney', '', 6),
 ('Yoshimi', 'Kato', 'Tokyo', 'Chiyoda-Ku', 5)]

## Customers and their Orders (a One-to-Many join)

Return a DataFrame with all of the customer contacts (first and last names) along with details for each of the customers' order numbers, order dates, and statuses.

In [9]:
# Your code here
cur.execute("""SELECT CONTACTFIRSTNAME, CONTACTLASTNAME, ORDERNUMBER, ORDERDATE, STATUS
                FROM CUSTOMERS 
                JOIN ORDERS 
                USING (CUSTOMERNUMBER)""").fetchall()

[('Carine ', 'Schmitt', 10123, '2003-05-20', 'Shipped'),
 ('Carine ', 'Schmitt', 10298, '2004-09-27', 'Shipped'),
 ('Carine ', 'Schmitt', 10345, '2004-11-25', 'Shipped'),
 ('Jean', 'King', 10124, '2003-05-21', 'Shipped'),
 ('Jean', 'King', 10278, '2004-08-06', 'Shipped'),
 ('Jean', 'King', 10346, '2004-11-29', 'Shipped'),
 ('Peter', 'Ferguson', 10120, '2003-04-29', 'Shipped'),
 ('Peter', 'Ferguson', 10125, '2003-05-21', 'Shipped'),
 ('Peter', 'Ferguson', 10223, '2004-02-20', 'Shipped'),
 ('Peter', 'Ferguson', 10342, '2004-11-24', 'Shipped'),
 ('Peter', 'Ferguson', 10347, '2004-11-29', 'Shipped'),
 ('Janine ', 'Labrune', 10275, '2004-07-23', 'Shipped'),
 ('Janine ', 'Labrune', 10315, '2004-10-29', 'Shipped'),
 ('Janine ', 'Labrune', 10375, '2005-02-03', 'Shipped'),
 ('Janine ', 'Labrune', 10425, '2005-05-31', 'In Process'),
 ('Jonas ', 'Bergulfsen', 10103, '2003-01-29', 'Shipped'),
 ('Jonas ', 'Bergulfsen', 10158, '2003-10-10', 'Shipped'),
 ('Jonas ', 'Bergulfsen', 10309, '2004-10-15', 

## Customers and their Payments (another One-to-Many join)

Return a DataFrame with all of the customer contacts (first and last names) along with details for each of the customers' payment amounts and date of payment. Sort these results in descending order by the payment amount. 

In [12]:
# Your code here
cur.execute("""SELECT CONTACTFIRSTNAME, CONTACTLASTNAME, AMOUNT, PAYMENTDATE
    FROM CUSTOMERS 
    JOIN PAYMENTS 
    USING (CUSTOMERNUMBER)
    ORDER BY AMOUNT DESC""").fetchall()

[('Diego ', 'Freyre', 120166.58, '2005-03-18'),
 ('Diego ', 'Freyre', 116208.4, '2004-12-31'),
 ('Susan', 'Nelson', 111654.4, '2003-08-15'),
 ('Eric', 'Natividad', 105743, '2003-12-26'),
 ('Susan', 'Nelson', 101244.59, '2005-03-05'),
 ('Julie', 'Brown', 85559.12, '2003-11-03'),
 ('Susan', 'Nelson', 85410.87, '2004-08-28'),
 ('Veysel', 'Oeztan', 85024.46, '2003-12-03'),
 ('Susan', 'Nelson', 83598.04, '2005-04-16'),
 ('Peter', 'Ferguson', 82261.22, '2004-12-15'),
 ('Valarie', 'Thompson', 80375.24, '2004-03-15'),
 ('Mike', 'Graham', 75020.13, '2005-05-23'),
 ('Diego ', 'Freyre', 65071.26, '2005-03-25'),
 ('Diego ', 'Freyre', 63843.55, '2003-12-09'),
 ('Kelvin', 'Leong', 63357.13, '2004-09-07'),
 ('Mihael', 'Holz', 61402, '2004-09-18'),
 ('Henriette ', 'Pfalzheim', 61234.67, '2004-11-06'),
 ('Diego ', 'Freyre', 59830.55, '2004-01-30'),
 ('Sue', 'Frick', 59551.38, '2004-06-21'),
 ('Valarie', 'Franco', 59265.14, '2003-12-26'),
 ('Jeff', 'Young', 58841.35, '2003-06-18'),
 ('Jeff', 'Young', 58

## Orders, Order details, and Product Details (a Many-to-Many Join)

Return a DataFrame with all of the customer contacts (first and last names) along with the product names, quantities, and date ordered for each of the customers and each of their orders. Sort these in descending order by the order date.

> Note: This will require joining 4 tables! This can be tricky! Give it a shot, and if you're still stuck, turn to the next section where you'll see how to write subqueries that can make complex queries such as this much simpler!

In [21]:
# Your code here
import pandas as pd
df = pd.DataFrame(cur.execute("""SELECT CONTACTFIRSTNAME, CONTACTLASTNAME, PRODUCTNAME, QUANTITYORDERED, ORDERDATE
                FROM
                    (
                    SELECT CONTACTFIRSTNAME, CONTACTLASTNAME, ORDERDATE, ORDERNUMBER
                    FROM CUSTOMERS
                    JOIN ORDERS
                    USING (CUSTOMERNUMBER)
                    )
                JOIN 
                    (
                    SELECT ORDERNUMBER, PRODUCTNAME, QUANTITYORDERED 
                    FROM PRODUCTS 
                    JOIN ORDERDETAILS 
                    USING (PRODUCTCODE)
                    )
                USING (ORDERNUMBER)
                """).fetchall())
df.columns = [x[0] for x in cur.description]
df

Unnamed: 0,CONTACTFIRSTNAME,CONTACTLASTNAME,PRODUCTNAME,QUANTITYORDERED,ORDERDATE
0,Carine,Schmitt,1965 Aston Martin DB5,26,2003-05-20
1,Carine,Schmitt,1999 Indy 500 Monte Carlo SS,46,2003-05-20
2,Carine,Schmitt,1948 Porsche Type 356 Roadster,34,2003-05-20
3,Carine,Schmitt,1966 Shelby Cobra 427 S/C,50,2003-05-20
4,Carine,Schmitt,1996 Moto Guzzi 1100i,39,2004-09-27
...,...,...,...,...,...
2991,Tony,Snowden,2002 Suzuki XREO,29,2005-04-01
2992,Tony,Snowden,1936 Harley Davidson El Knucklehead,30,2005-04-01
2993,Tony,Snowden,1997 BMW R 1100 S,57,2005-04-01
2994,Tony,Snowden,1960 BSA Gold Star DBD34,58,2005-04-01


## Summary

In this lab, you practiced your knowledge of one-to-many and many-to-many relationships!