# Join Statements - Lab

## Introduction

In this lab, you'll practice your knowledge of `JOIN` statements, using various types of joins and various methods for specifying the links between them.

## Objectives

You will be able to:
* Write SQL queries that make use of various types of joins
* Compare and contrast the various types of joins
* Discuss how primary and foreign keys are used in SQL
* Decide and perform whichever type of join is best for retrieving desired data

## CRM Schema

In almost all cases, rather than just working with a single table you will typically need data from multiple tables. 
Doing this requires the use of **joins** using shared columns from the two tables. 

In this lab, you'll use the same customer relationship management (CRM) database that you saw from the previous lesson.
<img src='images/Database-Schema.png' width="600">

## Connecting to the Database
Import the necessary packages and connect to the database `'data.sqlite'`.

In [15]:
# Your code here
import sqlite3
import pandas as pd

conn = sqlite3.connect('data.sqlite')
cur = conn.cursor()

cur.execute('''SELECT * FROM offices WHERE city='Boston';''')
df = pd.DataFrame(cur.fetchall()) 
df.columns = [i[0] for i in cur.description]
df.head()

Unnamed: 0,officeCode,city,phone,addressLine1,addressLine2,state,country,postalCode,territory
0,2,Boston,+1 215 837 0825,1550 Court Place,Suite 102,MA,USA,2107,
1,27,Boston,+1 977 299 8345,105 Cambridge Street,,MA,USA,2331,


## Display the names of all the employees in Boston 

Hint: join the employees and offices tables.

In [22]:
# Your code here
cur.execute('''SELECT *
                FROM employees  
                LEFT JOIN offices 
                USING(officecode)
                WHERE offices.city = 'Boston';''')

df = pd.DataFrame(cur.fetchall()) 
df.columns = [i[0] for i in cur.description]
df.head()

Unnamed: 0,employeeNumber,lastName,firstName,extension,email,officeCode,reportsTo,jobTitle,city,phone,addressLine1,addressLine2,state,country,postalCode,territory
0,1188,Firrelli,Julie,x2173,jfirrelli@classicmodelcars.com,2,1143,Sales Rep,Boston,+1 215 837 0825,1550 Court Place,Suite 102,MA,USA,2107,
1,1216,Patterson,Steve,x4334,spatterson@classicmodelcars.com,2,1143,Sales Rep,Boston,+1 215 837 0825,1550 Court Place,Suite 102,MA,USA,2107,


## Are there any offices that have zero employees?
Hint: Combine the employees and offices tables and use a group by.

In [39]:
# Your code here
cur.execute('''SELECT * FROM employees 
                LEFT JOIN offices 
                USING(officecode)
                GROUP BY employees.lastname''').fetchall()

[(1102,
  'Bondur',
  'Gerard',
  'x5408',
  'gbondur@classicmodelcars.com',
  4,
  1056,
  'Sale Manager (EMEA)',
  'Paris',
  '+33 14 723 4404',
  "43 Rue Jouffroy D'abbans",
  '',
  '',
  'France',
  '75017',
  'EMEA'),
 (1501,
  'Bott',
  'Larry',
  'x2311',
  'lbott@classicmodelcars.com',
  7,
  1102,
  'Sales Rep',
  'London',
  '+44 20 7877 2041',
  '25 Old Broad Street',
  'Level 7',
  '',
  'UK',
  'EC2N 1HN',
  'EMEA'),
 (1143,
  'Bow',
  'Anthony',
  'x5428',
  'abow@classicmodelcars.com',
  1,
  1056,
  'Sales Manager (NA)',
  'San Francisco',
  '+1 650 219 4782',
  '100 Market Street',
  'Suite 300',
  'CA',
  'USA',
  '94080',
  'NA'),
 (1401,
  'Castillo',
  'Pamela',
  'x2759',
  'pcastillo@classicmodelcars.com',
  4,
  1102,
  'Sales Rep',
  'Paris',
  '+33 14 723 4404',
  "43 Rue Jouffroy D'abbans",
  '',
  '',
  'France',
  '75017',
  'EMEA'),
 (1076,
  'Firrelli',
  'Jeff',
  'x9273',
  'jfirrelli@classicmodelcars.com',
  1,
  1002,
  'VP Marketing',
  'San Francisc

## Write 3 Questions of your own and answer them

In [None]:
# Answers will vary
# Example: Display the htmlDescription and employee's first and last name for each product that each employee has sold

In [41]:
# Your code here
# name and payments for each customer
cur.execute('''SELECT customerName, paymentDate 
                FROM customers c
                JOIN payments p
                ON c.customerNumber = p.customerNumber''').fetchall()

[('Atelier graphique', '2003-06-05'),
 ('Atelier graphique', '2004-10-19'),
 ('Atelier graphique', '2004-12-18'),
 ('Signal Gift Stores', '2003-06-06'),
 ('Signal Gift Stores', '2004-08-20'),
 ('Signal Gift Stores', '2004-12-17'),
 ('Australian Collectors, Co.', '2003-05-20'),
 ('Australian Collectors, Co.', '2003-05-31'),
 ('Australian Collectors, Co.', '2004-03-10'),
 ('Australian Collectors, Co.', '2004-12-15'),
 ('La Rochelle Gifts', '2004-08-08'),
 ('La Rochelle Gifts', '2004-11-14'),
 ('La Rochelle Gifts', '2005-02-22'),
 ('Baane Mini Imports', '2003-02-16'),
 ('Baane Mini Imports', '2003-10-28'),
 ('Baane Mini Imports', '2004-11-04'),
 ('Baane Mini Imports', '2004-11-28'),
 ('Mini Gifts Distributors Ltd.', '2003-04-11'),
 ('Mini Gifts Distributors Ltd.', '2003-08-15'),
 ('Mini Gifts Distributors Ltd.', '2003-11-25'),
 ('Mini Gifts Distributors Ltd.', '2004-03-26'),
 ('Mini Gifts Distributors Ltd.', '2004-08-28'),
 ('Mini Gifts Distributors Ltd.', '2004-11-02'),
 ('Mini Gifts Dis

In [59]:
# Your code here
# salesrep for each order
cur.execute('''SELECT salesRepEmployeeNumber
                        FROM orders
                        LEFT JOIN customers
                        ON(customers.customerNumber)''')
employee_sales = pd.DataFrame(cur.fetchall()) 
employee_sales.columns = [i[0] for i in cur.description]
employee_sales.head()

Unnamed: 0,salesRepEmployeeNumber
0,1370
1,1166
2,1611
3,1370
4,1504


In [68]:
# Your code here
# see how many customers each employee has
cur.execute('''SELECT salesRepEmployeeNumber,
                COUNT(customerNumber)
                FROM customers
                LEFT JOIN employees
                ON customers.salesRepEmployeeNumber=employees.employeeNumber
                GROUP BY salesRepEmployeeNumber
                ORDER BY COUNT(customerNumber) DESC''')
df = pd.DataFrame(cur.fetchall()) 
df.columns = [i[0] for i in cur.description]
df

Unnamed: 0,salesRepEmployeeNumber,COUNT(customerNumber)
0,,22
1,1401.0,10
2,1504.0,9
3,1501.0,8
4,1323.0,8
5,1370.0,7
6,1286.0,7
7,1702.0,6
8,1337.0,6
9,1216.0,6


## Level Up: Display the names of every individual product that each employee has sold

In [50]:
# Your code here


## Level Up: Display the Number of Products each employee has sold

In [53]:
# Your code here
employee_sales['salesRepEmployeeNumber'].value_counts()

        7172
1401    3260
1504    2934
1501    2608
1323    2608
1370    2282
1286    2282
1702    1956
1337    1956
1216    1956
1188    1956
1166    1956
1165    1956
1621    1630
1612    1630
1611    1630
Name: salesRepEmployeeNumber, dtype: int64

## Summary

Congrats! You practiced using join statements and leveraged your foreign keys knowledge!