# Join Statements

## Introduction

In this lab, you'll practice your knowledge on Join statements.

## Objectives

You will be able to:
- Write queries that make use of various types of Joins
- Join tables using foreign keys

## CRM Schema

In almost all cases, rather then just working with a single table we will typically need data from multiple tables. 
Doing this requires the use of **joins ** using shared columns from the two tables. 

In this lab, we'll use the same Customer Relationship Management (CRM) database we used in our lecture before!
<img src='Database-Schema.png' width=550>

## Connecting to the Database
Import the necessary packages and connect to the database **data.sqlite**.

In [1]:
#Your code here
import sqlite3
import pandas as pd
conn = sqlite3.connect('data.sqlite', detect_types=sqlite3.PARSE_COLNAMES)
cur = conn.cursor()

## Display the names of all the employees in Boston.

In [3]:
#Your code here
cur.execute("""select firstname, lastName from employees join offices using(officeCode) where city = 'Boston';""" )
cur.fetchall()

[('Julie', 'Firrelli'), ('Steve', 'Patterson')]

## Do any offices have no employees?

In [5]:
#Your code here
cur.execute('''select city, 
                    count(*)
                    from offices
                    left join employees
                    using(officeCode)
                    group by 1;''' )
df = pd.DataFrame(cur.fetchall())
df.head()

Unnamed: 0,0,1
0,Boston,2
1,London,2
2,NYC,2
3,Paris,5
4,San Francisco,6


## Write 3 Questions of your own and answer them

In [None]:
# Answers will vary

In [25]:
# Your code here
#are there orders over 40? 
cur.execute("""select quantityOrdered from orderdetails limit 10;""")
df = pd.DataFrame(cur.fetchall()) #Take results and create dataframe
df.columns = [i[0] for i in cur.description]
df.head()

Unnamed: 0,quantityOrdered
0,30
1,50
2,22
3,49
4,25


In [27]:
# Your code here
who are the vendors?
cur.execute("""select productVendor from products limit 10;""")
df = pd.DataFrame(cur.fetchall()) #Take results and create dataframe
df.columns = [i[0] for i in cur.description]
df.head()

Object `vendors` not found.


Unnamed: 0,productVendor
0,Min Lin Diecast
1,Classic Metal Creations
2,Highway 66 Mini Classics
3,Red Start Diecast
4,Motor City Art Classics


In [28]:
#what cities are the offices in?
cur.execute("""select city from offices limit 10;""")
df = pd.DataFrame(cur.fetchall()) #Take results and create dataframe
df.columns = [i[0] for i in cur.description]
df.head()

Unnamed: 0,city
0,San Francisco
1,Boston
2,NYC
3,Paris
4,Tokyo


In [None]:
# Your code here

## Level Up: Display the names of each product each employee has sold.

In [30]:
# Your code here
cur.execute("""select firstName, lastName,
                      productName
                      from employees e
                      join
                      customers c
                      on e.employeenumber = c.salesRepEmployeeNumber
                      join orders o
                      using(customernumber)
                      join orderdetails od
                      using(orderNumber)
                      join products p
                      using(productCode)""")
df = pd.DataFrame(cur.fetchall())
print(len(df))
df.head()

2996


Unnamed: 0,0,1,2
0,Leslie,Jennings,1958 Setra Bus
1,Leslie,Jennings,1940 Ford Pickup Truck
2,Leslie,Jennings,1939 Cadillac Limousine
3,Leslie,Jennings,1996 Peterbilt 379 Stake Bed with Outrigger
4,Leslie,Jennings,1968 Ford Mustang


## Level Up: Display the Number of Products each Employee Has sold

In [31]:
#Your code here
df.groupby([0,1]).count()

Unnamed: 0_level_0,Unnamed: 1_level_0,2
0,1,Unnamed: 2_level_1
Andy,Fixter,185
Barry,Jones,220
Foon Yue,Tseng,142
George,Vanauf,211
Gerard,Hernandez,396
Julie,Firrelli,124
Larry,Bott,236
Leslie,Jennings,331
Leslie,Thompson,114
Loui,Bondur,177


## Summary

Congrats! You now know how to use Join statements, along with leveraging your foreign keys knowledge!