# Selecting Data - Lab


## Introduction 

NASA wants to go to Mars! Before they build their rocket, NASA needs to track information about all of the planets in the Solar System. In this lab, you'll practice querying the database with various `SELECT` statements. This will include selecting different columns and implementing other SQL clauses like `WHERE` to return the data desired.

<img src="./images/planets.png" width="600">

## Objectives
You will be able to:
* Connect to a SQL database using Python
* Retrieve all information from a SQL table
* Retrieve a subset of records from a table using a `WHERE` clause
* Write SQL queries to filter and order results
* Retrieve a subset of columns from a table

## Connecting to the DataBase

To get started import pandas and sqlite3. Then, connect to the database titled `planets.db`. 

Don't forget to instantiate a cursor so that you can later execute your queries.

In [1]:
# Your code here
import pandas as pd
import sqlite3

conn = sqlite3.connect('planets.db')
cur = conn.cursor()



## Selecting Data

Here's an overview of the planet's table you'll be querying.

|name   |color |num_of_moons|mass|rings|
|-------|-------|-------|-------|-------|
|Mercury|gray   |0      |0.55   |no     |
|Venus  |yellow |0      |0.82   |no     |
|Earth  |blue   |1      |1.00   |no     |
|Mars   |red    |2      |0.11   |no     |
|Jupiter|orange |67     |317.90 |no     |
|Saturn |hazel  |62     |95.19  |yes    |
|Uranus |light blue|27  |14.54  |yes    |
|Neptune|dark blue|14   |17.15  |yes    |

Write SQL queries for each of the statements below using the same pandas wrapping syntax from the previous lesson.

## Select just the name and color of each planet

In [4]:
# Your code here
cur.execute('''SELECT name, color FROM planets;
''')
df = pd.DataFrame(cur.fetchall())
df.columns = [x[0] for x in cur.description]
df

Unnamed: 0,name,color
0,Mercury,gray
1,Venus,yellow
2,Earth,blue
3,Mars,red
4,Jupiter,orange
5,Saturn,hazel
6,Uranus,light blue
7,Neptune,dark blue


## Select all columns for each planet whose mass is greater than 1.00


In [12]:
# Your code here
cur.execute("""SELECT * FROM planets WHERE mass > 1;""")
df = pd.DataFrame(cur.fetchall())
df.columns = [x[0] for x in cur.description]
df

Unnamed: 0,id,name,color,num_of_moons,mass,rings
0,5,Jupiter,orange,68,317.9,0
1,6,Saturn,hazel,62,95.19,1
2,7,Uranus,light blue,27,14.54,1
3,8,Neptune,dark blue,14,17.15,1


## Select the name and mass of each planet whose mass is less than or equal to 1.00

In [14]:
# Your code here
cur.execute('''SELECT name, mass FROM planets WHERE mass <= 1.00;
''')
df = pd.DataFrame(cur.fetchall())
df.columns = [x[0] for x in cur.description]
df

Unnamed: 0,name,mass
0,Mercury,0.55
1,Venus,0.82
2,Earth,1.0
3,Mars,0.11


## Select the name and color of each planet that has more than 10 moons

In [15]:
# Your code here
cur.execute('''SELECT name, color FROM planets WHERE num_of_moons > 10;
''')
df = pd.DataFrame(cur.fetchall())
df.columns = [x[0] for x in cur.description]
df

Unnamed: 0,name,color
0,Jupiter,orange
1,Saturn,hazel
2,Uranus,light blue
3,Neptune,dark blue


## Select the planet that has at least one moon and a mass less than 1.00

In [19]:

# Your code here
cur.execute('''SELECT * FROM planets WHERE mass < 1 and num_of_moons >= 1;
''')
df = pd.DataFrame(cur.fetchall())
df.columns = [x[0] for x in cur.description]
df

Unnamed: 0,id,name,color,num_of_moons,mass,rings
0,4,Mars,red,2,0.11,0


## Select the name and color of planets that have a color of blue, light blue, or dark blue

In [22]:
# Your code here
cur.execute('''SELECT name, color FROM planets 
               WHERE color == 'blue' 
               or color == 'light blue' 
               or color == 'dark blue'
''')
df = pd.DataFrame(cur.fetchall())
df.columns = [x[0] for x in cur.description]
df

Unnamed: 0,name,color
0,Earth,blue
1,Uranus,light blue
2,Neptune,dark blue


## Select the name, color, and number of moons for the 4 largest planets that don't have rings and order them from largest to smallest

In [23]:
# Your code here
cur.execute('''SELECT name, color, num_of_moons 
               FROM planets 
               WHERE rings == 0 
               ORDER BY mass DESC 
               LIMIT 4;
''')

df = pd.DataFrame(cur.fetchall())
df.columns = [x[0] for x in cur.description]
df

ValueError: Length mismatch: Expected axis has 0 elements, new values have 3 elements

## Summary

Congratulations! NASA is one step closer to embarking upon its mission to Mars. In this lab, You practiced writing `SELECT` statements that query a single table to get specific information. You also used other clauses and specified column names to cherry-pick the data we wanted to retrieve. 