# Plotting Foreclosures

In this activity, you’ll use GeoViews to visualize and explore real-estate data.

Instructions:

1. Using the `read_csv` function and the Path module, read `2018_Registered_Foreclosure_Properties.csv` from the `Resources` folder, and create the `la_foreclosures_2018` DataFrame.

2. Review the code that creates the `most_foreclosures_df` DataFrame from the `la_foreclosures_2018` DataFrame. The starter file provides this code. Be sure to run this cell as you work through the next steps.

    >**Important** The [Pandas `isin` function](https://pandas.pydata.org/docs/reference/api/pandas.DataFrame.isin.html) was used to filter the `la_foreclosures_2018` DataFrame based on a Python list that includes the five lenders with the most foreclosures: Ocwen Loan Servicing, LLC; Wells Fargo Bank NA; JP Morgan Chase NA; and Nationstar Mortgage LLC.

3. Use the `points` function to plot the data from the `most_foreclosures_df` DataFrame. Include parameters as follows:

    * Use the “Longitude” and “Latitude” columns in the DataFrame as your geospatial data.

    * Set `geo` equal to True to enable the GeoViews integration

    * Set `color` based on the “Lender” column.

    * Set `tiles` to "OSM".

    * Set `frame_width` to 700.

    * Set `frame_height` to 500.

    * Create a `title` for the plot.

4. Use the `points` function to create a second plot from the `most_foreclosures_df` DataFrame. Keep all the parameters unchanged except as follows:

    * Set the `color` parameter to “Property Type”.

    * Change the `title` parameter to correspond to the new information that you’re plotting.

5. Use the `points` function to create a third plot from the `most_foreclosures_df` DataFrame. Keep all the parameters unchanged except as follows:

    * Set the `color` parameter to “Council District”.

    * Change the `title` parameter to correspond to the new information that you’re plotting.

6. Answer the following questions in the notebook:

    * After reviewing the visualizations, what insights can you gain about the foreclosures in Los Angeles in 2018?

    * Which lender owns the most foreclosed properties?

    * Do the lenders tend to focus on one area or council district in the city, or do they evenly distribute their properties throughout the region?


References:

[Geographic data in hvPlot](https://hvplot.holoviz.org/user_guide/Geographic_Data.html)

[Pandas `isin` function](https://pandas.pydata.org/docs/reference/api/pandas.DataFrame.isin.html) 

In [None]:
# Import the the required libraries and dependencies
import pandas as pd
import os
from pathlib import Path
import hvplot.pandas

### Step 1: Using the `read_csv` function and the Path module, read "2018_Registered_Foreclosure_Properties.csv" from the `Resources` folder, and create the `la_foreclosures_2018` DataFrame.

In [None]:
# Using the read_csv file and Path module, read in the 
# "2018_Registered_Foreclosure_Properties.csv" file and create the DataFrame

# YOUR CODE HERE


# Review the first and last five rows of the DataFrame
display(# YOUR CODE HERE)
display(# YOUR CODE HERE)

### Step 2: Review the code that creates the `most_foreclosures_df` DataFrame from the `la_foreclosures_2018` DataFrame. The starter file provides this code. Be sure to run this cell as you work through the next steps.

In [None]:
# A list of the lenders that are to be included in the analysis.
lenders = [
    "Ocwen Loan Servicing, LLC",
    "Wells Fargo Bank N.A.",
    "JP Morgan Chase NA",
    "Nationstar Mortgage LLC",
]

# Using the Python `isin` function, create a subset of the original DataFrame that includes
# only the "Lenders" that are in the "lenders" list.
most_foreclosures_df = la_foreclosures_2018[la_foreclosures_2018["Lender"].isin(lenders)]

# Review the first five rows of the resulting DataFrame
most_foreclosures_df.head()

### Step 3: Use the `points` function to plot the data from the `most_foreclosures_df` DataFrame.

Include parameters as follows:

* Use the “Longitude” and “Latitude” columns in the DataFrame as your geospatial data.

* Set `geo` equal to True to enable the GeoViews integration

* Set `color` based on the “Lender” column.

* Set `tiles` to "OSM".

* Set `frame_width` to 700.

* Set `frame_height` to 500.

* Create a `title` for the plot.

In [None]:
# Create the plot based on the most_foreclosures_df DataFrame.
# The color parameter should be based on Lender

# YOUR CODE HERE

### Step 4: Use the `points` function to create a second plot from the `most_foreclosures_df` DataFrame. 

Keep all the parameters unchanged except as follows:

* Set the `color` parameter to “Property Type”.

* Change the `title` parameter to correspond to the new information that you’re plotting.

In [None]:
# Create the plot based on the most_foreclosures_df DataFrame.
# The color parameter should be based on Property Type

# YOUR CODE HERE

### Step 5: Use the `points` function to create a third plot from the `most_foreclosures_df` DataFrame. 

Keep all the parameters unchanged except as follows:

* Set the `color` parameter to “Council District”.

* Change the `title` parameter to correspond to the new information that you’re plotting.

In [None]:
# Create the plot based on the most_foreclosures_df DataFrame.
# The color parameter should be based on Council District

# YOUR CODE HERE

## Step 6: Answer the following questions: 

**Questions:** After reviewing the visualizations, what insights can you gain about the foreclosures in Los Angeles in 2018? Which lender owns the most foreclosed properties? Do the lenders tend to focus on one area or council district in the city or are their properties evenly distributed throughout the region?

**Answers:** # YOUR ANSWER HERE