## Peer-graded Assignment: Analyzing Historical Stock/Revenue Data and Building a Dashboard
By: Taha Ebrahimnazari

### **Question 1:** Use `yfinance` to extract Stock Data.
Reset the index, save, and display the first five rows of the `tesla_data` dataframe using the `head` function. Upload a screenshot of the results and code from the beginning of Question 1 to the results below.

In [1]:
# Import the YFINANCE library:
import yfinance as yf

I use the `ticker` function to identify the ticker symbol of the stock I want to extract data on to create a ticker object `tesla`. The stock is **Tesla** and its ticker symbol is `TSLA`.

In [2]:
# Download historical data for TESLA stock
tesla = yf.Ticker("TSLA")

I use the ticker object `tesla` and the function `history` to extract stock information and save it in a dataframe named `tesla_data`. I set the period parameter to `max` to get information for the maximum amount of time.

In [3]:
# Setting data period:
tesla_data = tesla.history(period="max")

Resetting the indext and displaying the first five rows.

In [4]:
# Reset the index and display the downloaded data
tesla_data.reset_index(inplace=True)
tesla_data.head()

Unnamed: 0,Date,Open,High,Low,Close,Volume,Dividends,Stock Splits
0,2010-06-29 00:00:00-04:00,1.266667,1.666667,1.169333,1.592667,281494500,0.0,0.0
1,2010-06-30 00:00:00-04:00,1.719333,2.028,1.553333,1.588667,257806500,0.0,0.0
2,2010-07-01 00:00:00-04:00,1.666667,1.728,1.351333,1.464,123282000,0.0,0.0
3,2010-07-02 00:00:00-04:00,1.533333,1.54,1.247333,1.28,77097000,0.0,0.0
4,2010-07-06 00:00:00-04:00,1.333333,1.333333,1.055333,1.074,103003500,0.0,0.0


### **Question 2:** Use Webscraping to Extract Tesla Revenue Data
Display the last five rows of the `tesla_revenue` dataframe using the `tail` function. Upload a screenshot of the results.

In [5]:
# Import the REQUESTS and BEATIFULSOUP libraries:
import requests
from bs4 import BeautifulSoup
import pandas as pd

I use the `requests` library to download the webpage [https://cf-courses-data.s3.us.cloud-object-storage.appdomain.cloud/IBMDeveloperSkillsNetwork-PY0220EN-SkillsNetwork/labs/project/revenue.htm](https://cf-courses-data.s3.us.cloud-object-storage.appdomain.cloud/IBMDeveloperSkillsNetwork-PY0220EN-SkillsNetwork/labs/project/revenue.htm). Then, I save the text of the response as a variable named `tesla_html`.

In [6]:
url_tesla = "https://cf-courses-data.s3.us.cloud-object-storage.appdomain.cloud/IBMDeveloperSkillsNetwork-PY0220EN-SkillsNetwork/labs/project/revenue.htm"
tesla_html  = requests.get(url_tesla).text

Next, I parse the html data into the `soup_tesla` object.

In [7]:
soup_tesla = BeautifulSoup(tesla_html, 'html.parser')

In this section, I use beautiful soup to extract the table with *Tesla Quarterly Revenue* and store it into a dataframe named `tesla_revenue`. The dataframe has two columns *Date* and *Revenue*.

In [8]:
tesla_tables = soup_tesla.find_all('table')
 
for index,table in enumerate(tesla_tables):
    if ("Tesla Quarterly Revenue" in str(table)):
        tesla_table_index = index
tesla_revenue = pd.DataFrame(columns=["Date", "Revenue"])

I then remove the *comma* and *dollar sign* from the *Revenue* column.

In [9]:
for row in tesla_tables[tesla_table_index].tbody.find_all("tr"):
    col = row.find_all("td")
    if (col !=[]):
        date = col[0].text
        revenue = col[1].text.replace("$", "").replace(",", "")
        new_row = pd.DataFrame({"Date": [date], "Revenue": [revenue]})
        tesla_revenue = pd.concat([tesla_revenue, new_row], ignore_index=True)

Finally, I display the last five rows of `tesla_revenue` using the `tail` function.

In [10]:
tesla_revenue.tail()

Unnamed: 0,Date,Revenue
49,2010-06-30,28.0
50,2010-03-31,21.0
51,2009-12-31,
52,2009-09-30,46.0
53,2009-06-30,27.0


Since there are missing values in the Tesla revenue data, I have to remove these missing values to prevent problems while creating the graph.

In [11]:
# Drop missing values:
tesla_revenue.dropna(inplace=True)
not_empty = tesla_revenue["Revenue"]!=""
tesla_revenue = tesla_revenue[not_empty]

tesla_revenue.tail()

Unnamed: 0,Date,Revenue
48,2010-09-30,31
49,2010-06-30,28
50,2010-03-31,21
52,2009-09-30,46
53,2009-06-30,27


### **Question 3:** Use `yfinance` to Extract Stock Data
Reset the index, save, and display the first five rows of the `gme_data` dataframe using the `head` function. Upload a screenshot of the results and code from the beginning of Question 1 to the results below.

Using the `Ticker` function I enter the ticker symbol of the stock I want to extract data on to create a ticker object. The stock is **GameStop** and its ticker symbol is `GME`.

In [12]:
# Download historical data for GAMESTOP stock
gamestop = yf.Ticker("GME")

Using the ticker object and the function `history` I extract stock information and save it in a dataframe named `gamestop_data`. I set the period parameter to `max` to get information for the maximum amount of time.

In [13]:
# Setting data period:
gamestop_data = gamestop.history(period="max")

Resetting the index and displaying the first five rows.

In [14]:
# Reset the index and display the downloaded data
gamestop_data.reset_index(inplace=True)
gamestop_data.head()

Unnamed: 0,Date,Open,High,Low,Close,Volume,Dividends,Stock Splits
0,2002-02-13 00:00:00-05:00,1.620129,1.69335,1.603296,1.691667,76216000,0.0,0.0
1,2002-02-14 00:00:00-05:00,1.712707,1.716074,1.670626,1.68325,11021600,0.0,0.0
2,2002-02-15 00:00:00-05:00,1.68325,1.687458,1.658001,1.674834,8389600,0.0,0.0
3,2002-02-19 00:00:00-05:00,1.666418,1.666418,1.578047,1.607504,7410400,0.0,0.0
4,2002-02-20 00:00:00-05:00,1.61592,1.66221,1.603296,1.66221,6892800,0.0,0.0


### **Question 4:** Use Webscraping to Extract GME Revenue Data
Display the last five rows of the `gme_revenue` dataframe using the `tail` function. Upload a screenshot of the results.

I use the `requests` library to download the webpage [https://cf-courses-data.s3.us.cloud-object-storage.appdomain.cloud/IBMDeveloperSkillsNetwork-PY0220EN-SkillsNetwork/labs/project/stock.html](https://cf-courses-data.s3.us.cloud-object-storage.appdomain.cloud/IBMDeveloperSkillsNetwork-PY0220EN-SkillsNetwork/labs/project/stock.html). Then, I save the text of the response as a variable named `gamestop_html`.

In [15]:
url_gamestop = "https://cf-courses-data.s3.us.cloud-object-storage.appdomain.cloud/IBMDeveloperSkillsNetwork-PY0220EN-SkillsNetwork/labs/project/stock.html"
gamestop_html  = requests.get(url_gamestop).text

Next, I parse the html data into the `soup_gamestop` object.

In [16]:
soup_gamestop = BeautifulSoup(gamestop_html, 'html.parser')

In this section, I use beautiful soup to extract the table with *GameStop Quarterly Revenue* and store it into a dataframe named `gamestop_revenue`. The dataframe has two columns *Date* and *Revenue*.

In [17]:
gamestop_tables = soup_gamestop.find_all('table')
 
for index,table in enumerate(gamestop_tables):
    if ("GameStop Quarterly Revenue" in str(table)):
        gamestop_table_index = index
        
gamestop_revenue = pd.DataFrame(columns=["Date", "Revenue"])

I then remove the *comma* and *dollar sign* from the *Revenue* column.

In [18]:
for row in gamestop_tables[gamestop_table_index].tbody.find_all("tr"):
    col = row.find_all("td")
    if (col !=[]):
        date = col[0].text
        revenue = col[1].text.replace("$", "").replace(",", "")
        new_row = pd.DataFrame({"Date": [date], "Revenue": [revenue]})
        gamestop_revenue = pd.concat([gamestop_revenue, new_row], ignore_index=True)

Finally, I display the last five rows of `tesla_revenue` using the `tail` function.

In [19]:
gamestop_revenue.tail()

Unnamed: 0,Date,Revenue
57,2006-01-31,1667
58,2005-10-31,534
59,2005-07-31,416
60,2005-04-30,475
61,2005-01-31,709


### **Question 5:** Plot Tesla Stock Graph

Use the `make_graph` function to graph the *Tesla Stock Data*, also provide a title for the graph. Upload a screenshot of your results.

In [20]:
import plotly.graph_objects as go
from plotly.subplots import make_subplots

In [21]:
# Define MAKE_GRAPH function

def make_graph(stock_data, revenue_data, stock):
    fig = make_subplots(rows=2, cols=1, shared_xaxes=True, subplot_titles=("Historical Share Price", "Historical Revenue"), vertical_spacing = .3)
    stock_data_specific = stock_data[stock_data.Date <= '2021--06-14']
    revenue_data_specific = revenue_data[revenue_data.Date <= '2021-04-30']
    fig.add_trace(go.Scatter(x=pd.to_datetime(stock_data_specific.Date), y=stock_data_specific.Close.astype("float"), name="Share Price"), row=1, col=1)
    fig.add_trace(go.Scatter(x=pd.to_datetime(revenue_data_specific.Date), y=revenue_data_specific.Revenue.astype("float"), name="Revenue"), row=2, col=1)
    fig.update_xaxes(title_text="Date", row=1, col=1)
    fig.update_xaxes(title_text="Date", row=2, col=1)
    fig.update_yaxes(title_text="Price ($US)", row=1, col=1)
    fig.update_yaxes(title_text="Revenue ($US Millions)", row=2, col=1)
    fig.update_layout(showlegend=False,
    height=900,
    title=stock,
    xaxis_rangeslider_visible=True)
    fig.show()

In [22]:
make_graph(tesla_data,tesla_revenue,'Tesla')

### **Question 6:** Plot GameStop Stock Graph
Use the `make_graph` function to graph the *GameStop Stock Data*, also provide a title for the graph. Upload a screenshot of your results.

In [23]:
make_graph(gamestop_data,gamestop_revenue,'GameStop')