<h1>Extracting and Visualizing Stock Data</h1>
<h2>Description</h2>


Extracting essential data from a dataset and displaying it is a necessary part of data science; therefore individuals can make correct decisions based on the data. In this assignment, i will extract some stock data,i will then display this data in a graph.


In [None]:
! pip install yfinance
! pip install bs4

In [None]:
import yfinance as yf
import pandas as pd
import requests
from bs4 import BeautifulSoup
import plotly.graph_objects as go
from plotly.subplots import make_subplots

## 1. Use yfinance to Extract Stock Data


Using the `Ticker` function i enter the ticker symbol of the stock i want to extract data on to create a ticker object. The stock is Tesla and its ticker symbol is `TSLA`.


In [None]:
Tesla = yf.Ticker("TSLA")

Using the ticker object and the function `history` i extract stock information and i save it in a dataframe named `tesla_data`. I Set the period parameter to `max` so i get information for the maximum amount of time.

In [None]:
tesla_data = Tesla.history(period="max")

**I reset the index** using the `reset_index(inplace=True)` function on the tesla_data DataFrame and i display the first five rows of the `tesla_data` dataframe using the `head` function.

In [None]:
tesla_data.reset_index(inplace = True)
tesla_data.head()

## 2. Webscraping to Extract Tesla Revenue Data


I Use the `requests` library to download the webpage [https://www.macrotrends.net/stocks/charts/TSLA/tesla/revenue](https://www.macrotrends.net/stocks/charts/TSLA/tesla/revenue?cm_mmc=Email_Newsletter-_-Developer_Ed%2BTech-_-WW_WW-_-SkillsNetwork-Courses-IBMDeveloperSkillsNetwork-PY0220EN-SkillsNetwork-23455606&cm_mmca1=000026UJ&cm_mmca2=10006555&cm_mmca3=M12345678&cvosrc=email.Newsletter.M12345678&cvo_campaign=000026UJ&cm_mmc=Email_Newsletter-_-Developer_Ed%2BTech-_-WW_WW-_-SkillsNetwork-Courses-IBMDeveloperSkillsNetwork-PY0220EN-SkillsNetwork-23455606&cm_mmca1=000026UJ&cm_mmca2=10006555&cm_mmca3=M12345678&cvosrc=email.Newsletter.M12345678&cvo_campaign=000026UJ&cm_mmc=Email_Newsletter-_-Developer_Ed%2BTech-_-WW_WW-_-SkillsNetwork-Courses-IBMDeveloperSkillsNetwork-PY0220EN-SkillsNetwork-23455606&cm_mmca1=000026UJ&cm_mmca2=10006555&cm_mmca3=M12345678&cvosrc=email.Newsletter.M12345678&cvo_campaign=000026UJ&cm_mmc=Email_Newsletter-_-Developer_Ed%2BTech-_-WW_WW-_-SkillsNetwork-Courses-IBMDeveloperSkillsNetwork-PY0220EN-SkillsNetwork-23455606&cm_mmca1=000026UJ&cm_mmca2=10006555&cm_mmca3=M12345678&cvosrc=email.Newsletter.M12345678&cvo_campaign=000026UJ). I save the text of the response as a variable named `html_data`.


In [None]:
url = ' https://www.macrotrends.net/stocks/charts/TSLA/tesla/revenue'
html_data = requests.get(url).text 

I parse the html data using `beautiful_soup`.


In [None]:
soup = BeautifulSoup(html_data,"html5lib") 

I use beautiful soup extract the table with `Tesla Quarterly Revenue` and i store it into a dataframe named `tesla_revenue`. The dataframe contains columns `Date` and `Revenue`. comma and dollar sign is removed from the `Revenue` column.


In [None]:
tesla_revenue = soup.find('Tesla Quarterly Revenue')
tesla_revenue = pd.DataFrame(columns=["Date", "Revenue"])
for row in soup.find("tbody").find_all("tr"):
    col = row.find_all("td")

    date =col[0].text
    revenue = col[1].text
    
    tesla_revenue = tesla_revenue.append({"Date":date, "Revenue":revenue}, ignore_index=True)
    
tesla_revenue["Revenue"] = tesla_revenue["Revenue"].str.replace("$", "").str.replace(",", "")


I remove the rows in the dataframe that are empty strings or are NaN in the Revenue column.

In [None]:
tesla_revenue.dropna(subset=['Revenue'],inplace = True)
tesla_revenue = tesla_revenue[tesla_revenue['Revenue'] != ""]
tesla_revenue

i display the last 5 row of the `tesla_revenue` dataframe using the `tail` function.


In [None]:
 tesla_revenue.tail()

## 3. Use yfinance to Extract Stock Data


I Use the `Ticker` function to enter the ticker symbol of the stock i want to extract data on to create a ticker object. The stock is GameStop and its ticker symbol is `GME`.


In [None]:
GameStop = yf.Ticker("GME")

I Used the ticker object and the function `history` extract stock information and save it in a dataframe named `gme_data`. i set the `period` parameter to `max` so we get information for the maximum amount of time.


In [None]:
gme_data = GameStop.history(period="max")

**I reset the index** using the `reset_index(inplace=True)` function on the gme_data DataFrame and i display the first five rows of the `gme_data` dataframe using the `head` function.

In [None]:
gme_data.reset_index(inplace=True)
gme_data.head()

## 4. Use Webscraping to Extract GME Revenue Data


 I use the `requests` library to download the webpage [https://www.macrotrends.net/stocks/charts/GME/gamestop/revenue](https://www.macrotrends.net/stocks/charts/GME/gamestop/revenue?cm_mmc=Email_Newsletter-_-Developer_Ed%2BTech-_-WW_WW-_-SkillsNetwork-Courses-IBMDeveloperSkillsNetwork-PY0220EN-SkillsNetwork-23455606&cm_mmca1=000026UJ&cm_mmca2=10006555&cm_mmca3=M12345678&cvosrc=email.Newsletter.M12345678&cvo_campaign=000026UJ&cm_mmc=Email_Newsletter-_-Developer_Ed%2BTech-_-WW_WW-_-SkillsNetwork-Courses-IBMDeveloperSkillsNetwork-PY0220EN-SkillsNetwork-23455606&cm_mmca1=000026UJ&cm_mmca2=10006555&cm_mmca3=M12345678&cvosrc=email.Newsletter.M12345678&cvo_campaign=000026UJ&cm_mmc=Email_Newsletter-_-Developer_Ed%2BTech-_-WW_WW-_-SkillsNetwork-Courses-IBMDeveloperSkillsNetwork-PY0220EN-SkillsNetwork-23455606&cm_mmca1=000026UJ&cm_mmca2=10006555&cm_mmca3=M12345678&cvosrc=email.Newsletter.M12345678&cvo_campaign=000026UJ&cm_mmc=Email_Newsletter-_-Developer_Ed%2BTech-_-WW_WW-_-SkillsNetwork-Courses-IBMDeveloperSkillsNetwork-PY0220EN-SkillsNetwork-23455606&cm_mmca1=000026UJ&cm_mmca2=10006555&cm_mmca3=M12345678&cvosrc=email.Newsletter.M12345678&cvo_campaign=000026UJ).I save the text of the response as a variable named `html_data`.


In [None]:
url = ' https://www.macrotrends.net/stocks/charts/GME/gamestop/revenue'
html_data = requests.get(url).text 

I parse the html data using `beautiful_soup`.


In [None]:
soup = BeautifulSoup(html_data,"html5lib") 

I Use beautiful soup extract the table with `GameStop Quarterly Revenue` and I store it into a dataframe named `gme_revenue`. The dataframe contains columns `Date` and `Revenue`. comma and dollar sign is removed from the `Revenue` column.


In [None]:
gme_revenue = soup.find('GameStop Quarterly Revenue')
gme_revenue = pd.DataFrame(columns=["Date", "Revenue"])
for row in soup.find("tbody").find_all("tr"):
    col = row.find_all("td")

    date =col[0].text
    revenue = col[1].text
    
    gme_revenue = gme_revenue.append({"Date":date, "Revenue":revenue}, ignore_index=True)
    
gme_revenue["Revenue"] = gme_revenue["Revenue"].str.replace("$", "").str.replace(",", "")

I display the last five rows of the `gme_revenue` dataframe using the `tail` function.


In [None]:
gme_revenue.tail()

## 5. Define Graphing Function


define the function `make_graph`

In [None]:
def make_graph(stock_data, revenue_data, stock):
    fig = make_subplots(rows=2, cols=1, shared_xaxes=True, subplot_titles=("Historical Share Price", "Historical Revenue"), vertical_spacing = .3)
    fig.add_trace(go.Scatter(x=pd.to_datetime(stock_data.Date, infer_datetime_format=True), y=stock_data.Close.astype("float"), name="Share Price"), row=1, col=1)
    fig.add_trace(go.Scatter(x=pd.to_datetime(revenue_data.Date, infer_datetime_format=True), y=revenue_data.Revenue.astype("float"), name="Revenue"), row=2, col=1)
    fig.update_xaxes(title_text="Date", row=1, col=1)
    fig.update_xaxes(title_text="Date", row=2, col=1)
    fig.update_yaxes(title_text="Price ($US)", row=1, col=1)
    fig.update_yaxes(title_text="Revenue ($US Millions)", row=2, col=1)
    fig.update_layout(showlegend=False,
    height=900,
    title=stock,
    xaxis_rangeslider_visible=True)
    fig.show()

## 6. Plot Tesla Stock Graph

I Use the `make_graph` function to graph the Tesla Stock Data, also provide a title for the graph. The structure to call the `make_graph` function is `make_graph(tesla_data, tesla_revenue, 'Tesla')`


In [None]:
make_graph(tesla_data, tesla_revenue, 'Tesla')

## 7. Plot GameStop Stock Graph


I use the `make_graph` function to graph the GameStop Stock Data, also provide a title for the graph. The structure to call the `make_graph` function is `make_graph(gme_data, gme_revenue, 'GameStop')`.


In [None]:
make_graph(gme_data, gme_revenue, 'GameStop')


<hr>

## <h3 align="center"> © IBM Corporation 2020. All rights reserved. <h3/>

<p>
