# The Voice of the Blockchain

Canada lies at the frontier of the blockchain sector with increasing adoption rates and favorable regulations. In this activity you will retrieve news articles regarding blockchain in Canada for both English and French languages to capture the voice of the blockchain.

In [6]:
# Initial imports
import os
import pandas as pd
from path import Path
from dotenv import load_dotenv
from newsapi import NewsApiClient

In [7]:
# Load environment variables and retrieve the News API key
load_dotenv()
api_key = os.getenv("NEWS_API_KEY")

In [8]:
# Create the newsapi client
newsapi = NewsApiClient(api_key=api_key)

## Getting News Articles in English

In this section you have to fetch all the news articles using the News API with the keywords `blockchain`, `canada`, and `2020` in English.



In [9]:
# Fetch news about Canada and Blockchain in 2020 in the English language
blockchain_news_en = newsapi.get_everything(
    q="blockchain AND canada AND 2020",
    language="en"
)

# Show the total number of news
blockchain_news_en["totalResults"]

179

## Getting News Articles in French

Fetching news in French will require keywords on this language, so retrieve all the news articles using the News API using the keywords `blockchain`, `canada`, and `2020`.

In [10]:
# Fetch news about Canada and Blockchain in 2020 in the French language
blockchain_news_fr = newsapi.get_everything(
    q="blockchain AND canada AND 2020",
    language="fr"
)

# Show the total number of news
blockchain_news_fr["totalResults"]

3

## Create a DataFrame with All the Results

The first task on this section is to create a function called `create_df(news, language)` that will transform the `articles` list in a DataFrame. This function will receive two parameters: `news` is the articles' list and `language` is a string to specify the language of the news articles.

The resulting DataFrame should have the following columns:

* Title: The article's title
* Description: The article's description
* Text: The article's content
* Date: The date when the article was published, using the format `YYY-MM-DD` (eg. 2019-07-11)
* Language: A string specifying the news language (`en` for English, `fr` for French)

In [11]:
# Function to create a dataframe for english news and french news
def create_df(news, language):
    articles = []
    for article in news:
        try:
            title = article["title"]
            description = article["description"]
            text = article["content"]
            date = article["publishedAt"][:10]

            articles.append({
                "title": title,
                "description": description,
                "text": text,
                "date": date,
                "language": language
            })
        except AttributeError:
            pass

    return pd.DataFrame(articles)

Use the create_df() function to create a DataFrame for the English news and another for the French news.

In [12]:
# Create a DataFrame with the news in English
blockchain_en_df = create_df(blockchain_news_en["articles"], "en")

# Create a DataFrame with the news in French
blockchain_fr_df = create_df(blockchain_news_fr["articles"], "fr")

Concatenate both DataFrames having the English news at the top and the French news at the bottom.

In [13]:
# Concatenate dataframes
blockchain_df = pd.concat([blockchain_en_df, blockchain_fr_df])

In [14]:
# Show the head articles (they are in English)
blockchain_df.head()

Unnamed: 0,date,description,language,text,title
0,2021-04-06,*To read more by the Thomson Reuters Regulator...,en,NEW YORK(Thomson Reuters Regulatory Intelligen...,INSIGHT: U.S. cryptocurrency regulatory path a...
1,2021-04-24,Summary List PlacementHere's a rundown of news...,en,"Here's a rundown of news on hires, exits, and ...","Must-know promotions, exits, and hires at firm..."
2,2021-05-03,The launch of ether exchange-traded funds in C...,en,"SINGAPORE: Cryptocurrency ether broke past $3,...","Ethereum breaks past $3,000 to quadruple value"
3,2021-05-03,"Ether, the token transacted on the ethereum bl...",en,"Cryptocurrency ether broke past $3,000 on Mond...","Ethereum breaks past $3,000 to quadruple in va..."
4,2021-04-09,Blockchain technology was all the rage back in...,en,Blockchain technology was all the rage back in...,Blockchain ETF Lets Investors Cash In On Bitco...


In [15]:
# Show the tail articles (they are in French)
blockchain_df.tail()

Unnamed: 0,date,description,language,text,title
18,2021-04-22,The Canadian Association of Medical Mask Manuf...,en,"A leader in vertical integration, machine lear...",Inno Lifecare Joins the Canadian Association o...
19,2021-04-17,<ol><li>Mark Cuban Believes DOGE ‘Will Find It...,en,(Bloomberg) -- The Nordic region is losing its...,Mark Cuban Believes DOGE ‘Will Find Its Level’...
0,2021-04-10,Plusieurs facteurs rendent ces cyberattaques d...,fr,"Ce nest plus un secret, les ransomwares sont e...",Pourquoi les ransomwares prolifèrent-ils autan...
1,2021-05-02,"Aujourd'hui, on va parler de cryptomonnaies et...",fr,"Aujourd'hui, on va parler de cryptomonnaies et...",Une approche féministe de la blockchain est-el...
2,2021-04-09,"PEKIN, 9 avril 2021 /PRNewswire/ -- La restaur...",fr,"PEKIN, 9 avril 2021 /PRNewswire/ -- La restaur...",Les projets d'alimentation et de boissons TOJO...


Save tha final DataFrame as a CSV file for further analysis in the forthcoming activities.

In [16]:
# Save to CSV
file_path = Path("../Resources/blockchain_news_en_fr.csv")
blockchain_df.to_csv(file_path, index=False, encoding='utf-8-sig')