An umbrella project, comprising of several different modules, to scrap and crawl financial news for useful/relevant information.
Install the following packages before you can start using them::
>> pip install lxml
>> pip install requests
>> pip install pandas
>> pip install quandl
>> pip install matplotlib
>> pip install numpy>=1.6.1
>> pip install pillow
Else:
>> pip install -r requirements.txt
Download the package from https://github.com/somdipdey/Scrapping_And_Crawling_FinancialNews_For_Keywords.git
Then:
>> python setup.py install
The news article on "Fast-Food Joke About Saudi Prince Backfires for Lebanon Comedian" available at https://www.bloomberg.com/news/articles/2018-01-26/fast-food-joke-about-saudi-prince-backfires-for-lebanon-comedian can be visualised as follows:
Using Word Cloud to visualise the news article with specific keywords appearing in specific colors (Module: news_word_cloud.news_word_cloud_by_color)
The news article on "Fast-Food Joke About Saudi Prince Backfires for Lebanon Comedian" available at https://www.bloomberg.com/news/articles/2018-01-26/fast-food-joke-about-saudi-prince-backfires-for-lebanon-comedian can be visualised as follows:
Using Keyword Crawler to extract/check if the news article contains the keyword(s) (Module: financial_news_keyword_crawler)
The news article on "Fast-Food Joke About Saudi Prince Backfires for Lebanon Comedian" available at https://www.bloomberg.com/news/articles/2018-01-26/fast-food-joke-about-saudi-prince-backfires-for-lebanon-comedian is searched for keyword "national" and the result is as follows:
Paragraph number: 1
False
Paragraph number: 2
False
Paragraph number: 3
False
Paragraph number: 4
False
Paragraph number: 5
False
Paragraph number: 6
False
Paragraph number: 7
False
Paragraph number: 8
False
Paragraph number: 9
False
Paragraph number: 10
False
Paragraph number: 11
False
Paragraph number: 12
False
Paragraph number: 13
False
Paragraph number: 14
False
Paragraph number: 15
True
Paragraph number: 16
False