In [2]:
from googlesearch import search  # grabs URLs from google query
from newspaper import Article    # very flexible library for getting all kinds of data from a source
from newspaper import Config
import nltk                      # NLTK for performing various natural language processing operations



"""
a quick way to compile all sources related to a google search and extract data without
ever having to click on the link. you could search for a person or a company 
they are involved in and easily collect a volume of sources for background research.

sources:
Eric D. Brown, D.Sc.
https://pythondata.com/quick-tip-consuming-google-search-results-to-use-for-web-scraping/
Super cool site


BSD 3-Clause License

Copyright (c) 2019, 
All rights reserved.

Redistribution and use in source and binary forms, with or without
modification, are permitted provided that the following conditions are met:

* Redistributions of source code must retain the above copyright notice, this
  list of conditions and the following disclaimer.

* Redistributions in binary form must reproduce the above copyright notice,
  this list of conditions and the following disclaimer in the documentation
  and/or other materials provided with the distribution.

* Neither the name of the copyright holder nor the names of its
  contributors may be used to endorse or promote products derived from
  this software without specific prior written permission.

THIS SOFTWARE IS PROVIDED BY THE COPYRIGHT HOLDERS AND CONTRIBUTORS "AS IS"
AND ANY EXPRESS OR IMPLIED WARRANTIES, INCLUDING, BUT NOT LIMITED TO, THE
IMPLIED WARRANTIES OF MERCHANTABILITY AND FITNESS FOR A PARTICULAR PURPOSE ARE
DISCLAIMED. IN NO EVENT SHALL THE COPYRIGHT HOLDER OR CONTRIBUTORS BE LIABLE
FOR ANY DIRECT, INDIRECT, INCIDENTAL, SPECIAL, EXEMPLARY, OR CONSEQUENTIAL
DAMAGES (INCLUDING, BUT NOT LIMITED TO, PROCUREMENT OF SUBSTITUTE GOODS OR
SERVICES; LOSS OF USE, DATA, OR PROFITS; OR BUSINESS INTERRUPTION) HOWEVER
CAUSED AND ON ANY THEORY OF LIABILITY, WHETHER IN CONTRACT, STRICT LIABILITY,
OR TORT (INCLUDING NEGLIGENCE OR OTHERWISE) ARISING IN ANY WAY OUT OF THE USE
OF THIS SOFTWARE, EVEN IF ADVISED OF THE POSSIBILITY OF SUCH DAMAGE.

"""



# type in what you want to search between the apostrophes 
query = 'Silicon Valley Criticisms'


# specify how many hits you want
amount = 25



sources = []

for url in search(query, tld='com', stop=amount):
    sources.append(url)
    
    
# occasionally running into issues (403 access error), so this is an attempt to change the user_agent and 
# hopefully bypass.
user_agent = 'Mozilla/5.0 (Macintosh; Intel Mac OS X 10_11_5) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/50.0.2661.102 Safari/537.36'
config = Config()
config.browser_user_agent = user_agent


# using a try/pass block ONLY because certain sites can't be accessed from this method. Usually this
# is poor (very poor) practice, but this is a small script and this 403 error is the only real potential error.
# Definitely make the except more specific in case I decide to add more features later. 
for link in sources:
    try:
        article = Article(link)
        article.download()
        article.parse()
        article.nlp()
        print(link)
        print(article.authors)
        print(article.publish_date)
        print(article.summary)
        print("\n")

    except:
        print(link)
        print('access blocked, check link manually')
        print("\n")
        pass
    


https://www.theguardian.com/news/2018/may/03/why-silicon-valley-cant-fix-itself-tech-humanism
['Ben Tarnoff', 'Moira Weigel']
2018-05-03 00:00:00
Why Silicon Valley can’t fix itself – podcast Read moreIn the mid-2000s, as the internet moved on to mobile devices, physical and virtual life began to merge.
As suspicion of Silicon Valley grows, the tech humanists are making a bid to become tech’s loyal opposition.
The story begins in the 1960s, when Silicon Valley was still a handful of electronics firms clustered among fruit orchards.
Computing was industrial, in other words, not personal, and Silicon Valley remained dependent on a small number of big institutional clients.
Far from challenging Silicon Valley, tech humanism offers Silicon Valley a useful way to pacify public concerns without surrendering any of its enormous wealth and power.


https://www.wired.com/story/the-other-tech-bubble/
['Erin Griffith', 'Hannah Wallace', 'Katia Moskvitch', 'Peter Rubin', 'Emily Tate', 'Evan Ratlif

https://www.vox.com/conversations/2017/9/8/16266496/silicon-valley-google-apple-facebook-amazon-monopolies
['Sep']
2017-09-08 00:00:00
The tech companies have no such fear,” Foer writes.
Alexander Bisley“Like Donald Trump, Silicon Valley is part of the great American tradition of sham populism.” This is a strong comparison.
“Obama spent his presidency cheering on the tech companies, even pleading with the Europeans not to collect the taxes owed to them.” This seems like a significant Obama fail.
Elizabeth Warren has been especially good on the tech companies.
She delivered a speech about a year ago about the dangers of monopoly that singled out the tech companies.


https://www.technologyreview.com/s/533096/a-feminist-critique-of-silicon-valley/
['Jason Pontin']
None
Shanley Kane is cofounder and editor of the most interesting and original of new publications that cover technology: Model View Culture, a quarterly journal and media site that offers readers a remorseless feminist critiqu

https://qz.com/work/1415191/how-vc-mark-suster-handles-the-sting-of-criticism-when-he-speaks-up-against-sexism-in-silicon-valley/
[]
None
An entrepreneur-turned-VC, Suster, age 50, successfully founded and sold two companies before joining Upfront (he sold his second company to Salesforce).
Though he knew some founders would push back against the idea of a fund regulating job-interviewing practices, Suster wasn’t too concerned.
The biggest threat men have is the same threat that women have—existential destruction of the planet.
Modeling good behavior is the most important thing both men and women can do to change equality for women in the future.
My advice for young men: speak up.


