This repository is designed to house several notebooks aimed at getting search results from Google, and then parsing them for downstream analysis. Specifically looking at various search results to better understand clusters of similar articles and clustering them to see which articles stand out for further manual analysis.
Web Scraping (API), Exploratory Data Analysis, Datamining, Clustering
Inspired By This Custom Google API Search
First check out the following site to get your own Google API key to enable making requests to the search engine service.
After you have an API key you will need to create a custom search engine. Make sure "Search the entire web" is enabled in the control panel. If this is not set your search engine will not return any results.
Search Engine ID Copy this value from the control panel you will need this value.
export GOOGLE_API_KEY=YOUR_API_KEY
export GOOGLE_SEARCH_ENGINE=SEARCH_ENGINE_ID
export
# Create Google Client
gc = GoogleSearch(api_key=GOOGLE_API_KEY, search_engine=GOOGLE_SEARCH_ENGINE)
# Get Results
results = gc.get_results(q='coffee near me')
print(results)