Skip to content

Convert Your Personal Google Data Download HTML into CSV. View the categories and specific terms you searched for years ago.

Notifications You must be signed in to change notification settings

nicholasurban/Google-Personal-Data

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

5 Commits
 
 
 
 

Repository files navigation

Google Personal Data


Want to explore the data associated with your Google account?

Google has a data export tool. But the data is housed inside HTML files. This project:

  • Converts your Search history into a CSV for your own EDA, model building, and miscellaneous use.
  • Clusters your Google Searches into a specified number of topics and the words that comprise each

Instructions:

  1. Visit Google Takeout, and request a copy of your data
  2. Install required libraries
  3. Change path to your Google HTML file
  4. Change OUTPUT_FILE to your desired CSV name
  5. Optional: specify start date and end date in the ModelData class to see search query topics from a previous time.
  6. Run

Features Generated From Google Data

  • Log type
  • Query (raw)
  • Date
  • URL
  • Location (coordinates)
  • Day
  • Desktop/Mobile
  • Site
  • Location (address)
  • Query (cleaned for topic modeling)

* Note: Only searches have location data

What

  • ProcessGoogleData class: creates the CSV from Google's HTML
  • GenerateFeatures class: builds additional useful features
  • ModelData class: creates topic NMF or LDA topic models from the Google Search queries*

* Only tested on Google search since I rarely use the others.

About

Convert Your Personal Google Data Download HTML into CSV. View the categories and specific terms you searched for years ago.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published