Skip to content

jelewe/button_puppeteer

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

6 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

button_puppeteer

The purpose of this script is to compare 2 or more domains & their shared subpages, and return a CSV list of all anchor tags with class= primary button. This script is useful when duplicating a website and confirming that button links match without having to manually compare each subpage. Of course, the user can change the query selector to their intended purpose if they wish to compare something other than anchor tags. This script can then be ran from the CLI with node scrape.js


This script uses Puppeteer to pull anchor tag data from multiple domains & URLs.
See https://pptr.dev/

Puppeteer must be installed after cloning the repo. Run npm install puppeteer

A note: One seemingly common issue encountered is [Bug]: Error retrieving document: Error: Could not find Chrome
The solution is to change the default cache directory of Puppeteer by creating a config file. You will then need to uninstall and reinstall Puppeteer to resolve. Instructions here: https://pptr.dev/guides/configuration#changing-the-default-cache-directory
This script also uses the json2csv library to output the scraped data into a CSV file.
See https://github.com/zemirco/json2csv

json2csv library must also be installed after cloning the repo. Run npm install json2csv

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published