Used to gather Canopy Forum Authors and Articles for the CSLR Team.
- Download/Clone this repository by clicking on the "Code" green button.
- Install Python 3: Visit the python webpage and use the installer
- Install the neccessary python-3 packages:
python3 -m pip install requests beautifulsoup4 tqdm - Run the scraper from your terminal - ensure you are in the appropriate directory:
python3 CFScraper.py
Inside the CFScraper.py script, you will find a couple of flags at the top of the script:
logging = FalseSetting this toTruewill print warnings and progress information to the consolesave_to_pdf = False # Has not been fully implementedSetting this toTruewill save each script as a pdfdebugging = FalseSetting this toTruewill only export 1 contactauto_email = FalseSetting this toTruewill email the csv file to a target email you provide
If you have set the auto_email script flag to True, then you must use the following parameters when running the script:
--email- The from sender (gmail account - CSLR team, see login info for scraper email)--password- The password for the from sender--to- The to email address (can be any email address)
Example: python3 CFScraper.py --email "MY EMAIL" --password "MY PASSWORD", --to "TO EMAIL"
If you run into a Bad Credentials Error, note that the from email you are using must be a gmail account.
Furthermore, the gmail account must have "Less Secure Apps Authentication" enabled.
For any other issues, feel free to contact me or file a new issue.