Skip to content

Built in February 2020. A Node.js + Puppeteer.js application to fetch as many valid email addresses from crawling web pages sources resulted from crawling search engines sources, according to specific and random search keys.

License

Notifications You must be signed in to change notification settings

orassayag/crawler

Repository files navigation

Crawler

Built in February 2020. A Node.js + Puppeteer.js application to fetch as many valid email addresses from crawling web pages sources resulted from crawling search engines sources, according to specific and random search keys.

Getting Started

Clone the application to your computer. Steps to view the solution:

  1. Open server application on IDE (I use VSCode).
  2. In the terminal run: 'npm i'.
  3. In the terminal run: 'npm start'.

Search email addresses script:

READ THE FOLLOWING SETTINGS BEFORE YOU BEGIN:

-Instructions & How to use:

  1. Go to the 'settings.js' file and make sure that all settings are fit to needs.
  2. Go to the 'list/searchKeys.list.js' file and make sure all search keys fit to needs.
  3. Go to the 'list/filterKeys.list.js' file and make sure all domains filtered are fit to needs.
  4. Make sure the Mongo database is installed and configured properly.
  5. Run the script by 'npm start' in the terminal and let the magic begin :).

In the terminal run: 'npm start'.

Happy testing! :).

Prerequisites

You'll need to install VSCode, Node, and clone the application, and in the terminal run: 'npm i'.

Built With

  • Node.js - The web framework used - Server side.
  • Git - Source management.

Contributing

Please read CONTRIBUTING.md for details on our code of conduct, and the process for submitting pull requests to us.

Versioning

We use SemVer for versioning. For the versions available, see the tags on this repository.

Author

License

This application has an MIT license.

About

Built in February 2020. A Node.js + Puppeteer.js application to fetch as many valid email addresses from crawling web pages sources resulted from crawling search engines sources, according to specific and random search keys.

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published