This Node application extracts all email addresses given an HTML file. Not a URL because of lazy loading or URL not reflecting search terms or login requirements etc.. I wrote this in a hurry before going to bed so pardon the code quality.
npm install
FILE_PATH=/Users/denizozger/Desktop/ FILE_NAME=toExtract.html DELIMETER=, node --harmony emailExtractor.js
See the self-explanatory environment variables. Delimeter is the character to use to seperate email addresses in the resulting file. The resulting file is created in FILE_PATH and it's called extractedEmailAddresses.txt
Email addresses of the format foo@bar@gmail.com are assumed valid. You can easily change the regex tho.
If anyone needs this as a web application send me an email or better implement it and create a pull request.