WebHarvest mirror (https://sourceforge.net/projects/web-harvest), with some modifications on the code done by me, like the complete porting to HttpComponents 4.2.3. I will be glad if WebHarvest maintainers want to merge my branch and take care of my additions.
================================================================================
To build Web-Harvest using Maven:
$ mvn clean install