Join GitHub today
GitHub is home to over 28 million developers working together to host and review code, manage projects, and build software together.Sign up
Selenium-based protocol implementation #144
referenced this issue
Jun 18, 2015
@kkrugler could put that in a separate repo so that the requirement for Java 8 does not become necessary for core and the other modules.
Nutch has a HTMLUnit-based protocol implementation I think but not sure it's been used much yet and I haven't heard on that. There's also a Selenium one.
It's very easy to use and based on Selenium WebDriver which means it supports all browser that have a Driver implementation.
I did some very intensive integration testing with Geb (including waiting for AJAX responses etc.) and it is absolutely awesome.
Hi @raaz1234, see branch https://github.com/DigitalPebble/storm-crawler/tree/jBrowserDriver. Not yet merged but please give it a try