This project search web pages in the web by keywords using SERP API, WebDriver with Selenium to get all page content (static and dynamically content) and Jsoup to clean the web page (remove ads, images, etc).
- Execute the commands bellow:
git clone https://github.com/kaio-giovanni/webpage-cleaner.git
cd webpage-cleaner
./gradlew clean build
-
Create a Serp API account and get the API key.
-
Create an
.env
file in the project root and enter your credentials based on the.env.example
file.
- Execute the commands bellow:
./gradlew bootRun