Skip to content

smartsquare/wecky-crawl

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

52 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Wecky - Website Crawler

Checks a given website for modifications. Does so by generating a hash of the webpage and adding a new hash and content if the webpage changed.

Dependencies

Uses DynamoDB for persistence. Creates the table WebsiteHashes if it doesn't exist yet. Region eu-central-1 is hardcoded for now.

Build

Building is straightforward, using Maven:

mvn package

The shade plugin is already attached to the package phase, so the resulting JAR can be uploaded to AWS. Alternatively you can use the buildspec.yml file to build the project in AWS CodeStar.

Test

Localstack is used during tests, to spin up local docker containers with mocked AWS Services. If you want to skip the end-2-end tests you can run the tests using

mvn package -DargLine="-Dci-server=true"

Running locally

Using sam-cli and localstack you can run the lambda function locally.

You need to set the environment variable DYNDB_LOCAL to point to the local DynamoDB that gets started by localstack.

ℹ️ Environment variables need to be set in the template.yml file, since the Lambda Function is run in a docker container. See template_sam.yml for an example.

Then run sam local invoke "CrawlWebsite" -e event.json

About

Crawler for wecky, the website checker

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages