Serverless Crawl - Serverless Framework AWS

This was generated using the Serverless Framework AWS Python template.

This is an example of a serverless static website crawler using AWS Lambda, SQS, S3, and DynamoDB.

Note: You may be notified by AWS about recursive loops with Lambda. This is because of the Crawl SQS handler Lambda also writing to the same SQS queue that it is polling from. This is mitigated by using the DynamoDB table to track the URLs that have been queued.

Plugins

serverless-python-requirements
serverless-lift

Architecture

Usage

Deployment

In order to deploy the example, you need to run the following command:

$ serverless deploy

After running deploy, you should see output similar to:

Deploying aws-python-project to stage dev (us-east-1)

✔ Service deployed to stack aws-python-project-dev (112s)

functions:
  hello: aws-python-project-dev-start (1.5 kB)

Invocation

After successful deployment, you can invoke the deployed function by using the following command:

serverless invoke --function start

Local development

You can invoke your function locally by using the following command:

serverless invoke local --function start

Bundling dependencies

In case you would like to include third-party dependencies, you will need to use a plugin called serverless-python-requirements. You can set it up by running the following command:

serverless plugin install -n serverless-python-requirements

Running the above will automatically add serverless-python-requirements to plugins section in your serverless.yml file and add it as a devDependency to package.json file. The package.json file will be automatically created if it doesn't exist beforehand. Now you will be able to add your dependencies to requirements.txt file (Pipfile and pyproject.toml is also supported but requires additional configuration) and they will be automatically injected to Lambda package during build process. For more details about the plugin's configuration, please refer to official documentation.

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
images		images
src/handlers		src/handlers
.env.local		.env.local
.gitignore		.gitignore
README.md		README.md
handler.py		handler.py
package-lock.json		package-lock.json
package.json		package.json
requirements.txt		requirements.txt
serverless.yml		serverless.yml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Serverless Crawl - Serverless Framework AWS

Plugins

Architecture

Usage

Deployment

Invocation

Local development

Bundling dependencies

About

Releases

Packages

Languages

rjolaverria/serverless-crawler

Folders and files

Latest commit

History

Repository files navigation

Serverless Crawl - Serverless Framework AWS

Plugins

Architecture

Usage

Deployment

Invocation

Local development

Bundling dependencies

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages