Amplify Spiders v1

Amplify Spiders v1 is an AWS Amplify project that hosts a Next.js site with real-time data and custom Lambda handlers that include Lambda containers and tensorflow.js. This project provides crawlers for several different search engines for competitive analysis only.

Getting Started

To get started with Amplify Spiders v1, follow these steps:

Clone the repository to your local machine.
Install the necessary dependencies by running npm install.
Set up your AWS Amplify environment by following the instructions in the amplify/README.md file.
Run the project locally by running npm run dev.
Build and push the container CI/CD Needs some work still: see here for more information Pre-push Hook This may need to be disabled on the first deploy?
Deploy the first time rename the hook, amplify push, put the hooks name back.
Update the following secrets with amplify update function:
- googleKey,
- googleCx,
- foursquareClientId,
- foursquareClientSecret,
- facebookAccessToken,
- infogroupApiKey,
- yellowpagesKey,
- yelpApiToken,
- foursquareApiKey
Deploy the project to the cloud with the hook enabled by running ECR_REPO_NAME="" ACCOUNT_ID="" amplify push. Where ECR_REPO_NAME is the repo that CDK generates.

Features and Functionality

Amplify Spiders v1 includes the following features and functionality:

Roadmap

Amplify Spiders v1 is currently in development. The following features and functionality are planned for future releases:

Finish the main site menubar (can login, but not logout yet) IN PROGRESS
Remove lambda contianers by treeshaking this library to reduce bundle size.
Crawler for Facebook Business: Need to get the app approved by Facebook for the demo site
Crawler for Bing?
Crawler for Yahoo?
CI/CD for Container Lambda handlers?
Find good sources of regional statistical and demohgraphic data for cross referencing with search results?

Contributing

See the CONTRIBUTING.md file for information on how to contribute to Amplify Spiders v1.

License

Amplify Spiders v1 is licensed under the MIT License. See LICENSE.txt for more information.

Code of Conduct

Amplify Spiders v1 has adopted the Contributor Covenant Code of Conduct. See CODE_OF_CONDUCT.md for more information.

Name		Name	Last commit message	Last commit date
Latest commit History 64 Commits
.github		.github
.vscode		.vscode
amplify		amplify
images		images
pages		pages
public		public
src		src
.eslintignore		.eslintignore
.gitignore		.gitignore
.graphqlconfig.yml		.graphqlconfig.yml
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE.txt		LICENSE.txt
README.md		README.md
SECURITY.md		SECURITY.md
amplify.yaml		amplify.yaml
next-env.d.ts		next-env.d.ts
next.config.js		next.config.js
package-lock.json		package-lock.json
package.json		package.json
tsconfig.json		tsconfig.json

License

ExcitingTheory/amplify-spiders-v1

Folders and files

Latest commit

History

Repository files navigation

Amplify Spiders v1

Getting Started

Features and Functionality

Roadmap

Contributing

License

Code of Conduct

About

Topics

Resources

License

Code of conduct

Security policy

Stars

Watchers

Forks

Languages